The Timing Mega-study: Comparing a Range of Experiment Generators, Both Lab-based and Online

Overview

Journal PeerJ

Specialties Biology
Environmental Health
General Medicine

Date 2020 Oct 2

PMID 33005482

Citations 169

Authors

David Bridges

Alain Pitiot

Michael R MacAskill

Jonathan W Peirce

Affiliations

Soon will be listed here.

Abstract

Many researchers in the behavioral sciences depend on research software that presents stimuli, and records response times, with sub-millisecond precision. There are a large number of software packages with which to conduct these behavioral experiments and measure response times and performance of participants. Very little information is available, however, on what timing performance they achieve in practice. Here we report a wide-ranging study looking at the precision and accuracy of visual and auditory stimulus timing and response times, measured with a Black Box Toolkit. We compared a range of popular packages: PsychoPy, E-Prime®, NBS Presentation®, Psychophysics Toolbox, OpenSesame, Expyriment, Gorilla, jsPsych, Lab.js and Testable. Where possible, the packages were tested on Windows, macOS, and Ubuntu, and in a range of browsers for the online studies, to try to identify common patterns in performance. Among the , Psychtoolbox, PsychoPy, Presentation and E-Prime provided the best timing, all with mean precision under 1 millisecond across the visual, audio and response measures. OpenSesame had slightly less precision across the board, but most notably in audio stimuli and Expyriment had rather poor precision. Across , the pattern was that precision was generally very slightly better under Ubuntu than Windows, and that macOS was the worst, at least for visual stimuli, for all packages. did not deliver the same level of precision as lab-based systems, with slightly more variability in all measurements. That said, PsychoPy and Gorilla, broadly the best performers, were achieving very close to millisecond precision on several browser/operating system combinations. For response times (measured using a high-performance button box), most of the packages achieved precision at least under 10 ms in all browsers, with PsychoPy achieving a precision under 3.5 ms in all. There was considerable variability between OS/browser combinations, especially in audio-visual synchrony which is the least precise aspect of the browser-based experiments. Nonetheless, the data indicate that online methods can be suitable for a wide range of studies, with due thought about the sources of variability that result. The results, from over 110,000 trials, highlight the wide range of timing qualities that can occur even in these dedicated software packages for the task. We stress the importance of scientists making their own timing validation measurements for their own stimuli and computer configuration.

Citing Articles

Looking for cues over time: A study on self-initiated monitoring in event-based and time-based prospective memory.

Laera G, Del Missier F, Laloli S, Zuber S, Kliegel M, Hering A Mem Cognit. 2025; .

PMID: 40080255 DOI: 10.3758/s13421-025-01700-5.

Faster implicit motor sequence learning of new sequences compatible in terms of movement transitions.

Dyck S, Klaes C NPJ Sci Learn. 2025; 10(1):3.

PMID: 39820476 PMC: 11739496. DOI: 10.1038/s41539-025-00296-4.

The cost of perspective switching: Constraints on simultaneous activation.

Segal D Psychon Bull Rev. 2025; .

PMID: 39806243 DOI: 10.3758/s13423-024-02633-x.

The Role of Morphological Information in Processing Pseudo-words in Italian L2 Learners: It's a Matter of Experience.

Amenta S, Foppolo F, Badan L J Cogn. 2025; 8(1):14.

PMID: 39803176 PMC: 11720858. DOI: 10.5334/joc.420.

Do the effects of sleep problems on cognitive function differ according to age in daytime workers?.

Asaoka S, Nishimura R, Nozoe K, Yamamoto R Sleep Biol Rhythms. 2025; 23(1):13-20.

PMID: 39801932 PMC: 11717744. DOI: 10.1007/s41105-024-00546-9.

References

Peirce J, Gray J, Simpson S, Macaskill M, Hochenberger R, Sogo H . PsychoPy2: Experiments in behavior made easy. Behav Res Methods. 2019; 51(1):195-203. PMC: 6420413. DOI: 10.3758/s13428-018-01193-y. View

Pronk T, Wiers R, Molenkamp B, Murre J . Mental chronometry in the pocket? Timing accuracy of web applications on touchscreen and keyboard devices. Behav Res Methods. 2019; 52(3):1371-1382. PMC: 7280355. DOI: 10.3758/s13428-019-01321-2. View

Forster K, Forster J . DMDX: a windows display program with millisecond accuracy. Behav Res Methods Instrum Comput. 2003; 35(1):116-24. DOI: 10.3758/bf03195503. View

Plant R, Hammond N, Turner G . Self-validating presentation and response timing in cognitive paradigms: how and why?. Behav Res Methods Instrum Comput. 2004; 36(2):291-303. DOI: 10.3758/bf03195575. View

Garaizar P, Vadillo M, Lopez-de-Ipina D . Presentation accuracy of the web revisited: animation methods in the HTML5 era. PLoS One. 2014; 9(10):e109812. PMC: 4193832. DOI: 10.1371/journal.pone.0109812. View

Henninger F, Shevchenko Y, Mertens U, Kieslich P, Hilbig B . lab.js: A free, open, online study builder. Behav Res Methods. 2021; 54(2):556-573. PMC: 9046347. DOI: 10.3758/s13428-019-01283-5. View

de Leeuw J . jsPsych: a JavaScript library for creating behavioral experiments in a Web browser. Behav Res Methods. 2014; 47(1):1-12. DOI: 10.3758/s13428-014-0458-y. View

de Leeuw J, Motz B . Psychophysics in a Web browser? Comparing response times collected with JavaScript and Psychophysics Toolbox in a visual search task. Behav Res Methods. 2015; 48(1):1-12. DOI: 10.3758/s13428-015-0567-2. View

Miller R, Schmidt K, Kirschbaum C, Enge S . Comparability, stability, and reliability of internet-based mental chronometry in domestic and laboratory settings. Behav Res Methods. 2018; 50(4):1345-1358. DOI: 10.3758/s13428-018-1036-5. View

10.

Garaizar P, Vadillo M . Accuracy and precision of visual stimulus timing in PsychoPy: no timing errors in standard usage. PLoS One. 2014; 9(11):e112033. PMC: 4218832. DOI: 10.1371/journal.pone.0112033. View

11.

Plant R, Turner G . Millisecond precision psychological research in a world of commodity computers: new hardware, new problems?. Behav Res Methods. 2009; 41(3):598-614. DOI: 10.3758/BRM.41.3.598. View

12.

Reimers S, Stewart N . Presentation and response timing accuracy in Adobe Flash and HTML5/JavaScript Web experiments. Behav Res Methods. 2014; 47(2):309-27. PMC: 4427652. DOI: 10.3758/s13428-014-0471-1. View

13.

Garaizar P, Vadillo M, Lopez-de-Ipina D, Matute H . Measuring software timing errors in the presentation of visual stimuli in cognitive neuroscience experiments. PLoS One. 2014; 9(1):e85108. PMC: 3883681. DOI: 10.1371/journal.pone.0085108. View

14.

Plant R . A reminder on millisecond timing accuracy and potential replication failure in computer-based psychology experiments: An open letter. Behav Res Methods. 2015; 48(1):408-11. DOI: 10.3758/s13428-015-0577-0. View

15.

Neath I, Earle A, Hallett D, Surprenant A . Response time accuracy in Apple Macintosh computers. Behav Res Methods. 2011; 43(2):353-62. DOI: 10.3758/s13428-011-0069-9. View

16.

Anwyl-Irvine A, Massonnie J, Flitton A, Kirkham N, Evershed J . Gorilla in our midst: An online behavioral experiment builder. Behav Res Methods. 2019; 52(1):388-407. PMC: 7005094. DOI: 10.3758/s13428-019-01237-x. View

17.

Schubert T, Murteira C, Collins E, Lopes D . ScriptingRT: A Software Library for Collecting Response Latencies in Online Studies of Cognition. PLoS One. 2013; 8(6):e67769. PMC: 3689727. DOI: 10.1371/journal.pone.0067769. View

18.

Reimers S, Stewart N . Adobe Flash as a medium for online experimentation: a test of reaction time measurement capabilities. Behav Res Methods. 2007; 39(3):365-70. DOI: 10.3758/bf03193004. View

19.

Plant R, Quinlan P . Could millisecond timing errors in commonly used equipment be a cause of replication failure in some neuroscience studies?. Cogn Affect Behav Neurosci. 2013; 13(3):598-614. DOI: 10.3758/s13415-013-0166-6. View