Mouse movements - theDebbister/cognitiveNLP-dataCollection GitHub Wiki
Mouse movement datasets for NLP
Mouse or cursor movements, including clicks and scrolling logs, are behavioral biometrics and have been extensively used in psycholinguistic and reading research to gain insights into cognitive processing. Cursor dynamics represent the user's reading patterns and correlate with eye movements.
This collection contains datasets in the following languages:
English
Scrolling Interactions to Predict Readability
Stimulus: advanced and elementary texts from the OneStopEnglish corpus
Subjects: 598 participants (native speakers and English L2 speakers)
Data: https://github.com/siangooding/readability_scroll
Reference: Gooding et al. 2021
VQA-HAT (Human Attention on Visual Question Answering)
Stimulus: Cursor movements for 58475 train and 1374 val question-image pairs from the VQA dataset
Subjects: 20000 Human Intelligence Tasks (HITs) on AMT from 800 unique workers
Data: https://computing.ece.vt.edu/~abhshkdz/vqa-hat/
Reference: Das et al. (2016)
German
Multimodal Duolingo Bio-Signal Dataset
Stimulus: German language lessons using the web-based Duolingo
Subjects: 22 participants (either native English speakers or fluent in English)
Data: https://figshare.com/s/688e387fbfdc000f4e90
Reference: Notaro et al. (2018)
This dataset also contains eye-tracking and EEG recordings.