Mouse movements - norahollenstein/cognitiveNLP-dataCollection GitHub Wiki

Mouse movement datasets for NLP

Mouse or cursor movements, including clicks and scrolling logs, are behavioral biometrics and have been extensively used in psycholinguistic and reading research to gain insights into cognitive processing. Cursor dynamics represent the user's reading patterns and correlate with eye movements.

This collection contains datasets in the following languages:

English

Scrolling Interactions to Predict Readability

Stimulus: advanced and elementary texts from the OneStopEnglish corpus
Subjects: 598 participants (native speakers and English L2 speakers)
Data: https://github.com/siangooding/readability_scroll
Reference: Gooding et al. 2021

VQA-HAT (Human Attention on Visual Question Answering)

Stimulus: Cursor movements for 58475 train and 1374 val question-image pairs from the VQA dataset
Subjects: 20000 Human Intelligence Tasks (HITs) on AMT from 800 unique workers
Data: https://computing.ece.vt.edu/~abhshkdz/vqa-hat/
Reference: Das et al. (2016)

German

Multimodal Duolingo Bio-Signal Dataset

Stimulus: German language lessons using the web-based Duolingo
Subjects: 22 participants (either native English speakers or fluent in English)
Data: https://figshare.com/s/688e387fbfdc000f4e90
Reference: Notaro et al. (2018)

This dataset also contains eye-tracking and EEG recordings.