B CSV files - StatisticalReinforcementLearningLab/HeartstepsV1Code GitHub Wiki
Here we list all of the CSV files for data from the intake/exit interviews, HeartSteps, Jawbone and Google Fit. The files are loaded into data frames, which are saved in the R workspace csv.RData. These data require considerable processing before it can be reasonably analyzed, so only the analysis data frames should be used. Tables that played a role in constructing the analysis data frames are documented further on individual wiki pages, linked from the table name.
The descriptions also give limitations or complications of the data available from each file for analysis. Underlying reasons for these issues are documented on the data issues wiki page. We emphasize "available" because essentially every app-based source is subject to missingness; this topic is covered in the data issues page and the data summary.
Interviews
These CSV files are primarily exported manually from Excel spreadsheets, which are maintained by the study coordinators.
Table | Data frame | Description |
---|---|---|
HeartSteps Participant Directory | participants | User names, identifiers, intake and exit dates, and notes on study participation (e.g. study dropout, international travel, technical issues). |
Survey_Intake | intake | User answers to the intake interview questions. |
Survey_Exit | exit | User answers to the exit interview questions; these re-administer the activity choice index, self-efficacy for physical activity and the IPAQ. See Survey_Intake for the variable format. |
TestX_AppUsageData | tracker | The exit interview also involved downloading data from an app usage tracker, which is installed on the phone at intake. This file records the time and duration of use for any application installed on the user's phone. |
HeartSteps
These CSV files generally contain data captured by the HeartSteps app. Files are exported using Google App Engine tools, following the steps documented on the exporting data wiki page.
Table | Data frame | Description |
---|---|---|
EMA_Completed | complete | EMA completion status and time, but does not always reflect the data in EMA_Response; EMA_Completed thus is used only for time zone information. |
EMA_Context_Engaged | engage | Context in which the user engaged with the EMA. What constitutes an engagement is not clear, but generally corresponds to interactions with the EMA questions after the initial interaction via the EMA notification. |
EMA_Context_Notified | notify | Context at the time of EMA notification and administered planning/EMA questions. The latter are recorded under a race condition; we instead infer from EMA_Response, Structured_Planning_Response and Unstructured_Planning_Response. |
EMA_Response | ema | Answers to each EMA question. Time stamps for each answer have no time zone. |
Heartsteps_Usage_History | usage | Interaction with the 4 main pages of the HeartSteps app interface. |
Momentary_Decision | decision | Context and decision result for the decision on whether or not an activity suggestion is provided. |
Motivational_Message | Motivational messages the user entered into the home page of the HeartSteps app. | |
Response | response | Context and interaction result (thumbs up/down, snooze, timed out) of the activity suggestion notification. |
Reviewed_Heartsteps_Messages | messages | Unique suggestion messages with all relevant tags. Used for post-hoc tagging, since the messages and the tags relevant to the context were not captured by HeartSteps. |
Snooze_FromInApp | snooze | Times at which the user enabled snooze from the HeartSteps app (as opposed to interaction with an activity suggestion). |
Structured_Planning_Response | plan | Structured planning options selected by the user. |
Unstructured_Planning_Response | plan | Text the user entered when prompted to complete unstructured planning. |
User_Addresses | address | Addresses for home and work, provided by the user through the HeartSteps app settings. Time stamps have no time zone. |
User_Calendars | calendar | Google calendar accounts, provided by the user through the HeartSteps app settings. Data for the calendar accounts were not retrieved due to privacy issues. |
User_Data | User name, identifier and time of HeartSteps account creation. | |
User_Decision_Times | timeslot | Time slots for suggestion and EMA notifications chosen by the user, apart from some constraints imposed by the app settings menu. |
Weather_History | weather | Morning temperature, high temperature and total preciptation for each city represented in the users' GPS coordinates encounted in each day. Time stamps do not have time zone. |
Physical activity tracking
These CSV files are exported from a web interface, following the steps documented on the exporting data wiki page.
Table | Data frame | Description |
---|---|---|
google_fit_data | googlefit | Cumulative step counts per bout of physical activity. What defines a "bout" is not apparent from the Google Fit API. |
jawbone_step_count_data | jawbone | Cumulative step counts per minute in each bout of physical activity. Here a bout is defined differently from Google Fit, but the exact definition is similarly unclear from the Jawbone API. |