Information & Resources - Starryz/fivethirtyeight GitHub Wiki
1. naming
- no more 20 char long
- lower case char, replace space w/ underscores (snake_case instead of camelCase)
- variable name: underscore instead of spaces
2. observational units
- observations in the left-hand columns
3. dates
- if only
year β numerical
- if
year and month β convert to Date (year-month-01) β associate all observations from the same month to have a day of 01
- if
year, month, day β convert to Date (year-month-day)
4. other class variables
- ordinal β
ordered
- categorical w/ a fixed and known set of levels β
factor
- categorical w/ unknown/very large number of possible levels β
characters
- βyes/noβ character, binary variables β TRUE/FALSE
logical
5. tidy format
- save all data frames in tidy format
- each variable forms a column
- each observation forms a row
- each type of observational unit forms a table
- if converting alerts the dataset too much, make the code to convert easily accessible (i.e.
Examples in the documentation)