Data Pipeline - psrc/shiny-fixie GitHub Wiki
- make sure
trip_id
stays the same throughout data pipeline
Data Delivery and Prep
-
materials:
- raw data tables
- data cleaning codebook
-
data notes:
- NULL codes: unify all NULL codes to -995 (not 995)
Rulesy
Step 1:Rulesy cleans data programmatically, performs trip linking and preps tables for Shiny-Fixie
Rulesy Processes
- Prep trip table
- Procedure to recalculate derived fields (This is used both in Rulesy and Fixie.)
- Data Corrections
- Trip Linking
- Mode number standardization, including access and egress characterization
- Harmonize trips where possible
- Revise travel times for excessive speed trips
- Flag inconsistencies for further scrutiny
Prep Tables for Shiny-Fixie
- save tables as temporal tables right after the second task in Rulesy "2. Procedure to recalculate derived fields"
- double check if values in all variables match with codebook (example: -995 in hhmember variables)
Shiny-Fixie App
Step 2:The Shiny-Fixie app collects data edits and executes update query/ stored procedures to update tables in database
Shiny-Fixie Main features
- Edit trip
- Add trip (create blank trip, add reverse trip, add return home trip)
- Dismiss flag
- Delete trip
- Trip linking
- Trip unlinking
Step 3: Post Fixie data finalization
- recalculate all derived fields: make sure we updated every variable in the final dataset (e.g., number of trips per day/person/household, number of complete days)
- weighting