Synthia File Formats - midas-isg/data-format-repository GitHub Wiki

https://www2.census.gov/acs2011_5yr/summaryfile/ACS_2007-2011_SF_Tech_Doc.pdf

Synthia Household file format

Variable from Original Variable Synthia Description Categories
sp_id Synthia A numeric identifier that uniquely identifies households, persons, schools, workplaces, group quarters locations and group quarters residents throughout the entire 2010 U.S. Synthetic Population. Numeric values
serialno PUMS serialno This is the PUMS standard serialno field, which is the PUMS unique identifier for households within states.
stcotrbg Synthia The state, county, tract, and block group FIPS code of the household.
hh_race PUMS RAC1P The coded race of the householder. PUMS Description: Recoded detailed race code Notes: The RAC1P variable for the householder (RELATE = 01) in each selected household is determined to be the race of the household. Even though this is a person-level characteristic, it was used at the household level to enable the use of race as a selection category in the IPF procedure. See inset
hh_income PUMS HINCP The household income PUMS Description: Household income (past 12 months) Bbbbbbbb .N/A(GQ/vacant) -59999 .Loss of -$59,999 or more 1 .$1 or break even 000000002-999999999 .Total household income in dollars .(Components are rounded)
hh_size PUMS NP The number of persons in the household PUMS Description: Number of person records following 00 .Vacant unit 01 .One person record (one person in household or any person in group quarters) 02-20 .Number of person records (number of persons in household)
hh_age PUMS AGEP The age of the head of household. PUMS Description: age Notes: The AGEP variable for the householder (RELATE = 01) in each selected household is the householder age. It is attached to the [prefix]_synth_households.txt file because this field was used in selecting households in the IPF procedure. 00 .Under 1 year 01-99 .1 to 99 years (Top-coded)
latitude Synthia The latitude of the household, based on geocoding.
longitude Synthia The longitude of the household, based on geocoding.

Race of the householder CATEGORY SUPPLEMENTARY TABLE

Code Meaning
1 White alone
2 Black or African American alone
3 American Indian alone
4 Alaska Native alone
5 American Indian and Alaska Native tribes specified; or American Indian or Alaska native
6 Asian alone
7 Native Hawaiian or Other Pacific Islander alone
8 Some other race alone
9 Two or more major race groups

Sample of Synthia Household records

sp_id serialno stcotrbg hh_race hh_income hh_size hh_age latitude longitude
11043889 2008000608832 420659501001 1 13800 1 71 41.2504738 -78.8020092
11043890 2007001074348 420659501001 1 50100 6 50 41.2489389 -78.7920583
11043891 2010000874376 420659501001 1 17900 1 81 41.2501511 -78.8012828
11043892 2007000830641 420659501001 1 23000 1 72 41.2499543 -78.801268
11043893 2009000734635 420659501001 1 39500 2 29 41.2486304 -78.7875176
11043894 2007001165135 420659501001 1 17300 2 19 41.2502748 -78.8012257
11043895 2010000368147 420659501001 1 36300 2 75 41.2516576 -78.7851551
11043896 2011000458267 420659501001 1 12400 1 81 41.2479113 -78.7871405
11043897 2009000953698 420659501001 1 17100 1 78 41.2487364 -78.7918694
11043898 2010001231190 420659501001 1 15000 5 22 41.251952 -78.7856109

Synthia Person file format

Variable from Original Variable Synthia Description Categories
sp_id Synthia A numeric identifier that uniquely identifies households, persons, schools, workplaces, group quarters locations and group quarters residents throughout the entire 2010 U.S. Synthetic Population. Numeric values. Numeric values
sp_hh_id Synthia Identifies the household in which the person resides. This identifier links to the sp_id field in the synth_households.txt file.
serialno PUMS serialno The original PUMS serial number (unique identifier). This code is used to link persons in the synth_people.txt file to the pums_p.txt file.
stcotrbg Synthia The state, county, tract, and block group FIPS code of the person
age PUMS AGEP The person’s age. PUMS Description: age 00 .Under 1 year 01-99 .1 to 99 years (Top-coded)
sex PUMS SEX The person’s sex (duplicate of the sex attribute in the pums_p.txt file) PUMS Description: Sex of person 1 = male and 2 = female.
race PUMS RAC1P The persons coded race. See inset
sporder SynthiaA unique serial number assigned to persons within each household
relate PUMS RELATE The relationship of the person to the household (see Appendix B for codes). PUMS Description: Household relationship See inset
sp_school_id Synthia Identifier of the school to which this person is assigned. If the person is not assigned to a school, then this field will be blank.
sp_work_id Synthia Identifier of the workplace to which this person is assigned. If the person is not assigned to a workplace, then this field will be blank. This identifier consists of state, county, tract, and block group FIPS codes and a unique serial number added as a suffix.

Race of the person CATEGORY SUPPLEMENTARY TABLE

Code Meaning
1 White alone
2 Black or African American alone
3 American Indian alone
4 Alaska Native alone
5 American Indian and Alaska Native tribes specified; or American Indian or Alaska native
6 Asian alone
7 Native Hawaiian or Other Pacific Islander alone
8 Some other race alone
9 Two or more major race groups

Household relationship CATEGORY SUPPLEMENTARY TABLE

Code Meaning
00 Reference Person
01 Husband/wife
02 Son/daughter
03 Brother/sister
04 Father/mother
05 Grandchild
06 In-law
07 Other relative
08 Roomer/boarder
09 Housemate/roommate
10 Unmarried partner
11 Foster child
12 Other nonrelative
13 Institutionalized group quarters population
14 Noninstitutionalized group quarters population

Sample of Synthia Person records

sp_id sp_hh_id serialno stcotrbg age sex race sporder relate sp_school_id sp_work_id
164281596 11048181 2007000951052 420659503005 42 1 1 1 0 514269692
164281599 11049475 2007000951052 420659505001 41 2 1 2 1 514267988
164281603 11051921 2007000951052 420659506005 41 2 1 2 1 514267791
164281604 11053415 2007000951052 420659507003 42 1 1 1 0 514270908
164281606 11055298 2007000951052 420659509001 42 1 1 1 0 514268126
164281607 11055298 2007000951052 420659509001 41 2 1 2 1 514267974
164281608 11056301 2007000951052 420659509004 42 1 1 1 0 514269445
164281610 11056740 2007000951052 420659510001 42 1 1 1 0 514269279
164281611 11056740 2007000951052 420659510001 41 2 1 2 1 514270454
164281612 11057213 2007000951052 420659510002 42 1 1 1 0 514268812

Synthia School file format

Variable from # of chars Synthia Description Categories
sp_id Synthia A numeric identifier that uniquely identifies households, persons, schools, workplaces, group quarters locations and group quarters residents throughout the entire 2010 U.S. Synthetic Population. Numeric values
name Synthia The name of the school. Character values
stabbr Synthia 2 The two-letter abbreviation of the state in which the school is located. Character values
address Synthia The physical address of the school, if known. Character values
city Synthia The city where the school is located. Character values
county Synthia The name of the county where the school is located Character values
zipcode Synthia 5 Five-digit zip code in which the school is located. Character values
zip4 Synthia 9 The nine-digit zip code (i.e., zip code plus four digits) in which the school is located. Character values
nces_id Synthia A unique identifier for each school in the National Center for Education Statistics (NCES) database. Character values
total Synthia The total number of students enrolled in the school Numeric values
prek Synthia The total number of pre-kindergarteners enrolled in the school. Numeric values
kinder Synthia The total number of kindergarteners enrolled in the school. Numeric values
gr01_gr12 Synthia The total number of students in grades one through twelve. Numeric values
ungraded Synthia The total number of students enrolled in the school whose specific grade level is unknown. Numeric values
latitude Synthia The latitude of the school, based on geocoding. Numeric values
longitude Synthia The longitude of the school, based on geocoding. Numeric values
source Synthia The source of the school’s information (either NCES [for public schools] or schoolinformation.com [for private schools]). Character values
stco Synthia State and county FIPS codes of the county and state in which the schools are located. Character values

Sample of Synthia School records

sp_id name stabbr address city county zipcode zip4 nces_id total prek kinder gr01_gr12 ungraded latitude longitude source stco
450111632 C G JOHNSON EL SCH PA JEFFERSON COUNTY 420783002385 547 0 59 488 41.090196 -78.884381 NCES 42065
450110845 BROOKVILLE JSHS PA JEFFERSON COUNTY 420432006162 885 0 0 885 41.167228 -79.085844 NCES 42065
450057407 HILLSDALE MENNONITE SCHOOL PA PO BOX 52 HILLSDALE INDIANA 15746 0 A0701937 26 0 0 26 40.7618373 -78.8775199 schoolinformation.com 42063
450132458 JENKS HILL EL SCH PA JEFFERSON COUNTY 421980002373 115 0 18 97 40.949353 -78.968361 NCES 42065
450063600 OAK HILL CHRISTIAN SCHOOL MA PO BOX 277 OXFORD WORCESTER 0 0 A0501924 428896 0 0 428896 0 0 schoolinformation.com 25027
450066817 SEEDS OF FAITH CHRISTIAN ACADEMY PA 640 CHURCH ST INDIANA INDIANA 15701 2756 A0903172 158 33 14 111 40.6215091 -79.1521905 schoolinformation.com 42063
450151186 RAYNE EL SCH PA INDIANA COUNTY 421473007338 1178 713 49 416 40.702717 -79.100597 NCES 42063
450108468 BELL TWP EL SCH PA JEFFERSON COUNTY 421980002377 133 0 28 105 40.947169 -78.92369 NCES 42065
450129160 HICKORY GROVE EL SCH PA JEFFERSON COUNTY 420432005231 605 0 0 605 41.167313 -79.085924 NCES 42065
450120306 EAST FOREST EL SCH PA FOREST COUNTY 420828006160 552 440 10 102 41.472626 -79.121872 NCES 42053

Synthia Workplace file format

Variable from Synthia Description Categories
sp_id Synthia A numeric identifier that uniquely identifies schools, workplaces, persons, and group quarters throughout the entire 2010 U.S. Synthetic Population.
workers Synthia Number of workers assigned to the workplace.
latitude Synthia The latitude of the workplace, based on geocoding. Numeric values
longitude Synthia The longitude of the workplace, based on geocoding. Numeric values

Sample of Synthia Workplace records

sp_id workers latitude longitude
514265741 25 41.19023 -79.3677
514402646 2 41.43171 -78.7416
514270578 2 41.09753 -78.8623
514265933 7 41.19014 -79.3971
514268461 2 40.91255 -79.0005
513935092 6 40.65783 -79.3205
513933820 376 40.60668 -79.188
514567179 15 42.12717 -80.0789
514271525 49 41.13363 -78.7614
514267835 2 40.99523 -78.9965

Synthia Group Quarters file format

Variable from Synthia Description Categories
sp_id Synthia A numeric identifier that uniquely identifies households, persons, schools, workplaces, group quarters locations and group quarters residents throughout the entire 2010 U.S. Synthetic Population Numeric values
gq_type Synthia A code indicating the type of group quarters facility. M = military, P = prison, N = nursing home, C = college
persons Synthia The number of synthesized persons who live in this facility.
stcotrbg Synthia The facility’s census 2010 block group identifier,
latitude Synthia The latitude of the facility, based on geocoding Numeric values
longitude Synthia The longitude of the facility, based on geocoding Numeric values

Sample of Synthia Group Quarters records

sp_id gq_type persons stcotrbg latitude longitude
450004650 C 148 420659512002 40.9474 -78.9881
450004651 C 20 420659513002 40.94201 -78.9714
450008460 P 79 420659506001 41.16818 -79.0549
450022046 P 17 420659506001 41.16234 -79.0117
450030495 N 141 420659505002 41.1537 -79.0955
450030496 N 189 420659506001 41.16277 -79.057
450036345 N 34 420659513002 40.94414 -78.9687
450040313 N 45 420659501002 41.24599 -78.7876
450042411 N 33 420659513002 40.9427 -78.9769
450047487 N 18 420659503001 41.19774 -78.8345

Synthia Group Quarters People file format

Variable from Synthia Description Categories
sp_id Synthia A numeric identifier that uniquely identifies households, persons, schools, workplaces, group quarters locations and group quarters residents throughout the entire 2010 U.S. Synthetic Population Numeric values
sp_gq_id Synthia The sp_id (from the [prefix]_synth_gq.txt file) of the group quarters facility each person resides in.
sporder Synthia A unique serial number assigned to persons within each group quarter
age Synthia The age of this synthesized group quarters agent.
sex Synthia The sex of this group quarters agent. 1 = male and 2 = female

Sample of Synthia Group Quarters People records

sp_id sp_gq_id sporder age sex
940001069 450040313 1 86 2
940001070 450040313 2 77 2
940001071 450040313 3 70 2
940001072 450040313 4 84 2
940001073 450040313 5 90 2
940001074 450040313 6 75 2
940001075 450040313 7 89 2
940001076 450040313 8 98 2
940001077 450040313 9 86 2
940001087 450040313 10 90 2
⚠️ **GitHub.com Fallback** ⚠️