HCUP KID Healthcare Cost and Utilization Project, Kids Inpatient Database - onetomapanalytics/Meta_Data GitHub Wiki
HCUP KID - Healthcare Cost and Utilization Project, Kids' Inpatient Database
General description
- Database primary purpose - Provide national estimates of hospital inpatient stays for patients younger than 21 years of age.
- Overall data type - Health outcomes
- Dataset type - Cross-sectional
- Data source - Claims
- Data level - Patient level, but includes a hospital-level file
- Geographic location of the data collection sites - United States
- Sponsor, manager, or home institution - Agency for Healthcare Research and Quality's (AHRQ)
- Date range - 2006, 2009, and 2012
- Geolocation data - 2006 and 2009: hospital state postal code, modified FIPS state/county code, urban/rural code, region, zip code and address from AHA Annual Survey; 2012: location (urban/rural) of both hospital and patient
- Dates - Year; admission day is on a weekend and month; discharge quarter
- Hospital identifiers - 2006 and 2009: AHA hospital identifier, HCUP hospital identification number, and Hospital name from AHA Annual Survey; 2012: KID hospital number
- Physicians identifiers - Provides de-identified physician identifiers, which can be used to distinguish between physicians. If the original physician identifier is based on a state license number or Universal Physician Identification Number (UPIN), then Physician number can be used to track a physician across hospitals. If the original physician identifier is based on hospital-specific identifiers, then it can only be used to track physicians within a hospital.
- Longitudinal tracking - Track providers (see identifiers above)
- Financial variables - Contains charge information and provides supplemental files containing cost-to-charge ratios.
- Clinical areas of interest - all; Pediatric care
- Number of records - Two to three million hospital stays each year.
- Variables that are uniquely present in this dataset - The largest publicly available all-payer pediatric inpatient care database in the United States. KID's data enables analyses of rare conditions, such as congenital anomalies, as well as uncommon treatments, such as organ transplantation.
- Database caveats and limitations - Not all data elements are available from every state, and not in all the years. Also, not all data elements are uniformly coded across states.
Applicable methods
- Association methods, such as linear regression models (1, 2), logistic regression models (3, 4), hierarchical models (5), ANOVA (6)
- Univariate statistical analysis (7, 8)
- Propensity score (9, 10)
- Sensitivity Analysis (11)
High-impact designs
-
Assess temporal trends in condition-related hospitalizations (12)
-
Association between regional factors and resources utilization for congenital disease (13)
Data dictionary
To access the data dictionary, click here
Variable categories
- Patient demographics (e.g., age, neonatal age, sex, race, ethnicity, median household income, and median income for patient's ZIP Code)
- Hospital discharge records (e.g., diagnosis-related group (DRG) in use on discharge date, elective and non-elective admission, number of diagnoses on discharge, LOS)
- Charges (e.g., expected payer, total charges)
- Diagnosis codes (ICD-9; including injury type)
- Procedure codes
- Hospital characteristics (e.g., ownership, bed size, STRATA, teaching status, number of hospital discharges, and hospital markets file defined by geopolitical boundaries, fixed radius, variable radius, and patient flow)
- Disposition at discharge (e.g., discharge to home, transferred to [type of facility], left against medical advice)
- Died during hospitalization
- Severity and comorbidity measures (e.g., chronic condition indicators, AHRQ comorbidity measure)
Linkage to other datasets
- Linkages can be established with AHA for the years 2006 and 2009