BCSC Risk Factors Dataset Breast Cancer Surveillance Consortium - onetomapanalytics/Meta_Data GitHub Wiki
BCSC Risk Factors Dataset
General description
- Database primary purpose - Provide data regarding the distribution of breast cancer risk factors in US women, aiming to help describe the distribution of breast cancer risk in the general population and to explore relationships among breast cancer risk factors
- Overall data type - Health outcomes
- Dataset type - Cross-sectional
- Data source - Registry, survey
- Data level - Patient level
- Geographic location of the data collection sites - United States
- Sponsor, manager, or home institution - Breast Cancer Surveillance Consortium
- Date range - January 2005 - December 2017
- Dates - Year of observation
- Clinical areas of interest - Breast Cancer
- Number of records - The dataset includes information from 6,788,436 mammograms
- Variables that are uniquely present in this dataset - The dataset includes participant characteristics previously shown to be associated with breast cancer risk, including age, race/ethnicity, family history of breast cancer, age at menarche, age at first birth, breast density, use of hormone replacement therapy, menopausal status, body mass index, history of biopsy, and history of breast cancer
- Other - This dataset was created by selecting one exam per woman per calendar year and year of age. When both screening and diagnostic mammograms exist for a given woman and year, screening mammograms were preferentially selected.
Applicable methods
- Association methods, such as logistic regression (1), Cox proportional hazards models (2, 3, 4)
- Descriptive analysis (1, 5, 6)
- Area under the curve (7, 8, 9)
High-impact designs
-
Determine the population-attributable risk proportion for breast cancer associated with clinical breast cancer risk factors (1)
-
Evaluate the association between risk factors and breast cancer incidence (2)
Data dictionary
To access the BCSC Risk Factors Dataset dictionary, click here
Variable categories
- Patient demographics (e.g., age group, race, ethnicity)
- Patient overall characteristics (e.g., BMI group, menopausal status)
- Cancer-related data (e.g., BI-RADS breast density, hormone replacement therapy, breast biopsy, cancer diagnosis)