Truth Data Format - european-modelling-hubs/RespiCast-SyndromicIndicators GitHub Wiki
The ground truth data for forecasting targets can be found in the folder target-data
. To access the latest data files, use this link for ERVISS data and this link for FluID data. Alternatively, historical data files are stored in the snapshots
folders and are named YYYY-MM-DD-[ILI/AIR]_incidence.csv
, with YYYY-MM-DD
representing the date of the last data update (which occurs every Friday). It's important to note that the latest file
not only includes new data points but also the entire available history.
Each ground truth CSV file contains the following columns:
column | column type | description |
---|---|---|
target |
string | The forecast target; one of: "ILI incidence", "ARI incidence" |
location |
string | ISO-2 code identifying the country |
truth_date |
date | Date in format YYYY-MM-DD: the last day of the truth week (Sunday) |
year_week |
string | A string denoting the year and week to which the truth data corresponds |
value |
decimal | ILI or ARI incidence per $100,000$ |
Below are illustrative rows as examples:
target,location,truth_date,year_week,value
ILI incidence,AT,2024-04-07,2024-W14,1512.6
ILI incidence,AT,2024-03-31,2024-W13,1629.1
ILI incidence,AT,2024-03-24,2024-W12,2179.8
ILI incidence,AT,2024-03-17,2024-W11,1798.0
From the first row, for instance, we can read that in Austria (AT), during week $47$ of the year $2023$, ending on Sunday, November 26, 2023, the reported ILI/ARI incidence per $100,000$ was approximately $2778.61$.
The countries are divided into the two data ground truth data sources as follows:
Data Source | Countries (ISO-2 code) |
---|---|
ERVISS | AT, BE, HR, CZ, DK, EE, FI, FR, GR, HU, IS, IE, IT, LV, LT, LU, MT, NL, NO, PL, PT, RO, SK, SI |
FluID | CH, GB-ENG, GB-WLS, GB-NIR, GB-SCT |