Truth Data Format - european-modelling-hubs/RespiCast-SyndromicIndicators GitHub Wiki

The ground truth data for forecasting targets can be found in the folder target-data. To access the latest data files, use this link for ERVISS data and this link for FluID data. Alternatively, historical data files are stored in the snapshots folders and are named YYYY-MM-DD-[ILI/AIR]_incidence.csv, with YYYY-MM-DD representing the date of the last data update (which occurs every Friday). It's important to note that the latest file not only includes new data points but also the entire available history.

Each ground truth CSV file contains the following columns:

column column type description
target string The forecast target; one of: "ILI incidence", "ARI incidence"
location string ISO-2 code identifying the country
truth_date date Date in format YYYY-MM-DD: the last day of the truth week (Sunday)
year_week string A string denoting the year and week to which the truth data corresponds
value decimal ILI or ARI incidence per $100,000$

Below are illustrative rows as examples:

target,location,truth_date,year_week,value
ILI incidence,AT,2024-04-07,2024-W14,1512.6
ILI incidence,AT,2024-03-31,2024-W13,1629.1
ILI incidence,AT,2024-03-24,2024-W12,2179.8
ILI incidence,AT,2024-03-17,2024-W11,1798.0

From the first row, for instance, we can read that in Austria (AT), during week $47$ of the year $2023$, ending on Sunday, November 26, 2023, the reported ILI/ARI incidence per $100,000$ was approximately $2778.61$.

The countries are divided into the two data ground truth data sources as follows:

Data Source Countries (ISO-2 code)
ERVISS AT, BE, HR, CZ, DK, EE, FI, FR, GR, HU, IS, IE, IT, LV, LT, LU, MT, NL, NO, PL, PT, RO, SK, SI
FluID CH, GB-ENG, GB-WLS, GB-NIR, GB-SCT