7. Minimum viable product - os-data/eu-structural-funds GitHub Wiki
Requirements
Datasets
All available datasets describing the beneficiaries of the European Structural and Investment Funds
Fields
All datasets should have the following set of fields:
fund_name
(string, codelist)funding_period_number
(string, codelist)beneficiary_name
(string, raw)project_name
(string, raw)starting_date
(date, cleaned*)approval_date
(date, cleaned*)beneficiary_country_code
(string, codelist)beneficiary_nuts_code
(string, codelist)total_amount
(number, cleaned*)
(*) cleaned = parsed, casted and verified
Requirements:
- Query per Fund ** Nomenclatura needed - check
- Clean Geographic Data ** Hierarchies for NUTS-codes (existing) ** Country, state, region, municipality, address ** Constants needed to be added to the data
- Clean Amounts ** Remodelling needed from differentiated and detailed amounts to one ‘total amount’ = fixed by remodeling?
- Clean Dates.
Make Sure dimensions and hierarchies are correctly modelled for the open spending viewer:
- Drill down for: Beneficiary, Activity.
- Drill down for: geographic regions, cities etc.
- Drill down for: Funds, Priorities (if possible)
- Drill down for: Dates: Period, Year, Exact date.
Basic normalisation operations needed: Currencies converted to Euro Dates to one standard format. Fund names to nomenclatura. Grouping of Beneficiary Names
Enrichment possibilities - Low Priority:
CCI Codes (lower priority) Type of company Translations
Back-end
The above data should be available on Open-Spending as:
- Individual datasets (
.csv
data and.json
datapackage) - Unified dataset (
.csv
data and.json
datapackage)
Front-end
The website should embed Open-Spending iframes.