7. Minimum viable product - os-data/eu-structural-funds GitHub Wiki

Requirements

Datasets

All available datasets describing the beneficiaries of the European Structural and Investment Funds

Fields

All datasets should have the following set of fields:

  • fund_name (string, codelist)
  • funding_period_number (string, codelist)
  • beneficiary_name (string, raw)
  • project_name (string, raw)
  • starting_date (date, cleaned*)
  • approval_date (date, cleaned*)
  • beneficiary_country_code (string, codelist)
  • beneficiary_nuts_code (string, codelist)
  • total_amount (number, cleaned*)

(*) cleaned = parsed, casted and verified

Requirements:

  • Query per Fund ** Nomenclatura needed - check
  • Clean Geographic Data ** Hierarchies for NUTS-codes (existing) ** Country, state, region, municipality, address ** Constants needed to be added to the data
  • Clean Amounts ** Remodelling needed from differentiated and detailed amounts to one ‘total amount’ = fixed by remodeling?
  • Clean Dates.

Make Sure dimensions and hierarchies are correctly modelled for the open spending viewer:

  • Drill down for: Beneficiary, Activity.
  • Drill down for: geographic regions, cities etc.
  • Drill down for: Funds, Priorities (if possible)
  • Drill down for: Dates: Period, Year, Exact date.

Basic normalisation operations needed: Currencies converted to Euro Dates to one standard format. Fund names to nomenclatura. Grouping of Beneficiary Names

Enrichment possibilities - Low Priority:

CCI Codes (lower priority) Type of company Translations

Back-end

The above data should be available on Open-Spending as:

  1. Individual datasets (.csv data and .json datapackage)
  2. Unified dataset (.csv data and .json datapackage)

Front-end

The website should embed Open-Spending iframes.