technical fields - procure-project/EU-Contract-Hub GitHub Wiki

OpenSearch Index Field Documentation

Overview

The EU Contract Hub uses OpenSearch as its document-oriented storage solution. This page documents the fields indexed in the main procure_v4 index, which stores procurement contract data from various sources. Each field is described with its data type, a brief description of its purpose, and its provenance (source).

Field Reference Table

Field Name Type Description Provenance
Awarded Contracts.Awarded Contract Title text The official title of the awarded contract as provided in the contract award notice. TED_XML
Awarded Contracts.Conclusion Date date The date when the contract was officially concluded between parties. TED_XML
Awarded Contracts.Contractors.Address.Address text The street address of the contractor who won the contract. TED_XML
Awarded Contracts.Contractors.Address.Country keyword The country where the contractor is located. TED_XML
Awarded Contracts.Contractors.Address.Postal Code keyword The postal code of the contractor's address. TED_XML
Awarded Contracts.Contractors.Address.Territorial Unit (NUTS3) keyword The NUTS3 territorial classification code for the contractor's location. TED_XML
Awarded Contracts.Contractors.Address.Town text The town or city where the contractor is located. TED_XML
Awarded Contracts.Contractors.Contact.Email keyword The email address of the contractor. TED_XML
Awarded Contracts.Contractors.Contact.Phone keyword The phone number of the contractor. TED_XML
Awarded Contracts.Contractors.Contact.URL keyword The website URL of the contractor. TED_XML
Awarded Contracts.Contractors.Name text The official name of the business or organization awarded the contract. TED_XML
Awarded Contracts.Contractors.National ID keyword The national identification number of the contractor. TED_XML
Awarded Contracts.Corresponding Lot text Indicates which lot of the tender the awarded contract corresponds to. TED_XML
Awarded Contracts.Number of Tenders integer The total number of bids received for this contract. TED_XML
CPV keyword The Common Procurement Vocabulary code that categorizes the goods, works, or services being procured. TED_XML
CPV Description text The textual description of the CPV code. TED_XML
Contract Nature keyword The nature of the contract (supplies, services, or works). TED_XML
Contracting Authority.Activity keyword The main activity of the contracting authority (e.g., health, education). TED_XML
Contracting Authority.Address.Address text The street address of the contracting authority. TED_XML
Contracting Authority.Address.Country keyword The country where the contracting authority is located. TED_XML
Contracting Authority.Address.Postal Code keyword The postal code of the contracting authority's address. TED_XML
Contracting Authority.Address.Territorial Unit (NUTS3) keyword The NUTS3 territorial classification code for the contracting authority's location. TED_XML
Contracting Authority.Address.Town text The town or city where the contracting authority is located. TED_XML
Contracting Authority.CA Type keyword The type of contracting authority (e.g., ministry, regional authority). TED_XML
Contracting Authority.Contact.Email keyword The email address of the contracting authority. TED_XML
Contracting Authority.Contact.Phone keyword The phone number of the contracting authority. TED_XML
Contracting Authority.Contact.URL keyword The website URL of the contracting authority. TED_XML
Contracting Authority.Name text The official name of the contracting authority. TED_XML
Contracting Authority.National ID keyword The national identification number of the contracting authority. TED_XML
Country keyword The country code of the contracting authority. TED_XML
Critical Services CPV boolean Indicates if the contract is for critical services based on its CPV code. Derived
Description text The detailed description of the contract or procurement. TED_XML
Description (Translation) text The English translation of the contract description. Derived
Dispatch Date date The date when the notice was dispatched for publication. TED_XML
Health Authority Class keyword Classification of the contracting authority as a healthcare entity. Derived
Healthcare Authority Class text Sub-classification of the healthcare authority type. Derived
Healthcare CPV boolean Indicates if the contract is healthcare-related based on its CPV code. Derived
Lots.CPV Codes keyword The CPV codes assigned to a specific lot. TED_XML
Lots.Criteria.Cost.Criterion text The description of a cost criterion used for evaluating bids. TED_XML
Lots.Criteria.Cost.Criterion (Translation) text The English translation of the cost criterion. Derived
Lots.Criteria.Cost.Weight float The weight or importance assigned to the cost criterion in bid evaluation. TED_XML
Lots.Criteria.Price.Criterion text The description of a price criterion used for evaluating bids. TED_XML
Lots.Criteria.Price.Weight float The weight or importance assigned to the price criterion in bid evaluation. TED_XML
Lots.Criteria.Quality.Criterion text The description of a quality criterion used for evaluating bids. TED_XML
Lots.Criteria.Quality.Criterion (Translation) text The English translation of the quality criterion. Derived
Lots.Criteria.Quality.Weight float The weight or importance assigned to the quality criterion in bid evaluation. TED_XML
Lots.Criteria.Type text The type of criterion used (price, quality, or cost). TED_XML
Lots.Criteria.Weight float The overall weight of the criterion in the evaluation process. TED_XML
Lots.Lot Number keyword The identifier for a specific lot within a contract. TED_XML
Lots.Main Criterion keyword The main criterion used for award decision (e.g., price, MEAT). TED_XML
Lots.Short Description text A brief description of the lot. TED_XML
Lots.Short Description (Translated) text The English translation of the lot's short description. Derived
Lots.Short Description (Translation) text The English translation of the lot's short description (alternative field). Derived
Lots.Title text The title of the specific lot. TED_XML
Lots.Title (Translated) text The English translation of the lot's title. Derived
Lots.Title (Translation) text The English translation of the lot's title (alternative field). Derived
Number of Lots integer The total number of lots in the tender. TED_XML
Procurement Route keyword The procurement route used (e.g., above threshold, below threshold). Derived
Procurement Techniques.Centralised Purchasing Activities and Central Purchasing Bodies boolean Indicates if centralized purchasing was used. TED_XML
Procurement Techniques.Centralized Purchasing Activities and Central Purchasing Bodies boolean Alternative field for indicating centralized purchasing. TED_XML
Procurement Techniques.Dynamic Purchasing System boolean Indicates if a dynamic purchasing system was used. TED_XML
Procurement Techniques.Dynamic Purchasing Systems boolean Alternative field for indicating dynamic purchasing systems. TED_XML
Procurement Techniques.Electronic Auction boolean Indicates if an electronic auction was used. TED_XML
Procurement Techniques.Electronic Catalogue boolean Indicates if an electronic catalogue was used. TED_XML
Procurement Techniques.Framework Agreement boolean Indicates if a framework agreement was used. TED_XML
Procurement Techniques.Occasional Joint Procurement boolean Indicates if occasional joint procurement was used. TED_XML
Procurement Techniques.Procurement Involving Contracting Authorities from Different Member States boolean Indicates if the procurement involved authorities from multiple EU member states. TED_XML
Procurement Techniques.Unknown boolean Indicates if the procurement technique is unknown. Derived
Procurement Type keyword The type of procurement procedure used (e.g., open, restricted). TED_XML
Tags.Process Date date The date when the record was processed in the system. Derived
Tags.Source.TED-CSV boolean Indicates if the data was sourced from TED CSV format. Derived
Tags.Source.TED-EFORMS boolean Indicates if the data was sourced from TED eForms format. Derived
Tags.Source.TED-EForms boolean Alternative field indicating if the data was sourced from TED eForms format. Derived
Tags.Source.TED-XML boolean Indicates if the data was sourced from TED XML format. Derived
Title text The official title of the contract or procurement. TED_XML
Title (Translation) text The English translation of the contract title. Derived
Value float The estimated or awarded monetary value of the contract in euros. TED_XML

Data Types Explanation

  • text: Analyzed string fields that are searchable by their content. Used for full-text search.
  • keyword: Non-analyzed string fields optimized for exact matching and aggregations.
  • date: Date and time values stored in a standardized format.
  • integer: Whole number values.
  • float: Decimal number values.
  • boolean: True/false values.
  • nested: Complex objects within documents that can be queried independently.

Field Provenance

  • TED_XML: Fields sourced directly from TED XML format (legacy or eForms).
  • TED_CSV: Fields sourced from TED CSV dataset.
  • Derived: Fields that are calculated, translated, or derived from other source fields.

Field Relationships

The index uses a nested structure to represent complex relationships:

  • Contracting Authority: Contains information about the organization issuing the contract.
  • Lots: Details about individual parts of the contract that can be bid on separately.
  • Awarded Contracts: Information about the outcomes of the procurement process, including who won each lot.
  • Contractors: Details about the organizations that won contracts.

Querying Fields

Fields can be queried in OpenSearch using various query types:

  • Full-text fields (type: text) can be searched using match, match_phrase, or query_string queries.
  • Exact-match fields (type: keyword) can be searched using term or terms queries.
  • Numeric fields can be queried using range queries.
  • Boolean fields can be filtered using term queries.

For details on query syntax, refer to the OpenSearch DQL documentation page.