Annual Report: TDWG Data Quality Interest Group for 2023 - tdwg/bdq GitHub Wiki
Phase of work:
The Interest Group currently has 2 Task Groups - 2 were requested to be wound up during the year
[TG1 – Framework on Data Quality - CLOSED]
TG2 – Data Quality Tests and Assertions
[TG3 – Data Quality Use Cases - CLOSED]
TG4 – Best Practices for Development of Vocabularies of Value
Activities:
A lot of work has been carried out during the year - especially wrt Task Group 2 (see under report for that Task Group below). Requests for the winding up of Task Groups 1 and 3 have been made to the executive as the work of those two groups has been completed and outcomes folded into Task Group 2 in the lead up to a new Biodiversity Data Quality Standard (tentatively named BDQ Core).
Accomplishments:
- A lot of progress has been made in Task Group 2 leading to the start of a draft Standards document (https://github.com/tdwg/bdq/wiki/TG2-Tests-and-Assertions-Standards-Document)
- The winding up of Task Group 1: Framework on Data Quality (see https://github.com/tdwg/bdq/wiki/Winding-up-of-Task-Group-1-%E2%80%93-Framework-on-Data-Quality)
- The winding up of Task Group 3: Data Quality Use Cases (see https://github.com/tdwg/bdq/wiki/Final-Report-of-Task-Group-3:-Data-Quality-Use-Cases)
- No real progress on Task Group 4.
Impediments to progress:
- Inability to meet ‘face-to-face’ to finalize the writing of the standard document. Zoom and equivalents are useful but far from optimal for collaboration and the different time zones do not help.
Changes in goals or scope:
- Zero
Plans for next calendar year:
- We hope to complete the implementation of the outstanding tests, the test data and the standards document.
- Submission of the work as a TDWG standard.
TG1: Framework on Data Quality Task Group
Task group wound up - see final report at https://github.com/tdwg/bdq/wiki/Winding-up-of-Task-Group-1-%E2%80%93-Framework-on-Data-Quality
TG2: Data Quality Tests and Assertions Task Group
Phase of work:
- Completing test data suite (last task before submission of Standard)
Activities:
- Finalized 99 CORE tests and documented them against a standard template: https://github.com/tdwg/bdq/issues?q=is%3Aissue+is%3Aopen+label%3ATes
- The generation of test datasets that can be used to validate the isntallation of the test code is approaching completion with some issues still to be worked out on the final structure of the test data. See https://github.com/tdwg/bdq/tree/master/tg2/core/testdata for a subset.
Accomplishments:
- Tests and specifications (based on a single template) are final
- Test data template agreed
- Code has been written to extract the parameters of each of the tests to RDF and we believe that this will form the basis of the proposed TDWG standard for the Tests and Assertions. Finalized 99 CORE tests and documented them against a standard template: https://github.com/tdwg/bdq/issues?q=is%3Aissue+is%3Aopen+label%3ATest.
Impediments to progress:
- Inability to meet ‘face-to-face’.
- Busy TG2 members
- ‘Burnt out’ TG2 members. This work has taken much longer than anyone in the group anticipated. This has largely been due to the complexity of the task. COVID-19
Changes in goals or scope:
- Zero
Plans for next calendar year:
- Proof and finalise Test Data and make available for public review
- Develop a technical specification
- Submit the work of TG2 as a TDWG standard.
TG3: Data Quality Use Cases Task Group
Task Group wound up - see final report at https://github.com/tdwg/bdq/wiki/Final-Report-of-Task-Group-3:-Data-Quality-Use-Cases
TG4: Best Practices for development of Vocabularies of Value Task Group
Phase of work:
Preparing best practices document
Activities:
Accomplishments:
Very slow progress during 2023 due to convener personal circumstances.
Impediments to progress:
Changes in goals or scope:
None
Plans for next calendar year:
The Task Group does not plan to propose a new data standard or any modification to existing ones but intends to provide a best current practice for building TDWG vocabularies of values.