Label Definitions - psu-libraries/library_data_services GitHub Wiki
The intended use of the Issue labels is described here. Not all issues are described. However, if you feel there needs to be a label description here, feel free to add it or ask that it be added.
Data Provenance
This refers to two distinct uses of the term provenance.
The first use functions as chain of custody. Who has touched the file or files, who owns it etc. This can be accomplished by good versioning software,
The second use of the term refers to the workflow of the data. How files were created and by what processes. This would reference what scripts manipulated the data, documenting the inputs, processors and outputs. This can be documented using the Prov ontology (https://www.w3.org/TR/prov-o/)
Research Data Planning
Refers to activities that typically occur prior to the onset of research and often occur during the process of grant writing. Activities include but are not limited to:
- Writing Data Management Plans
- Planning file organization
- Planning naming conventions
- Organizing personnel and assigning roles.
Data Integrity
This label refers to issues concerning maintaining the data during the research project. It would include things like storage backups, version control and access control. It is distinct from data preservation as data preservation refers to activities that ensure the persistence of the data beyond the project.
Data Preservation
This label refers to activities that are intended to preserve the data long term. Typically these processes are performed by the repository and include activities such as generating checksums, file validation and preservation level storage.
Data Assurance
This can be used in two senses. First, quality assurance of the data itself, such as identifying incorrect entries, outliers etc. Typically that falls outside the realm of library services. The other sense is assuring that the data is properly organized such as but not limited to.
- spread sheets are rectangular
- file types are properly used