Data Curation Session 1 - OscarO-0/BAT102_oscarosuna GitHub Wiki

#Goals and aims

Review data collected by myself and peers to ensure a global harmonization

#Methods

Followed Protocol #4 (Data Review)

#Results and Notes

##Corrections applied

*Data set 2, on column 41, was marked as unavailable and under maintenance in error message tab. Database is back up and running --> corrected from "FALSE" > "TRUE". Error message also removed. + added update to available date as 10/6/24

*Data set 3, column 86, database marked as unavailable. But data base is working for me --> Corrected unavailable "FALSE" > "TRUE". + added update to available date as 10/6/24

##Missing data

No missing data from my analysis.

#Suggestions

I noticed the other two datasets had no notes for any of their data. I think there could be some more emphasis or explanation on the notes section. Possibly to frame it more as a mini journaling outlet to show the researcher's general thought process behind their data collection. Or maybe more emphasis on writing notes about any doubts, even if minimal, just to give as much supplementary information about our data collection because we are doing such a large-scale collaborative project. Maybe In the video explanation of protocol, there could be some additional examples for things to possibly take notes on in our brief analysis of each paper we look at. For example, I took a note that a database said it was active since 1995, but it had no trace in our DB_JL software, little discrepancies like that can add a lot more useful supplementary data.