Home - Jefflier/covid-drdb-payload GitHub Wiki
We're curating a database containing the antibody resistance data of SARS-CoV-2. The data are extracted from publications (most of time are papers) and include treatments of monoclonal antibodies (mAb), plasma or serum samples (convalescent and vaccinee plasma).
This document is about how to contribute to the database, how to enter data, and how we manage the database.
The pipeline of curating data
- Read the paper you have received.
- Extract key data points, and organize them in several Excel spreadsheets with corresponding table headers.
- Save all CSV files into corresponding folders.
Because of the heterogeneity of data, we provide different documents to explain them separately.
- Enter neutralization data (metadata tables)
- Enter neutralization data (plasma-or-serum-samples)
- Enter-neutralization-data-(mAb).md
Comment: if you find the iso_name
already exists in the database, you can skip the isolates
and isolate_mutations
tables. This also applies to other tables like antibodies
, vaccines
, etc. The goal of the template is to standardize the process, you don't need to fill in duplicated information that already exists in the database.
Suggested tools
- Please use PDF readers to view the paper
- You may use Adobe Illustrator to extract data from figures
- You may convert PDF to word to get data from PDF tables
- Please use Excel to format data
- use Excel to open CSV files
- convert all dates to the "YYYY-MM-DD" format
- Excel may convert names with
-
to dates, please convert them to text by prefix'
- save the CSV files in "CSV UTF-8 (Comma Delimited)" format