Home - Jefflier/covid-drdb-payload GitHub Wiki

We're curating a database containing the antibody resistance data of SARS-CoV-2. The data are extracted from publications (most of time are papers) and include treatments of monoclonal antibodies (mAb), plasma or serum samples (convalescent and vaccinee plasma).

This document is about how to contribute to the database, how to enter data, and how we manage the database.

The pipeline of curating data

Read the paper you have received.
Extract key data points, and organize them in several Excel spreadsheets with corresponding table headers.
Save all CSV files into corresponding folders.

Because of the heterogeneity of data, we provide different documents to explain them separately.

Comment: if you find the iso_name already exists in the database, you can skip the isolates and isolate_mutations tables. This also applies to other tables like antibodies, vaccines, etc. The goal of the template is to standardize the process, you don't need to fill in duplicated information that already exists in the database.

Suggested tools

Please use PDF readers to view the paper
You may use Adobe Illustrator to extract data from figures
You may convert PDF to word to get data from PDF tables
Please use Excel to format data
- use Excel to open CSV files
- convert all dates to the "YYYY-MM-DD" format
- Excel may convert names with - to dates, please convert them to text by prefix '
- save the CSV files in "CSV UTF-8 (Comma Delimited)" format