Roadmap - HobnobMancer/cazy_webscraper GitHub Wiki

Roadmap

This page describes planned updates and enhancements to cazy_webscraper ahead of future releases

v1.0

All this work is being done on the currently active update_expand branch.

  • Unit/Integration Tests

    • We aim to cover at least 95% of the entire package with tests
  • expand subcommand

    • This subcommand is used to retrieve protein sequences from GenBank and PDB structures from RCSB/PDB, and include an additional EC number filter for selecting CAZymes to retrieve structural and sequence data.
    • unit tests for the Expand module
  • Documentation

    • step-by-step tutorials for using cazy_webscraper
    • tidy documentation for publication

v1.1

  • Add video tutorials

  • Extend the expand subcommand to retrieve protein sequences from the local CAZyme database and write them to FASTA

  • Allow users to provide their own GenBank accessions for retrieval of protein sequences, in addition to other query filters

No planned version

  • GUI (Flask interface) for cazy_webscraper database