Project Settings - giffordlabcvr/Hepadnaviridae-GLUE GitHub Wiki

Project settings tailor the analysis and handling of sequence data within GLUE projects by defining how sequences are processed, translated, and normalized.

The Hepadnavirus-GLUE project settings are defined in this project build file, and are as follows:

1. Ignore Nucleotide Sequence Hyphens

  • Setting: ignore-nt-sequence-hyphens = true
  • Description: This setting ensures that any hyphens (-) in nucleotide sequences are ignored during processing. Hyphens, often representing alignment gaps, do not interfere with analyses such as translation or sequence comparisons.

2. Translate Beyond Possible Stop Codons

  • Setting: translate-beyond-possible-stop = true
  • Description: Enables translation of nucleotide sequences past ambiguous codons (e.g., NNN) that might represent a possible stop codon. This setting is useful for sequences with uncertain or incomplete data, ensuring continuity in protein translations.

3. Translate Beyond Definite Stop Codons

  • Setting: translate-beyond-definite-stop = true
  • Description: Allows translation of nucleotide sequences beyond known stop codons. This feature is essential for exploring downstream open reading frames (ORFs) and non-standard translation scenarios.

4. Sequence Reverse Complement Tracking

  • Setting: sequence-reverse-complement-boolean-field = reverse_complement
  • Description: Specifies a custom boolean field (reverse_complement) in the sequence table to track whether sequences should be reverse-complemented during processing. This ensures accurate handling of sequence orientation, particularly for viral genomes with reverse-strand transcription.

5. Sequence Rotation Normalization

  • Setting: sequence-rotation-integer-field = rotation
  • Description: Defines a custom integer field (rotation) used to apply rotation transformations to circular HBV genomes. This normalizes sequence positions, aligning them to a defined reference point for consistent analysis.