Internal organization of ASE - molgenis/systemsgenetics GitHub Wiki
Internal organisation of CTS ASE
The cell type specific (CTS) allele specific expression (ASE) module is separated into three sub-modules:
ASreads
This sub-module is able to quantify the number sequencing reads that overlap an allele at heterozygous SNP sites, by integrating any quantitative sequencing experiment (like RNA-seq or proseq) with genotype information.
The data created in this module can be used in the later modules for the quantification of allele specific expression.
ASEperSNP
This sub-module combines multiple ASfiles (Produced previously in the ASreads sub-module) of samples, and determines the probability of allelic imbalance per SNP. This module uses multiple methods of finding the allelic imbalance using a likelihood ratio test (LRT):
- Binomial LRT
- Beta binomial LRT
When dealing with heterogeneous tissue, and cell proportions are available:
- CTS Binomial LRT
- CTS beta binomial LRT
Per SNP, a P value is determined. The P value is the probability of finding no ASE. Lower P values mean more evidence for ASE at this SNP. Usually the Beta Binomial (BB) test is better at correction for sample specific biases, thus, the results of the Beta Binomial should usually be considered to be more accurate.
ASEperRegion
This sub-module combines multiple ASfiles (Produced previously in the ASreads submodule) of samples, and determines the probability of allelic imbalance per genomic region. This module uses multiple methods of finding the allelic imbalance using a likelihood ratio test (LRT):
- Binomial LRT
- Beta binomial LRT
Per Region, and test SNP, a P-value is determined, in the same was as the ASEperSNP module.