ScoreBigwig - loosolab/TOBIAS GitHub Wiki
Background
The main task in footprinting is to identify regions of protein binding across the genome. Using single basepair cutsite tracks (as produced by ATACorrect), TOBIAS ScoreBigwig
is used to calculate a continuous footprinting score across regions. The calculation of the TOBIAS footprint score is as follows:
This score takes into account the depletion of signal (negative counts), while also including the general accessibility of the nearby region. This helps to improve binding prediction for transcription factors with low footprintability (i.e. showing weak footprints). It is therefore important to note is that if there is no depletion, the score will converge to the accessibility of the flanking regions. A clear shape of a footprint is therefore not a necessity for a high footprint score to occur.
Example command
$ TOBIAS FootprintScores --signal test_data/Bcell_corrected.bw --regions test_data/merged_peaks.bed --output Bcell_footprints.bw --cores 8
~1 minute
Input parameters
-
--signal
Signal bigwig containing cutsites per basepair - e.g. {prefix}_corrected.bw from ATACorrect -
--regions
Limits the computation of footprints to regions found in--regions
.bed-file (in most cases the peaks of accessible chromatin). -
--score The score to apply to the input --signal. The default is "footprint", which calculates the footprint score depicted above. Other options include "sum" and "mean", which can be used to calculate other types of scores on the input
--signal
.
Full input parameters can be found by running TOBIAS ScoreBigwig --help
.
Output
- Bigwig containing calculated scores. Positions with no score assigned can be interpreted as 0.