AB Consensus - mendessoares/BuddySuite GitHub Wiki
--consensus, -con
Description
Condense alignments down to a single majority-rule consensus sequence. If two or more residues are tied for the highest frequency at a given column, then an ambiguous 'X' (protein) or 'N' (nucleotide) is used at that position.
Example(s)
Input file: Panx_C-terms.stklm
# STOCKHOLM 1.0
#=GF SQ 3
Mle-Panxα9 ---atgttaga------catactttcaaagtttaaaggagttactccttttaaaggtataacgatag
Mle-Panxα7A atgggggtggaaattctgtttcccataatcaacagagccaccgctccgatcaagtctgttaacatcg
Mle-Panxα4 atggttattga------gctgctagctggatacaaaggtctgtccccgtttaaagacgcgactgttg
//
# STOCKHOLM 1.0
#=GF SQ 3
Mle-Panxα9 -mldilskf--kgvtpfkgitiddgwdqlnrsfmfvllvvmgttvtvr-qytgsviscdgfkkfg--stfaedycwtqg
Mle-Panxα7A mgveilfpiinratapiksvniddlssqlnrtfmfylsltfaititirqqlggayiacdgfsrdeeyerfaeewcwssg
Mle-Panxα4 mviellagy--kglspfkdatvddswdqinrcyvfiamvvmgavttmr-qysgtliacdgftkfh--pqfaedycwsig
//
Usage example
$: alb Panx_C-terms.stklm -con
Output
# STOCKHOLM 1.0
#=GF SQ 1
consensus atggtgNtNga------gNtNctNNcaaNNtacaaaggNNtNNctccgtttaaagNtgtNacNatNg
#=GS consensus AC consensus
#=GS consensus DE Original sequences: Mle-Panxα9, Mle-Panxα7A, Mle-Panxα4
//
# STOCKHOLM 1.0
#=GF SQ 1
consensus mXXeilXXX--kgXXpfkXXtiddXwdqlnrXfmfXlXvvmgXtXtXr-qyXgXXiacdgfXkfX--XXfaedycwsXg
#=GS consensus AC consensus
#=GS consensus DE Original sequences: Mle-Panxα9, Mle-Panxα7A, Mle-Panxα4
//