tra1 - dictyBase/community-annotations GitHub Wiki
tra1 from genomic sequence contains a 2 nt gap in the 3rd exon that has been introduced to create the best possible open reading frame; ESTs show there is a 1 nt deletion in the chromosomal sequence, leading to a frame shift; the resulting protein sequence is missing one asparagine residue at position 3856.
MSTNPPQPPPSSTIASNPPQPIATPMSTTNPSQPTITSSSAASSSSSSSSGNPVNFESYA RRCFELNNNNEQTQLLALVTEIRDNIELVHTVEYPTFLNFLFPVFYNILRQGAVQFNDGP EQKIRNTILDILNKLPNNELLRPHILVLLQLSMYLLEVDNEENALVCLRIIIELHKNYRN ALESEIQPFLNIVLKLYTDLPSTIEKTFSSSSSASLSTTTTAISPTTTTTTTPATATTPA TTTATGNTITTPPPATPPSTTATAISPTSSTTTTTTATTAAAATIATTTATTTITPPLPP YMIKSIESFKILTECPIVVILLFQLYNSYMSSNVPKFIPLIIETLSLQAPANSTVTHHSQ YVDFIAAQVKTLYLLAYVLKWHIEQIKQYSDRFPRSVIQLLQNCPAHSSAIRKELLVTLR HILSSDFKSKFIVYLDLLLDEKIILGTSRTSYESLRSMAYGSLADFIHNMRNELNINQIS KVVAIYSRHLHDQTNPVSIQIMSVKLIISLMDVIQRKQDPPEYKSRSIIYKVIESFINKF SSLKRSIPKLLADQQKEKEKELKDPQSLKDKLDGLSSANTTTSSTGEIIILDPVKDTRTL IKTMTSSLRNIFWSLSACPINKPGTGITTGAGATTTTTTNTNNTIIPPVRIALPSIEESL LFIKLFKSTVKCFPIYGGCNPSPQEEKEMIENFTASFMMLDQRTFQEVSTFILPFLYQRS LNNPSLLLIPQGFLSVTQMNPTGVQINRVFLEVLTPFLYEKIRNLQPTDKPDICMIKLIK LIFNAIQPNNNSGVGGSGGSNSSGGGGGGGSNSSNNSTNSNTTTNIDSTCVQQVLSSMIL ILLKLITESKQIDSIQYLLLLKTIFKSCTRPDQSKEITLLFPIILETLNDLLLSSSHSTM IPAVQQLLIELSLSIPVQIATLLPSLHLLVKPLMLALDSSSSELLSTTFRILELIVDNAT GDFLLFTFRDNKSEFLQILSKHLRPAPYFYGPHAIRILGKMAGKSRSFSVLSPILSIDST SNSRSIPSSNKNNNNNNYYYNGSCSNSENYSKVFKLLLPCETGDDKTKSIPLDKSIQSIK NILLYQLDDSYLQSNAYSLLKYYISLYLSSQDFLINQQSLLNELLNNLKQSNNNNNNNSS TVNLNIIELDNENENENDNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNIKTFKT KEEYLNEIKNFKDLVYCLFLSITNDHLKEKFDSLKFLNNFIYHFVLYLSTFKFNYSIISM KELDPKIFLEALVDVMSMSSHNIINQSNIEDLQTISTSKFNKAHITSLLDMIFNCSNQIF SENSNSKKNNELMTSSTDVKDGEKVEMETEDSLKKDEMSAAATSEIKKETNVVVENEQDK DTVLISPIFKYLVKLFIKCCYDKDFSVKGAGLIGIEYIIENVKLSWIQPFQHLILKSLLF VCEDLSYSGYQPTIDYASEIIINLIKLCVPNLNIVPDSMEIDQSTTTASTTETAATTTTT ETATPMVTESTAIVTEPTATTPTATPTSTPTSTSTPTPTPIPTATTSSTTTAPTTTTTTT TNLSSSSTINQKPHCKLNQLKLKDRELLKLILEILMERITSWSGHTRSLAQRMLTMISVE ITKIPMSQLIEDLKMTVQKLLPKTPLKSLSISLQTGVIDGLTFCLSQKPSPLIEIGADTV RVLQECLNVAGDESSPTQQSQIKSSSAKSISATNNLRVCGVEMVATAMTCPDFLQFECLE FKNRIIRMFFKVVTARNKEMAMAAKRGLANSIQQQRLHRDLLQTCLRPVLSNITDPKSLS VPFLQGLSRLLELLSNCFNAALGEKLFEYLKKFEEAGKLSYLANKYRDSEEVKICASIID IFHLLPPAAKLLDSTIILTIRLEQSLCKEVTSPYREPLIRFLAKYPQRTIEIFMGQLPQF NLIFRLILKHQPLSKPIVEELANTYSIWLEAHLKSPSADIRFHTLSMVSIIRKQLPNWLP ENRKVLDILIEYWRPLSHMIQSASNPLDISNQTLRETKIIVKCFLQYCKAHSEETDLYFY MLSVLTLRASMDFNFLRDYYQHDLAPSSTIEQKKKIIQTFLIFFKDQTIPSDNKVQAIQN LITPILTNYFHQTDRNSSSGGGIIEDSLFIQLTKQTLETEVKASYDDTLLIELLQLETLL VKNLSSVLVDCRKELIKFAWNHLKNEDLTCKQSAYILACGFIEAYETPHKIVLQVYVPLL RAYQPESKHLVKQALDILMPCFKTRLPGGDPKNSTWVKWTKKIIVEEGHTTAQLVHIIQL IVRHPQLFYPSRSQFVPHIILLLPKIALGSNLTAENKKLSIDIADTIIIWEKMRMSNLQQ SIKTSSSSLPTTTTTTTSSNKPTDSSSLPPNTPIAEGSITTPSQGGVATPNVSDSTPTPG IHHGATNIDDEYRPPLSAIEHISLFLIRMASNWYHINEKCSELLRQTLVIWPETNIKFSV FEKPMNTDQPQMISTCLSMLNLIAEYQVNTFIPNNVVALQQSLLQALNSDNAKISSLLGS LFKKILAAFPLPTNNTTTTTPVSSTTTTEQSSDSSSLPPPPPVQVTKPIPNEMVSFYTFI GTQFEMILGAFDKNYNLSILSNIKVFSDHSESFIDPYISLIVKVLIRLTRNYLSQDSDGG TGSLANKPLSSSGSTSQTGGASQTATSASNVVLKKSNSEIISGLCKTYGFLKTKTTKLNS DQRNAFIQSLLVLIERSNDVELLSEIIKVVDYLISISPSPSPSTTPVVTETTIPSTTTTT TTAATTTTTTTPSTTTTAATTTTAPTTTETTTTAATTTITPFLTIKEKINFLIKLGRVDQ LSNAELSLSYYKLVLSFYSESNSSSKQELSQLEPCFMMGLRNTVDQGMRKSLFNILHKSI GTTPYQRLNYIIGVQQWDILGTTYWIKHALDLLLAILPNDKFVKISNFCSKLPTSLKFAN RNGNDINQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQHHQQEQPMEIDENLV VEQSSSVNKGNEEFKKSLKLHTQWLESLKNEESLKFSEFNENLRELIFIDSHLVNDLWCH LFSDMWSDLTKEEQFKLSKSLTLLLSKDYTKKVPLVSKPIIPPTSIPISKPIITTTTSTS SSTSTTTPITTIPLINNSQLITIVTLTQQQTNPIIVPSLREPNVIKTWMETLGMCKPIPK VPIEVISFLGENYNCWYYAIRMIEQQLIDRQKLLDSTDINWDYLSYLYGAIGEKDLLYGI YRKRYQCDETKLGLLLEQFYMFQSSQEVFLSAMNKYSAVGCKPTPRSENLLWEDHWLECA KRLNQWNFVHEFSKEKNMYDLTIESAWKIPQWNSVKENMKKMMSQGDTSIRKILQGYFLT NEKRYHEVDPAIVTSNQLILDKWVSLPERSFRSHTNSLVEMQQVVELQESVHILKEISNI TLSQQPADLSRSFLTSNYIKSIFNIWRERLPNKDEDLLIWFELMAWRQQVFNIIGTPSMN GGIGANPVTPTNTTTTITNPDGTTTTTTTPLPPPQQPINQIEFASPRYMVLEMAWTMNKY SHIVRKHNIIEVCLNSLSKMFDLQIELHDIFLNLKEQIKCYLQLPTHYDTGISIINSTNL DFFTPMQKGEFLQLKGEFLNRLGRYDEANQSFASSVSQYENSAKNWISWAHFCDNQFTNH SSSSITPSSTPTTYDIKTQWAESAISCYIQGIKCDPKYGSRYVPRIFWLLYLNGSGEVPQ QIQTQQQQQQAAAQGGLPPQPRKLTPAQSVFQSFLNSWTILPQWIWLNYMPQLISGAANL LNFPGYGFLCWQMIGKICYLFPNSSYYHFRKLVLEMKSNASKFTTSPPTTTTTATTTTTA TTTITTATTTSTPTQNTTPTQNTTTPIKEESSTTTATTTPAVPSTSTPTSTSAPAPISTS TNTPPNATTTTPQANTTSPPPPSSTFSPLKMTETLSLGLHQYHSCLINEIDMMLGSFSIL SGSIPAVYQFNGSLNQILLEAFKLNKIEDSIYNSIRSLYKHYFVNEIKYQNSKEFLEAYK SEFKVDFIEFNLDDIVSDTIKKESDQETNVVEGTTNVSKELETTSEKQQQQQQQLPTISI LLLIEKLIKWIDRKPDNSLITVVNTDQSTNYYGIDTMICLESICPQLVNFKPSILEIPGQ YNTNRDPNIENNVKVEKVGMFAKLIKHSNGMVCPRITLYGGNGKAYQFLIESSPSLINGI TNSNNNNVARVYERKNQLLGSINSMLIKNRETRRRGLTLNSYPTVVPIKNSLTMIQNIGN DSIKQLAEVWYTHSNQSNLFKPMLKYKEMLLNSNLHTELLSKKDQDGDLEFTNITEDNNI SSSSSSSSSSGSNSGENSPIIDSSKLVVFREMSKEIGDELMINYIQSTLLPTNYQDQYEF KLNFSNQFGLHSLLQYILFSDIGDIDPSKIYLTKSTGSVYYNDWSLKLTNRKLGFDLLQD NPYNQQQLLRLSPNIRNYLGPLYLEGSYLSSMISTCICLSDLKDQLVNSINLFIFDEYMC MNNVEPLQQSEQNKDRNIHYEFIDKTTATVHQMLENRIDSLTPSSQPDKTCFISPIVKKV NQLIQNSLSSNISQLDQLSCPWL
pfey October 2009
Alignment of correct protein from ESTs on top and protein from genomic sequence with 2 nt gap because of frame shift, missing the Asp residue at position 3856 is bottom strand.
Correct: 301 TTTITTATTTSTPTQNTTPTQNTTTPIKEESSTTTATTTPAVPSTSTPTSTSAPAPISTS 360
TTTITTATTTSTPTQ TTPTQNTTTPIKEESSTTTATTTPAVPSTSTPTSTSAPAPISTS
Genomic: 3841 TTTITTATTTSTPTQ-TTPTQNTTTPIKEESSTTTATTTPAVPSTSTPTSTSAPAPISTS 3899
Correct: 361 TNTPPNATTTTPQANTTSPPPPSSTFSPLKMTETLSLGLHQYHSCLINEIDMMLGSFSIL 420
TNTPPNATTTTPQANTTSPPPPSSTFSPLKMTETLSLGLHQYHSCLINEIDMMLGSFSIL
Genomic: 3900 TNTPPNATTTTPQANTTSPPPPSSTFSPLKMTETLSLGLHQYHSCLINEIDMMLGSFSIL 3959
Alignment of genomic sequence on top and EST sequence at bottom; note the 1 nt deletion in genomic sequence.
Genomic: 396 cagcaacaacaacaacaacagcaacaacaacaataacaacagctactacaacctcaacac 455
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
EST: 301 cagcaacaacaacaacaacagcaacaacaacaataacaacagctactacaacctcaacac 360
Genomic: 456 caactc-aaatacaacaccaactcaaaatacaactacacctattaaagaagaatcatcta 514
|||||| |||||||||||||||||||||||||||||||||||||||||||||||||||||
EST: 361 caactcaaaatacaacaccaactcaaaatacaactacacctattaaagaagaatcatcta 420
pfey August 2022
return to tra1 Gene Page