EMBL-EBI > Goldman Group

PANDIT Home | Browse PANDIT | Help on PANDIT | Release notes | Pfam



PANDIT Homepage
pan•dit
PANDIT
Protein and Associated NucleotideDomains with Inferred Trees
(View summary and phylogenies for PF05707 - Zot)
FAM  PF05707
PID  Zot
DES  Zonular occludens toxin (Zot)
IPR  IPR008900
ANO  19
ALN  322
AID  0.263437
APH  ((((((((Q9JY47_NEIMB/1-190:0.81979,Q38198_9VIRU/7-180:0.70789):0.26688,(Q9PCA6_XYLFA/1-188:0.64734,VG30_BPPF3/1-179:0.29902):0.45674):0.08937,O82966_RALSO/6-189:0.72172):0.11657,(Q9JTF4_NEIMA/3-196:0.56774,Q9F3Z6_NEIME/1-180:0.41179):0.13606):0.51254,((VG46_BPPF1/2-213:0.69509,(Q9MBV9_9VIRU/2-255:1.00086,ZOT_VIBCH/2-211:0.49207):0.13528):0.54197,((O86426_PSEAE/1-203:0.53121,(O56839_9VIRU/1-195:0.36339,O88128_9VIRU/1-195:0.14419):0.15693):0.34424,(Q8NL40_XANCP/1-200:0.92456,Q8ZEA3_YERPE/2-200:0.89144):0.24215):0.14246):0.20565):0.91424,O80263_9VIRU/3-188:0.62106):0.07479,VG1_BPIKE/2-187:0.27995):0.16615,VG1_BPIF1/2-187:0.08770,VG1_BPFD/2-187:0.17980);
ATP  ((((((((Q9JY47_NEIMB/1-190,Q38198_9VIRU/7-180),(Q9PCA6_XYLFA/1-188,VG30_BPPF3/1-179)),O82966_RALSO/6-189),(Q9JTF4_NEIMA/3-196,Q9F3Z6_NEIME/1-180)),((VG46_BPPF1/2-213,(Q9MBV9_9VIRU/2-255,ZOT_VIBCH/2-211)),((O86426_PSEAE/1-203,(O56839_9VIRU/1-195,O88128_9VIRU/1-195)),(Q8NL40_XANCP/1-200,Q8ZEA3_YERPE/2-200)))),O80263_9VIRU/3-188),VG1_BPIKE/2-187),VG1_BPIF1/2-187,VG1_BPFD/2-187);
ATL  14.917
DNO  19
DLN  966
DID  0.392979
DPH  ((Q8ZEA3_YERPE/2-200:0.66362,((O80263_9VIRU/3-188:0.44304,(VG1_BPIKE/2-187:0.25531,(VG1_BPIF1/2-187:0.20553,VG1_BPFD/2-187:0.20622):0.10723):0.05311):0.51346,((Q9PCA6_XYLFA/1-188:0.50900,VG30_BPPF3/1-179:0.28215):0.21990,((Q9JY47_NEIMB/1-190:0.56188,(Q38198_9VIRU/7-180:0.49442,O82966_RALSO/6-189:0.46846):0.09424):0.15389,(Q9JTF4_NEIMA/3-196:0.39896,Q9F3Z6_NEIME/1-180:0.31577):0.17880):0.13015):0.13828):0.14676):0.04250,((VG46_BPPF1/2-213:0.48860,(ZOT_VIBCH/2-211:0.44138,Q9MBV9_9VIRU/2-255:0.55980):0.08575):0.18982,Q8NL40_XANCP/1-200:0.61083):0.16327,(O56839_9VIRU/1-195:0.34455,(O86426_PSEAE/1-203:0.48300,O88128_9VIRU/1-195:0.22743):0.09474):0.18422);
DTP  ((Q8ZEA3_YERPE/2-200,((O80263_9VIRU/3-188,(VG1_BPIKE/2-187,(VG1_BPIF1/2-187,VG1_BPFD/2-187))),((Q9PCA6_XYLFA/1-188,VG30_BPPF3/1-179),((Q9JY47_NEIMB/1-190,(Q38198_9VIRU/7-180,O82966_RALSO/6-189)),(Q9JTF4_NEIMA/3-196,Q9F3Z6_NEIME/1-180))))),((VG46_BPPF1/2-213,(ZOT_VIBCH/2-211,Q9MBV9_9VIRU/2-255)),Q8NL40_XANCP/1-200),(O56839_9VIRU/1-195,(O86426_PSEAE/1-203,O88128_9VIRU/1-195)));
DTL  10.4525
RID  0.263437
RPH  ((((((((Q9JY47_NEIMB/1-190:0.81979,Q38198_9VIRU/7-180:0.70789):0.26688,(Q9PCA6_XYLFA/1-188:0.64734,VG30_BPPF3/1-179:0.29902):0.45674):0.08937,O82966_RALSO/6-189:0.72172):0.11657,(Q9JTF4_NEIMA/3-196:0.56774,Q9F3Z6_NEIME/1-180:0.41179):0.13606):0.51254,((VG46_BPPF1/2-213:0.69509,(Q9MBV9_9VIRU/2-255:1.00086,ZOT_VIBCH/2-211:0.49207):0.13528):0.54197,((O86426_PSEAE/1-203:0.53121,(O56839_9VIRU/1-195:0.36339,O88128_9VIRU/1-195:0.14419):0.15693):0.34424,(Q8NL40_XANCP/1-200:0.92456,Q8ZEA3_YERPE/2-200:0.89144):0.24215):0.14246):0.20565):0.91424,O80263_9VIRU/3-188:0.62106):0.07479,VG1_BPIKE/2-187:0.27995):0.16615,VG1_BPIF1/2-187:0.08770,VG1_BPFD/2-187:0.17980);
RTP  ((((((((Q9JY47_NEIMB/1-190,Q38198_9VIRU/7-180),(Q9PCA6_XYLFA/1-188,VG30_BPPF3/1-179)),O82966_RALSO/6-189),(Q9JTF4_NEIMA/3-196,Q9F3Z6_NEIME/1-180)),((VG46_BPPF1/2-213,(Q9MBV9_9VIRU/2-255,ZOT_VIBCH/2-211)),((O86426_PSEAE/1-203,(O56839_9VIRU/1-195,O88128_9VIRU/1-195)),(Q8NL40_XANCP/1-200,Q8ZEA3_YERPE/2-200)))),O80263_9VIRU/3-188),VG1_BPIKE/2-187),VG1_BPIF1/2-187,VG1_BPFD/2-187);
RTL  14.917
LNK  O56839_9VIRU/1-195:O56839:D89074
LNK  O80263_9VIRU/3-188:O80263:AB002632
LNK  O82966_RALSO/6-189:O82966:AB015669
LNK  O86426_PSEAE/1-203:O86426:L06240
LNK  O88128_9VIRU/1-195:O88128:AB012574
LNK  Q38198_9VIRU/7-180:Q38198:M57538
LNK  Q8NL40_XANCP/1-200:Q8NL40:AE012313
LNK  Q8ZEA3_YERPE/2-200:Q8ZEA3:AJ414151
LNK  Q9F3Z6_NEIME/1-180:Q9F3Z6:AJ277475
LNK  Q9JTF4_NEIMA/3-196:Q9JTF4:AL162757
LNK  Q9JY47_NEIMB/1-190:Q9JY47:AE002524
LNK  Q9MBV9_9VIRU/2-255:Q9MBV9:AB043679
LNK  Q9PCA6_XYLFA/1-188:Q9PCA6:AE004008
LNK  VG1_BPFD/2-187:P03655:J02451
LNK  VG1_BPIF1/2-187:O80299:U02303
LNK  VG1_BPIKE/2-187:P03658:X02139
LNK  VG30_BPPF3/1-179:P03626:M11912
LNK  VG46_BPPF1/2-213:P25131:X52107
LNK  ZOT_VIBCH/2-211:P38442:AF175708
AMK  xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...xxxxxxxxxxxxxx.xx......xxxxxxxxxxxxxxxxxxxxxxxx..xxxxxxxxxxx....xxxxxxxxxxxxx.xxx.xxxxxxxxxxxxxxxxx..xxxxxxxxxxxxxxxxxxx..........................................xxxxxxxxxxxxxxxxxxxxx.....x..xxxxxxxxxxxxxxxxxx.x.xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx.......xxxxxxxxxxxxxxxxxxxxxx
DMK  xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx.........xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...xxxxxx..................xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx......xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx............xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...xxxxxxxxx...xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx......xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx..............................................................................................................................xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...............xxx......xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...xxx...xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx.....................xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
NAM  Q9F3Z6_NEIME/1-180
ASQ  MITLITGVPGSGKTLSVVSD-LAK----KINNEWK...-DRKIFIEGIPELT.IE......TTPIPEGHSIN-------------..-----------....--------DMHVW.LQ-.-YPENNGSVVIIDEAQN..VFPPRSP-AVKA-------..........................................-------PPLV-EWLHVHRHS.....G..IDIILITQMPQRIDKHVR.D.LVGAHYHIHKT-PLGIFMRYFWDY--CANNPRSEFANARP---.......-EVYKLDKKAFGLYK-SAEILL
DSQ  ATGATTACATTAATTACAGGCGTTCCCGGTTCGGGCAAAACTTTATCCGTCGTTTCGGAC---TTGGCGAAA------------AAGATAAATAATGAGTGGAAA.........---GACAGGAAGATTTTTATTGAAGGCATTCCGGAATTAACG...ATAGAA..................ACAACGCCCATTCCCGAAGGTCATTCCATTAAT---------------------------------------......---------------------------------............------------------------GATATGCACGTTTGG...CTTCAA---...---TATCCCGAAAATAACGGCTCTGTCGTCATCATAGACGAAGCGCAAAAT......GTTTTCCCGCCGCGCTCCCCT---GCCGTAAAAGCA---------------------..............................................................................................................................---------------------CCGCCTCTTGTC---GAATGGTTGCACGTTCACAGGCATTCG...............GGC......ATAGATATTATCCTGATAACGCAGATGCCCCAACGCATAGATAAGCACGTCCGC...GAT...TTGGTAGGGGCGCATTATCATATCCATAAAACC---CCGCTGGGCATTTTTATGCGTTATTTTTGGGACTAT------TGTGCGAATAATCCAAGGTCAGAGTTCGCCAATGCACGCCCC---------.....................---GAAGTCTATAAGTTGGATAAAAAGGCGTTCGGGCTTTATAAA---TCAGCGGAAATCCTACTA
TRN  10010101100011000
NAM  Q9JTF4_NEIMA/3-196
ASQ  EICLITGTPGSGKTLKMVSM-MANDEMFKPDENGI...-RRKVFT-NIKGLK.IP......HTYIETDAKKL-------------..--------PKS....TDEQLSAHDMYEW.IK-.-KPENIGSIVIVDEAQD..VWPARSA-GSKI-------..........................................-------PENV-QWLNTHRHQ.....G..IDIFVLTQGSKLLDQNLR.T.LVRKHYHIASN-KMGMRTLLEWKI--CADDPVKMASSAFS---.......-SIYTLDKKVYDLYE-SAEVHT
DSQ  GAGATCTGTTTGATAACCGGCACGCCCGGTTCAGGGAAAACATTAAAAATGGTTTCCATG---ATGGCAAACGATGAAATGTTTAAGCCGGATGAAAACGGCATA.........---CGCCGTAAAGTATTTACG---AACATCAAAGGCTTGAAG...ATACCG..................CACACCTACATAGAAACGGACGCGAAAAAGCTG---------------------------------------......------------------------CCGAAATCG............ACAGATGAGCAGCTTTCGGCGCATGATATGTACGAATGG...ATAAAG---...---AAGCCCGAAAATATCGGGTCTATTGTCATTGTAGATGAAGCTCAAGAC......GTATGGCCGGCACGCTCGGCA---GGTTCAAAAATC---------------------..............................................................................................................................---------------------CCTGAAAATGTC---CAATGGCTGAATACGCACAGACATCAG...............GGC......ATTGATATATTTGTTTTGACTCAAGGCTCTAAGCTTCTAGATCAAAATCTTAGA...ACG...CTTGTACGGAAACATTACCACATCGCTTCAAAC---AAGATGGGTATGCGTACGCTTTTAGAATGGAAAATA------TGCGCGGACGATCCCGTAAAAATGGCATCAAGCGCATTCTCC---------.....................---AGTATCTATACACTGGATAAAAAAGTTTATGACTTGTACGAA---TCAGCGGAAGTTCATACC
TRN  10010101100011000
NAM  O82966_RALSO/6-189
ASQ  PITLITATPGGGKTALAVQM-MKA----AVD---Q...-GRPLFVMGIPELK.LP......YIPTPAVSDWTEL-----------..-----------....REDPENPGMMLPY.FT-.-FP--PNSLIVLDEAQR..VFRVRTA-GSKV-------..........................................-------PDHV-AAFETVRHT.....G..VTFVLITQNPTFLDSHIR.K.LVGQHVHLRDA-GLLGRWYYEWPE--CANP-----ETFNTAPI.......KKKWSLPKSSFGLYK-SSSLHI
DSQ  CCGATCACGCTGATCACGGCGACGCCTGGTGGCGGGAAGACCGCGTTGGCGGTCCAGATG---ATGAAGGCA------------GCCGTCGAC---------CAG.........---GGGCGTCCGCTCTTCGTCATGGGCATCCCGGAGCTGAAG...CTGCCG..................TATATCCCGACGCCGGCGGTTTCGGACTGGACGGAGCTG---------------------------------......---------------------------------............CGCGAAGACCCTGAAAACCCAGGGATGATGCTGCCGTAC...TTCACC---...---TTCCCG------CCGAACTCGCTGATCGTGCTGGATGAGGCGCAGCGG......GTGTTTCGCGTCCGCACGGCC---GGGTCTAAGGTG---------------------..............................................................................................................................---------------------CCTGACCACGTG---GCGGCGTTTGAGACGGTGCGGCATACC...............GGC......GTGACGTTTGTTCTGATCACGCAGAACCCGACGTTCCTGGACAGTCATATCCGC...AAG...CTGGTGGGGCAGCATGTCCACCTGCGGGACGCT---GGCTTGCTCGGACGCTGGTACTACGAATGGCCGGAG------TGCGCGAACCCG---------------GAGACGTTCAACACTGCGCCGATC.....................AAGAAGAAGTGGAGCCTGCCGAAGTCCAGCTTTGGCCTGTACAAG---TCCTCCAGTCTGCACATC
TRN  11011111101111111
NAM  Q38198_9VIRU/7-180
ASQ  SISLLTGLPGSGKSLRIIQA-IRY----LMD---K...-GAHVYVCNIDGIS.VP......GTTPW-------------------..-----------....-------ADPHKW.QD-.-LP--AGSILFVDEAQH..FFPARRG-GDPV-------..........................................--------ETI-KAMSTIRHD.....G..VRLVLATQQPNYLDTYLR.G.LVGYHEHLLRQS--GKQKTFIFRNSQIIEEVRSPLPRIKKLYD.......YEVWKQPTECFKFYK-SAEVHT
DSQ  TCTATTTCACTGCTCACCGGCTTGCCAGGATCTGGCAAGAGCTTGCGCATTATCCAGGCG---ATTCGCTAT------------CTCATGGAC---------AAG.........---GGTGCGCACGTCTACGTGTGCAACATCGACGGCATCTCC...GTGCCC..................GGCACGACGCCGTGG---------------------------------------------------------......---------------------------------............---------------------GCTGATCCGCATAAGTGG...CAAGAT---...---CTACCG------GCTGGGTCAATTCTTTTCGTTGATGAGGCGCAGCAT......TTTTTCCCCGCACGTCGTGGC---GGGGATCCGGTC---------------------..............................................................................................................................------------------------GAAACGATC---AAGGCGATGTCCACGATTCGACACGAC...............GGC......GTGCGTTTGGTGCTTGCCACGCAGCAGCCGAACTACCTCGACACCTATCTGCGT...GGA...TTGGTCGGCTATCACGAACACCTGCTGCGTCAGAGC------GGCAAACAGAAGACCTTTATTTTCCGCAATAGTCAGATCATCGAAGAGGTGCGGTCGCCGTTGCCGCGCATCAAAAAGCTCTACGAC.....................TACGAAGTGTGGAAACAGCCAACAGAGTGCTTCAAGTTCTACAAG---TCGGCTGAGGTCCACACG
TRN  11011101101011001
NAM  VG30_BPPF3/1-179
ASQ  MITLITAVPGSGKTLYAIGL-IEA----ALS---E...-GRPVFT-NISGLV.KD......KFSNP-------------------..-----------....---HLLSDAPDDW.RD-.-TP--EGSLVVYDEAQQahLYPSNAQRGPVT-------..........................................-------DERL-TAMETHRHT.....G..HDLVFITQAPTFVHHHIR.K.LVGLHIHLYRSRGLQAASKYEWSH--VCDSP--NDRKEQQRAD.......FVLWKFPKEHFAFYT-SAVMHT
DSQ  ATGATTACGTTGATAACAGCGGTGCCGGGTTCCGGTAAAACTCTTTACGCTATAGGTTTG---ATTGAGGCT------------GCCCTTTCC---------GAA.........---GGGCGGCCAGTTTTCACA---AACATAAGTGGGTTGGTT...AAGGAT..................AAATTCTCTAATCCT---------------------------------------------------------......---------------------------------............---------CACCTTTTGTCTGATGCGCCTGATGATTGG...CGGGAC---...---ACTCCA------GAGGGATCGTTGGTGGTTTATGACGAGGCCCAGCAAgcgcacTTGTACCCTAGCAATGCCCAGCGTGGGCCGGTCACG---------------------..............................................................................................................................---------------------GACGAGCGCCTA---ACGGCAATGGAAACGCACCGGCATACC...............GGT......CATGATCTGGTTTTCATTACCCAGGCTCCCACGTTTGTGCATCATCACATTCGT...AAG...CTGGTTGGGCTGCATATCCATCTTTATCGTTCGCGTGGCCTGCAAGCGGCTTCTAAGTACGAATGGAGCCAT------GTTTGTGATTCACCT------AACGACCGGAAGGAACAACAAAGGGCTGAT.....................TTCGTTCTATGGAAGTTCCCGAAAGAACATTTTGCTTTTTACACC---AGTGCGGTTATGCACACG
TRN  10010101100011001
NAM  Q9PCA6_XYLFA/1-188
ASQ  MLNLVTGAPGNGKTLYAVDWLIRQIEIDKSLVKSG..aVPRSFYT-DIEGFD.VE......AVRRLTGYVVQ-------------..-----------....-------SAPEDW.RT-.-TP--QGSVIVYDEAHR..MFPAGRP-GRSD-------..........................................-------DPRV-CDLDTHRHG.....G..YDLMFVTQWPTKIHHELR.R.LVGEHVHLNRAMGLQTAGLYRWSR--AQDDP--YDIHQREKAE.......EEVWKFPKDRYALYA-SSTLHT
DSQ  ATGTTAAATCTTGTGACTGGTGCTCCAGGCAATGGTAAGACTTTATATGCTGTTGATTGGTTAATTAGACAGATTGAGATTGATAAATCTCTTGTTAAGTCTGGT......gctGTTCCTCGTTCTTTTTATACT---GATATTGAGGGGTTTGAT...GTTGAG..................GCAGTTCGTCGTCTGACTGGTTATGTTGTCCAG---------------------------------------......---------------------------------............---------------------TCTGCCCCTGAGGATTGG...CGGACA---...---ACGCCT------CAAGGGAGTGTGATTGTTTACGATGAAGCTCACCGT......ATGTTTCCTGCTGGTCGCCCT---GGTCGCTCTGAT---------------------..............................................................................................................................---------------------GATCCTAGGGTG---TGTGATTTAGATACACATCGACATGGT...............GGT......TATGATCTTATGTTTGTCACTCAGTGGCCGACGAAGATTCATCATGAGTTACGT...CGT...TTGGTGGGTGAGCATGTGCATTTGAACCGTGCGATGGGGTTGCAAACCGCTGGTTTATATCGTTGGTCTCGT------GCTCAGGATGATCCC------TATGATATTCATCAACGTGAAAAAGCTGAG.....................GAGGAGGTTTGGAAGTTTCCTAAGGATCGTTATGCTTTATATGCC---TCGAGCACTCTTCATACT
TRN  10010101100011010
NAM  Q9JY47_NEIMB/1-190
ASQ  MIYLFTGNMGTGKTSRVVSMILNNEDGLFKMKLEDgteVDRPLYFCHIDGLD.KR......QFKA--------------------..----------H....ELTEEQIMSAPLR.DV-.-IP--EGAVLIVDEAHY..TYPVRAA-GRPV-------..........................................-------PPYI-QELTELRHH.....G..HTVILMTQHPSQLDIFVR.N.LVSKHVHLERK-AIGMKQYY---WYKCVTS--LDNPAGVSGVE.......VASWKPPKEAFKYYK-SASQHQ
DSQ  ATGATTTATCTGTTTACAGGAAACATGGGGACAGGCAAAACCTCCCGCGTCGTCTCTATGATTTTGAACAACGAAGACGGATTGTTCAAAATGAAATTGGAAGACggcacagagGTAGACAGACCGCTTTATTTCTGCCATATCGACGGATTGGAT...AAACGG..................CAGTTTAAAGCC------------------------------------------------------------......------------------------------CAC............GAACTGACGGAAGAGCAAATCATGTCCGCCCCGCTTCGT...GATGTC---...---ATACCG------GAAGGCGCAGTGCTGATTGTTGACGAAGCGCACTAC......ACTTATCCGGTACGCGCGGCA---GGCCGTCCCGTT---------------------..............................................................................................................................---------------------CCGCCTTATATT---CAGGAACTGACAGAACTCCGCCATCAC...............GGG......CATACCGTTATTTTGATGACGCAGCACCCGAGCCAACTTGATATATTCGTCCGC...AAC...CTTGTTTCAAAGCATGTACACCTTGAACGCAAG---GCAATCGGAATGAAACAGTATTAT---------TGGTATAAATGCGTAACCTCG------TTGGACAATCCCGCAGGCGTAAGCGGCGTAGAA.....................GTCGCAAGTTGGAAACCGCCGAAAGAAGCCTTTAAATACTATAAA---TCAGCAAGCCAGCACCAA
TRN  10010101100011001
NAM  Q8ZEA3_YERPE/2-200
ASQ  AISAYIGIPGSGKSYEAVCNVIIP----AFT---S...-GRRVVT-NIYGLQ.KD......KITERYPDATGEII----------..------VVDND....DVLKADFFPFKGG.EG-.-SFCQFGDLIVIDEAWR..IFGSDKDMTAEK-------..........................................-----------KSFIAEHRHFthpetGisCDLVIVNQSLSNIARFLK.D.KIETTYRMRKLKALGLNNHYCIDVYSGHK--IY----KSNLVT.......SYRNKYNPDIFELYK-SYEGNN
DSQ  GCTATTTCTGCATATATTGGCATACCTGGCTCAGGAAAAAGTTATGAAGCCGTTTGCAATGTCATTATTCCG------------GCATTTACC---------AGC.........---GGCCGGAGAGTTGTGACG---AACATTTATGGTTTACAA...AAAGAT..................AAAATCACCGAACGTTATCCTGATGCAACGGGAGAAATTATT------------------------------......------------------GTTGTGGATAATGAT............GATGTGCTTAAAGCAGATTTCTTTCCTTTTAAAGGTGGG...GAAGGG---...---AGCTTTTGCCAGTTTGGTGATTTAATTGTTATTGATGAAGCATGGCGA......ATCTTCGGTAGCGATAAGGATATGACGGCTGAGAAG---------------------..............................................................................................................................---------------------------------AAATCATTTATTGCTGAACATCGTCATTTTacgcaccctgaaacgGGTattagcTGTGATTTGGTTATTGTAAATCAGTCACTTTCTAATATTGCTCGCTTTCTGAAA...GAC...AAAATAGAAACAACTTACCGGATGCGCAAGCTGAAAGCGTTGGGCCTGAATAATCATTACTGCATTGACGTATATTCAGGCCACAAA------ATCTAT------------AAAAGCAACCTCGTCACC.....................AGTTATCGCAATAAATATAACCCTGATATTTTTGAACTTTACAAA---AGTTATGAAGGAAATAAC
TRN  10010101100011000
NAM  O88128_9VIRU/1-195
ASQ  MIYAIAGRPGGGKTYEAVAYHIIP----AIK---D...-GRKVIT-NIT-LN.ID......WFVKVFGEDVRELIKIVDGR----..---LTDFGSTT....RPFSQIEDYSDEW.RN-.--EKGQGPLYVVDEAHM..SLPSRGL-AA---------..........................................--------PIL-EWYSIHRHY.....G..VDIILLTQNIRKVHRDIK.D.MIEVTYRCTKNTAMGSTSSYTKKVQDGC---------AGEVVN.......TSTRFYKSEYFPFYK-SHSQSN
DSQ  ATGATATACGCCATAGCAGGGAGACCAGGTGGCGGTAAAACCTATGAGGCTGTCGCCTATCACATCATTCCG------------GCCATTAAA---------GAT.........---GGCCGCAAAGTCATCACC---AATATCACC---TTAAAC...ATTGAT..................TGGTTCGTTAAGGTGTTTGGTGAAGACGTTCGAGAACTCATCAAAATCGTGGATGGACGT------------......---------TTAACGGATTTCGGTTCGACTACG............CGCCCGTTCAGCCAGATTGAAGACTACTCCGACGAATGG...CGTAAT---...------GAAAAAGGACAAGGGCCACTTTATGTGGTCGATGAGGCGCACATG......AGCTTGCCAAGTCGAGGCTTG---GCCGCG---------------------------..............................................................................................................................------------------------CCGATTCTA---GAATGGTACTCAATACACCGTCACTAC...............GGT......GTCGATATCATCTTGCTCACGCAGAACATCCGCAAAGTGCATCGAGACATTAAG...GAC...ATGATTGAAGTGACCTACCGATGCACAAAGAACACGGCCATGGGCTCAACCAGTTCTTACACCAAGAAAGTGCAAGATGGTTGT---------------------------GCCGGTGAAGTGGTGAAC.....................ACCTCTACCCGATTTTATAAGTCGGAATACTTCCCGTTCTATAAG---AGTCATTCGCAATCCAAC
TRN  10010101110011000
NAM  O56839_9VIRU/1-195
ASQ  MIYAIVGRPRSGKSYESVVYHIIP----AIQ---S...-GRKVIT-NIP-LN.IP......MFEKVFGESAKYLIKVIDAQ----..---FTEYGSMN....RPFSKVEHYLDDW.RD-.--GKNRAPLYVIDEAHM..VIPTRLGD-----------..........................................-------PKIL-EFYSMHGHY.....G..IDIIILTQNLRKIHSDIR.A.MIEMTYYCAKNTAFGSKKTYTKKV---------RIGDTREDIN.......IEQRTYKEHYFGFYQ-SHTQSS
DSQ  ATGATTTATGCAATTGTTGGCCGTCCTCGCTCTGGTAAGTCATACGAGTCTGTTGTTTACCATATTATCCCT------------GCTATCCAA---------TCT.........---GGCAGAAAGGTTATTACT---AACATCCCC---CTTAAT...ATTCCC..................ATGTTTGAAAAAGTATTTGGCGAAAGTGCAAAATATTTGATTAAGGTTATTGATGCTCAA------------......---------TTTACAGAGTATGGCTCTATGAAT............CGTCCCTTCTCTAAAGTTGAGCATTATCTCGATGATTGG...CGAGAC---...------GGCAAGAATAGAGCCCCTCTCTACGTCATTGATGAGGCTCACATG......GTTATTCCTACACGCCTTGGCGAC---------------------------------..............................................................................................................................---------------------CCTAAAATACTT---GAATTCTATTCAATGCACGGTCACTAC...............GGC......ATTGATATTATTATTCTCACTCAAAATCTAAGAAAGATTCATTCTGATATTAGA...GCA...ATGATTGAGATGACTTATTACTGTGCTAAAAATACCGCATTCGGCAGTAAAAAGACTTATACAAAAAAGGTT---------------------------CGCATCGGTGATACCAGAGAAGATATAAAC.....................ATCGAGCAACGCACCTATAAAGAACACTATTTCGGTTTTTATCAA---TCTCACACCCAAAGCTCT
TRN  10010101110011001
NAM  O86426_PSEAE/1-203
ASQ  MINLILGQPGGGKSHEAVVYHVVP----ALN---Q...-GRKVIT-NL-ALD.MD......KFKAFFPESWHLIELRDSTVEVFN..NESGEEESRVV....RPFSRVDHYADPW.RH-.-PDEGFGPLYVIDECHL..SIPLRGTP-----------..........................................-------VPVE-EWYSLHRHE.....L..ADVLLITQSYGKINRAIR.D.LVQVVYRCKKATAFGTNDRYIRKVQDGL---------RGEVVN.......TSIREYQKQFYGFWK-SHTRSS
DSQ  ATGATTAACTTGATCCTGGGGCAGCCTGGTGGCGGAAAGTCTCATGAGGCTGTTGTCTATCATGTTGTTCCT------------GCGTTGAAT---------CAA.........---GGGCGAAAGGTCATCACG---AACTTG---GCTTTGGAT...ATGGAT..................AAGTTCAAGGCGTTTTTCCCGGAGTCTTGGCATTTGATCGAGCTTCGGGATTCTACTGTTGAGGTGTTCAAC......AATGAGAGTGGTGAGGAGGAGAGTAGGGTAGTA............CGCCCGTTTAGCCGGGTTGATCATTATGCGGACCCTTGG...CGGCAT---...---CCTGATGAAGGATTCGGGCCGCTGTATGTGATCGATGAATGTCACCTT......TCGATACCGCTGCGCGGCACGCCT---------------------------------..............................................................................................................................---------------------GTGCCGGTTGAA---GAGTGGTATTCGCTTCATCGTCACGAA...............CTG......GCCGATGTGCTGTTGATTACTCAGAGTTACGGCAAGATCAACCGTGCAATTCGT...GAC...CTTGTCCAGGTCGTGTATCGCTGTAAGAAAGCCACAGCTTTCGGCACCAATGATCGCTATATCCGCAAGGTCCAGGACGGTCTG---------------------------CGCGGCGAGGTCGTGAAT.....................ACCAGCATCCGGGAGTATCAGAAACAGTTCTACGGATTCTGGAAG---TCGCATACGCGGTCCTCG
TRN  10010101100011011
NAM  Q8NL40_XANCP/1-200
ASQ  MLVFNEGVPRAGKSYDAVKNHILP----ALK---K...-GRRVFA-RLNGLR.FD......RIAK---------HLGMAENDVQH..LLVLVDTKDVS....KLFACTQDESGKW.CI-.-PDEFKDALVVIDEVHE..FYVNERKPLA---------..........................................-------PAVE-NFWALLGQN.....G..GDAVIMTQWINRLHSAVK.A.RIEKKNTFQKLTAIGMKGRYRVTYFHTTSPG------KFEKVG.......GQTLKYDPAIFPLYD-GYAPGA
DSQ  ATGCTCGTATTCAATGAAGGTGTGCCGCGTGCCGGCAAGAGTTACGACGCGGTAAAGAATCACATTCTTCCG------------GCGCTCAAA---------AAG.........---GGGCGTCGTGTGTTTGCG---CGATTGAATGGGTTGCGC...TTTGAC..................CGCATCGCCAAG---------------------------CACCTGGGCATGGCGGAGAATGATGTTCAGCAC......TTGCTTGTGCTGGTGGACACCAAGGACGTGTCG............AAGTTGTTTGCGTGCACGCAGGACGAGTCGGGCAAGTGG...TGTATC---...---CCGGACGAGTTCAAAGACGCATTGGTTGTGATCGATGAGGTGCATGAG......TTCTACGTCAACGAGCGCAAGCCGCTCGCG---------------------------..............................................................................................................................---------------------CCGGCCGTCGAG---AATTTTTGGGCGCTGCTTGGCCAGAAC...............GGC......GGCGATGCCGTGATCATGACGCAATGGATCAACCGCTTGCACTCGGCGGTCAAG...GCG...CGCATCGAGAAGAAAAACACGTTCCAGAAGCTCACCGCTATCGGCATGAAGGGCCGCTATCGCGTGACGTATTTCCACACGACCTCACCGGGC------------------AAGTTTGAGAAGGTCGGC.....................GGTCAAACGCTCAAGTACGACCCGGCCATTTTTCCGCTCTATGAC---GGCTATGCGCCTGGTGCC
TRN  11011101101011001
NAM  ZOT_VIBCH/2-211
ASQ  SIFIHHGAPGSYKTSGALWLRLLP----AIK---S...-GRHIIT-NVRGLN.LE......RMAKYLKMDVSDIS----------..---IEFIDTDH....PDGRLTMARFWHW.AR-.-----KDAFLFIDECGR..IWPPRLT-VTNLKALDTPP..........................................DLVAEDRPESFEVAFDMHRHH.....G..WDICLTTPNIAKVHNMIR.E.AAEIGYRHFNRATVGLGAKFTLTTHDAANSGQMDSHALTR---.......-QVKKIPSPIFKMYA-STTTGK
DSQ  AGTATCTTTATTCATCACGGCGCGCCAGGCTCTTATAAAACGTCAGGGGCATTATGGCTTCGTCTGCTGCCG------------GCGATTAAG---------TCA.........---GGCCGTCACATCATCACG---AATGTGCGAGGCTTAAAC...CTTGAA..................CGCATGGCTAAGTACTTAAAAATGGATGTCTCGGACATCAGT------------------------------......---------ATCGAGTTTATTGATACAGACCAT............CCTGACGGTCGCTTAACGATGGCGCGTTTTTGGCACTGG...GCGAGA---...---------------AAGGACGCGTTTCTCTTTATCGATGAATGTGGTCGC......ATCTGGCCGCCGAGACTGACG---GTCACCAATTTAAAGGCGCTCGACACGCCGCCG..............................................................................................................................GATTTGGTCGCAGAGGATAGGCCGGAGAGCTTTGAGGTGGCTTTTGACATGCATCGTCACCAC...............GGC......TGGGATATCTGCCTAACCACGCCTAACATTGCCAAAGTGCACAACATGATAAGA...GAG...GCGGCGGAGATAGGGTATCGCCACTTTAACCGCGCCACCGTGGGGCTAGGGGCAAAGTTTACCCTGACCACCCATGATGCAGCCAACTCTGGACAGATGGACTCGCACGCGCTGACACGC---------.....................---CAAGTCAAAAAAATTCCAAGTCCGATTTTTAAGATGTACGCA---AGCACCACGACAGGCAAA
TRN  10010101100011000
NAM  VG46_BPPF1/2-213
ASQ  SIKIHHGPNGSYKTSGAIQDDAVP----ALK---D...-GRVIIT-NVRGFT.LE......RAYQVFPDLPN-------TAEIINldLESLEDLE---....--------KMRTW.FQ-.-WA-PRGAFLIFDETQL..LFPKSWRE--KDLERFDYPgg.....................................peaAHAADRPMGWL-DAWTRHRHF.....N..WDIVLTTPNISYIRDDIR.M.TCEMAYKHSNLAVIGIPGRY--KE--AQHDAQLNRPPADG-TI.......IEYKRIRKQTFALYQ-STATGK
DSQ  TCGATCAAGATCCATCACGGCCCCAATGGCTCCTACAAAACCTCCGGCGCGATCCAGGATGACGCCGTGCCC------------GCGCTGAAA---------GAC.........---GGGCGGGTGATCATCACC---AACGTGCGCGGCTTCACC...CTGGAG..................CGGGCCTATCAGGTCTTCCCGGACCTGCCCAAC---------------------ACGGCGGAAATCATCAACctcgatCTGGAGTCGCTGGAAGACCTCGAA---------............------------------------AAGATGCGCACGTGG...TTTCAG---...---TGGGCG---CCCCGCGGTGCCTTCCTGATCTTCGACGAAACCCAACTG......CTGTTTCCCAAGTCCTGGCGGGAA------AAAGACCTCGAGCGCTTCGACTACCCCggtgga...............................................................................................................ccggaagcgGCCCACGCGGCCGACCGCCCCATGGGCTGGCTC---GACGCTTGGACCCGGCACCGGCATTTC...............AAC......TGGGACATTGTCCTCACCACGCCGAACATCTCCTACATCCGCGACGATATCCGC...ATG...ACCTGCGAGATGGCCTACAAGCATTCCAACCTCGCGGTGATCGGCATCCCTGGCCGCTAC------AAGGAG------GCCCAGCATGACGCCCAACTCAACCGTCCGCCCGCCGATGGC---ACCATC.....................ATCGAATACAAGCGGATCCGAAAGCAGACCTTCGCCCTCTACCAG---TCCACGGCCACCGGCAAG
TRN  11011101101011011
NAM  Q9MBV9_9VIRU/2-255
ASQ  ATSFRYGHGGSYKSACAVWFDLLP----ALR---E...-GRICIT-NIHGMQpLE......VIEQRLGEKFP-----DTAR----..---LIRISSRN....PEGFELWKYFFCW.AP-.-----IGAFILIDECQQ..IFSVNAGF--KMANIHKRPftdfephlpegfselfhsrwltidtssldngeiddcqrtrfdEQGRIIYPENFNNAFMEHRHY.....N..WDIVLLTPDFAQIPKELK.G.VAELAKQHKGKD--GIFFSNRKPR--ILE------HDPTRTVTkpskddvVYNLKVPLDVHLLYA-STVTGQ
DSQ  GCTACTTCATTTCGATACGGTCACGGTGGCTCTTACAAATCGGCTTGCGCCGTGTGGTTTGACTTACTGCCT------------GCACTGCGT---------GAA.........---GGTCGAATTTGCATTACG---AATATTCATGGCATGCAGccaCTTGAA..................GTGATTGAACAACGCCTTGGTGAAAAGTTTCCT---------------GATACGGCTCGG------------......---------CTCATTCGCATTAGCTCTCGCAAC............CCTGAAGGCTTCGAGCTTTGGAAATACTTCTTCTGTTGG...GCTCCT---...---------------ATTGGCGCGTTCATCCTCATTGATGAGTGTCAGCAA......ATCTTCTCGGTCAATGCAGGTTTC------AAAATGGCGAACATACACAAGCGCCCTttcactgactttgagcctcacttgccggaaggattctctgagctgtttcactctcgctggctaacgattgatacgtccagtttggacaatggcgagatagacgattgccaacgtacacgttttgatGAGCAAGGACGCATCATCTATCCGGAGAACTTTAACAACGCCTTTATGGAGCACCGGCACTAC...............AAC......TGGGACATTGTGTTACTCACACCTGACTTTGCTCAAATCCCGAAAGAGTTAAAA...GGT...GTCGCGGAGTTGGCCAAGCAACATAAGGGTAAAGAT------GGGATCTTCTTTTCCAACCGCAAACCGCGC------ATCTTGGAA------------------CATGACCCAACTCGAACGGTCACCaaaccaagcaaagacgatgtgGTTTATAACCTCAAGGTGCCGCTTGACGTCCACCTCCTCTACGCC---TCGACTGTCACGGGGCAA
TRN  10010101100011000
NAM  VG1_BPIKE/2-187
ASQ  AVYVVTGKLGAGKTLVAVSR-IQR----TLA---K...-GGIVAT-NLN-LK.LH......HFPQVGRYAKQ--CRVMRIA----..----------D....KPTLEDLESIGRGnLT-.-YDESKNGLLVLDECGT..WFNSRNW-SDKS-------..........................................------RQPVI-DWCLHARKL.....G..WDIIFIIQDISLMDKQAR.DaLAEHVVYCRRL------DKLNIPI----------IGGLISVLS.......GGRLPLPKVHFGIVKYGDNPQS
DSQ  GCTGTTTATGTTGTTACAGGTAAATTAGGTGCTGGTAAAACTCTGGTTGCTGTATCACGT---ATTCAGCGC------------ACATTAGCG---------AAA.........---GGTGGCATTGTTGCCACC---AATTTAAAT---CTTAAA...CTGCAT..................CATTTTCCTCAAGTTGGAAGATATGCAAAACAA------TGCCGTGTTATGCGTATTGCT------------......------------------------------GAT............AAGCCAACTCTGGAGGATTTAGAATCTATTGGTCGCGGTaatTTAACT---...---TATGATGAATCAAAGAATGGCTTATTAGTTCTTGATGAATGTGGTACT......TGGTTTAACTCAAGAAACTGG---AGCGATAAATCA---------------------..............................................................................................................................------------------AGGCAGCCTGTTATT---GATTGGTGTTTACATGCACGTAAATTA...............GGT......TGGGATATTATCTTTATTATTCAGGATATTTCTCTGATGGATAAGCAAGCGCGT...GATgctTTGGCTGAGCACGTTGTCTATTGTCGTCGCTTG------------------GATAAATTGAATATACCAATC------------------------------ATCGGTGGGTTAATATCTGTATTATCT.....................GGTGGTAGATTACCGTTACCAAAAGTGCATTTTGGTATTGTTAAATATGGTGATAATCCTCAATCT
TRN  10010101100011000
NAM  VG1_BPFD/2-187
ASQ  AVYFVTGKLGSGKTLVSVGK-IQD----KIV---A...-GCKIAT-NLD-LR.LQnlpqvgRFAK-TPRVLR-------------..----------I....PDKPSISDLLAIG.RGNdSYDENKNGLLVLDECGT..WFNTRSWNDKER-------..........................................-------QPII-DWFLHARKL.....G..WDIIFLVQDLSIVDKQARsA.LAEHVVYCRRL------DRITLP-FVGT---------LYSLVT.......GSKMPLPKLHVGVVKYGDSQLS
DSQ  GCTGTTTATTTTGTAACTGGCAAATTAGGCTCTGGAAAGACGCTCGTTAGCGTTGGTAAG---ATTCAGGAT------------AAAATTGTA---------GCT.........---GGGTGCAAAATAGCAACT---AATCTTGAT---TTAAGG...CTTCAAaacctcccgcaagtcgggAGGTTCGCTAAA---ACGCCTCGCGTTCTTAGA---------------------------------------......------------------------------ATA............CCGGATAAGCCTTCTATTTCTGATTTGCTTGCTATTGGT...CGTGGTAATgatTCCTACGACGAAAATAAAAACGGTTTGCTTGTTCTTGATGAATGCGGTACT......TGGTTTAATACCCGTTCATGGAATGACAAGGAAAGA---------------------..............................................................................................................................---------------------CAGCCGATTATT---GATTGGTTTCTTCATGCTCGTAAATTG...............GGA......TGGGATATTATTTTTCTTGTTCAGGATTTATCTATTGTTGATAAACAGGCGCGTtctGCA...TTAGCTGAACACGTTGTTTATTGTCGCCGTCTG------------------GACAGAATTACTTTACCC---TTTGTCGGCACT---------------------------TTATATTCTCTTGTTACT.....................GGCTCAAAAATGCCTCTGCCTAAATTACATGTTGGTGTTGTTAAATATGGTGATTCTCAATTAAGC
TRN  10010101100011000
NAM  VG1_BPIF1/2-187
ASQ  AVYVVTGKLGAGKTLVAVGK-IQD----KIV---S...-GCRVAT-NLD-LR.IH......KLPR--------------------..--VGIFAKSPDviriPDKPSLDDLLAIG.RGNnSYDENKNGLLVLDECGT..WFNSRSW-ADKE-------..........................................-------RQSVINWFLHARKL.....G..WDIIFLIQDLSIMDKQARvA.LAEHVVYCRRL------DKITIPF----------IGSIYSVIT.......GSKLPLPKVHVGIVKYGDSPQS
DSQ  GCTGTATATGTCGTAACGGGAAAACTTGGTGCAGGTAAAACACTTGTCGCTGTCGGTAAA---ATCCAGGAC------------AAAATTGTT---------TCT.........---GGTTGCAGGGTGGCTACG---AATCTCGAT---TTGCGC...ATTCAT..................AAGCTACCGAGA------------------------------------------------------------......------GTGGGTATATTTGCAAAGTCACCGGATgtaatacggataCCTGATAAACCATCACTGGATGACTTACTGGCAATCGGG...AGAGGCAATaatTCCTATGATGAAAATAAAAATGGATTACTGGTTCTTGATGAATGTGGGACA......TGGTTTAACTCACGCTCATGG---GCTGATAAAGAA---------------------..............................................................................................................................---------------------AGACAGTCCGTAATTAACTGGTTTTTACATGCCAGAAAACTG...............GGC......TGGGATATTATTTTTCTTATTCAGGATTTATCCATCATGGATAAGCAAGCTCGTgtgGCA...CTGGCAGAGCATGTCGTTTATTGCCGACGCCTC------------------GATAAAATCACAATTCCATTT------------------------------ATTGGCTCAATTTATAGTGTAATAACG.....................GGATCAAAACTTCCGCTACCAAAAGTACATGTCGGGATTGTTAAATATGGTGACTCTCCGCAGTCA
TRN  10010101100011000
NAM  O80263_9VIRU/3-188
ASQ  SVYFVTGKLGSGKSLIAVSR-IRD----ALM---R...-GVPVAT-NLN-IN.LK......EMIGRDKRNTRLYRLPDKPT----..----------V....EDIEILGYAISPY.DT-.-SK---DGLIVLDECGT..WFNSRTWNDKNR-------..........................................-------QALL-DRFLHIRKL.....G..WDVIFIVQNISMVDKQAR.EgLAEHVVHCKRLDRMQIPYLSTIVW--ILT--------LGQ---.......-LKIPMPKLHIGIVKYGDTINA
DSQ  AGTGTTTATTTTGTAACGGGTAAACTGGGCTCTGGAAAGTCCTTGATTGCTGTCAGTCGC---ATTCGTGAC------------GCTCTCATG---------AGA.........---GGCGTACCAGTTGCCACT---AATCTCAAT---ATCAAC...CTAAAG..................GAAATGATAGGCCGTGATAAACGGAACACCCGTTTATATCGTCTGCCTGACAAGCCTACG------------......------------------------------GTC............GAAGATATTGAGATACTTGGCTATGCAATAAGTCCTTAC...GATACG---...---TCTAAG---------GACGGTCTAATCGTGCTTGATGAGTGCGGTACT......TGGTTTAACTCTCGCACATGGAACGATAAGAACCGA---------------------..............................................................................................................................---------------------CAAGCCTTACTC---GATAGATTTTTGCATATCCGCAAATTG...............GGA......TGGGATGTCATTTTCATCGTTCAGAACATTTCAATGGTTGATAAACAAGCCCGT...GAGggtCTTGCTGAGCACGTTGTTCACTGCAAGCGATTAGACCGGATGCAAATCCCTTATCTATCTACTATTGTTTGG------ATACTTACG------------------------CTTGGTCAG---------.....................---CTCAAAATCCCTATGCCAAAACTTCACATCGGCATAGTCAAATACGGCGATACGATTAACGCT
TRN  10010101100011000
//