ID HQ607518; SV 1; linear; genomic RNA; STD; VRL; 4674 BP. XX AC HQ607518; XX DT 08-MAY-2011 (Rel. 108, Created) DT 08-MAY-2011 (Rel. 108, Last updated, Version 1) XX DE Trichomonas vaginalis virus 2 strain TVV2-OC3, complete genome. XX KW . XX OS Trichomonas vaginalis virus 2 OC Viruses; Riboviria; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4674 RX DOI; 10.1128/JVI.00220-11. RX PUBMED; 21345965. RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W., RA Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H., RA Singh B.N., Fichorova R.N., Nibert M.L.; RT "Clinical isolates of Trichomonas vaginalis concurrently infected by RT strains of up to four Trichomonasvirus species (Family Totiviridae)"; RL J. Virol. 85(9):4258-4270(2011). XX RN [2] RP 1-4674 RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A., RA Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.; RT ; RL Submitted (12-NOV-2010) to the INSDC. RL Department of Microbiology and Molecular Genetics, Harvard Medical School, RL Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA XX DR MD5; eb7410747e41c1a12ffcae60a4edbb71. DR EuropePMC; PMC3126235; 21345965. XX FH Key Location/Qualifiers FH FT source 1..4674 FT /organism="Trichomonas vaginalis virus 2" FT /host="Trichomonas vaginalis" FT /strain="TVV2-OC3" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="Nov-2009" FT /db_xref="taxon:674954" FT gene 298..4607 FT /gene="pol" FT CDS join(298..2380,2380..4607) FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /db_xref="GOA:F5B2U2" FT /db_xref="InterPro:IPR001795" FT /db_xref="UniProtKB/TrEMBL:F5B2U2" FT /protein_id="AED99808.1" FT /translation="MASTLISSDNSATSGTVGEVINNTDTSPPDTPPSDHSNPRLTKIL FT DEMSKKPCVNINEIRKVIRNFQPQIIQPRNGNRPGAQPRTVNSFEWVVRIQSTVNTQLL FT GATNTIPEQTLNLDISFTDDSTTITPASIPGSISMLDNSRHIPAIQSMIQNFKARYLGN FT LQDTAQLNSPQYPQLLAYLFGQLIAIKDRLDLFRPSNPLSFADALFGFTLAQNARPRYD FT DHIHAKACTGPIVIPAATNADCGPCGFVQINANQGLTLPLGACLFVNPDTVNDQSFQDF FT LWLIFATHHRMPNQMQNNWPFALNIVSTCAAPGRQVPQAGQLTDARFHAALDTGHRILL FT SMFNDDEETLRYYQRKGIETMFRPCCFYTEGGLLRKATRYVSMVPLSGLYYYNGASSYI FT VTPIHTDAHPGITAAIESFVDIMVLQAVFSFTGPKVVAARADANQVDASSVFGPAVAEG FT DGFVYDPRRPAPPLSAFYSEFIHRPAEQRIFQMAMSQIYGSHAPLIIANVINSIHNCKT FT KIVNNKLRATFVRRPPGAPHLKADTAIINRFHDPELAYALGILADGIAPLDGTHEYNIL FT DELDYLFNGGDIRNCFGLNALNTRGLGQIVHVRPKRDPGKKPRRGFYTTLDGQVHPITH FT DAPLDEIYQWRDHGNLTRPYSCHILDSEGLEFADVSNGRSRGKLLVVVTTPLKTSAAYQ FT GPQLRAKAGQRYVERINQCGGNVIYPRLSTARACSSSAIDSKTLTLANQLCYLYKSSDL FT HQQLDFTIPQSYLTFLEWLLRMNPDNQKSSIRHFPNYDNQEVITCNLRNLTKEQEIELF FT PIKDIIQANRRVNAYARNLLDASPLPDFALQQMLLPKTANDVVCGILLLGEVLWLLRCP FT ISIIVGISRAICRNDSFIKDLSDFNKMLGLTKIPIANCLTELNSLQGRGVTSSDAKRDL FT THRIDDVNPHEAKISREILKEAINHIYKEEINKTEVPDTFKQHVFSSPLWVKKGAHHHP FT HFKSYDNRLEFVENVDLDRVLQSHPAVYITQASKLEHGKTRYIYNCDTVSYIYFDYILN FT YVESVWSNKHVLLNPDYMNPVIFSSLNYDEYCMLDYTDFNSQHSIASMKLVFSCLMPFL FT PYSMHSVLQWCLTSFDNMYINNVHWKSTLPSGHRATTFINSILNRAYLLPFLQVSNAFH FT TGDDVLLCGKADYATLINTVPYELNKTKQSFGSSAEFLRLHKHNNQVSGYPARAISSLV FT SGNWLSYDNPLWQPSLLSIMQQLYTISARSGLLPTLPVTMKLEVRRRYDLPTRLTNGLF FT SGDIVPSGCPCYKSNAALLSAVIPDTVLKAQPKHYDLRTLDILKHTSPWINSESKYLDL FT LDRRHMESNKKNVLYNIQYLPSKMLPMIDVDPSEALPPQKRYHPRSHIAHPLPRDAHLK FT ELRFATCRVGPATAIRLGSLWPANRINLIRPVYV" FT CDS 298..2427 FT /codon_start=1 FT /gene="pol" FT /product="capsid protein" FT /db_xref="UniProtKB/TrEMBL:F5B2U3" FT /protein_id="AED99807.1" FT /translation="MASTLISSDNSATSGTVGEVINNTDTSPPDTPPSDHSNPRLTKIL FT DEMSKKPCVNINEIRKVIRNFQPQIIQPRNGNRPGAQPRTVNSFEWVVRIQSTVNTQLL FT GATNTIPEQTLNLDISFTDDSTTITPASIPGSISMLDNSRHIPAIQSMIQNFKARYLGN FT LQDTAQLNSPQYPQLLAYLFGQLIAIKDRLDLFRPSNPLSFADALFGFTLAQNARPRYD FT DHIHAKACTGPIVIPAATNADCGPCGFVQINANQGLTLPLGACLFVNPDTVNDQSFQDF FT LWLIFATHHRMPNQMQNNWPFALNIVSTCAAPGRQVPQAGQLTDARFHAALDTGHRILL FT SMFNDDEETLRYYQRKGIETMFRPCCFYTEGGLLRKATRYVSMVPLSGLYYYNGASSYI FT VTPIHTDAHPGITAAIESFVDIMVLQAVFSFTGPKVVAARADANQVDASSVFGPAVAEG FT DGFVYDPRRPAPPLSAFYSEFIHRPAEQRIFQMAMSQIYGSHAPLIIANVINSIHNCKT FT KIVNNKLRATFVRRPPGAPHLKADTAIINRFHDPELAYALGILADGIAPLDGTHEYNIL FT DELDYLFNGGDIRNCFGLNALNTRGLGQIVHVRPKRDPGKKPRRGFYTTLDGQVHPITH FT DAPLDEIYQWRDHGNLTRPYSCHILDSEGLEFADVSNGRSRGKLLVVVTTPLKTSAAYQ FT GPSFAPKPGSAMWNE" XX SQ Sequence 4674 BP; 1341 A; 1237 C; 860 G; 1236 T; 0 other; gctttaaaag gagtgacgac ctttaaagcc caggcctaac cagcctggtc agaaactcct 60 ggggtccatc aggagacgga tcgctaacgc gaactggata ttagagtcaa aagccttgtg 120 cgccatggat acttggtaca cttcgcggga gtaggtgacc cgcaagccgt acggttttta 180 gtggattgaa aattgtactg ttaactgcta gccttatgcg gttgctgtgt attgagaggg 240 cttatatgta gcttttatgc ttccttattg tataattgtg tacatctaat tacgataatg 300 gcttcgacgc taatatcgtc tgataattct gccacgtcag gcactgttgg tgaagttata 360 aacaacacag atacttcacc tcccgacact ccacctagtg atcactcaaa tccacgttta 420 acaaagattc tagatgaaat gtccaaaaaa ccttgtgtaa atattaatga aataagaaaa 480 gttattagaa atttccaacc tcaaattatt caacctcgta atggtaatcg cccaggcgct 540 caaccacgca cagtaaattc tttcgaatgg gttgtccgta ttcaaagcac tgtcaatact 600 caactacttg gtgcaactaa tacgattcct gaacagactc tcaacctcga tatctcgttt 660 acagatgatt ctactacaat tactccagct tccattccag gctctatttc gatgctcgac 720 aactcacgcc atatccctgc gatccagagc atgatccaga atttcaaagc ccgttactta 780 ggtaatctcc aagatacagc ccaactcaat tctccgcagt atcctcaact tcttgcctat 840 ctattcggac aattaatcgc catcaaggac cgcctcgatc ttttccgacc atcgaaccca 900 ctttcatttg ctgatgcttt atttggcttt actttagccc agaacgcacg ccctcgctac 960 gatgaccaca tacatgctaa ggcatgtaca ggacctatcg tcatcccagc agctacaaat 1020 gcagattgcg gtccttgcgg cttcgtccaa atcaatgcta atcagggcct cactttaccc 1080 ctcggcgctt gcctttttgt taatccagat acagttaacg atcaatcctt ccaagatttc 1140 ctttggctca ttttcgcaac acaccatcgc atgccaaacc agatgcaaaa taactggcca 1200 tttgctctca atatcgtctc gacatgtgcc gctccaggtc gtcaggttcc tcaagccggc 1260 caactcaccg atgccaggtt tcacgctgcc ctcgatacag gtcatcgcat cttactctca 1320 atgttcaacg atgatgaaga aacactccgc tactatcaac gcaaaggaat agaaacgatg 1380 ttcagaccat gttgcttcta cactgaaggc ggtttgctca gaaaagctac aagatatgtt 1440 tcaatggttc cactcagcgg cttatattac tacaatggcg cctcttcata cattgttact 1500 ccaatccaca ctgatgcaca cccaggaatc acagctgcaa tcgaatcctt cgtcgatatc 1560 atggtcttac aagcggtctt ttctttcacg ggccctaaag ttgtcgctgc tagagccgac 1620 gcaaaccaag ttgatgcttc ctcagtcttc ggccctgctg tcgctgaagg agatggtttt 1680 gtctacgatc ctcgccgtcc agcccctccg ctctccgcat tctacagtga attcatccac 1740 agaccagccg aacaacgcat cttccagatg gcgatgagcc agatttacgg atcccatgcc 1800 cccctcatca tcgctaacgt catcaattcc atccacaatt gcaagacaaa gatcgtcaac 1860 aacaaattac gcgccacttt cgttcgtcgt ccacccggtg ctccccatct caaggcggac 1920 accgcaatca tcaaccgctt ccatgatcca gaactcgcct atgccctcgg cattctcgct 1980 gatggtatag ctccactcga tggaacacac gaatacaaca tcctcgacga actcgattac 2040 ttgttcaacg gtggtgacat ccgcaattgc ttcggcctca acgcgctcaa cactcgcgga 2100 ttaggccaaa tcgtccacgt tcgtccgaag cgcgatccag gaaagaaacc tcgccgcggt 2160 ttctacacca ccctcgatgg acaagtccac cccatcacac acgatgctcc actcgatgag 2220 atttaccagt ggcgtgacca cggaaatctt acacgtccat attcgtgcca catcctcgac 2280 agcgaaggat tggaattcgc tgacgtctcc aacggacggt cacggggcaa gctcctcgtt 2340 gtcgtcacca caccgctcaa gacaagcgct gcctaccagg gccccagctt cgcgccaaag 2400 ccgggcagcg ctatgtggaa cgaataaacc aatgcggggg caacgtcatc tatccgcgct 2460 tgagcactgc tcgcgcctgc agctcgtctg ccattgatag taagacgttg accctcgcaa 2520 accaactgtg ctacctatac aagtcttctg acttgcacca acagttggac ttcacgatac 2580 cacaatccta cttaacattt cttgaatggc tactcagaat gaatcccgat aatcaaaaaa 2640 gttcaattcg ccacttccca aattacgata atcaagaagt tattacatgc aatttacgaa 2700 acctcacaaa agaacaagaa attgaactat ttccaataaa agacataatc caagctaacc 2760 gccgcgtaaa tgcctacgcg cgaaatctcc ttgacgcctc accacttcct gattttgctc 2820 tacagcagat gctcttacca aaaacagcta atgatgtagt ttgcggaatc ctgttacttg 2880 gtgaagtatt atggttactt cgctgtccca tctcgatcat cgtaggtatt tcccgtgcaa 2940 tatgtcgcaa tgatagtttc ataaaagatt tatcggattt caacaaaatg ttaggcttaa 3000 ctaagatccc aatagctaac tgtttgactg aattaaacag tctacagggt agaggagtca 3060 cttcaagcga tgccaaaaga gatttgaccc accgaataga cgatgtcaat cctcatgagg 3120 cgaaaataag tagagaaatc ctcaaagaag caatcaatca tatatacaaa gaagaaatca 3180 acaaaacaga agtacccgat acattcaaac aacatgtatt ctcatctcca ttatgggtta 3240 agaaaggcgc acaccatcac cctcacttca aatcttacga taaccgacta gagtttgtcg 3300 agaatgttga cttagacagg gttttacaat cacaccctgc tgtctacatt acgcaagcct 3360 caaaacttga acacggtaag acacgataca tttacaattg tgatacagtc agttacatat 3420 attttgatta tatcttgaac tacgtcgaga gtgtttggtc caataaacat gtgcttctaa 3480 accctgatta catgaaccca gtcattttta gtagtcttaa ttatgatgag tactgtatgt 3540 tggattacac tgacttcaat tcccaacatt ctattgcaag tatgaagcta gtgttttcct 3600 gtctcatgcc atttttacct tactcaatgc attcagtatt acaatggtgc cttacatcat 3660 tcgataacat gtacatcaat aatgtccact ggaaatcgac attaccatca ggacacagag 3720 caacaacatt tatcaattct atactaaaca gggcatactt gttgccgttt ctgcaagttt 3780 caaatgcatt ccacacaggt gatgacgttc tgctatgcgg aaaggctgac tatgcaacct 3840 taattaacac ggtaccttat gaactcaata aaactaagca atcgttcggg tcttcagctg 3900 aatttctacg tcttcataaa cataacaacc aggtctcggg ctacccagct cgtgcaatta 3960 gtagtctcgt aagcggaaac tggttgtcat acgataatcc actatggcaa ccttcactac 4020 tgtcaattat gcaacaattg tacaccatat cagcgagatc aggcttgtta ccaactcttc 4080 ctgtcacgat gaagttagag gttcggcgac ggtacgactt acctacacgg ctcacaaatg 4140 gattgttctc tggtgatatt gtcccaagtg gttgcccatg ttataagtca aatgctgcct 4200 tactaagcgc ggtcatccct gatacagtac ttaaggcaca accaaagcat tacgacttac 4260 gtactttgga catcctaaaa catacttcac cttggatcaa ttctgaatcc aaatatttgg 4320 atctcctgga tcgccgtcac atggaatcga ataagaaaaa tgtattatat aatattcagt 4380 atttaccttc caagatgtta ccaatgattg atgttgaccc atctgaggct cttcctccac 4440 aaaaacggta tcatccacgt tcccacatcg cacacccact cccacgtgat gctcatctca 4500 aggaattgag attcgccacg tgtagagtgg gcccggctac agcgataaga ttaggatcgc 4560 tttggcctgc gaacagaata aacctaatca ggccagtgta cgtctaagta caactgccaa 4620 aaatcttaca aaactactcg gctagagtag gagagtaata tcaactctta tgtc 4674 //