ID HQ607513; SV 1; linear; genomic RNA; STD; VRL; 4684 BP. XX AC HQ607513; XX DT 08-MAY-2011 (Rel. 108, Created) DT 08-MAY-2011 (Rel. 108, Last updated, Version 1) XX DE Trichomonas vaginalis virus 1 strain TVV1-UR1, complete genome. XX KW . XX OS Trichomonas vaginalis virus 1 OC Viruses; Riboviria; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4684 RX DOI; 10.1128/JVI.00220-11. RX PUBMED; 21345965. RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W., RA Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H., RA Singh B.N., Fichorova R.N., Nibert M.L.; RT "Clinical isolates of Trichomonas vaginalis concurrently infected by RT strains of up to four Trichomonasvirus species (Family Totiviridae)"; RL J. Virol. 85(9):4258-4270(2011). XX RN [2] RP 1-4684 RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A., RA Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.; RT ; RL Submitted (12-NOV-2010) to the INSDC. RL Department of Microbiology and Molecular Genetics, Harvard Medical School, RL Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA XX DR MD5; a4ab8cc3ea1ffcb27b747c614a24dd89. DR EuropePMC; PMC3126235; 21345965. XX FH Key Location/Qualifiers FH FT source 1..4684 FT /organism="Trichomonas vaginalis virus 1" FT /host="Trichomonas vaginalis" FT /strain="TVV1-UR1" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="Jun-1999" FT /db_xref="taxon:674953" FT gene 326..4616 FT /gene="pol" FT CDS join(326..2352,2354..4616) FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /db_xref="GOA:F5B2T2" FT /db_xref="InterPro:IPR001795" FT /db_xref="UniProtKB/TrEMBL:F5B2T2" FT /protein_id="AED99812.1" FT /translation="MEASANGLSHDDNANKSQNVGPSTLPGSDKQGGENHENSFNSFSN FT DFFFNFLRTSTSTHISDSPGVSFVSKDGTPYTSATIQSAVGRLTHNVVASAVQLNITAN FT NTLEVDYGFGQDVSRATGTITIPIFDGEKYKEVARALSLVFSKKGMALDVTSQTVQDTL FT MNSDLTIATVAAGYYTALAARHELTKEASVAAHRIPFVTALSDTFTAADNAQRSSHVIS FT SCLRCPASNNAQRQVTVGTNMWTNVSVENLAVQGAAIPNPNDVSFFIPNKALPSSWWCA FT IWLLNAFLHSFVAQTRFHIFITPGETYNLAPFTDADIYEAIPVLLAMSKSSRPVPESVE FT SMLYAYGTQMVIQPHSLYTEGGIIRKMIFTVPHLPAHGYFVTNAEYSRYMNIAVPNDPR FT TAKDYIIGVGTGLLQVILAYQAAFSCGGPIALHWHANDAISHGMDTVAAAYLEGRYFTI FT PMAINVATNIAQYTTGVRADPQYKHSLDRILPRIFGPSTDTVFNFIESAITSSWVSINA FT TKRNGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSNIAQEVEYLTNGGDLR FT NCPILRTLKAAEAEETVTFMCTGKIGSIFAIDGTMRTFKRYQTIDLAELGWTSHGKVMK FT PYAFRAPVIQGITVCKTAYTSTAIDIVTTVFGPLRLRVGTLLSKAVRCGPIIPSVKHHF FT NIRRIITVKRNGNEYVFIPGYGWVLQDDYLVNSVKMTGEDQLPPNQLPYGDDLLLIYSE FT ILLYNYISLFPKFRYKNPDLLNQETELQLFPLKTDSAARNKANFYARSLWNEAKTDKTA FT FKPGTYNDTVAGLLMWQQCALMWSLPRSVINRTISGVCDALTERTSLALLKRISDWLKQ FT LGLACSPIHRLFIELPTLLGRGAIPGDSVKDMKHRLKFDPSITVDVPRDQLHDLIYRLL FT SRNLHITNVESFDHHLEERLLWSKSGSHYYPDEEVNRLLPNQPTRKEFLDVVTVDYIKE FT CKPQVFIRQSRKLEHGKERFIYNCDTVSYVYFDFILKLFEAGWQDSEAILSPGDYTGER FT LHARISSYKYKAMLDYTDFNSQHTIRSMRLIFETMKELLPPETTFALDWCIASFDNMYT FT SDGHKWVSTLPSGHRATTFINTVLNWCYTQMVGLKFNSFMCAGDDVILLSQEPISLVPI FT LTSHFKFNPSKQSTGTRGEFLRKHYTSEGVFAYPARAIASLVSGNWLSQSLRENTPILV FT PIQNGIDRLRSRAGLLGVPWILGLSELTEREAVPRDVSMALLNSHAAGPGLITRNYSSF FT TVTPKPPTLTSTLEYTATRYGVQDLSKHVPWEQLTLEERNKLGKQIKKMSHRHCSQAKI FT TYTCVHEVYKPSGLPKVLSGASQPSLSMVWWQAMLKEAMQDNSTKKIDAQMFASSACTD FT RVSGDAFLQASAKAAGVLITSLIQSSS" FT gene 326..2362 FT /gene="cap" FT CDS 326..2362 FT /codon_start=1 FT /gene="cap" FT /product="capsid protein" FT /db_xref="UniProtKB/TrEMBL:F5B2T3" FT /protein_id="AED99811.1" FT /translation="MEASANGLSHDDNANKSQNVGPSTLPGSDKQGGENHENSFNSFSN FT DFFFNFLRTSTSTHISDSPGVSFVSKDGTPYTSATIQSAVGRLTHNVVASAVQLNITAN FT NTLEVDYGFGQDVSRATGTITIPIFDGEKYKEVARALSLVFSKKGMALDVTSQTVQDTL FT MNSDLTIATVAAGYYTALAARHELTKEASVAAHRIPFVTALSDTFTAADNAQRSSHVIS FT SCLRCPASNNAQRQVTVGTNMWTNVSVENLAVQGAAIPNPNDVSFFIPNKALPSSWWCA FT IWLLNAFLHSFVAQTRFHIFITPGETYNLAPFTDADIYEAIPVLLAMSKSSRPVPESVE FT SMLYAYGTQMVIQPHSLYTEGGIIRKMIFTVPHLPAHGYFVTNAEYSRYMNIAVPNDPR FT TAKDYIIGVGTGLLQVILAYQAAFSCGGPIALHWHANDAISHGMDTVAAAYLEGRYFTI FT PMAINVATNIAQYTTGVRADPQYKHSLDRILPRIFGPSTDTVFNFIESAITSSWVSINA FT TKRNGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSNIAQEVEYLTNGGDLR FT NCPILRTLKAAEAEETVTFMCTGKIGSIFAIDGTMRTFKRYQTIDLAELGWTSHGKVMK FT PYAFRAPVIQGITVCKTAYTSTAIDIVTTVFGPLRLRVGTLFE" XX SQ Sequence 4684 BP; 1356 A; 1204 C; 943 G; 1181 T; 0 other; gcaaaaagag ggagtcaccc actttcctct ttttgcaccc caacattgtc acctcatcat 60 gacgaactca taacgcggac acataacaag cgtagtgtcc tcgacgattg ccatcctcgt 120 gtgaattccg ggctccgctt gcactgatgg tacctcttac gaaacttgga gagacttcgg 180 cctcgaagag cggtaatgta ccctctgcgc ctgggaccta atggcgtttt tgctgtaggg 240 atttttcagt ggtagaaaga ttaggggtta aacatcctgg ttcgctaggt ttgtccttaa 300 ctttactcag ctgatgggaa tacccatgga ggcttctgct aatgggttat cacatgatga 360 taatgcgaat aaatcgcaaa atgttggacc ttctactctt ccggggtcag ataaacaagg 420 aggagaaaac cacgaaaatt cttttaattc tttttcaaat gatttctttt ttaatttttt 480 acgcacatct acgagtactc atatttcaga cagtccagga gtttctttcg tctcaaagga 540 tggaacacct tacacatcag ccaccatcca gtccgcagtc ggtcgtctta cacataatgt 600 cgtcgcatca gcagtccagc tcaacattac agccaacaat acgttagagg tggactacgg 660 ttttggccaa gacgtctcaa gagctacagg aaccatcaca atcccgatct ttgatggtga 720 aaagtacaaa gaggtagctc gcgccttatc attagttttc agcaagaagg gtatggcgct 780 cgacgttaca tctcaaactg ttcaagacac cctcatgaac tccgatctca caattgctac 840 cgtcgctgct ggatattaca cagctttagc cgctcgccat gaactcacaa aagaagcaag 900 cgtcgcagcc catcgtattc ctttcgttac agccttatca gacacgttca cagctgcaga 960 caacgcgcaa cgctcaagcc acgtcatctc ttcttgcttg cgctgccctg cctcgaataa 1020 cgctcaacgc caagtcacag tcggtacgaa tatgtggacg aatgtttccg tcgaaaatct 1080 tgcagtacaa ggcgctgcaa tcccaaatcc aaacgatgta tcattcttca ttccgaacaa 1140 agccctccca tcttcttggt ggtgcgcaat ctggcttctc aacgcttttc ttcacagctt 1200 cgtcgcgcag actcgtttcc acatcttcat tacaccaggt gaaacctaca atcttgcgcc 1260 gttcacagat gccgatatct acgaagcgat ccctgttttg ctcgcgatgt caaaatcatc 1320 gcgtcctgtt ccagagagcg tcgaaagtat gctctacgcc tacggcaccc agatggttat 1380 ccaaccacac tcgctctata cggagggcgg cataattagg aaaatgatct ttaccgtccc 1440 acaccttcca gcacacggct atttcgtaac aaacgctgaa tactcgagat acatgaacat 1500 cgctgttcca aacgacccac gtacagccaa agattacata attggagtcg gcaccggtct 1560 cttacaggtc atacttgctt accaagccgc cttcagctgt ggtggtccaa tcgctctcca 1620 ctggcacgcc aacgacgcta tctcacatgg tatggatact gttgcagctg cttacctcga 1680 aggaagatac ttcaccatcc caatggctat caacgttgcg acgaatattg cccaatatac 1740 tacaggagtt agggcagatc cacagtacaa acattcactc gaccggatct taccacgcat 1800 tttcggtccg tcgacagaca cagttttcaa cttcatcgaa tccgcgatca catcttcttg 1860 ggtctcaatc aatgctacga aacgcaacgg ccgtgccaga aagttcagga cggctttcat 1920 caaccgcttt catgatccag aattcgctta catgttcggc attactggca atggcatcga 1980 gcggatggaa ggtaaagtca cgtcgaacat cgcacaggaa gttgaatacc tcaccaacgg 2040 tggtgacctt cgcaactgcc caattcttcg caccttaaag gctgctgaag cagaagagac 2100 cgtcactttc atgtgtacgg gaaagatcgg ttccatcttc gcgatcgatg gtacaatgcg 2160 cacattcaaa cggtaccaaa cgatcgacct cgctgaactc ggatggacgt cacacggcaa 2220 ggtcatgaaa ccatacgctt tcagagcccc agtcatccaa ggaatcaccg tctgcaagac 2280 agcttacaca tccacagcta tcgacatcgt cacaacagtc ttcggcccct tacgccttcg 2340 cgtaggcacc ctttttgagt aaggctgtac gttgtggccc tataatacca tccgtcaagc 2400 atcacttcaa cataagacgc atcataacag ttaaacgtaa tggtaatgaa tacgtattta 2460 tcccaggtta cggatgggta ttacaggatg attatttggt gaattccgtc aagatgactg 2520 gtgaagatca actacctcct aaccagttac cttatggcga tgatctttta cttatatatt 2580 cagaaatttt actttataat tacatatctc tttttcccaa gttcagatac aagaatccag 2640 acttattaaa tcaagaaaca gaattacaac ttttcccact taaaaccgac tcagctgcca 2700 gaaataaagc caatttttat gctagatcac tatggaatga agcaaaaaca gacaaaacag 2760 ctttcaaacc aggaacctac aatgacacag tagcaggtct attgatgtgg caacaatgtg 2820 ctctcatgtg gtcactgcct cgctcagtta tcaacagaac aattagcggc gtttgtgatg 2880 cgttaaccga aaggacttca ctcgcgctat taaaacgtat ctcagattgg ttgaaacaac 2940 tcgggctggc ttgctcacca atccatcgct tattcataga actcccaaca ctattaggac 3000 gcggagcaat tccaggcgat agtgtaaagg atatgaagca cagactcaag ttcgacccat 3060 ctataacagt agatgtccca agagaccagt tacacgatct aatctacaga ctcttatcaa 3120 gaaatcttca cataaccaac gttgagagct tcgatcacca tctagaagag cgtctgcttt 3180 ggtctaaatc cggaagtcat tactatcccg acgaggaagt caatagatta cttcccaatc 3240 aacccacaag gaaagaattc ttagacgtcg taaccgtaga ctacatcaag gaatgcaagc 3300 ctcaggtttt cataagacaa tcacgtaagc tagaacacgg caaagaacgt ttcatctaca 3360 actgcgacac agtctcatac gtctattttg attttatcct gaaactcttt gaggcaggat 3420 ggcaagatag cgaagcaata ctgtcgccag gtgactacac tggtgaacgc ttacacgcaa 3480 gaatttctag ctacaaatac aaggctatgc tcgattacac ggatttcaat tctcagcata 3540 caatccgaag catgcgactg atattcgaaa ctatgaagga gttactacca cctgaaacca 3600 cctttgctct cgactggtgt atcgcctcat tcgataacat gtacacatcc gatggccaca 3660 aatgggtctc gactctccca agcggacatc gagctactac cttcatcaac acagtattaa 3720 attggtgcta cacacaaatg gtcggtctca agttcaacag ttttatgtgc gccggtgatg 3780 atgtcatttt attgtctcaa gagccaatat cactggtccc cattcttaca tcacatttca 3840 agttcaatcc cagtaaacaa agtacaggta ctagaggtga attcttacgc aagcattaca 3900 cctcagaagg cgtgtttgca tacccggcac gagcaattgc aagcttagta agcggaaatt 3960 ggttaagcca atctttaaga gagaacactc caattttggt cccaatacaa aacggaatcg 4020 acagacttcg aagcagagct ggtttactcg gagtcccttg gatcttaggc ctctcggagc 4080 tcacagagcg agaagccgtt cctagggatg tcagcatggc tctgttaaat tcacacgctg 4140 caggaccagg tttgatcaca cggaattaca gttctttcac cgttaccccg aaaccaccta 4200 cgctaactag tacactcgag tacacagcaa cccgttacgg tgtccaagac ctgtccaaac 4260 acgtaccatg ggaacaactt acattggaag aacgtaataa gttaggaaaa caaattaaga 4320 aaatgagtca caggcattgt agccaggcaa agataacata cacttgtgtt cacgaagttt 4380 acaaaccaag tggcctcccc aaggtgttat ctggtgccag ccaaccatcg ttgtcgatgg 4440 tgtggtggca ggcaatgctt aaggaagcaa tgcaagacaa ctctactaag aagatagatg 4500 cacaaatgtt cgcttcgagt gcatgtacag accgcgtcag cggtgatgca ttcttgcaag 4560 cgagcgcaaa agctgctggt gtactaatca ctagcttgat tcaatcttct tcataacgta 4620 cagcaaaaaa agtctctata gttgctcaag acatatatga gccagatggc cctgctatac 4680 cttc 4684 //