ID HQ607523; SV 1; linear; genomic RNA; STD; VRL; 4680 BP. XX AC HQ607523; XX DT 08-MAY-2011 (Rel. 108, Created) DT 08-MAY-2011 (Rel. 108, Last updated, Version 1) XX DE Trichomonas vaginalis virus 1 strain TVV1-OC5, complete genome. XX KW . XX OS Trichomonas vaginalis virus 1 OC Viruses; Riboviria; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4680 RX DOI; 10.1128/JVI.00220-11. RX PUBMED; 21345965. RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W., RA Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H., RA Singh B.N., Fichorova R.N., Nibert M.L.; RT "Clinical isolates of Trichomonas vaginalis concurrently infected by RT strains of up to four Trichomonasvirus species (Family Totiviridae)"; RL J. Virol. 85(9):4258-4270(2011). XX RN [2] RP 1-4680 RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A., RA Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.; RT ; RL Submitted (12-NOV-2010) to the INSDC. RL Department of Microbiology and Molecular Genetics, Harvard Medical School, RL Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA XX DR MD5; b89c65c649b9f6107d5d8712427786db. DR EuropePMC; PMC3126235; 21345965. XX FH Key Location/Qualifiers FH FT source 1..4680 FT /organism="Trichomonas vaginalis virus 1" FT /host="Trichomonas vaginalis" FT /strain="TVV1-OC5" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="Jan-2010" FT /db_xref="taxon:674953" FT gene 323..4613 FT /gene="pol" FT CDS join(323..2349,2351..4613) FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /db_xref="GOA:F5B2V2" FT /db_xref="InterPro:IPR001795" FT /db_xref="UniProtKB/TrEMBL:F5B2V2" FT /protein_id="AED99820.1" FT /translation="MEASANGLSHDDNANKSQNVGPSTLPGSDKQGGENHENSFNSFSN FT DFFFNFLRTSTSTHISDSPGVSFVLKDGTPYTSATIQSAVGRLTHNVVASAVQLNITAN FT NTLEVDYGFGQDVSRATGTITIPVFDGEKYKEVARALSLIFSKKGTAIDTTSQTIQDTL FT KNSDLTIATVAAGYYTALAARHELTKAESTAAHRIPFATALSDTFSAAGDALRSSHVIS FT SCLRCPASNNAQRQVTVGTNMWTNVSVENIAVQGLIIPNPNDVSFFIPNKSLPPSWWCA FT IWLLNAFLHSFIAQTRIHIFITPGETYNLAPFTDADIYEAIPILLGMSKTSRPVPESVE FT SMLYAYGAQMVIQPHSLYTEGGIVRRMIFTVPHLPAHGYFIANTEYSRYMNIAVPNDPR FT TAKDYIIGVGTGLLQIILAYQAAFSCAGPIALHWHDNDAISNGMDTVAAAYLEGRYFTI FT PMAVNVATNIAQYTTRVRADPQYKHTLDRILPRIFGPSTDTVFNFIESAITSSWVSINA FT TKRNGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSNIAQEVEYLTNGGDLR FT NCPILRTLKAAEAEETVTFMCTGKIGSIFAVDGTMRTLKRYQTVDLADLGWTSHGKVMK FT PYAFRAPVIQGITVCKTAYTSTAIDIVTTVFGPLRLRVGTLLSKAVRCGPIIPSVRHHF FT NIRHIITVKRNGNEYVFIPGYGWVLQDDYLVNSVKMTGVDQLPPNQLPYGDDLLFIYAE FT ILLYNYISLFPKFRYKNPDLLNQETELQLFPLKTDSAARNKANFYARSLWNEAKTDKTA FT FRPGTYNDTVAGLLMWQQCALMWSLPRSVINRTISGVCDALTERTSLALLKRISDWLKQ FT LGLACSPIHRLFIELPTLLGRGAIPGDNVKDMKHRLKFDPSITVDVPKDELHALIYRLL FT SRNLNITKVDSFEHHLEERLLWSKSGSHYYPDDRINQLLPKQPTRKEFLDVVTVDYIKE FT CKPHVFIRQSRKLEHGKERFIYNCDTISYVYFDFILKLFEAGWQDSEAILSPGDYTSER FT LHTRISNYKYKAMLDYTDFNSQHTIQSMRLIFETMKELLPPETTFALDWCIASFDNMYT FT SDGHKWVSTLPSGHRATTFINTVLNWCYTQMVGLKFNSFMCAGDDVILLSQEPISLAPI FT LTSHFKFNPSKQSTGTRGEFLRKHYTTEGVFAYPTRAIASLVSGNWLSQSLRENTPILV FT PIQNGVDRLRSRAGLLGVPWILGLSELTEREAIPRDVSMALLNSHAAGPGLITRNYSSF FT TVTPKPPKLTSTLEYTATRFGVQDLSKHVPWEQLTLEERNKLGKQIKKMSHRHCSQAKI FT TYTCVHDFYKPSGLPTVLSGASQPSLSMAWWQAMLKEAMQDNFTKKLDAQMFASNACTD FT CVSGDAFLQASAKTAGVLFTSSILSSS" FT CDS 323..2359 FT /codon_start=1 FT /gene="pol" FT /product="capsid protein" FT /db_xref="UniProtKB/TrEMBL:F5B2V3" FT /protein_id="AED99819.1" FT /translation="MEASANGLSHDDNANKSQNVGPSTLPGSDKQGGENHENSFNSFSN FT DFFFNFLRTSTSTHISDSPGVSFVLKDGTPYTSATIQSAVGRLTHNVVASAVQLNITAN FT NTLEVDYGFGQDVSRATGTITIPVFDGEKYKEVARALSLIFSKKGTAIDTTSQTIQDTL FT KNSDLTIATVAAGYYTALAARHELTKAESTAAHRIPFATALSDTFSAAGDALRSSHVIS FT SCLRCPASNNAQRQVTVGTNMWTNVSVENIAVQGLIIPNPNDVSFFIPNKSLPPSWWCA FT IWLLNAFLHSFIAQTRIHIFITPGETYNLAPFTDADIYEAIPILLGMSKTSRPVPESVE FT SMLYAYGAQMVIQPHSLYTEGGIVRRMIFTVPHLPAHGYFIANTEYSRYMNIAVPNDPR FT TAKDYIIGVGTGLLQIILAYQAAFSCAGPIALHWHDNDAISNGMDTVAAAYLEGRYFTI FT PMAVNVATNIAQYTTRVRADPQYKHTLDRILPRIFGPSTDTVFNFIESAITSSWVSINA FT TKRNGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSNIAQEVEYLTNGGDLR FT NCPILRTLKAAEAEETVTFMCTGKIGSIFAVDGTMRTLKRYQTVDLADLGWTSHGKVMK FT PYAFRAPVIQGITVCKTAYTSTAIDIVTTVFGPLRLRVGTLFE" XX SQ Sequence 4680 BP; 1377 A; 1198 C; 928 G; 1177 T; 0 other; gcaaaaagga gggagtagtc cgctcttctc ctttttgcac tcaacatttt tactccatca 60 tgacgaatcc atgacatgga catgtaacaa gcgttttgtc ctcgatgatt gccatcctcg 120 tgtgaactcc gggcaccgct tgcactgatg atacctctta caaagctgga gagacacccg 180 tcttgaagag ccgtaatgta tcctctgcgc ctgggaccta atggtgattt tgctgtaggt 240 actttttaag ggaggaatta gggttgaaca tactagttcg ctagtatgcc ttttttctac 300 tttattaaga aattgaatac ccatggaggc ttctgctaat gggttatcac atgatgataa 360 tgcgaataaa tcgcaaaatg ttggaccttc tactcttccg gggtcagata aacaaggagg 420 agaaaaccac gaaaattctt ttaattcttt ttcaaatgat ttctttttta attttttacg 480 cacatctacg agtactcaca tttcagacag tccaggagtt tcttttgttt tgaaggatgg 540 aacaccatat acatccgcta ccatccaatc cgctgtcggt cgtcttacac acaatgtcgt 600 cgcatcagca gtccaactca atattacagc aaacaatacg ttagaggtgg actacggttt 660 cggtcaggat gtttcaagag ctacaggaac catcacaatc ccagtcttcg atggcgaaaa 720 gtacaaagag gtagctcgcg ctttatcatt aattttcagt aagaaaggta cggcgattga 780 cactacgtct caaactattc aagacaccct caaaaactcc gatctcacta ttgctaccgt 840 cgctgctgga tactacacag ccttagctgc tcgccatgaa cttaccaaag cagaaagcac 900 tgcagctcat cgcattccat tcgctacagc tttatcagat acattctcag cagccggcga 960 cgcgctgcgt tcaagccacg tcatctcttc ttgcttacgc tgccctgcct caaacaacgc 1020 acaacgacag gttacagtcg gaaccaacat gtggacgaac gtctccgtcg aaaacatcgc 1080 agtacaaggc ttgataattc caaatccaaa cgacgtatcg ttcttcattc cgaacaaatc 1140 tcttccacct tcctggtggt gcgcaatctg gcttctcaac gctttcctcc acagcttcat 1200 cgcacaaacc cgcatccaca tcttcatcac gccaggtgaa acttacaatc ttgcgccatt 1260 cacagatgcc gatatctacg aggctattcc tatcttactt ggaatgtcga aaacatcacg 1320 cccagttcca gaaagcgtcg aaagcatgct ctacgcatac ggcgcgcaga tggttatcca 1380 gccacactcg ctttacacag aaggcggtat agtcagaaga atgatcttta ccgtcccaca 1440 tcttccagca catggctact tcatagcaaa cacagaatac tcgagataca tgaacatcgc 1500 tgttccaaac gacccacgta cagctaagga ttacataatt ggtgtcggaa ctggcctctt 1560 acagatcata ctcgcctacc aagctgcttt tagctgcgct ggtccaatcg ctctccattg 1620 gcacgacaat gacgctatct caaatggcat ggatacggtt gcagctgctt acctcgaagg 1680 acggtacttc actatcccaa tggccgtcaa tgttgccaca aacatcgccc aatacactac 1740 aagagtcagg gctgacccac agtacaaaca cacactcgat cgaatcttac cacgcatctt 1800 cggtccatcg actgacacag tcttcaactt catcgaatcc gcaatcacat catcttgggt 1860 ctcgatcaac gcgacgaagc gtaacggtcg ggccagaaag ttcaggaccg ccttcatcaa 1920 tcgttttcat gatccagaat tcgcctacat gtttggcatc accggcaacg gtatcgagcg 1980 gatggaaggc aaggttacat ccaacatcgc acaagaagtc gaatacctca ccaacggcgg 2040 tgaccttcgc aactgcccta tccttcgcac cttaaaagct gcggaggcag aagagaccgt 2100 cacttttatg tgtacgggaa agatcggttc catcttcgcc gtcgatggta caatgcgcac 2160 gctcaagcgg taccaaacag tcgacctcgc cgacctcgga tggacatcgc atggcaaggt 2220 catgaaaccg tacgccttca gggccccagt catccaagga atcaccgtct gcaagacagc 2280 ttacacatcc acagctatcg acatcgttac aacagtcttc ggccccttac gcctccgcgt 2340 aggcaccctt tttgagtaag gctgtacgtt gtggccctat aataccatcc gtcaggcatc 2400 actttaacat aagacacatc ataacagtta aacgcaatgg taacgaatac gtatttatcc 2460 caggttacgg atgggtatta caggatgatt atttggtgaa ttccgtcaag atgactggtg 2520 tagatcaact acctcccaac cagttaccct atggcgatga tcttttattt atatatgcag 2580 aaattttact ttataactac atatctcttt ttcctaaatt cagatacaaa aatccagact 2640 tattaaatca agaaacagaa ttacagctct tcccacttaa aaccgactca gctgccagaa 2700 ataaagccaa cttttatgct agatcactat ggaacgaagc aaaaacagac aaaacagctt 2760 ttagaccggg aacatacaat gacacagtag ctggtctatt gatgtggcaa caatgtgctc 2820 tcatgtggtc actgcctcgc tcagttatca acagaacaat tagcggcgtt tgtgatgcat 2880 taaccgaaag gacttcactc gcgctattaa aacgtatctc cgattggttg aaacaactcg 2940 ggctggcttg ctcaccaatc catcgcttgt tcatagagct cccaacatta ctaggacgcg 3000 gagcaatccc aggcgataac gtgaaagaca tgaagcacag actcaaattc gatccatcta 3060 tcacagtaga cgttccaaaa gacgagttac atgccctaat ctacagacta ttatcaagaa 3120 atctcaacat aactaaagtt gacagcttcg aacaccacct agaagagcgt ttgctttggt 3180 ccaaatccgg cagtcattat tatccggacg acagaatcaa tcagctactt ccaaaacaac 3240 ccacaaggaa agaattctta gatgttgtaa ccgtagatta catcaaggaa tgcaagcctc 3300 acgtcttcat aaggcaatca cgtaagctag aacacggtaa ggaacgtttc atttataatt 3360 gcgatacgat ctcctatgtc tattttgatt ttatcctgaa gctcttcgaa gcgggatggc 3420 aagatagcga agcaatacta tcgccaggcg actatactag tgaacgctta catacaagaa 3480 tttcaaacta caaatataag gctatgctag attatacgga tttcaattca cagcatacaa 3540 tccaaagcat gagactgata ttcgaaacga tgaaggagtt actaccaccc gaaactactt 3600 tcgcactcga ttggtgtatt gcctcattcg ataacatgta cacatccgat ggtcataaat 3660 gggtctcgac tctcccaagc ggacatcgag ccacaacctt catcaacaca gtcctaaatt 3720 ggtgctacac acagatggta ggtcttaagt ttaacagttt tatgtgcgct ggtgatgatg 3780 tcattttatt gtctcaagag ccaatatcac tagccccaat tcttacatca cacttcaagt 3840 tcaatcccag caaacaaagt acaggtacaa gaggtgaatt cttacgcaag cattacacta 3900 cagaaggcgt gtttgcatac ccaacacgag caattgcaag cttagtaagc ggaaattggt 3960 taagtcaatc tttaagagag aacaccccaa tcttggtccc aatacaaaac ggagtcgaca 4020 gacttcgaag cagagcaggt ctactcggag tcccatggat tttgggcctc tcggagctca 4080 cggagcgaga ggccattcct agggatgtca gcatggctct gttgaattca cacgcagcag 4140 gacccggttt gatcactcgg aactacagtt ctttcacagt taccccgaaa ccacctaagc 4200 taactagcac actagagtac acagcaaccc gttttggtgt ccaggacctg tccaaacacg 4260 taccatggga acaacttaca ctggaagaac gtaataagtt agggaaacaa attaagaaaa 4320 tgagtcacag gcattgtagc caggcaaaga taacatacac ttgtgttcac gatttttaca 4380 aaccaagtgg cctccccacg gtgttatctg gtgccagcca gccatcgttg tcgatggcgt 4440 ggtggcaggc gatgcttaaa gaagcaatgc aggacaattt tactaagaag ttagatgcac 4500 aaatgttcgc ttcgaatgca tgtacagact gcgtcagcgg tgatgcattc ttgcaagcga 4560 gcgccaaaac tgctggtgta ttattcacta gctcgattct atcttcttca taacgtacag 4620 caaaaaagtc tctatagttg ctcaggacat atatgagcca gatggccccg ctataccttc 4680 //