ID HQ607525; SV 1; linear; genomic RNA; STD; VRL; 4842 BP. XX AC HQ607525; XX DT 08-MAY-2011 (Rel. 108, Created) DT 08-MAY-2011 (Rel. 108, Last updated, Version 1) XX DE Trichomonas vaginalis virus 3 strain TVV3-OC5, complete genome. XX KW . XX OS Trichomonas vaginalis virus 3 OC Viruses; Riboviria; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4842 RX DOI; 10.1128/JVI.00220-11. RX PUBMED; 21345965. RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W., RA Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H., RA Singh B.N., Fichorova R.N., Nibert M.L.; RT "Clinical isolates of Trichomonas vaginalis concurrently infected by RT strains of up to four Trichomonasvirus species (Family Totiviridae)"; RL J. Virol. 85(9):4258-4270(2011). XX RN [2] RP 1-4842 RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A., RA Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.; RT ; RL Submitted (12-NOV-2010) to the INSDC. RL Department of Microbiology and Molecular Genetics, Harvard Medical School, RL Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA XX DR MD5; 7fd7ce53aba8f9b1758831cb75495744. DR EuropePMC; PMC3126235; 21345965. XX FH Key Location/Qualifiers FH FT source 1..4842 FT /organism="Trichomonas vaginalis virus 3" FT /host="Trichomonas vaginalis" FT /strain="TVV3-OC5" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="Jan-2010" FT /db_xref="taxon:170965" FT gene 360..4690 FT /gene="pol" FT CDS join(360..2445,2445..4690) FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /db_xref="GOA:F5B2V6" FT /db_xref="InterPro:IPR001795" FT /db_xref="UniProtKB/TrEMBL:F5B2V6" FT /protein_id="AED99804.1" FT /translation="MSAPEPLNTEVRSPDGVSEATETQNLAITQSRVSNEKITDTQSDL FT QTLKKQSQPVSRSTDFETLYNYFYGLDVSPSTDRIGNAITRNTPVTDTNEVVSFPLTAS FT VSHTFSNSPVPAHIQPLQISIADDCINYELDESGTLCPALDSSVHVQRATSLASALKVK FT LTGEIMHSSSVRPIQTPQLIAYLYGVLLAVKDRINIHRNQPTNLWRSLCAAGRAAQAKP FT FFDEIPNNKFRPGALVAPPLPEAGFGPFPAEGLNQNSKLDFKAKAYVFYKQRTYNPDDM FT NRAFWFIWAIYNRMPNDFQNSYPLNITFCTSELPVQSPMPTADGISAEQCDKALLLLDK FT IVLEFFNNDRKLAYYYVFKGSQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA FT EQYANMISCASHPGIIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV FT ADENYWGIAPGSNAEPVGMYYTEIIQRKTEHNLFTETFMDIYGSTASVICANIETSLFT FT SGTEVINQRMQNDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLSGSFEV FT DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGLRPIQQKRKKMCYFRTLDGNFHEVT FT IRSETRDLQVWRDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLISGFAIPGRAY FT QGPSLAGSNRGRPDLSDVPATGSLSNLISLSKASRLPYRKRQNGVRVSDYTVARELACA FT FRNSRLTRQMDHVTDIAYLNFLRWVLLPYNGQTVRPHPTEWGQTPYPEHVNLKFLSKEM FT ELELFPLKKAPQADLKVNCYARNILASTELTDDLLKRCLPVGLNNDSVCGIVIVLELLL FT IAGVPSKLLPVIGQAIANKDPFIKELSDFNKMIGATSSRIANILTECNTLIGRGVKSSD FT PSADLYHRVAPEGNRHEAKISRHILIEAINKIYKNEMTDMPPPGDFMLHLITSPLWCKA FT GSHHHPHFAKYGSRLEFVMDVPADKIAAEPPAVYITQAEKLEHGKTRYIYNCDTIAYLF FT FDYILHYVECVWSNESVLLNPAAMSVERFSVLDYPEYCMIDYTDFNSQHSLESQKLVFE FT CLRPYLPSEMHPILDWCITSMDHMEINGQHWLSTLPSGHRATTFINSVLNKAYLIPYIG FT DTVSFHCGDDVLLCGEYDYQTLIDTLPYELNKSKQSFGPNAEFLRLHRRGGDVIGYPSR FT AVSSLVSGNWLSKTSWEWQPSLISVTNQCNVIISRSQLNIRFIPAMQQELRNRYADKMS FT EPFDVSSDYYVMPGCPCYSDAATTIVPNVPKLEHSDVPFSQAQKLFDTMRDSCPEFTTV FT NDIIDKVRARRSSSAASNITYNVGSPVAPQVCVVVNPNHYQFLLRKRYYPREHIAPPGF FT DASNDSKLVFTTYDLAPSIAMKSCAVLAPAKIICGHGLRSG" FT CDS 360..2486 FT /codon_start=1 FT /gene="pol" FT /product="capsid protein" FT /db_xref="UniProtKB/TrEMBL:F5B2V7" FT /protein_id="AED99803.1" FT /translation="MSAPEPLNTEVRSPDGVSEATETQNLAITQSRVSNEKITDTQSDL FT QTLKKQSQPVSRSTDFETLYNYFYGLDVSPSTDRIGNAITRNTPVTDTNEVVSFPLTAS FT VSHTFSNSPVPAHIQPLQISIADDCINYELDESGTLCPALDSSVHVQRATSLASALKVK FT LTGEIMHSSSVRPIQTPQLIAYLYGVLLAVKDRINIHRNQPTNLWRSLCAAGRAAQAKP FT FFDEIPNNKFRPGALVAPPLPEAGFGPFPAEGLNQNSKLDFKAKAYVFYKQRTYNPDDM FT NRAFWFIWAIYNRMPNDFQNSYPLNITFCTSELPVQSPMPTADGISAEQCDKALLLLDK FT IVLEFFNNDRKLAYYYVFKGSQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA FT EQYANMISCASHPGIIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV FT ADENYWGIAPGSNAEPVGMYYTEIIQRKTEHNLFTETFMDIYGSTASVICANIETSLFT FT SGTEVINQRMQNDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLSGSFEV FT DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGLRPIQQKRKKMCYFRTLDGNFHEVT FT IRSETRDLQVWRDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLISGFAIPGRAY FT QGPRLQVATEAAQI" XX SQ Sequence 4842 BP; 1296 A; 1312 C; 997 G; 1237 T; 0 other; gcttaaaagc ttagtccact tttaagccgg tcatacttca accgtgatac cggggcaaaa 60 ttaatcaaca ccctcctgga atcgccgggg tgttgcgagc cataagagac tggttctaaa 120 ggactgacat agcgccgcga gggtaggcgg tcgatagccc gtttgaggga atagcaatat 180 tcctgattct ggtgtagcat cgactggggc cccctagcgt gagctcagca cgttgggaaa 240 acgaaaaact gcatgcgcac agccttgcag tagcgtgagc tcaaggcacc ctaaaaagtg 300 cctcgtttca tgacgaactt tatgtcgtta tgaaatacta gtgtattgcg tgcaacggta 360 tgtcagctcc cgagccctta aatactgaag tacgctcacc tgatggtgtt agtgaagcca 420 cagaaactca aaacttggct atcactcaaa gccgtgtgtc aaacgaaaaa ataactgaca 480 cacaaagtga tctgcaaaca cttaaaaaac agtcacaacc ggtcagcaga tccacagatt 540 ttgaaactct ttataattat ttttatggtt tagatgtctc tccttcaaca gatcgcattg 600 gtaatgcaat tacccgcaat accccagtca ctgatacgaa tgaggttgtt agttttccac 660 tcactgcatc tgtttcacac acattttcga attcgccagt tccagctcac atacagcctc 720 tccaaatttc tattgctgat gactgcatca actacgagtt agatgagagc ggaacgttat 780 gcccagcgct tgatagttct gttcacgtcc agagagccac ttctcttgct agcgctctca 840 aggtcaagtt aacaggcgaa attatgcatt cttcatcagt tagaccaatt caaactcctc 900 aattaattgc ttatttatac ggtgttctcc ttgctgtcaa agaccgcatt aacattcatc 960 gtaatcagcc tacgaattta tggcgtagct tatgtgcagc aggtcgcgca gcccaagcaa 1020 agccgttctt cgatgaaatt cccaacaaca agttcaggcc cggtgccctc gtcgcacccc 1080 ctcttcctga agcaggattc ggtcctttcc cagctgaggg ccttaaccaa aattctaagc 1140 tcgatttcaa agcaaaagca tacgtcttct acaagcaacg cacctacaat ccagatgaca 1200 tgaatcgcgc attctggttt atctgggcaa tttacaaccg tatgcccaat gacttccaaa 1260 attcgtaccc actcaacatc actttctgca cttccgagct accagtccaa agcccgatgc 1320 caacagctga tggaatttcc gccgaacaat gcgataaagc gctccttcta cttgacaaaa 1380 tcgttctcga attcttcaat aacgaccgca aactcgctta ctactatgtg ttcaaaggaa 1440 gccagttcgt tatgcgtcct tgttcatgtt atcaagaagg aggcttgatc cgcaaggcct 1500 cacgcaatgt cgctcttcgc gcttttactg gcatctacta tctcgccgga ttcgccgaac 1560 aatacgctaa catgatttca tgcgcctccc atccaggaat catcggcgcc cttttccaat 1620 acgtcgacac tatggtctta caggccgttt tctctctttc cggccccaag cttgttcgct 1680 tcgcggctcc acctgaatat cagggtcgtc acgcttgtcc attttccttc gtcgccgatg 1740 aaaactattg gggcattgct cccggctcaa atgccgaacc agtcggtatg tattacacgg 1800 aaattatcca acgcaaaacc gagcacaatc tgttcaccga aacattcatg gatatctacg 1860 gttcgactgc ctccgtcata tgcgcaaata tcgaaacaag cttgttcaca tccggcactg 1920 aagttataaa ccagcgcatg caaaacgatt tcgctcgcga cacgccaaag cctggaaccc 1980 ttcgccacca gcatgccatc atcaatcgct tccacgaacc cgaatatgct taccgccttg 2040 gcatcctcgc tgatggcatt attccgctta gcggctcttt cgaagtcgac atcctcaaag 2100 aagctgaacg cctcatcaca ggcgaagaca tccgcaatct cccaggttta cgttgcttat 2160 gctctcgcgg tctcgacgcc atcctcggtc tccgtccaat ccaacagaaa cgcaagaaga 2220 tgtgttactt ccgcacactc gatggcaact tccatgaagt aacaatcaga tcggagactc 2280 gcgatctaca ggtctggcgt gatcatggct acctcgctcg cccatacgcg tgccacatcg 2340 ttgattcaga tggcatcgaa ttctacgaca aatccaacgg tctctataag ggacgcgtca 2400 acgttctcat ttccggattt gccattccag gtcgcgcata tcagggccct cgcttgcagg 2460 tagcaacaga ggccgcccag atctaagcga cgtcccggcg acaggaagtt tgtccaacct 2520 catcagcctt tctaaagcaa gtcggctacc ataccgtaag cggcagaatg gcgtgagagt 2580 gtcagactac accgtcgccc gcgagttagc ttgcgctttt cgcaattctc gcctaactcg 2640 ccaaatggat cacgtcacag atatagctta tctcaatttc cttagatggg tgttgttacc 2700 ttacaacggt caaaccgtac gaccacaccc caccgagtgg ggtcaaacac cctaccccga 2760 acacgtcaat ttgaagttct taagcaagga aatggagctc gaacttttcc cactgaagaa 2820 ggccccacaa gccgatctta aagtgaattg ttacgcgcga aacatccttg cttccacaga 2880 gctaacagat gatctcctca aacggtgtct gccagtcgga ctcaacaatg attcagtttg 2940 cggaattgtc atcgttttag agctgcttct cattgcaggt gtcccaagta agttattacc 3000 agtcattggc caagccatcg ccaacaaaga tccatttatt aaagaattgt ccgatttcaa 3060 taagatgata ggagcgacct cctcacgtat cgccaatatt ctcacagaat gtaacacatt 3120 gataggtcgc ggagtcaagt catctgaccc aagtgctgat ttgtatcacc gggtagcgcc 3180 tgagggcaat aggcacgaag cgaagatttc tcgacacatc ctcatcgaag ccatcaacaa 3240 aatctacaaa aacgaaatga cagacatgcc tccaccaggt gatttcatgc tccacttaat 3300 aacgagccct ctatggtgta aggctggctc tcaccaccat ccacactttg caaagtacgg 3360 ttcacgctta gaattcgtca tggacgttcc agcagacaaa atcgctgctg agccgcccgc 3420 tgtttacatt actcaagcgg agaaactaga acacggtaag actaggtaca tttacaactg 3480 cgatacaatt gcatacctat tcttcgatta catcttgcac tatgtcgagt gtgtatggtc 3540 aaacgagtca gttttactca acccagctgc tatgagtgtt gagcgattca gtgtcttaga 3600 ttacccggag tactgcatga tcgattacac agacttcaac tctcaacaca gcttagaatc 3660 acagaagcta gtttttgagt gtttgagacc atacttacca agtgaaatgc acccaatcct 3720 cgattggtgt atcaccagca tggaccatat ggaaattaac ggccagcatt ggttaagtac 3780 gctaccctca ggacatagag ctacgacatt tatcaactcg gtcctgaata aagcttactt 3840 aatcccttac ataggtgaca ccgtttcctt ccattgtggt gacgacgtgt tactatgtgg 3900 tgagtacgat taccaaaccc tcattgatac cctgccctat gaattaaaca agagcaaaca 3960 gagctttgga cctaatgccg agttcttgcg cttgcatagg cgcggtggtg acgttatagg 4020 ttatccatca agagctgttt cgagtcttgt atctggaaat tggttaagca aaacgtcatg 4080 ggagtggcaa ccaagcctca tttcggtcac taatcaatgc aatgttatta tctcgcgttc 4140 acaattgaat atcagattta tccccgccat gcaacaagaa ctacgtaacc gctacgcgga 4200 caagatgagc gaaccattcg atgttagttc ggattactac gtcatgccag gttgtccctg 4260 ctatagtgac gccgcgacga caatcgtgcc gaatgtcccc aaattggaac attcagacgt 4320 accgttttcg caggcacaaa aactttttga tactatgcgc gactcctgtc ctgagttcac 4380 aactgttaac gacatcatcg acaaagttag agctcgccgg tcttccagtg ctgccagtaa 4440 catcacgtac aacgtcggct cacctgtcgc acctcaagtt tgcgtagtcg taaatccaaa 4500 tcattaccag ttccttttgc gcaagagata ctacccacga gagcatattg ctccaccagg 4560 cttcgacgca tccaacgact caaaactcgt tttcacgact tacgatctcg ctccttcaat 4620 cgctatgaaa tcgtgcgctg ttttggcccc ggcaaagata atatgcggcc acggactacg 4680 cagtggttga gtagttctgt cgtaccaagc cacacttggt accggatagg ccacgaacgg 4740 tctcctgtct tcggaccctt cgcctatagg ttaataggaa tacagtgtta ctgttgtgtg 4800 tatcgcttta ggcacacgaa cgtactaccc cacgtttagt tc 4842 //