ID HQ607519; SV 1; linear; genomic RNA; STD; VRL; 4846 BP. XX AC HQ607519; XX DT 08-MAY-2011 (Rel. 108, Created) DT 08-MAY-2011 (Rel. 108, Last updated, Version 1) XX DE Trichomonas vaginalis virus 3 strain TVV3-OC3, complete genome. XX KW . XX OS Trichomonas vaginalis virus 3 OC Viruses; Riboviria; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4846 RX DOI; 10.1128/JVI.00220-11. RX PUBMED; 21345965. RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W., RA Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H., RA Singh B.N., Fichorova R.N., Nibert M.L.; RT "Clinical isolates of Trichomonas vaginalis concurrently infected by RT strains of up to four Trichomonasvirus species (Family Totiviridae)"; RL J. Virol. 85(9):4258-4270(2011). XX RN [2] RP 1-4846 RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A., RA Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.; RT ; RL Submitted (12-NOV-2010) to the INSDC. RL Department of Microbiology and Molecular Genetics, Harvard Medical School, RL Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA XX DR MD5; c3c5c4f11127892ff6cd5141ef0b4b96. DR EuropePMC; PMC3126235; 21345965. XX FH Key Location/Qualifiers FH FT source 1..4846 FT /organism="Trichomonas vaginalis virus 3" FT /host="Trichomonas vaginalis" FT /strain="TVV3-OC3" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="Nov-2009" FT /db_xref="taxon:170965" FT gene 364..4694 FT /gene="pol" FT CDS join(364..2449,2449..4694) FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /db_xref="GOA:F5B2U4" FT /db_xref="InterPro:IPR001795" FT /db_xref="UniProtKB/TrEMBL:F5B2U4" FT /protein_id="AED99802.1" FT /translation="MSAPEPLNTEVRSPNGVSEATETQNLAVTQSSVSNEKTIDTQSDL FT QTLKKQLQPVTRSTDFETLYNYFYALNVSPSTDRIGNAITRNTPVNDTNEVVSFPLTAS FT VSHTFSNTPVPAHIQPLQISIADDCVNYELDESGTLCPALDSSVHVQRATSLASALKVK FT LTGEVMHSASVRPIQTPQLIAYLYGVLLAVQDRLNIHRNQHTNLWRSLCAAGRAAQAKP FT FFDEIPNNKFRTGALLAPPLPDAGFGPFPAEGLNQNSKLDFKSKGYIFYKQRTYNPDDM FT NRAFWFIWAIYNRMPEDFQNSYPLNITFCTSELPVQSPMPAADGISAEQCDRALVLLDK FT VILEFFNNDRKLAYYYVFKGCQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA FT EQYANMISCATHPGIIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV FT ADENYWGIAPGSDAEPVGMYYMDIIQRKAEHDLFTETFMDIYGSTASIICANIETSLFT FT SGTNVINKRMQNDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLGGSFEV FT DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGIRPVQQKRKKMSYFRTLDGTFHEVT FT IRSDTHDLQVWNDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLISGFAIPGRAY FT RGPSLAGSNRGRPSLSDIPATGSLSNLIDLSKASRLPYRKLKEGLRASDYTVARELASA FT FRNSRLTRQMDHVTDIAYLNFLRWVLLPYNGQTLRPHPSKWSPTPYPEHVNLKFLTKEI FT ELELFPLKKAPQADLKVNCYARNILASTELTDDLLKQCLPVGLNNDSVCGIVIVLELLL FT IAGVPSKLLPIIGQAIANKDPFIKELSDFNKMIGATTSRIANILTECNTLIGRGVKSSD FT PSADLYHRVAPEGNRHEAKISRHILIEAIDKIYKNEMTSMPPPGDFMLHLITSPLWCKA FT GSHHHPHFAKYDSRLEFVMDVPADKIAAEPPSAYITQAEKLEHGKTRYIYNCDTVSYLF FT FDYILHYVECVWSNESVLLNPAAMSVERFSVLDYPEYCMIDYTDFNSQHSLESQKLVFE FT CLRPYLPREMHSVLDWCIASMDHMEINGQHWLSTLPSGHRATTFINSVLNKAYLIPYIG FT DTVSFHCGDDVLLCGEYDYQTLIDTLPYELNKSKQSFGPNAEFLRLHRRGGDVIGYPSR FT AVSSLVSGNWLSKTSWEWQPSLISVTNQCNVIISRSQLNIRFIPAMQQELRNRYADKMS FT EPFDVSSNYYVMPGCPCYSDAATTIVPNVPQLEHSDVPFSQAQKLFDTMRDYCPEFTTV FT NDVIDKVKARRSSSAVSNIMYNVCSPVAPQVCVVVNPNNYQFLLRKRYYPREHIAPSGF FT DESSDSKLVFTTYDLAPSIAMKSCAVLTPAKIICGHGLRSG" FT gene 364..2490 FT /gene="cap" FT CDS 364..2490 FT /codon_start=1 FT /gene="cap" FT /product="capsid protein" FT /db_xref="UniProtKB/TrEMBL:F5B2U5" FT /protein_id="AED99801.1" FT /translation="MSAPEPLNTEVRSPNGVSEATETQNLAVTQSSVSNEKTIDTQSDL FT QTLKKQLQPVTRSTDFETLYNYFYALNVSPSTDRIGNAITRNTPVNDTNEVVSFPLTAS FT VSHTFSNTPVPAHIQPLQISIADDCVNYELDESGTLCPALDSSVHVQRATSLASALKVK FT LTGEVMHSASVRPIQTPQLIAYLYGVLLAVQDRLNIHRNQHTNLWRSLCAAGRAAQAKP FT FFDEIPNNKFRTGALLAPPLPDAGFGPFPAEGLNQNSKLDFKSKGYIFYKQRTYNPDDM FT NRAFWFIWAIYNRMPEDFQNSYPLNITFCTSELPVQSPMPAADGISAEQCDRALVLLDK FT VILEFFNNDRKLAYYYVFKGCQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA FT EQYANMISCATHPGIIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV FT ADENYWGIAPGSDAEPVGMYYMDIIQRKAEHDLFTETFMDIYGSTASIICANIETSLFT FT SGTNVINKRMQNDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLGGSFEV FT DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGIRPVQQKRKKMSYFRTLDGTFHEVT FT IRSDTHDLQVWNDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLISGFAIPGRAY FT RGPRLPVATEAAQV" XX SQ Sequence 4846 BP; 1274 A; 1334 C; 1008 G; 1230 T; 0 other; gcttaaaagg tctagtccac tttttaagcc ggctatactt cgccgtagac acttgggcaa 60 aattaatcaa caccctcctg caatcaccgg ggtgttgcga gccataagag actggttcta 120 aaggactgat ataccgccgt gcgggtaggt ggtcgatagc ccgtttgaag gagtagtaat 180 acttccgatt ctggtgtagc atcgactggg gccccctagc gtgagctcag cacgttgaga 240 aaacgaaaaa ctgcatgtgc acagccttcg cagtagcgtg agctcgaggc accctaaaaa 300 gtgccttttc cttgtgacaa cctacgtgtt atcacaaggt tctagagtat tacgtgcaac 360 ggtatgtcag ctcccgagcc cttaaatact gaagtacgtt cacctaatgg tgttagtgaa 420 gccactgaaa ctcaaaactt ggctgtcact caaagcagtg tgtcaaacga aaaaacaatc 480 gacacacaaa gtgatctgca aacactcaaa aaacagttac aaccggtcac cagatccaca 540 gattttgaaa ctctttataa ttatttttat gctttaaatg tctctccgtc aacagatcgt 600 attggtaatg ctatcacacg caatactcca gttaatgata caaacgaagt ggttagtttt 660 ccgcttactg cttctgtctc gcacacattc tccaatacac cagtacctgc ccacattcag 720 cctctccaaa tttccattgc cgacgattgc gttaactatg aactggatga gagcggaaca 780 ttatgccctg ccctagatag ctctgtccac gttcaaaggg ccacttctct cgctagcgct 840 ctcaaggtca aattaacagg cgaggtcatg cactcagctt cagtcagacc aatccaaact 900 ccacagttaa ttgcttatct atatggtgtt cttctcgccg tccaagatcg tcttaatatt 960 catcgtaatc aacacacaaa cttatggcgt agcttatgtg ccgcaggtcg tgcagctcaa 1020 gcgaaaccat tcttcgatga aatcccaaat aacaagttca gaaccggcgc gctcttggca 1080 ccccctctcc cagacgccgg ctttggtccc ttcccagctg agggcctcaa ccagaattcc 1140 aagctcgatt tcaaatcgaa gggatacatt ttctacaagc agcgcactta caatcccgat 1200 gatatgaatc gcgctttttg gttcatttgg gcgatctaca atcgtatgcc cgaagacttc 1260 cagaattcat atcctctgaa cattactttc tgcacttccg aattaccagt ccaaagcccg 1320 atgccagcgg ctgacggaat ttccgcagag cagtgtgata gggcactcgt tcttctcgac 1380 aaggtcattc tcgaattctt caacaacgat cgcaagcttg cttattacta cgtattcaag 1440 ggatgccagt tcgtcatgcg tccttgttcc tgttatcaag aaggtggctt aatccgcaag 1500 gcctcacgca acgttgctct tcgcgccttt actggcattt actacctcgc cggcttcgct 1560 gagcaatacg ctaacatgat ttcatgtgcc acacacccag gaattatcgg tgctctcttc 1620 caatacgtcg acactatggt cttacaggct gttttctctc tctctggccc taagctggtc 1680 cgatttgccg ccccacccga atatcaaggt cgtcacgctt gtccgttctc cttcgtagca 1740 gacgaaaatt attggggtat cgctccaggt tcagacgctg agcctgttgg catgtactat 1800 atggatatca tccaacgcaa agccgaacac gacttattca ccgaaacatt catggatatc 1860 tacggttcaa cagcttccat catttgcgca aacatcgaaa caagtttgtt cacttctggc 1920 acaaacgtca tcaacaaacg catgcagaat gatttcgcac gcgacactcc aaagcctgga 1980 actcttcgcc accaacatgc catcatcaac cgcttccacg aacccgaata tgcttaccgt 2040 ctcggtatcc tcgctgatgg catcattccg ctcggcggct ccttcgaagt cgacatcctc 2100 aaagaagctg agcgcctcat cacaggtgaa gacatccgca acctcccagg actacgttgc 2160 ctgtgctctc gcggtctcga cgcgattctc ggcatacgcc cagtccaaca aaaacgcaag 2220 aagatgagtt acttccgcac tctcgatggc acattccacg aagtaacgat caggtcagat 2280 actcacgatt tacaggtctg gaatgaccac ggctaccttg cccgcccata cgcatgtcac 2340 atcgtcgact cagacggcat cgagttctac gacaaatcca acggtctcta caagggacgc 2400 gtcaatgtcc tcatctctgg attcgccatc ccaggtcgcg cataccgggg ccctcgcttg 2460 ccggtagcaa cagaggccgc ccaagtctga gcgacattcc ggcgacagga agtctgtcca 2520 acctcatcga cctttcgaaa gcaagtcggc taccataccg taagcttaaa gaaggcttga 2580 gagcgtcaga ctacaccgtc gcccgcgagt tagctagcgc ttttcgcaat tctcgcctaa 2640 ctcgccaaat ggatcatgtt acagatatag cttaccttaa tttcctcaga tgggtgttgc 2700 taccttacaa cggtcaaaca ctacgaccac acccctccaa gtggagtcca acaccctacc 2760 ccgaacacgt caacctaaag ttcctaacca aggaaatcga gctcgaactt ttcccactga 2820 agaaggcccc acaagccgat cttaaagtga attgttacgc gcgaaatatc cttgcttcga 2880 cagagctaac tgacgatctc ctcaaacagt gtctgccagt cggactcaac aacgattcag 2940 tttgcggaat tgttatcgtc ttagagctgc ttttgattgc aggtgttcca agtaagctgc 3000 taccaattat tggccaagcc atcgcgaaca aagatccatt catcaaagaa ttgtccgatt 3060 tcaataagat gataggagcg accacttcac gcattgctaa cattctcaca gagtgcaaca 3120 cattaattgg tcgcggtgtt aagtcatctg acccaagtgc tgatttgtat caccgggtag 3180 cgcctgaggg caataggcac gaagcgaaga tttcccgaca catcctcatc gaagccatcg 3240 acaaaattta caaaaacgaa atgacaagca tgcctccacc gggcgacttc atgctccact 3300 taataacaag tcctctatgg tgtaaggctg gctctcatca ccatcctcac ttcgccaaat 3360 acgattcccg cttggaattc gtcatggatg ttccagcaga caaaatcgct gctgaaccac 3420 cctctgcata cattactcaa gcggagaaat tggaacacgg taagactagg tacatctata 3480 actgcgatac agtatcatac ctattcttcg attacatctt acattacgtc gagtgtgtgt 3540 ggtcaaatga gtcagtctta ctcaacccag ctgctatgag tgtcgagcgt tttagtgtct 3600 tggactaccc ggagtactgc atgatcgatt atacagattt caactctcaa cacagtctag 3660 aatcccagaa gctagtcttc gagtgtttga gaccgtactt gccacgcgaa atgcattcag 3720 tcttggattg gtgtatcgcg agcatggacc atatggaaat taacggccaa cattggttaa 3780 gcacgttgcc gtcaggacac agagcgacaa ctttcataaa ctcggttctg aacaaagcct 3840 acttgatccc ctacataggc gacaccgttt ctttccattg tggcgacgac gtgttattat 3900 gtggcgagta cgactatcaa acactcatcg ataccctgcc ctatgagcta aacaagagca 3960 aacagagctt tggacctaat gccgagttct tgcgcttgca taggcgtggt ggtgacgtca 4020 taggataccc atccagagct gtgtcgagtt tggtatctgg caattggtta agcaaaacat 4080 cttgggagtg gcaaccaagt ctcatttcgg tcacaaatca atgcaacgta atcatctcgc 4140 gttcacaatt gaatatcagg tttattcccg ctatgcaaca ggaactgcgc aaccgctacg 4200 cggacaagat gagtgaacct ttcgatgtca gctcgaatta ctacgtcatg ccaggttgtc 4260 cgtgctatag tgacgccgcg actacgatcg taccgaacgt tccccaactg gaacattcgg 4320 atgtaccgtt ctcgcaagca caaaaacttt ttgatactat gcgcgactac tgtcctgagt 4380 tcactaccgt caacgacgtc atcgacaagg ttaaagcccg tcgttcctcg agtgctgtca 4440 gcaatatcat gtacaatgta tgctcacctg tcgcacctca agtttgcgta gtcgtaaatc 4500 ccaacaacta ccagttcctt ttgcgcaagc ggtactaccc acgcgaacac attgccccat 4560 ccggctttga tgaatctagc gactccaagc tcgtttttac tacttacgat ctcgctcctt 4620 caatcgctat gaaatcgtgc gctgttttga ccccggcaaa gataatatgt ggtcacgggc 4680 tacgcagtgg ttgaataatc tgcctgtacc aggctatgat tggtaccgat tcagccacga 4740 acggcctctt gtcttcggac cctccgccta taggttaata ggagtacagt gttactgttg 4800 tgtgtatcgc tctaggcaca cgaacgtact accccacgtt tagttc 4846 //