ID HQ607515; SV 1; linear; genomic RNA; STD; VRL; 4845 BP. XX AC HQ607515; XX DT 08-MAY-2011 (Rel. 108, Created) DT 08-MAY-2011 (Rel. 108, Last updated, Version 1) XX DE Trichomonas vaginalis virus 3 strain TVV3-UR1, complete genome. XX KW . XX OS Trichomonas vaginalis virus 3 OC Viruses; Riboviria; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4845 RX DOI; 10.1128/JVI.00220-11. RX PUBMED; 21345965. RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W., RA Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H., RA Singh B.N., Fichorova R.N., Nibert M.L.; RT "Clinical isolates of Trichomonas vaginalis concurrently infected by RT strains of up to four Trichomonasvirus species (Family Totiviridae)"; RL J. Virol. 85(9):4258-4270(2011). XX RN [2] RP 1-4845 RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A., RA Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.; RT ; RL Submitted (12-NOV-2010) to the INSDC. RL Department of Microbiology and Molecular Genetics, Harvard Medical School, RL Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA XX DR MD5; 677379ec0693b3e509e8a6fba4ae2710. DR EuropePMC; PMC3126235; 21345965. XX FH Key Location/Qualifiers FH FT source 1..4845 FT /organism="Trichomonas vaginalis virus 3" FT /host="Trichomonas vaginalis" FT /strain="TVV3-UR1" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="Jun-1999" FT /db_xref="taxon:170965" FT gene 363..4693 FT /gene="pol" FT CDS join(363..2448,2448..4693) FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /db_xref="GOA:F5B2T6" FT /db_xref="InterPro:IPR001795" FT /db_xref="UniProtKB/TrEMBL:F5B2T6" FT /protein_id="AED99800.1" FT /translation="MSAPEPLNTEVRSPNGVSEAKETQNLAITQSGVSNEKITDTQSDL FT QTLKKQLQPVSRSTDFETLYNYFYGLQVPASTDRIGNAIQRNIPVNDTNEVVSFPLTAS FT VSHTFSNTPVPAHIQPLQISVADDCXNYELDESGTLCPALDSSVHVQRATSLASALKVK FT LTGEVMHSASVRPIQTPQLIAYLYGVLLAVQDRINIHRNQPTNLWRSLCAPGRAAQAKP FT FFDEFANNKFRAGPLLAPPLPDAGFGPFPAEGLNQNSKLDFKSKGYIFYKQRTYNPDDM FT NRAFWFLWAIYNRMPGEFQQYYQLNITFCTSELPVQNPIPNADGISNEQCEKALLLLEK FT IILELFNNDRKLAYYYIFKGSQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA FT DQYANMISCAAHPGVIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV FT ADENYWGIAPGSNAEPVGMYYMDIIQRKSEHNLFVDTFMDIYGSTASIICANIETSLFT FT SGTNVLNERMQKDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLSGSFEV FT DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGLRPIQQKRKKMCYFRTLDGNFHEVT FT IRSETRDLQVWRDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLVSGFAIPGRAY FT QGPSLAGSNRGRPDLSDVPATGSLSNLIYLSKASRLPYRKLKEGVRAADYTVARELACA FT FRSSRLTRQMDHVTDIAYLNFLRWVLLPYNGQTLRPHPTMWRQTPYPEHVNLKFLSKET FT ELELFPLKKAPQADLKVNCYARNILASTELTDDILKQSLPIGLNNDSVCGIVIVLELLL FT IAGVPSKLLPVIGQAIANKDPFIKELSDFNKMIGATTSRIANILTECNTLIGRGVKSSD FT PSADLYHRVAPEGNRHEAKISRHILIEAINKIYKNEMTDMPPPGDFMLHLITSPLWCKA FT GSHHHPHFAKYDSRLEFVMDVPADKIAVEAPSVYITQAEKLEHGKTRYIYNCDTVSYLF FT FDYILHYVECVWSNESVLLNPAAMSVERFSILDYPQYCMIDYTDFNSQHSLESQKLVFE FT CLRPYLPSEMHPVLDWCIASFEHMEINGQHWLSTLPSGHRATTFINSVLNKAYLIPYIG FT DAVSFHCGDDVLLCGEYDYQTLIDTLPYELNKSKQSFGPNAEFLRLHRCGGDVIGYPSR FT AVSSLVSGNWLSKTSWEWQPSLISVTNQCNVIISRSQLNIRFIPAMQQELRNRYADKMS FT EPFDVSSNYYVMPGCPCYSDAATTIVPNVPQLERSDAPFSQAQKVFDAMRDFCPEFTTV FT GDVIDKVRARRSSSAVKNIMYDVCAPVAPRISIVVNPAHYQFLLRKKYYPREHIAPTGS FT DNTDRTKLVFATYDLAPSIAMKSCAVLTPAKIISGHGLRSG" FT gene 363..2489 FT /gene="cap" FT CDS 363..2489 FT /codon_start=1 FT /gene="cap" FT /product="capsid protein" FT /db_xref="UniProtKB/TrEMBL:F5B2T7" FT /protein_id="AED99799.1" FT /translation="MSAPEPLNTEVRSPNGVSEAKETQNLAITQSGVSNEKITDTQSDL FT QTLKKQLQPVSRSTDFETLYNYFYGLQVPASTDRIGNAIQRNIPVNDTNEVVSFPLTAS FT VSHTFSNTPVPAHIQPLQISVADDCXNYELDESGTLCPALDSSVHVQRATSLASALKVK FT LTGEVMHSASVRPIQTPQLIAYLYGVLLAVQDRINIHRNQPTNLWRSLCAPGRAAQAKP FT FFDEFANNKFRAGPLLAPPLPDAGFGPFPAEGLNQNSKLDFKSKGYIFYKQRTYNPDDM FT NRAFWFLWAIYNRMPGEFQQYYQLNITFCTSELPVQNPIPNADGISNEQCEKALLLLEK FT IILELFNNDRKLAYYYIFKGSQFVMRPCSCYQEGGLIRKASRNVALRAFTGIYYLAGFA FT DQYANMISCAAHPGVIGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV FT ADENYWGIAPGSNAEPVGMYYMDIIQRKSEHNLFVDTFMDIYGSTASIICANIETSLFT FT SGTNVLNERMQKDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILADGIIPLSGSFEV FT DILKEAERLITGEDIRNLPGLRCLCSRGLDAILGLRPIQQKRKKMCYFRTLDGNFHEVT FT IRSETRDLQVWRDHGYLARPYACHIVDSDGIEFYDKSNGLYKGRVNVLVSGFAIPGRAY FT QGPRSQVATEAAQI" XX SQ Sequence 4845 BP; 1296 A; 1281 C; 1016 G; 1251 T; 1 other; gcttaaaaag cgaagtccac tttttaagcc ggtttaactt caaccgtgaa taccagggca 60 aaattaatca acaccctcct ggaatcgccg gggtgttgcg agccataaga gactggttct 120 aaaggactga catagcgccg cgagggtagg cggtcgatag cccgtttgag ggggtagtaa 180 tactcctgat tctggtgaag catcgactgg ggccccctag cgtgagctca gcacgttgga 240 aaaacgaaaa actgcatgtg cacagctttg cagtagcgtg agctcagggc accctaaaaa 300 gtgctccgtt tcacaacaac ctatgcgttg ttgtgagact ctagtgtatt gcgtgcaacg 360 gtatgtcagc tcccgagccc ttaaatactg aagtacgctc acctaatggt gttagtgaag 420 ccaaagaaac tcaaaacttg gctatcactc aaagcggtgt gtcgaacgaa aaaataaccg 480 acacacaaag tgatctgcaa acactcaaaa aacagttaca accggtcagc agatccacag 540 atttcgaaac tctttataat tatttttatg gtttacaagt tcctgcttca acagatcgta 600 ttggcaatgc tattcagcgt aacatcccag tcaatgatac gaacgaagtc gttagctttc 660 cgcttacagc atcggtttca cacacatttt ccaatacgcc ggtacctgcc catatacagc 720 ctctccaaat ctcagttgcc gacgactgcr tcaactacga gctagacgag agcggaacat 780 tatgcccagc gttagatagc tctgttcacg tccaaagagc tacctccctt gctagcgctc 840 tcaaagtcaa attaacaggc gaagttatgc attctgcctc agtcaggcca atccagacac 900 cacagttgat cgcttatttg tatggcgtcc tccttgccgt ccaagatcgt atcaacatcc 960 atcgcaacca acctactaac ttatggcgca gcttatgtgc acctggtcgc gctgctcaag 1020 caaagccttt cttcgatgaa ttcgcaaaca acaaattcag ggcaggtccc ctcttggcac 1080 ctcccctccc tgatgctggt ttcggtccat tcccggcaga aggcctcaat cagaattcca 1140 agctcgactt caaatccaaa ggatacatct tctacaaaca gcgcacttac aacccagatg 1200 atatgaatcg tgctttctgg ttcctttggg cgatctacaa ccgcatgcct ggagaattcc 1260 aacaatacta tcagttgaac atcactttct gcacttccga gttaccagta caaaatccga 1320 taccaaatgc cgatggcatc tcaaatgaac aatgtgaaaa agcacttctc ctcctcgaaa 1380 aaattatcct cgaacttttc aataacgatc gcaaacttgc ttactactac atcttcaagg 1440 gaagccaatt cgttatgcgt ccttgttcct gctaccaaga aggaggctta attcgcaagg 1500 cttcacgtaa cgttgctctc cgtgctttta ctggcatcta ctacttagcc ggattcgcag 1560 atcaatacgc taacatgatt tcatgtgctg cccatccagg tgtcattggt gctcttttcc 1620 aatatgttga cacaatggtc ttacaagccg tcttctcgct ttctggtcct aagcttgttc 1680 gtttcgctgc cccacccgaa tatcaaggtc gccacgcttg cccgttctca tttgtggctg 1740 acgaaaacta ctggggtatt gctccaggct caaacgccga gccagttggc atgtactaca 1800 tggatatcat tcaacgcaaa tccgaacata atttgtttgt cgacacattc atggacatct 1860 atggttctac agcttcaatc atttgcgcta atatcgaaac tagcttattt acttctggca 1920 ctaatgtttt gaacgaacgc atgcagaagg atttcgctcg tgatacaccc aaacctggaa 1980 ctcttcgtca ccaacatgct atcatcaacc gcttccacga accagaatac gcttaccgcc 2040 tcggcatcct tgcagatggc atcattccgc tcagtggctc attcgaagtc gatatcctca 2100 aagaagctga gcgcctcatt actggtgagg atatccgcaa cctcccaggt ttacgttgct 2160 tatgctctcg tggcctcgac gctatcctcg gtctccgtcc aattcaacag aagcgcaaga 2220 agatgtgtta cttccgcacc ctcgacggca atttccacga agtaacaatc agatcggaga 2280 ctcgcgatct acaggtctgg cgtgaccacg gctacctcgc ccgcccatac gcgtgccaca 2340 ttgtcgactc agatggcatc gaattctacg acaagtccaa tggtctctac aagggacgcg 2400 tcaacgtcct cgtttccgga tttgccattc caggacgcgc atatcagggc cctcgctcgc 2460 aggtagcaac agaggccgcc cagatctaag cgacgtcccg gcgacaggaa gtctgtccaa 2520 cctcatctac ctttctaagg caagtcggct accataccgt aagctgaagg aaggcgtgag 2580 agcggcagac tacaccgtcg cccgcgagtt agcttgcgct tttcgcagtt ctcgcctaac 2640 tcgccaaatg gatcatgtca cagatatagc ttaccttaat ttcttgagat gggtgttgtt 2700 accttacaac ggtcaaactt tacgaccaca ccccaccatg tggcgtcaaa caccctaccc 2760 cgaacatgtc aatttgaagt tcctaagtaa ggagacggag ctcgaacttt tcccactgaa 2820 gaaggcccca caagccgatc ttaaagtgaa ttgttacgcg cgaaatatcc ttgcttctac 2880 agagcttact gacgatatac tcaaacagag tttgcccatt ggtctcaata atgactcggt 2940 ttgcggaatc gttattgttt tagagctact tctaattgca ggtgttccga gtaagttact 3000 accagttatt ggtcaagcaa tcgccaataa agatccattt attaaggaac tgtccgactt 3060 caacaagatg ataggagcga ccacttcccg tatcgctaac attcttacag agtgtaatac 3120 attaataggt cgtggtgtta agtcatctga cccaagtgct gatttgtatc accgggtagc 3180 gcccgagggc aataggcacg aggcgaagat ttctcgacac atcctcatcg aagccatcaa 3240 caaaatttac aaaaacgaaa tgacagacat gcctccaccg ggtgacttca tgctccactt 3300 gataacgagc cctctatggt gtaaggctgg ctctcaccat catccacact tcgccaagta 3360 tgattcgcgc ttggaattcg ttatggatgt tccagcagac aaaatcgctg ttgaagcacc 3420 ctctgtatac attactcaag ccgagaaatt agaacatggt aaaactagat acatttataa 3480 ctgtgataca gtttcatact tgttctttga ttacatctta cactatgtcg aatgtgtgtg 3540 gtcaaatgag tcagttctac tcaacccagc tgctatgagt gtcgagcgct ttagtatctt 3600 ggattacccg caatattgca tgatcgatta cacagatttc aactctcaac acagtctcga 3660 atcacagaag ctagtgttcg agtgtttgag accatactta ccaagcgaaa tgcatccagt 3720 cttggattgg tgtattgcca gctttgagca catggaaatc aacggacaac attggttaag 3780 cacgttgcct tcaggacata gggccacaac attcatcaac tcggtcctca ataaagcata 3840 cctgatccca tacataggcg acgcggtttc cttccattgt ggtgacgacg tgttactatg 3900 tggtgagtat gattaccaaa cactcattga taccctaccc tatgaattaa acaagagcaa 3960 acagagcttc ggacctaatg ccgagttctt gcgcttgcat aggtgtggtg gtgacgttat 4020 aggctatcca tccagagctg tttcgagtct tgtatctgga aattggttaa gcaagacatc 4080 atgggagtgg cagccaagtc tcatttcggt tacaaatcaa tgcaatgtga tcatctcgcg 4140 ttcacaattg aacatcaggt tcatccccgc aatgcaacaa gaactacgca atcgttacgc 4200 agacaagatg agtgaacctt tcgatgtcag ctccaactac tacgtcatgc caggatgtcc 4260 atgctatagc gacgccgcga cgacaatagt accgaatgtt ccccaactgg aacgttcgga 4320 cgcaccgttt tcgcaggcac aaaaagtttt tgatgctatg cgcgacttct gtcctgagtt 4380 cactactgtt ggcgatgtca tcgataaggt tagagctcgc cgatcttcaa gtgcagtcaa 4440 gaacatcatg tacgacgtat gcgcgcctgt tgcaccacgt atcagtatcg tagtgaaccc 4500 ggcacactat cagttcctct tacgcaagaa gtactaccca cgtgaacaca ttgcgcctac 4560 tggctccgat aatacagatc gaaccaaact cgttttcgca acatacgatc tcgctccttc 4620 aatcgccatg aagtcgtgcg ctgttttgac cccggctaag ataataagtg gtcacggact 4680 acgcagtggt tgaataatct gccagtacca ggcaacgatt ggtaccggct tggccacgca 4740 cggtctgctg tcttcggacc ctccgcctat aggttaatag gaacacagtg ttactgttgt 4800 gtgtatcgct ctaggcacac gaacgtacta ccccacgttt agttc 4845 //