ID HQ607524; SV 1; linear; genomic RNA; STD; VRL; 4671 BP. XX AC HQ607524; XX DT 08-MAY-2011 (Rel. 108, Created) DT 08-MAY-2011 (Rel. 108, Last updated, Version 1) XX DE Trichomonas vaginalis virus 2 strain TVV2-OC5, complete genome. XX KW . XX OS Trichomonas vaginalis virus 2 OC Viruses; Riboviria; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4671 RX DOI; 10.1128/JVI.00220-11. RX PUBMED; 21345965. RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W., RA Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H., RA Singh B.N., Fichorova R.N., Nibert M.L.; RT "Clinical isolates of Trichomonas vaginalis concurrently infected by RT strains of up to four Trichomonasvirus species (Family Totiviridae)"; RL J. Virol. 85(9):4258-4270(2011). XX RN [2] RP 1-4671 RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A., RA Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.; RT ; RL Submitted (12-NOV-2010) to the INSDC. RL Department of Microbiology and Molecular Genetics, Harvard Medical School, RL Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA XX DR MD5; f81af96cbab0817b379f8bacd92e253a. DR EuropePMC; PMC3126235; 21345965. XX FH Key Location/Qualifiers FH FT source 1..4671 FT /organism="Trichomonas vaginalis virus 2" FT /host="Trichomonas vaginalis" FT /strain="TVV2-OC5" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="Jan-2010" FT /db_xref="taxon:674954" FT gene 296..4605 FT /gene="pol" FT CDS join(296..2378,2378..4605) FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /db_xref="GOA:F5B2V4" FT /db_xref="InterPro:IPR001795" FT /db_xref="UniProtKB/TrEMBL:F5B2V4" FT /protein_id="AED99810.1" FT /translation="MASTLISSDNSATLGKDSEVINDTDTSPPDNPPSDHSNPQLTKIL FT DEMSKKPCANINEIRKMIRNFQPQFIQLRNGNRPNAQPRTVDSFEWVVRIQSTVETQLL FT GATNTVPQQTLNLDISFTDDSTTITPASIPGSISMLDNSRHIPAIQSMIQNFKARYLGS FT LQDAAQLNSPQYPQLLAYLFGQLIAIKDRLDLFKPSNPLSLADAIFGFTLAQHAHPRYD FT DHRHAKACTGPLVIPAATNSDCGPCGFVQINANQALTLPLGACLFVNPETVNDQSFQDF FT LWLVFATHHRMPNQMQNNWPFALNIVSTCAAPGRQAPHAGELTDERVRLALDTGHRILL FT AMFNDDEETLRYYQRKGIETMFRPCCFYTEGGLLRKATRYVSMVPLNGLYYYNGATSYI FT VSPIHTDAHPGITAAIESFVDIMVLQAVFSFTGPKVVAAKVNANQLDPAMIFGPAIADG FT DGFVYDPLRPAPPLSAFYSEFIHRPAEQRIFQMAMSQIYGSHAPLVIANVINSIHNCKT FT KIVNSKLRATFVRRPAGAPHLKADTAIINRFHDPELAYALGILADGIAPLDGSHEYNIL FT DELDYLFNGGDIRNCFGLNALNTRGLGQIVHIRPKREPGKRPRRGFYTTLDGQVHPVTQ FT DAPLDEIYQWRDHGNLTRPYSCHILDSQGLEFADVSNGRSRGKILVVVNSPLKTCAAYQ FT GPQLRANAGQRHVERINQCGGSVIYPRLSTARARSSSAIDTMTLTLANQLCYLYKSSDL FT HRQLDIMIPQSYLTFLEWLLRISPNREINSIRHFPSQDNHEIITCTLRNLSKEQEIMLF FT PIKDIIQANRRVNAYARNLLDASPLPDFALQQMLLPNTANDVVCAILLLGEVLWMLRCP FT ISIIVNISRAICRNDSFLKDLSDFNKMLGLTKIPIANCLTELNTLQGRGVTSSDAKRDL FT THRIADVNPHEAKISRENLKEAINQIYREEITKKEIPDTFRQHVFTSPLWVKKGAHHHP FT HFKSYDNRLEFVENVDLDEVLQSRPAVYITQAPKLEHGKTRFIYNCDTVSYIYFDYILN FT YVEGVWSNKHVLLNPDYMNPVIFSTLNYDEYCMLDYTDFNSQHSIESMKQVFSSLLPFL FT PTSMHRILQWCVTSFDNMYINNTHWNSTLPSGHRATTFINSVLNRAYLLPFLQVSNAFH FT TGDDVLLCGKADYGTLINTVPYELNKTKQSFGPSAEFLRLHKHNDQVSGYPARAISSLV FT SGNWLSFANPLWQPSLLSIMQQLYTISARSGLLPYIPVTMKLEVQRRYDLRSRITNGLF FT SGDIVPSGCPCYKSNAALLSAVVPDTVVKGSPNFYDLRTLDTLKQTSPWINSASKYMNL FT LZRRHMESDNKNVLYSIQYLPSKMLPIIDVDPADALPLQKRYHPRSHIAHPLPRDAHLK FT ELRFATCRVGPATAIRLGSLWPANRINLIKPVYV" FT CDS 296..2425 FT /codon_start=1 FT /gene="pol" FT /product="capsid protein" FT /db_xref="UniProtKB/TrEMBL:F5B2V5" FT /protein_id="AED99809.1" FT /translation="MASTLISSDNSATLGKDSEVINDTDTSPPDNPPSDHSNPQLTKIL FT DEMSKKPCANINEIRKMIRNFQPQFIQLRNGNRPNAQPRTVDSFEWVVRIQSTVETQLL FT GATNTVPQQTLNLDISFTDDSTTITPASIPGSISMLDNSRHIPAIQSMIQNFKARYLGS FT LQDAAQLNSPQYPQLLAYLFGQLIAIKDRLDLFKPSNPLSLADAIFGFTLAQHAHPRYD FT DHRHAKACTGPLVIPAATNSDCGPCGFVQINANQALTLPLGACLFVNPETVNDQSFQDF FT LWLVFATHHRMPNQMQNNWPFALNIVSTCAAPGRQAPHAGELTDERVRLALDTGHRILL FT AMFNDDEETLRYYQRKGIETMFRPCCFYTEGGLLRKATRYVSMVPLNGLYYYNGATSYI FT VSPIHTDAHPGITAAIESFVDIMVLQAVFSFTGPKVVAAKVNANQLDPAMIFGPAIADG FT DGFVYDPLRPAPPLSAFYSEFIHRPAEQRIFQMAMSQIYGSHAPLVIANVINSIHNCKT FT KIVNSKLRATFVRRPAGAPHLKADTAIINRFHDPELAYALGILADGIAPLDGSHEYNIL FT DELDYLFNGGDIRNCFGLNALNTRGLGQIVHIRPKREPGKRPRRGFYTTLDGQVHPVTQ FT DAPLDEIYQWRDHGNLTRPYSCHILDSQGLEFADVSNGRSRGKILVVVNSPLKTCAAYQ FT GPSFAPMPGSAMWNE" XX SQ Sequence 4671 BP; 1338 A; 1249 C; 858 G; 1225 T; 1 other; gctttgaagg agtgacgacc ttctaagccc aggcctcgac agcctggtca gacactcctg 60 gggtccatca ggagacgggt cgctaacgcg aactggatgt cagtgtcaaa agccttgtgc 120 gccatggata cttggtacac ttctacggga gtaggtaacc cgcgaacctt gaacagtcat 180 gaggatttaa aactgttctc cgaactgcta gccttatgca gttgacgtgt attgagaggg 240 cttaattgtt gctttcatac taatattgta tgatcgtcag cttttaatta cgataatggc 300 ttcgacgcta atatcgtctg ataattctgc cacgttaggc aaagatagtg aagttattaa 360 cgacacagat acttcaccac ccgacaatcc acctagtgat cattcaaatc ctcagttaac 420 aaagattcta gatgagatgt ccaaaaaacc atgtgcaaat attaatgaaa taagaaaaat 480 gattagaaat ttccaacctc agtttattca attacgtaac ggcaaccgtc caaatgctca 540 gccccggaca gtagattctt tcgaatgggt agttcgtatt caaagcactg tcgagaccca 600 attacttggt gccacgaata ctgtccccca acaaactctt aacctcgata tctcctttac 660 tgatgattct accactatta ccccagcttc cattccgggc tcaatttcaa tgcttgataa 720 ttctcgtcac atcccagcga tccagagtat gatccaaaac ttcaaggctc gttatttagg 780 ttcattacaa gatgccgccc agctcaattc cccacagtat ccacaactcc tcgcttactt 840 attcggccag ctaatcgcta tcaaggaccg cctcgatctc ttcaagccat caaacccact 900 ttctctcgct gatgccatat tcggtttcac gttagctcaa cacgctcacc cacgttacga 960 tgaccacaga cacgccaaag cttgtacagg accactcgtt attccagcag cgaccaacag 1020 cgactgtggc ccttgcggtt tcgtacagat taatgcaaat caagccctca ctcttcctct 1080 tggtgcttgt cttttcgtca acccagagac ggttaatgat caatctttcc aagattttct 1140 ctggctcgtc ttcgcaacac accaccgcat gccaaatcaa atgcaaaaca attggccatt 1200 tgctctcaac atcgtctcaa catgcgcggc cccaggtcgt caagctcctc acgcaggcga 1260 actcactgat gagagggtcc ggctcgccct cgatacaggc catcgcattc ttctcgcaat 1320 gttcaacgac gatgaagaaa ctctccgcta ctaccagcgc aaaggaatcg aaacaatgtt 1380 cagaccatgc tgtttctaca ctgaaggcgg tttactcaga aaagctacca gatacgtttc 1440 tatggtccca ctcaacggct tatattacta caacggtgca acctcatata tcgtctcccc 1500 gatccacact gatgctcatc ctggcattac tgcagcaatc gaatcattcg ttgacattat 1560 ggtcttacaa gcagtattct ctttcacagg tcctaaagta gttgctgcta aagttaatgc 1620 caaccaactc gatccagcca tgatcttcgg ccctgcaatc gccgacggag atggtttcgt 1680 ttacgaccct ctccgcccag cgcctccact ttccgcgttc tactccgaat tcatccacag 1740 accagccgaa caacgcatct tccagatggc gatgagccaa atctacggtt cacatgctcc 1800 tcttgtcatc gccaacgtca tcaactccat ccacaattgc aagacaaaga ttgtcaacag 1860 caaattacgc gctaccttcg ttcgtcgtcc agccggcgct cctcatctca aggccgacac 1920 agctatcatc aaccgcttcc atgatccaga actcgcttat gctctcggaa ttctcgccga 1980 cggcatcgcc cctcttgatg gctcacatga atacaacatt ctcgatgaac tcgattactt 2040 gttcaacggt ggcgacatcc gcaattgctt cggcctcaac gccctcaaca ctcgtggttt 2100 gggccaaatc gtccacatcc ggccaaaacg cgaaccagga aagagacccc gccgcggttt 2160 ctacaccaca ctcgatggac aggttcaccc tgtcacacaa gatgctccac tcgatgagat 2220 ttaccaatgg cgcgaccatg gaaatctcac acgcccatat tcgtgccaca tcctcgacag 2280 tcaaggactc gaattcgccg atgtttccaa cggacggtca cgtggaaaga tcctcgtggt 2340 cgtcaactca ccactcaaaa catgcgctgc ctaccagggc cccagcttcg cgccaatgcc 2400 gggcagcgcc atgtggaacg aataaaccaa tgcgggggca gcgtcatcta tccgcgcttg 2460 agcactgctc gcgcccgcag ctcgtctgcc attgatacga tgacgttgac cctcgcaaac 2520 caactgtgct acctatataa gtcctcggac ttacaccgac agttggatat catgatacca 2580 caatcctact taacatttct tgaatggcta ctcagaatca gtcccaacag agaaataaat 2640 tcaatccgtc atttcccaag tcaagacaat cacgaaatta taacatgcac tctaagaaat 2700 ctttccaagg aacaagagat catgctattt ccgatcaaag acataattca agccaaccgt 2760 cgtgtaaatg cttacgcacg aaatctcctt gatgcctcac cgcttcccga ttttgcccta 2820 caacagatgc ttctaccaaa tacagcaaat gatgtagtgt gtgcaatctt actactcggt 2880 gaagtactat ggatgcttcg gtgcccaatc tcaatcatcg taaacatttc acgcgcaatt 2940 tgtcgtaacg atagcttttt gaaagattta tccgatttta ataagatgtt aggcttaact 3000 aagattccta tagctaattg tttgacggaa ctgaacactt tacaaggtcg aggagtaact 3060 tcaagcgatg ccaaaagaga tttgacccac cgaattgccg atgtcaatcc tcatgaggcg 3120 aaaataagta gagagaatct caaagaagca atcaatcaaa tatacagaga agaaatcact 3180 aagaaagaaa tccctgatac atttagacaa catgtattta catctccatt atgggttaag 3240 aaaggagcac accaccaccc acacttcaag tcttacgaca atcgattgga gtttgttgag 3300 aatgtagatt tggacgaggt cttacaatca cgccccgctg tctacatcac acaagccccg 3360 aaacttgaac atggtaagac tagattcatt tacaattgtg acacagtcag ctacatttac 3420 tttgattaca ttttgaatta cgttgagggt gtttggtcca ataaacatgt actcttaaac 3480 cctgattaca tgaatccagt tatctttagt actcttaatt acgacgaata ctgcatgtta 3540 gattacactg atttcaactc acagcattcc atcgaaagta tgaagcaagt cttttcaagt 3600 cttttacctt ttctgccaac gtccatgcac cgtatattac agtggtgcgt cacatcgttt 3660 gacaacatgt atatcaacaa tactcactgg aattccacac tgccatcagg acatagagca 3720 actacattta taaactctgt cttgaacagg gcatacttgt taccattcct acaagtctcc 3780 aatgcgtttc acacaggcga tgatgtactt ttatgcggaa aagcagacta tggcacgctt 3840 atcaataccg taccttatga actcaacaaa actaagcaat cattcggacc ttcagctgaa 3900 ttcctgcgtc ttcataaaca caatgatcaa gtttccggtt atccagcacg tgcaattagc 3960 agtctcgtaa gtggcaattg gttgtcattc gcaaatccac tatggcaacc ttcgctactg 4020 tcaattatgc aacaattgta caccatatca gcaagatcag gtctattacc atatattcct 4080 gtaacaatga agttagaagt gcagcggcgt tatgatttgc gatcacgaat taccaatgga 4140 ttgttctctg gtgacatagt cccaagcggt tgtccttgtt ataagtcaaa tgctgcttta 4200 ctaagtgcgg tagttcccga cactgtagta aagggttccc ctaactttta tgacttacgt 4260 acattggaca cgttaaaaca aacatcacct tggatcaatt ctgcatccaa atacatgaac 4320 ctcttasaac ggcgccatat ggaatctgat aacaaaaatg ttttatatag tatccaatat 4380 ttaccatcta aaatgttacc aataattgat gttgaccctg cagatgctct tccattacaa 4440 aaacggtatc acccacgttc tcacatcgca cacccactcc cacgagatgc tcatcttaag 4500 gaattaagat ttgcaacgtg tcgagtgggc ccggctactg cgataagatt aggatcgctt 4560 tggcctgcga acagaatcaa cctaatcaag ccagtctacg tctaagtacg actgacacaa 4620 tcttacataa ctactcggct agagtaggag agtaatatca actcttacgt c 4671 //