ID HQ607517; SV 1; linear; genomic RNA; STD; VRL; 4684 BP. XX AC HQ607517; XX DT 08-MAY-2011 (Rel. 108, Created) DT 08-MAY-2011 (Rel. 108, Last updated, Version 1) XX DE Trichomonas vaginalis virus 1 strain TVV1-OC3, complete genome. XX KW . XX OS Trichomonas vaginalis virus 1 OC Viruses; Riboviria; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4684 RX DOI; 10.1128/JVI.00220-11. RX PUBMED; 21345965. RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W., RA Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H., RA Singh B.N., Fichorova R.N., Nibert M.L.; RT "Clinical isolates of Trichomonas vaginalis concurrently infected by RT strains of up to four Trichomonasvirus species (Family Totiviridae)"; RL J. Virol. 85(9):4258-4270(2011). XX RN [2] RP 1-4684 RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A., RA Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.; RT ; RL Submitted (12-NOV-2010) to the INSDC. RL Department of Microbiology and Molecular Genetics, Harvard Medical School, RL Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA XX DR MD5; 64df45ae05036f6a1b593e0687ed660f. DR EuropePMC; PMC3126235; 21345965. XX FH Key Location/Qualifiers FH FT source 1..4684 FT /organism="Trichomonas vaginalis virus 1" FT /host="Trichomonas vaginalis" FT /strain="TVV1-OC3" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="Nov-2009" FT /db_xref="taxon:674953" FT gene 327..4617 FT /gene="pol" FT CDS join(327..2353,2355..4617) FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /db_xref="GOA:F5B2U0" FT /db_xref="InterPro:IPR001795" FT /db_xref="UniProtKB/TrEMBL:F5B2U0" FT /protein_id="AED99816.1" FT /translation="MEASANGLSHDDNATRSQNVGPSTLPGSDKQGGEKHENSFNSFSY FT DFFFNFLRMSAHTHISDSPGVSFVGKDGTPYSTATIQSAVGRLTHNVIASAVQLNVTAD FT NVLEVDYGFGQDVSRSAGTITIPIFDGEKYKETARALAAIFSKKGMAVDVTSQTVQETL FT KNSDLTIATVAAGYYTALAARHELTKEVSMAAHTIPFVTALSDTFSAAPDAQRSSHVIS FT SCLRCPHSNNIQHDIGIGTDIWNNVSVESLSPQNMAVPNPNDVSFFIPNKALPSSWWCA FT IWLLNAFIHSFVAPTRFHIFIAPGETYHLAPFTDADIYEAIPIMLAMSKAARPVPESVE FT SMLYAYGTQMIIQPHSLYTEGGLIRKMIFTVPHLPAHGYFVTNSEYSRYMNIAVPNDPR FT SAKDFIIGAGTGLLQITLAYQAAFSCAGPIALHWHANDAISQGMDTIAETYLQGRYFTV FT PMAVAVATNVAQYTTLVRADPQYRHTLDRILPRIFGPSTDTVFNFIESAISSSWVSIDA FT RRRNGRTRKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSTISQEVDYLLNGGDLR FT NCPVLRTLKAAERDETITFMCKEKAGTLIAMDGTVRFFKRFETIDLTQLGWTSHGKVMK FT PYAFRAPLINGITICNTAYTTTAIDIVTTVFGPLRQRVGSLLSKAVRCGPVIPAVKHHF FT NFKNVIVATRNNSEYTFIPGYGWVLQDDYLLNAVKMTGEGDIPPDQLPYDDDLLLSYAK FT ILLYDYITHFPKYRYNNPKILTQTTELQLFPLKDDSAARNKVNFYARLLWNEATSDKKA FT FKPGTYNDTVAGLLMWQQCALMWSVPQSIINRVISGVCDALTDRTSLALLKRISDWLKQ FT LGLAYSPIHRLFIELPTLLGRGAIPGNAILDIKHRLTFDPSITVDVPTDRLHRLIYRLL FT SRNLHITTASSFEDHLEERLLWSKTGSHYYPDDEVNKLLPHRPTRKEFLDIVTVDYIKR FT CKPQVFIRQSRKLEHGKERFIYNCDTISYVYFDYILKLFETGWQDGEAILSPGDYTNDR FT LHAKISSYKYKAMLDYTDFNSQHTIQSMRLIFETMKELLPPEMSFALDWCIASFDNMKT FT SDGHKWVATLPSGHRATTFINTVLNWCYTQMVGLKFDSFMCAGDDVILMSQEPISLAPI FT LTSNFKFNPSKQSTGTRGEFLRKHYTEEGVFAYPCRAIASLVSGNWLSDTLRDNTPMVV FT PIQNGVDRLRSRAGLLGVPWILGLSELIEREDLPKEVGMALLNSHAAGPGLITRDYSSF FT TVTPKPPTITSSLEYTATRHGVQDLSKHVPWKQLTTQECNRLGQQIKKMSHRHCSQAKI FT TYKCVYEVFKPNRLPTVLSDVSQPSLSMAWWQAMLKEAMQDYTVKKIDAQMFASNACTN FT SVSGDAFLRATPKMAGVLITSLISSSS" FT gene 327..2363 FT /gene="cap" FT CDS 327..2363 FT /codon_start=1 FT /gene="cap" FT /product="capsid protein" FT /db_xref="UniProtKB/TrEMBL:F5B2U1" FT /protein_id="AED99815.1" FT /translation="MEASANGLSHDDNATRSQNVGPSTLPGSDKQGGEKHENSFNSFSY FT DFFFNFLRMSAHTHISDSPGVSFVGKDGTPYSTATIQSAVGRLTHNVIASAVQLNVTAD FT NVLEVDYGFGQDVSRSAGTITIPIFDGEKYKETARALAAIFSKKGMAVDVTSQTVQETL FT KNSDLTIATVAAGYYTALAARHELTKEVSMAAHTIPFVTALSDTFSAAPDAQRSSHVIS FT SCLRCPHSNNIQHDIGIGTDIWNNVSVESLSPQNMAVPNPNDVSFFIPNKALPSSWWCA FT IWLLNAFIHSFVAPTRFHIFIAPGETYHLAPFTDADIYEAIPIMLAMSKAARPVPESVE FT SMLYAYGTQMIIQPHSLYTEGGLIRKMIFTVPHLPAHGYFVTNSEYSRYMNIAVPNDPR FT SAKDFIIGAGTGLLQITLAYQAAFSCAGPIALHWHANDAISQGMDTIAETYLQGRYFTV FT PMAVAVATNVAQYTTLVRADPQYRHTLDRILPRIFGPSTDTVFNFIESAISSSWVSIDA FT RRRNGRTRKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSTISQEVDYLLNGGDLR FT NCPVLRTLKAAERDETITFMCKEKAGTLIAMDGTVRFFKRFETIDLTQLGWTSHGKVMK FT PYAFRAPLINGITICNTAYTTTAIDIVTTVFGPLRQRVGSLFE" XX SQ Sequence 4684 BP; 1365 A; 1190 C; 939 G; 1190 T; 0 other; gcaaaaagag ggggtcatcc acttccctct ttttgcactc aacattttca cctcatcatg 60 acgaatccgt gacgcggaca tgataacaag cgtactgtcc tcgacgattg ccatcctcgt 120 gtgaattccg ggctccgctt gcactgatgg tacctcttac gaaacttgga gagacttcgg 180 cctcaaagag cggtaatgtg ccctctgcgc ctgggaccta atggtgtttt ctgctgtagg 240 tacttcagta gtaggaaggt gaagggttaa acatcctggt tcgctaggtt tgtccttgcc 300 ttatctctgc tgataattga atacccatgg aggcttctgc taatgggtta tcacatgatg 360 ataatgcgac aagatcgcaa aatgttggac cttctactct tccggggtca gataaacaag 420 gaggagaaaa acacgaaaat tcttttaatt ctttttctta tgatttcttt tttaactttt 480 tacgtatgtc agcacacact cacatttcag acagtccagg tgtttctttc gtaggtaaag 540 acggcacacc ttattcgaca gctacaatcc aatctgctgt aggccgtctc acacataacg 600 taatcgcatc agccgtccag ctcaacgtta cagccgacaa tgtcttggaa gtagattatg 660 gttttggcca agacgtctca agatctgcag gaaccatcac catcccaatt tttgatggtg 720 agaagtacaa agaaacagcg cgcgctttag ctgcgatctt cagcaagaaa ggtatggcag 780 tcgatgtcac gtcacagaca gtccaagaaa ccctcaagaa ttccgatctc acaattgcta 840 cggtagccgc aggatattac actgctttag ctgcccgtca tgaactcacg aaggaagtaa 900 gcatggcggc tcacactatc ccatttgtta ccgcattgtc cgacacgttc tcagctgcac 960 cagatgccca acgttcaagc catgttattt cctcttgctt gcgttgccct cattcgaaca 1020 atatccaaca cgacatcgga atcggtacag acatctggaa caacgtctct gtcgaaagtc 1080 tctcaccgca aaatatggca gttccgaatc ccaacgacgt atcattcttc attccgaaca 1140 aagctctccc atcctcttgg tggtgtgcca tctggcttct taatgccttc atccacagct 1200 tcgtcgcgcc gacacgcttc catatcttca tcgcaccagg cgaaacatac catcttgcac 1260 cattcacaga tgccgatatt tacgaggcta tcccaattat gctcgcaatg tcgaaggcag 1320 ctcgcccagt tccagaaagt gtcgaaagca tgctttacgc atatggcact cagatgatta 1380 tccagccaca ctcgctctac acagaaggtg gactcatcag aaaaatgata ttcacagttc 1440 cacaccttcc agcccacggc tattttgtta caaattccga atactcgaga tacatgaaca 1500 tcgcagttcc taacgatcct cgctctgcaa aggacttcat catcggtgca ggaacaggtc 1560 tcttacagat cacactcgct taccaggctg ctttcagctg cgctggccct attgcacttc 1620 attggcacgc aaatgacgcc atctcccaag gcatggatac gatcgcggaa acatacctcc 1680 aaggaaggta tttcacagtt cctatggcag tcgcagttgc tacaaacgtt gctcaataca 1740 cgacgctggt cagagccgat ccccaataca gacacacact cgaccggatc ttaccacgca 1800 tattcggacc gtcaacagat acagtcttca atttcatcga gtccgcaatc tcatcatctt 1860 gggtatcaat agacgcccgc cgacgcaacg gccgcacaag aaagttcaga acagctttca 1920 tcaaccgctt ccacgatcca gaattcgctt acatgttcgg tatcacaggc aacggtatcg 1980 aaagaatgga aggcaaagtc acctccacga tcagccaaga ggtcgattac ctcttaaacg 2040 gcggtgacct ccgcaattgc ccagtcctcc gcacactcaa ggcagcagaa agagacgaaa 2100 caatcacgtt catgtgcaaa gaaaaagccg gtacactcat cgccatggac ggaacagtcc 2160 gctttttcaa gcggttcgag acgatcgatc tcactcagct cggatggaca tcccacggta 2220 aggtcatgaa accatacgca ttcagagctc cacttatcaa cggaatcacg atctgcaaca 2280 cagcctacac aacgacagcc atcgacatcg ttactacagt ctttggtcct ttacgtcaga 2340 gggtaggttc cctttttgag taaggctgta cgttgtggcc ctgtaatacc agccgtcaag 2400 catcatttca acttcaagaa cgttatagta gcaacacgaa ataattccga atacacgttc 2460 attcccggtt acggttgggt attacaggat gactatttat tgaatgccgt aaagatgact 2520 ggcgaaggtg atatacctcc tgatcagtta ccttacgatg atgatctttt actttcatac 2580 gcaaaaattt tactttacga ttacataact cattttccta aatacagata caacaatcca 2640 aaaatattga cacagacaac agaactacaa ctttttccac tcaaagacga ctcagctgct 2700 agaaataaag tcaacttcta cgctagatta ttatggaacg aagcaacctc agacaagaaa 2760 gctttcaaac caggaactta caatgatact gtagcaggtt tactgatgtg gcaacaatgt 2820 gctctcatgt ggtccgtacc tcagtccatt atcaacagag taattagcgg tgtttgtgat 2880 gcattaaccg acaggacgtc actcgcgcta ttgaaacgta tctcagattg gctgaagcaa 2940 cttggactag cctactcacc gatacatcgc cttttcatag agctcccaac actactagga 3000 cgtggagcta tcccaggcaa tgcaattctg gatattaagc acagactcac attcgaccca 3060 tcaattacag tagacgtccc gacggaccgg ctacatagat tgatttacag acttctatct 3120 cgcaatctcc atatcacgac ggccagtagt ttcgaggatc acttagaaga aagactactc 3180 tggtctaaaa caggaagcca ctattatccc gacgacgaag tcaataagtt actccctcat 3240 cgtcctacaa gaaaagagtt cctagatata gttacagtag actacattaa gcgatgcaaa 3300 ccccaagttt ttatcagaca atcacgcaag ttggaacacg gcaaggaacg attcatctac 3360 aattgcgaca cgatttcata cgtctatttt gattacatcc taaagctctt cgagacagga 3420 tggcaagatg gcgaggcaat actatctcca ggtgattata ccaatgatcg tctccacgcc 3480 aaaatatcta gttacaaata caaagcaatg ttagattaca cagatttcaa ttcacaacac 3540 acgatccaaa gcatgcgctt aatttttgaa acaatgaaag agctactccc accagaaatg 3600 tcctttgcac tagactggtg tatagcatcc tttgataaca tgaaaacgtc cgacggtcac 3660 aaatgggttg caacccttcc tagcggacat cgtgctacaa cattcattaa cacagtatta 3720 aattggtgtt acacacagat ggtcggtctt aagttcgata gttttatgtg cgctggtgat 3780 gacgtcatct tgatgtccca agaaccaatt tcactagccc caatacttac ctctaatttc 3840 aaatttaatc ccagcaaaca aagcacaggt actagaggtg agtttctacg taaacattat 3900 acggaagaag gtgtttttgc atatccatgt cgagcaattg ccagtttagt aagtggaaat 3960 tggttgagcg atacactaag agataacacc ccaatggtgg tccctataca gaatggagtc 4020 gatagattac gcagtagagc aggtttactc ggagtccctt ggattttagg cctctcagag 4080 ctcattgaga gagaggactt acccaaggag gtcggcatgg ctttactaaa ttcacacgca 4140 gcgggaccag gtctcatcac acgcgattac agttccttca cagttacgcc gaaaccacct 4200 acgataacta gttcacttga atacactgca actcgtcatg gtgtccagga cttgtcaaaa 4260 cacgtaccat ggaaacagct tacaacacaa gaatgcaata ggttaggtca acaaattaag 4320 aaaatgagtc acaggcattg tagccaggct aagataactt acaaatgtgt ctatgaggtt 4380 ttcaaaccca ataggctccc cacggtgtta tctgacgtca gccagccatc gttgtcgatg 4440 gcgtggtggc aggcaatgct taaggaagca atgcaagatt acactgtcaa gaagatagat 4500 gctcaaatgt tcgcgagtaa cgcatgtaca aactccgtta gcggggatgc gtttttacga 4560 gcgacaccca agatggctgg cgtcttaatc actagcctca tctcttcttc ttcataacgt 4620 acagcaaaaa gtctctgtag ttgctcaaga cttataatga gccagttggt ctcagtatac 4680 cttc 4684 //