ID HQ607521; SV 1; linear; genomic RNA; STD; VRL; 4680 BP. XX AC HQ607521; XX DT 08-MAY-2011 (Rel. 108, Created) DT 08-MAY-2011 (Rel. 108, Last updated, Version 1) XX DE Trichomonas vaginalis virus 1 strain TVV1-OC4, complete genome. XX KW . XX OS Trichomonas vaginalis virus 1 OC Viruses; Riboviria; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4680 RX DOI; 10.1128/JVI.00220-11. RX PUBMED; 21345965. RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W., RA Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H., RA Singh B.N., Fichorova R.N., Nibert M.L.; RT "Clinical isolates of Trichomonas vaginalis concurrently infected by RT strains of up to four Trichomonasvirus species (Family Totiviridae)"; RL J. Virol. 85(9):4258-4270(2011). XX RN [2] RP 1-4680 RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A., RA Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.; RT ; RL Submitted (12-NOV-2010) to the INSDC. RL Department of Microbiology and Molecular Genetics, Harvard Medical School, RL Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA XX DR MD5; a56c7ef62cc4c94fa170f0db09f660b2. DR EuropePMC; PMC3126235; 21345965. XX FH Key Location/Qualifiers FH FT source 1..4680 FT /organism="Trichomonas vaginalis virus 1" FT /host="Trichomonas vaginalis" FT /strain="TVV1-OC4" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="Dec-2009" FT /db_xref="taxon:674953" FT gene 325..4615 FT /gene="pol" FT CDS join(325..2351,2353..4615) FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /db_xref="GOA:F5B2U8" FT /db_xref="InterPro:IPR001795" FT /db_xref="UniProtKB/TrEMBL:F5B2U8" FT /protein_id="AED99818.1" FT /translation="MEASANGLSHDENATRSQNVGPSTLPGSDKQGGEKHENSFNSFSN FT DFFFNFLRMSAQTHISDNPGVSFIGKDGTPYSSATIPSAVGRLTHNVVASAVQLNVTAD FT NVLEVDYGFGQDVSRSTGTITIPIFDGEKYKETARALAAIFNKKGMAVDVTSQTVQETL FT KNSDLTIATVAAGYYTALAARHELTKQVSVAAHSIPFVTAISDTLAAAQDAQRSSHVIS FT SCLRCPHCNNAQHDIGIGTNMWNNVSVESLSPQNMAVPNPNDISFFIPNKALPPPWWCA FT IWLLNAFIHSFVAPTHIHIFITPGETYHLAPFTDADIYEAIPIMLAMSKAARPVPESVE FT SMLYAYGTQMIIQPHSLYTEGGLIRKMIFTVPHLPAHGYFVTNSEYSRYMNIAVPNDPR FT SAKDFIIGAGTGLLQIILAYQAAFSCAGPIALHWHANDAISQGMDTIAGTYLEGRYFTI FT PMAVAVATNVAQYTTLIRTDPQYRHTLERILPRIFGPSTDTVYNFIESAISSSWVSIDA FT RRRNGRTRKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSTISQEVDYLLNGGDLR FT NCPVLRTLKAAERDETITFMCKEKAGTLMAMDGTIRWFKRFETIDLTHLGWTSRGKVMK FT PYAFRAPIIQGITICNTAYTTTAIDIVTTVFGPLRQRVGSLLSKAVRCGPIIPTVKHHF FT NFKHVITTKRNDNEYIFIPGYGWVLQDDYLLNAVKMTGEGDLPPDQLPYDDDLLLSYAK FT ILLYDYITHFPKHRYNNPKILTQETELQLFPLKDDSAARTKVNFYARLLWNEATSDKTA FT FKPGTYNDTVAGLLMWQQCALMWSVPQSIINRVISGVCDALTDRTSLALLKRISDWLKQ FT LGLAYSPIHRLFIELPTLLGRGAIPGDAIHDIKHRLKFDPSITVDVPTDQLHRLIYRLL FT SRNLKVTTLDSFEDHLEERLLWSKSGSHYYPDDEVNKLLPHRPTRKEFLDIVTVDYIKR FT CKPQVFIRQSRKLEHGKERFIYNCDTISYVYFDYILKLFETGWQDSEAILSPGDYTNER FT LHAKISSYKYKAMLDYTDFNSQHTIESMRLIFETMKELLPSETAFALDWCIASFDNMRT FT SSGHKWVATLPSGHRATTFINTVLNWCYTQMVGLKFDSFMCAGDDVILMSQEPISLAPI FT LTSHFKFNPSKQSTGTRGEFLRKHYTAEGVFAYPCRAIASLVSGNWLSETLRDNTPMVV FT PIQNGIDRLRSRAGLLGVPWSLGLSELIEREGIPKEVGMALLNSHAAGPGLITRDYSSF FT TVTPKPPTITSTLEYTATRYGVQDLSKHVPWKQLTTEESRKLGQQIKKMSHRHCSQAKI FT TYKCIYEVFKPSGLPTVLSGVSQPSLSMVWWQAMLKEAMQNYSVKKIDAQMFASNACTS FT SVSGDAFLQATPKMAGVLMTSLIYSSS" FT gene 325..2361 FT /gene="cap" FT CDS 325..2361 FT /codon_start=1 FT /gene="cap" FT /product="capsid protein" FT /db_xref="UniProtKB/TrEMBL:F5B2U9" FT /protein_id="AED99817.1" FT /translation="MEASANGLSHDENATRSQNVGPSTLPGSDKQGGEKHENSFNSFSN FT DFFFNFLRMSAQTHISDNPGVSFIGKDGTPYSSATIPSAVGRLTHNVVASAVQLNVTAD FT NVLEVDYGFGQDVSRSTGTITIPIFDGEKYKETARALAAIFNKKGMAVDVTSQTVQETL FT KNSDLTIATVAAGYYTALAARHELTKQVSVAAHSIPFVTAISDTLAAAQDAQRSSHVIS FT SCLRCPHCNNAQHDIGIGTNMWNNVSVESLSPQNMAVPNPNDISFFIPNKALPPPWWCA FT IWLLNAFIHSFVAPTHIHIFITPGETYHLAPFTDADIYEAIPIMLAMSKAARPVPESVE FT SMLYAYGTQMIIQPHSLYTEGGLIRKMIFTVPHLPAHGYFVTNSEYSRYMNIAVPNDPR FT SAKDFIIGAGTGLLQIILAYQAAFSCAGPIALHWHANDAISQGMDTIAGTYLEGRYFTI FT PMAVAVATNVAQYTTLIRTDPQYRHTLERILPRIFGPSTDTVYNFIESAISSSWVSIDA FT RRRNGRTRKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSTISQEVDYLLNGGDLR FT NCPVLRTLKAAERDETITFMCKEKAGTLMAMDGTIRWFKRFETIDLTHLGWTSRGKVMK FT PYAFRAPIIQGITICNTAYTTTAIDIVTTVFGPLRQRVGSLFE" XX SQ Sequence 4680 BP; 1385 A; 1175 C; 938 G; 1182 T; 0 other; gcaaaaagag ggagtgatcc actttcctct ttttgcaccc aacattgtta catcatcatg 60 acgaatccat aacgcggaca cataacaagc gtagtgtcct cgacgattgc catcctcgtg 120 tgaattccgg gctccgcttg cactgatggt acctcttacg aaacttggat agacttcggc 180 cttgaagagc ggtaatgtgc cctctgcgcc tgggacctaa tggcgtttat gctgtaggta 240 atttcagtag taggaaggag aggggtaaac atcctggttc gctaggtttg tcctcgcctt 300 aatctctact gatagcgaat acccatggag gcttctgcta atgggttatc acatgatgaa 360 aatgcgacaa gatcgcaaaa tgttggacct tctactcttc cggggtcaga taaacaagga 420 ggagaaaaac atgaaaattc ttttaattct ttttcaaatg atttcttttt taatttttta 480 cgtatgtctg cacaaacaca catttccgac aatccaggtg tttctttcat aggtaaagac 540 ggcacacctt attcatcagc cacaattcct tcagctgtag gccgtcttac acacaacgta 600 gttgcatcag ccgtccagct caacgtcaca gccgataatg ttttagaagt cgattacggc 660 ttcggtcaag acgtttcaag gtctacagga acaattacaa tcccaatttt cgacggcgag 720 aaatacaaag aaactgctcg tgctttagcc gcaattttca acaagaaagg catggcagtt 780 gacgtcacat cacagacagt ccaagaaact ctcaagaatt cggatcttac aattgctaca 840 gttgcagccg gatattacac agccttagct gctcgccacg aacttaccaa acaagtaagc 900 gttgcagctc attccatacc attcgttaca gcgatatcag atacactcgc agccgcacaa 960 gatgcgcaac gctcaagcca tgttatctct tcttgcttgc gttgccctca ttgcaataac 1020 gcacagcacg acatcggaat tggtacaaac atgtggaata acgtttccgt cgaaagtctc 1080 tcaccacaga atatggcagt tccaaatccc aacgacatat ccttcttcat tccgaataag 1140 gctctcccac ccccttggtg gtgcgctatt tggcttctca acgcgttcat ccatagcttc 1200 gtcgcgccga cacatatcca catcttcatc actccaggtg agacatatca tcttgcaccg 1260 ttcacggatg ctgatatcta cgaagccatc cctatcatgc tcgcaatgtc aaaggccgca 1320 cgtccagttc cagaaagcgt cgaaagtatg ctttatgcat acggcacaca gatgattatc 1380 cagccacact cgctctatac agaaggtgga ttaatcagaa agatgatatt cacagttcca 1440 catctcccag cacacggcta tttcgtcacg aactccgaat actcaaggta catgaacatc 1500 gcggttccaa acgaccctcg ttccgcaaaa gacttcatta tcggtgcagg aacgggtctc 1560 ttacagatca tactcgctta ccaagctgct ttcagttgcg ctggccctat cgcgcttcat 1620 tggcacgcaa acgacgctat ctcccaaggt atggatacaa tcgcaggcac ttaccttgaa 1680 ggaaggtact tcacaatccc tatggcagtc gccgtcgcta cgaatgtagc tcaatacaca 1740 acactgatca gaacagatcc tcaatacagg cacacactcg aacggatctt accacgcata 1800 ttcggtccat cgacagatac ggtctacaac ttcatcgaat ccgctatctc gtcatcctgg 1860 gtatcaatcg atgctcgcag acgcaacggt cgcacaagaa agttcagaac agccttcatc 1920 aatcgtttcc atgatccaga attcgcatac atgttcggca tcaccggcaa cggtatcgag 1980 cgaatggaag gtaaagtcac ttccaccatc agccaagagg tcgattacct cttaaacggc 2040 ggcgaccttc gcaattgccc agtcctccgt actctcaaag cggcagaaag agacgaaaca 2100 atcacgttca tgtgcaagga gaaagctggt acacttatgg ccatggacgg aacgattcgc 2160 tggttcaagc ggttcgagac aattgatctc acccatctcg gatggacatc acgtggtaag 2220 gtcatgaaac catacgcatt cagagctcca atcatccaag gaatcacaat ctgcaacaca 2280 gcatacacaa caacagccat cgacatcgtt actacagtct ttggcccatt acgtcagagg 2340 gtaggttccc tttttgagta aggctgtacg ttgtggccct ataataccaa ccgtcaagca 2400 tcatttcaat ttcaaacatg ttataacaac taaacgaaat gataacgaat atattttcat 2460 tcccggttac ggttgggtat tacaggatga ttatttgctg aatgccgtaa agatgactgg 2520 tgaaggcgat ttaccccctg accagttacc ttacgatgat gatcttttac tttcatacgc 2580 aaaaatttta ctttatgatt acataactca ttttcctaaa cacagataca ataatccaaa 2640 aatattgaca caagaaacag aactacaact tttcccactc aaagacgact cagctgctag 2700 aacaaaagtc aacttctacg ctaggttact atggaacgaa gcaacctcag acaaaacagc 2760 tttcaaacca ggaacttaca acgatacagt agcaggctta ttgatgtggc aacaatgtgc 2820 tctcatgtgg tccgtacccc agtctattat caacagagta attagcggtg tttgtgatgc 2880 attaaccgat aggacttcac tcgcgctatt gaaacgtatc tcagactggt tgaaacaact 2940 cggactagct tactcaccga tacatcgcct tttcatagag ctccccacat tattaggacg 3000 tggagccatc ccaggcgacg caattcacga tatcaagcac agactcaagt ttgacccatc 3060 aattacagtc gacgtaccaa cagaccagtt acacaggcta atctacagac tcttgtctcg 3120 aaacctcaag gtcactacgc tagacagttt tgaagatcac ttagaggaac gtctactttg 3180 gtccaaatca ggaagtcact attatcctga cgacgaagtc aataagttac ttcctcaccg 3240 cccaacaaga aaagaattcc tagatatagt aacagtggac tatatcaaac gatgcaagcc 3300 tcaagttttt atccgacaat cacgcaagct ggagcatggc aaggaacgct tcatctacaa 3360 ttgtgacacg atctcttatg tctattttga ttacatcctg aagctcttcg agacaggatg 3420 gcaagatagt gaagcaatat tgtcaccagg tgattacact aacgaacgcc tccacgccaa 3480 gatctctagc tacaagtata aagcaatgtt agattataca gatttcaatt cccaacatac 3540 gatcgaaagc atgcgtttga ttttcgaaac catgaaggaa ctactcccgt cagagacagc 3600 ttttgcactc gactggtgta tagcatcttt cgataacatg agaacatcca gtggtcacaa 3660 atgggttgca acccttccta gcggacatcg tgctaccacc ttcatcaata cagtattgaa 3720 ttggtgttac acgcagatgg tcggtctcaa gtttgacagt tttatgtgcg ctggtgatga 3780 cgtcatctta atgtctcagg aaccaatatc actagcccca attcttacat cacattttaa 3840 attcaatcct agcaaacaaa gtacaggcac tagaggtgag ttcttacgta agcattacac 3900 tgcagaaggc gtgtttgcat acccatgtag agcgatcgcg agtttagtaa gtggaaattg 3960 gttgagcgaa acactaagag ataacacccc aatggtggtc ccaatacaga acggaatcga 4020 taggctacgc agtagagcgg gtttactcgg agttccttgg agtttaggcc tctcagagct 4080 cattgagaga gagggcatac ccaaggaagt cggcatggct ttactcaatt cacacgcagc 4140 gggaccaggt ctcatcacac gtgattacag ttcattcaca gttacaccca aaccacctac 4200 gataactagt acacttgaat acacagcgac tcgttatggc gtccaagacc tgtccaaaca 4260 cgtaccttgg aaacaactca caacagagga aagtcgcaaa ttaggtcaac agattaagaa 4320 aatgagtcac aggcattgta gccaggctaa gataacttac aaatgcatct acgaggtttt 4380 taaacctagt ggactcccta cggtgttatc tggagtcagc caaccatcgt tgtcgatggt 4440 gtggtggcag gcaatgctta aggaagcaat gcagaactat tctgtcaaga agatagatgc 4500 gcaaatgttc gcgagtaacg catgtacaag ctccgttagc ggggatgcgt ttttacaagc 4560 gacacccaag atggctggcg tcctaatgac tagcctcatc tattcttctt cataacgtac 4620 agcaaagtct ctatagttgc tcaagactta taatgagcca gttggtctca ctataccttc 4680 //