ID HQ607522; SV 1; linear; genomic RNA; STD; VRL; 4943 BP. XX AC HQ607522; XX DT 08-MAY-2011 (Rel. 108, Created) DT 08-MAY-2011 (Rel. 108, Last updated, Version 1) XX DE Trichomonas vaginalis virus 4 strain TVV4-1, complete genome. XX KW . XX OS Trichomonas vaginalis virus 4 OC Viruses; Riboviria; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4943 RX DOI; 10.1128/JVI.00220-11. RX PUBMED; 21345965. RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W., RA Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H., RA Singh B.N., Fichorova R.N., Nibert M.L.; RT "Clinical isolates of Trichomonas vaginalis concurrently infected by RT strains of up to four Trichomonasvirus species (Family Totiviridae)"; RL J. Virol. 85(9):4258-4270(2011). XX RN [2] RP 1-4943 RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A., RA Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.; RT ; RL Submitted (12-NOV-2010) to the INSDC. RL Department of Microbiology and Molecular Genetics, Harvard Medical School, RL Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA XX DR MD5; 372d9bef55daa8fa609720bbeb13e41b. DR EuropePMC; PMC3126235; 21345965. DR EuropePMC; PMC4300635; 25378500. XX FH Key Location/Qualifiers FH FT source 1..4943 FT /organism="Trichomonas vaginalis virus 4" FT /host="Trichomonas vaginalis" FT /strain="TVV4-1" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="Dec-2009" FT /db_xref="taxon:1008292" FT gene 338..4782 FT /gene="pol" FT CDS join(338..2534,2534..4782) FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /db_xref="GOA:F5B2V0" FT /db_xref="InterPro:IPR001795" FT /db_xref="UniProtKB/TrEMBL:F5B2V0" FT /protein_id="AED99796.1" FT /translation="MSAITATISSANLNDLSRSANAQQNNGVPALAPQQNIAKPNTGPP FT DPGDGTRQQTIPSSPKTDDHTKEPVSAPTTQQNVTTSDKELPDSAEGPRTHSTDFQTLY FT EYFYSYPVPASQTRTGGAIARNGPVNDNNEVVSFTTETQLVTSLTPRHIDANIQPLQIS FT IADDCVNYSCEYSGQTCPVFDGSQHVQSATALASSMKARLMCEVTQSLSARPVQQPQVI FT AYLYGALLAFGDRLNLHYGNKVNLWNALLGHNLQRATPVNGDAFNHHLLIDGALAPPIL FT PAAGLGPFPSTTLGPNTTVTFKARASIFVRPQTYDYALVDAAFWLIYAMYSRMPVAFRQ FT AYSLNVDFFTVQPMAACVFPGHDGFTTPVIDQALGVLESMLVEMFNGDRQIMYYYAFKG FT GQLFMRPCSCYQEGGLIRKASRNVSLASFTGIYSLIGYCAPDARPLHAASHPGIIAALF FT QYVDTMVLQAVLSYSGPKLVHFGAAPEFATKGSTPYDFIDPDNYWGIRAGVNAHPVGYY FT YLDILMRQKEHQLLDETLSDIYGHVGSLAMSNIMASVASSGTEVLNQKMQKSFVRRGNQ FT VRALRHSHAIINRFHEPEYAYRLGILADGIMPLAGTHKCDIIDEATRLLQGEDIRNLPG FT LRCLRGRGLDAIVGIRPINKKRRAGFYTLDGNFHVVTNQCTSDVLQVWNDHGYIARPYA FT CHIVESINVEIYDKSNGAYNGWIQALVGGFGVPERCYMGPSSAGSRRRPLCPLKGSNRA FT VALHVDGQLDRASRVPYRKLAPCHLNCSKRCARQLAVIYRYQTLSRQLPEVSDEDYLAF FT LRWVLLPYTGATNRPHPKRWPKPFYPREVNLKFLDKETELQLFPLKKVPQADLKVNCFA FT RNLLYSSPLSDRVLKACIPVGTNNDTVCGLLVLLELLFEAGVPLDLLPTISVAIAKNDP FT FVKALSDFNKMTGATTSNIANLLTECTTLLGRGVTASAPSADLYHRVAPEGNRHEAKVS FT DDVLYSAIRTIYKQEIKDCPKPGDFGLHLLTSPFWCKSGSHHHPEFPSYRNRLEFVMNT FT DPDSIAAVKPSVYITQAQKLEHGKTRYIYNCDTVSYLYFDYVLNYIESIWANSHVLLNP FT DALNAEKFATLEYPEYCMIDYTDFNSQHALNSMKAVFKVLKEFLPSEMFPVLDWCISSF FT DNMTIKNTKWRSTLPSGHRATTFINSVLNRAYLLPYIGTIVSYHCGDDVLLCGDYDYQN FT LITRLPFELNPSKQSFGPHAEFLRLHRHGEKVVGYPTRAISSLVSGNWLSTTSWNWQPS FT LLSITNQINAIICRSQLSIHTIRSLAQELRFRYCPLLDDYIDPATTSFVAAGCPSYRPT FT ATMIVPDVPHLDAEEVEFTQLRRLAEYAINTYPWLNSTESVNQLVRNRTRKPAAKAIRY FT NILGSAVPLVCYHHHCDSMIVPLARRYYPRDHLAPPVTPQVLPPQPVFCDRNLSPIIAL FT KLAPAGVAVKVTADRPIASA" FT CDS 338..2578 FT /codon_start=1 FT /gene="pol" FT /product="capsid protein" FT /db_xref="UniProtKB/TrEMBL:F5B2V1" FT /protein_id="AED99795.1" FT /translation="MSAITATISSANLNDLSRSANAQQNNGVPALAPQQNIAKPNTGPP FT DPGDGTRQQTIPSSPKTDDHTKEPVSAPTTQQNVTTSDKELPDSAEGPRTHSTDFQTLY FT EYFYSYPVPASQTRTGGAIARNGPVNDNNEVVSFTTETQLVTSLTPRHIDANIQPLQIS FT IADDCVNYSCEYSGQTCPVFDGSQHVQSATALASSMKARLMCEVTQSLSARPVQQPQVI FT AYLYGALLAFGDRLNLHYGNKVNLWNALLGHNLQRATPVNGDAFNHHLLIDGALAPPIL FT PAAGLGPFPSTTLGPNTTVTFKARASIFVRPQTYDYALVDAAFWLIYAMYSRMPVAFRQ FT AYSLNVDFFTVQPMAACVFPGHDGFTTPVIDQALGVLESMLVEMFNGDRQIMYYYAFKG FT GQLFMRPCSCYQEGGLIRKASRNVSLASFTGIYSLIGYCAPDARPLHAASHPGIIAALF FT QYVDTMVLQAVLSYSGPKLVHFGAAPEFATKGSTPYDFIDPDNYWGIRAGVNAHPVGYY FT YLDILMRQKEHQLLDETLSDIYGHVGSLAMSNIMASVASSGTEVLNQKMQKSFVRRGNQ FT VRALRHSHAIINRFHEPEYAYRLGILADGIMPLAGTHKCDIIDEATRLLQGEDIRNLPG FT LRCLRGRGLDAIVGIRPINKKRRAGFYTLDGNFHVVTNQCTSDVLQVWNDHGYIARPYA FT CHIVESINVEIYDKSNGAYNGWIQALVGGFGVPERCYMGPRLQVAGGAPSAL" XX SQ Sequence 4943 BP; 1286 A; 1428 C; 1008 G; 1221 T; 0 other; gcttaaagtc ccagtgagct ttaagcacca gaagtcgcag catagcacca caatatttca 60 cattagctgc gaggtggtta gaattaaaca cgcgttttgg aagtcgtcag acgcgtggtg 120 gtcactttgg tgatcaagtt atactgacat ggcgccgggt ggttgacgac cctggccgcc 180 tagttgtagc ctttggctct ttgccccaag cgtggtttga acctgccttt agtggaatct 240 gtcgagttag atctcccgat agaggtgtca aaccgactta gatgtaatag tctaagttta 300 tggacagtga tgtccacttg tgcttccgtg ctacgtcatg tcagctataa cagccactat 360 ttcttctgct aatcttaatg atttatcacg ttctgcaaac gcccagcaaa ataacggcgt 420 ccctgctctt gcgcctcagc aaaatattgc taagccaaac actggtcctc ctgacccggg 480 agatggaaca agacaacaaa ctataccctc ttcacctaaa acagacgatc atacaaaaga 540 acctgtttcc gctcctacaa ctcaacaaaa cgttactacc tcagataaag aactccccga 600 ttccgccgaa ggaccaagaa cacactctac agattttcaa actctttacg aatattttta 660 cagttatcca gttccagcat cacagaccag gaccggcggt gctattgcac gcaatggtcc 720 agttaacgac aacaatgaag ttgtctcatt cactactgaa acacaattag ttacatcact 780 tacaccaaga catatcgatg caaatatcca gcctcttcag atctcgatcg ccgatgattg 840 cgtcaactat tcatgcgaat acagcggtca aacctgccca gttttcgatg gttcacagca 900 cgtccaaagc gccacagccc ttgctagttc catgaaggcg cgcctcatgt gcgaagttac 960 acaatcatta tctgcacgcc ctgtccagca accacaagtc attgcttatc tctatggtgc 1020 cttacttgca ttcggagacc gccttaacct tcattatggt aacaaagtca atctctggaa 1080 cgctttactc ggccacaatt tacaaagagc aacaccagtc aacggtgatg ccttcaatca 1140 ccacttgctc atcgatggcg ctctcgctcc tccaatactt ccggcagctg gattaggtcc 1200 attcccatca acaacattag gacccaatac caccgttaca ttcaaagctc gcgcatctat 1260 tttcgtccgt ccacaaactt acgactacgc gctcgtcgat gccgctttct ggcttatata 1320 cgccatgtac tctcgcatgc cagttgcttt ccgccaagcg tactcactca atgttgattt 1380 ctttactgtc cagccaatgg ccgcttgcgt atttccagga cacgatggtt tcacaacacc 1440 agttatcgac caagcgctcg gtgtcctcga atcaatgttg gtcgaaatgt tcaacggaga 1500 ccgccaaatc atgtactact acgctttcaa gggcggtcaa ctcttcatgc gcccttgttc 1560 ttgctaccaa gaaggaggcc tcatccgcaa agcctcacgc aatgtctcgc tcgcttcatt 1620 tacaggcatt tactcgctca tcggctactg cgcgccagac gccagaccac tccatgcagc 1680 cagtcaccca ggtataatcg ctgccctctt ccagtacgtc gatacaatgg tcttacaagc 1740 cgtcctctcg tattccggcc ccaagctcgt ccacttcggc gccgcacccg aattcgctac 1800 aaaaggctcc acaccatacg attttatcga ccctgataac tattggggaa tcagggccgg 1860 cgtcaacgca catccagtcg gttactacta cctcgacatc cttatgcggc aaaaagaaca 1920 ccaactcctc gacgaaacgc tttccgatat ttacggccac gtaggttcgc tcgcaatgtc 1980 aaacataatg gcgagcgtcg catcttctgg caccgaagtt ctcaatcaga agatgcagaa 2040 atccttcgtc cgacgcggca accaagtacg cgcattacgc cactcccacg ccatcatcaa 2100 ccgcttccac gaacccgaat acgcttaccg cctcggaatc ctcgcagatg gcatcatgcc 2160 cttagcgggt acgcacaagt gcgatatcat cgacgaggcc acacgcttac tccagggaga 2220 agacatccgc aatctcccag gcctccgttg cttacgcggt cgcggactcg atgccatcgt 2280 cggcatccgc cctatcaata agaagcggcg cgcaggcttc tacactctcg acggcaattt 2340 ccacgttgtt acaaaccagt gcacaagcga cgtccttcag gtttggaacg atcacggcta 2400 catcgcgcgc ccttacgctt gccacatcgt cgaatccatc aacgtcgaaa tctacgataa 2460 gtcaaatggt gcttacaacg gatggattca ggcactcgtc ggcggcttcg gtgttccgga 2520 gcgctgctac atgggccctc gtctgcaggt agcaggcggc gccccctctg ccctttaaag 2580 ggcagtaatc gcgcagtagc actgcatgta gatggacaac tagatagagc aagtcgggta 2640 ccataccgta agctcgcccc atgtcatctt aattgctcga agcgctgcgc acgacagttg 2700 gcggtaatat acagatacca gacactgagc cgccaattgc ccgaggtcag tgatgaagat 2760 tatctcgctt tcctccgttg ggtcctgtta ccttacacag gtgctacaaa tagaccacac 2820 cccaagcggt ggcctaaacc attttaccca agggaggtga atctcaagtt tttggataag 2880 gagacagaat tgcagttgtt ccctctcaag aaggtcccac aagccgactt gaaagtcaat 2940 tgcttcgcaa gaaacctcct ttattcgtcg cctctttccg atcgtgttct caaggcttgc 3000 attccagtag ggacgaataa tgatacggtt tgtggtctcc ttgtcttact tgagctcctc 3060 ttcgaagcgg gagtcccact agacctcctt cctactatca gtgtcgctat cgcaaagaat 3120 gatccattcg ttaaagccct ttccgacttt aacaagatga cgggtgcaac cacctcaaac 3180 atcgcaaacc tcttgactga gtgcacgacc ttacttggtc gtggcgttac tgcatctgcg 3240 ccaagtgccg atttgtatca ccgggtagct cctgagggca atcgacacga agctaaggtt 3300 agtgacgatg tgttgtattc cgccataaga accatttaca aacaagaaat caaagattgt 3360 ccaaaaccag gtgacttcgg cttacacctc cttacaagtc cattttggtg caagtctggg 3420 tcacaccacc atccagaatt cccttcatac cgaaaccgac tcgaattcgt aatgaacacc 3480 gatcccgact ctattgccgc cgttaaaccg tccgtgtaca ttactcaagc acagaaactc 3540 gaacatggta aaactcgata catatataac tgcgatacgg tctcgtacct atacttcgac 3600 tacgtactga actacataga gagcatatgg gccaactctc atgtgctact caatccagac 3660 gctctcaatg cggaaaagtt cgccacactc gagtatccgg agtattgtat gattgactac 3720 accgacttca attcgcaaca cgctctcaat tctatgaaag cagttttcaa agtactgaaa 3780 gagttcctgc cttcagaaat gttcccggtt ctagattggt gtatcagtag tttcgataac 3840 atgacaatca aaaacacaaa gtggagaagc accttaccct caggccacag agcgaccaca 3900 tttattaatt ctgtgttaaa cagggcctat ttgttgcctt acatcggtac tatcgtcagc 3960 tatcattgcg gtgatgacgt actcttgtgt ggtgactacg actaccaaaa cctaataaca 4020 cgcctgcctt ttgagttgaa tccaagcaag cagagtttcg gaccacatgc cgaattctta 4080 cgtttgcaca ggcatggtga gaaagtcgtt ggttacccta cgcgcgcaat ttcatcactg 4140 gtctccggca actggcttag tacgacgagt tggaattggc aaccatcctt actatctata 4200 accaatcaaa taaacgctat catctgccgc tcacagctct ccatacatac gatacgttca 4260 ctcgcccaag aattacgctt ccgttactgt ccactactcg acgactacat tgacccagct 4320 acaaccagct ttgtagccgc gggatgcccc tcatatcgac caaccgcaac aatgatcgtc 4380 cccgacgtac cccacctgga cgccgaagag gtcgaattta cccaactccg taggttggcc 4440 gaatatgcta tcaatactta cccgtggctt aactcaactg agtccgttaa ccagctcgtc 4500 agaaatagaa cgcgtaaacc tgcagctaaa gccatccgtt acaacatact cggttcagct 4560 gtcccacttg tttgttacca tcatcactgc gactcaatga ttgttccatt agcgaggagg 4620 tattatccac gagaccactt agcgcctccg gttacacctc aagttttacc tcctcaacca 4680 gttttttgcg atagaaactt gtccccaata atagcactca agctagcacc cgccggtgtg 4740 gcagtaaagg ttactgcaga ccggccgata gccagtgctt aaataaatgg ccctctgcag 4800 tgggcctatg actgcagatt cagttggcac ttaacaccag ttctttgcca actcgcctac 4860 gtaagtagga atagccgtat ggcaaacaaa ccatacgtct cagcaccagc ttagtgctag 4920 acctatgact cccggtcttc ctc 4943 //