ID HQ607520; SV 1; linear; genomic RNA; STD; VRL; 4944 BP. XX AC HQ607520; XX DT 08-MAY-2011 (Rel. 108, Created) DT 08-MAY-2011 (Rel. 108, Last updated, Version 1) XX DE Trichomonas vaginalis virus 4 strain TVV4-OC3, complete genome. XX KW . XX OS Trichomonas vaginalis virus 4 OC Viruses; Riboviria; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4944 RX DOI; 10.1128/JVI.00220-11. RX PUBMED; 21345965. RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Talkington M.W., RA Tang-Fernandez V., Suciu O., Demidenko A.A., Ghabrial S.A., Beach D.H., RA Singh B.N., Fichorova R.N., Nibert M.L.; RT "Clinical isolates of Trichomonas vaginalis concurrently infected by RT strains of up to four Trichomonasvirus species (Family Totiviridae)"; RL J. Virol. 85(9):4258-4270(2011). XX RN [2] RP 1-4944 RA Goodman R.P., Freret T.S., Kula T., Geller A.M., Demidenko A.A., RA Talkington M.W.T., Ghabrial S.A., Singh B.N., Fichorova R.N., Nibert M.L.; RT ; RL Submitted (12-NOV-2010) to the INSDC. RL Department of Microbiology and Molecular Genetics, Harvard Medical School, RL Armenise Building, Room 523, 200 Longwood Avenue, Boston, MA 02115, USA XX DR MD5; c7cc323b1398c424713d3d6cf4f4bbb2. DR EuropePMC; PMC3126235; 21345965. XX FH Key Location/Qualifiers FH FT source 1..4944 FT /organism="Trichomonas vaginalis virus 4" FT /host="Trichomonas vaginalis" FT /strain="TVV4-OC3" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="Nov-2009" FT /db_xref="taxon:1008292" FT gene 339..4783 FT /gene="pol" FT CDS join(339..2535,2535..4783) FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /db_xref="GOA:F5B2U6" FT /db_xref="InterPro:IPR001795" FT /db_xref="UniProtKB/TrEMBL:F5B2U6" FT /protein_id="AED99794.1" FT /translation="MSAIAATISSANLNDLSRLAGAQPKEGVPAPVLQQNIATPKTGPP FT DPGEGTRKQTTDSPHSANPSTKEHTPAPTIQPDTPTPITDHSDSEEGPRTHSTDFQTLY FT EYFYSYPVPASQTRTGGAIARAGPVNDNNEVVSFTTETALVTSLTPKHIDANIQPLQIS FT IADDCVNYSCQYSGQTCPIFDGSQHVQSATALASSMKARLMCEVTQSLSARPVQQPQLI FT AYLYGALLAFGDRLNIHYGNKVNLWNALLGHNLQRGTPINGDNFNHHLLIDGPLAPPIL FT PAAGLGPFPSTTLGPNTTVTFKARASIFVRPQTYDYALVDAAFWLIYAMYSRMPVAFRQ FT AHSLNIDFFTVQPMAACVFPGHDGFTTPVIDQALGVLESMLVEMFNGDREIMYYYAFKG FT GQIFMRPCSCYQEGGLIRKASRNVSLASFTGIYSLIGYCAPEARPLHAANHPGIIAALF FT QYVDTMVLQAVLSYSGPKLVHFGAAPEFCSKGSTPYDFIDPDNYWGIRAGVNAHPIGYY FT YLDILMRPKEHQLLDETLSDIYGHVGSLAMANIMASIASSGTEVLNQKMQKSFVRRGNQ FT VRALRHSHAIINRFHEPEYAYRLGILADGIIPLAGTHKCDIIDEATRLLQGEDIRNLPG FT LRCLRGRGLDAIIGIRPINKKRRAGFYTLDGNFHVVTNQCTSDVLQVWNDHGYIARPYA FT CHIVESINVEIYDRSNGAYNGWIQALVSGFGVPERCYMGPSSAGSRRRPLCPLKGSNCA FT VALHVDGQLTRASRVPYRKLTPSHLNCSKRCARQLAVIYRYQTLSPQLTEVSDSDYLAF FT LRWVLLPYTGATNRPHPKRWPKPFYPAEVSLKFLDKKTELQLFPLKKAPQADLKVNCFA FT RNLLYSSPLSDRILKQCIPVGTNNDTVCGLVILLELLFEAGVPLDLLPTISVAIAKNDP FT FVKALSDFNKMTGATTSHIANLLTECTTLLGRGVTASAPNADLYHRVAPEGNRHEAKIS FT DDVLRSAIRTIYKQEIKDCPKPGDFGLHLLTSPFWCKSGSHHHPQFPRYRNRLEFVMNT FT DPSAIMAVKPSVYITQAQKLEHGKTRYIYNCDTVSYLYFDYILNYVESIWANSHVLLNP FT DALNAEKFATLEYSEYCMIDYTDFNSQHTLTSMKAVFEVLKEFLPSEMFPVLDWCISSF FT DNMTIKDMKWRSTLPSGHRATTFINSVLNRAYLLPYIGTIVSYHCGDDVLLCGEHDYQH FT LITRLPYELNPSKQSFGPHAEFLRLHRHGEKVIGYPTRAVSSLVSGNWLSTTSWNWQPS FT LLSITNQINAIICRSQLSISRIRSLAQELRFRYCPLLDNYIDPATTSFVAAGCPSYQPT FT ATMITPDVPHLDAEEVEFTQLHQLAEYAINTYPWLNSVESVNQLVRSRMRKPAARDIHY FT SVLGPAIPLVSYHHHCDPMVVPLTRRYYPRDHLAPPITPQVLPPQPVFCDRDLSPIMAL FT KIAPAGVAVKVTADRPIASA" FT gene 339..2579 FT /gene="cap" FT CDS 339..2579 FT /codon_start=1 FT /gene="cap" FT /product="capsid protein" FT /db_xref="UniProtKB/TrEMBL:F5B2U7" FT /protein_id="AED99793.1" FT /translation="MSAIAATISSANLNDLSRLAGAQPKEGVPAPVLQQNIATPKTGPP FT DPGEGTRKQTTDSPHSANPSTKEHTPAPTIQPDTPTPITDHSDSEEGPRTHSTDFQTLY FT EYFYSYPVPASQTRTGGAIARAGPVNDNNEVVSFTTETALVTSLTPKHIDANIQPLQIS FT IADDCVNYSCQYSGQTCPIFDGSQHVQSATALASSMKARLMCEVTQSLSARPVQQPQLI FT AYLYGALLAFGDRLNIHYGNKVNLWNALLGHNLQRGTPINGDNFNHHLLIDGPLAPPIL FT PAAGLGPFPSTTLGPNTTVTFKARASIFVRPQTYDYALVDAAFWLIYAMYSRMPVAFRQ FT AHSLNIDFFTVQPMAACVFPGHDGFTTPVIDQALGVLESMLVEMFNGDREIMYYYAFKG FT GQIFMRPCSCYQEGGLIRKASRNVSLASFTGIYSLIGYCAPEARPLHAANHPGIIAALF FT QYVDTMVLQAVLSYSGPKLVHFGAAPEFCSKGSTPYDFIDPDNYWGIRAGVNAHPIGYY FT YLDILMRPKEHQLLDETLSDIYGHVGSLAMANIMASIASSGTEVLNQKMQKSFVRRGNQ FT VRALRHSHAIINRFHEPEYAYRLGILADGIIPLAGTHKCDIIDEATRLLQGEDIRNLPG FT LRCLRGRGLDAIIGIRPINKKRRAGFYTLDGNFHVVTNQCTSDVLQVWNDHGYIARPYA FT CHIVESINVEIYDRSNGAYNGWIQALVSGFGVPERCYMGPRLQVAGGAPSAL" XX SQ Sequence 4944 BP; 1278 A; 1431 C; 1008 G; 1227 T; 0 other; gcttaaagtc ccagtgagct ttaagcacca gaagtcgcag caacaaacag ttgtacttta 60 catgagctgc gaggtggtta gaattaaaca cgcgttttgg aagtcgtcag acgcgtggtg 120 gtcacttcgg tgatcaagtt atactgacat ggcgccgggt ggttgacgac ccgggccgcc 180 tagttgtagc cctttggctc tttgccccaa gcgtggtttg aacctgcctt tagtggattc 240 tgttgagtgt aatatctcaa tggaggtgtc agaccggcct agatgtaata gtctaggttt 300 atggacagtg atgtctactt gtgcttccgt gctacgtcat gtcagctata gcagccacta 360 tttcttctgc taatcttaat gatttatcac gtttagcagg cgcccagcca aaagaaggcg 420 tccctgctcc tgtgcttcag caaaatattg ctacgccaaa aacaggtcct cctgaccctg 480 gagaaggaac aagaaaacaa acaacagatt caccacattc agcaaacccc tcgacaaagg 540 aacatactcc tgctcccaca atccagccag atactccgac tccaattaca gaccactccg 600 attccgaaga aggaccaaga acacattcta ctgattttca aactctttat gaatattttt 660 acagttaccc agttccagcc tcacagacca ggaccggcgg tgctatcgca cgcgctggcc 720 cagtcaacga caacaatgaa gtcgtctcct tcacaacaga aacagcattg gttacatcac 780 ttacaccaaa acatattgat gcaaatatcc aacctctcca gatctcaatc gcggatgact 840 gcgtcaatta ttcgtgccaa tacagcggtc aaacctgccc gatattcgat ggttcacagc 900 acgtccagag tgccacggct ctcgccagct ccatgaaggc gcgcctcatg tgtgaagtca 960 cacaatcttt atccgcacgc cctgttcaac aacctcaact cattgcttac ctttacggcg 1020 cgttactcgc attcggcgat cgcctcaaca ttcattacgg taacaaagtc aacctctgga 1080 acgccttact tggccacaat ttgcaaagag gtacgccgat caacggcgac aacttcaacc 1140 atcacttact catcgatggt cctctcgctc ctccaatact cccagctgct ggattaggtc 1200 cattcccatc gacgacattg ggacctaaca ccaccgtcac ctttaaggct cgcgcatcca 1260 ttttcgtccg tccacagact tacgattacg ctcttgtcga tgccgccttc tggcttatct 1320 acgccatgta ctctcgcatg ccagttgctt tccgccaagc acattctctc aatatcgact 1380 tcttcaccgt ccagccaatg gccgcctgtg tatttccagg acacgatggc ttcaccaccc 1440 cagttatcga tcaagctctc ggtgttcttg aatcaatgtt ggttgagatg ttcaacggtg 1500 atcgcgaaat catgtactac tacgctttca agggtggcca gatcttcatg cgtccttgct 1560 cctgctacca ggaaggaggc ctcatccgca aagcctcacg caatgtctca ctcgcttcgt 1620 ttacaggcat ctactcgctc attggttact gtgcaccaga agccagacca ctccatgcag 1680 ccaatcaccc aggcattatt gcagctcttt tccagtacgt cgacacaatg gtcttacagg 1740 ctgttctttc ctactctggt cccaaactcg tccacttcgg agccgcacca gagttctgct 1800 ccaaaggctc cacaccatac gatttcatcg accctgataa ctattgggga atcagggctg 1860 gtgtcaatgc acatccaatc ggttattact atctcgacat cctcatgcga ccaaaagaac 1920 accaactcct cgacgaaaca ctctccgata tctacggaca cgtcggttca ctcgccatgg 1980 caaacataat ggcaagcatc gcttcttctg gcaccgaagt tctcaatcag aagatgcaga 2040 aatccttcgt cagacgcggc aaccaagtcc gcgcgttacg tcactcccac gccattatca 2100 atcgcttcca cgaaccagaa tacgcctacc gcctcggaat ccttgcagat ggtatcattc 2160 ctttagccgg tacgcacaag tgcgatatta tcgacgaagc cactcgctta cttcagggag 2220 aagatatccg caatctccct ggtcttcgct gtttacgcgg ccgcggcctc gacgccatca 2280 tcggtattcg tccgatcaac aagaagcggc gtgcaggctt ctacacactc gacggcaatt 2340 tccacgtcgt cacgaaccag tgcacaagtg acgttctaca ggtctggaac gatcacggct 2400 acatcgcgcg tccttacgct tgccacatcg tcgaatccat caacgtcgaa atctacgaca 2460 gatccaatgg cgcttacaac ggatggattc aggcgctcgt cagcggcttc ggtgttccgg 2520 agcgctgcta catgggccct cgtctgcagg tagcaggagg cgccccctct gccctttaaa 2580 gggcagcaat tgcgcagtag cactgcatgt agatggacaa ctaactagag caagtcgggt 2640 accataccgt aagctcacac caagtcatct taattgctcg aagcgctgcg cacgtcaact 2700 ggcggttata tatagatacc agacactgag cccccagttg accgaggtta gtgatagtga 2760 ttatctcgct ttcctccgtt gggtcctgtt accttataca ggtgctacaa atagaccaca 2820 ccccaagcgg tggcctaaac cattttaccc agcggaggtg agtctcaagt ttttagataa 2880 aaagaccgaa ttacagttgt ttcctctcaa gaaggcccca caagccgacc tgaaagtcaa 2940 ctgcttcgca agaaacctac tttattcgtc acctctttcc gaccgtattc tcaaacaatg 3000 catcccagtt ggaacgaata acgatacggt ttgtggtctc gttatcttac ttgagctcct 3060 cttcgaagcg ggagtcccac tagacctcct acctactatc agtgtcgcta tcgcaaagaa 3120 cgatccattc gttaaagccc tttcggactt taacaagatg acgggcgcaa ccacctcaca 3180 cattgcaaac ctcttgacag agtgcacgac cttacttggc cgtggcgtca ccgcatccgc 3240 accgaatgcc gatttgtatc accgggtagc tcctgagggc aatcgacatg aggccaagat 3300 cagtgacgat gtgttgcgtt ccgccataag aaccatttac aaacaagaaa tcaaagattg 3360 tccaaagccg ggtgactttg gcttacacct cctcacaagt cccttttggt gcaagtccgg 3420 gtcacaccac catccacaat tccctcgata ccgcaaccga cttgaattcg taatgaacac 3480 tgatcccagc gctattatgg ctgttaagcc atccgtgtac attacccagg cacagaaact 3540 cgaacatggg aaaactcgat acatatataa ttgtgacaca gtctcgtatt tgtatttcga 3600 ctatatactg aactacgtag agagcatatg ggccaattct catgtgctac tcaatccaga 3660 cgctcttaac gcggaaaagt tcgccacact tgaatattcg gaatattgta tgattgacta 3720 cactgacttc aactcacaac acacccttac atctatgaaa gcagtcttcg aagtacttaa 3780 agagtttcta ccttcagaaa tgtttccagt tctcgattgg tgtatcagca gtttcgacaa 3840 catgacaatc aaagatatga aatggagaag cacattaccc tcaggccaca gagcgacaac 3900 attcatcaat tctgtactca atagagccta tttgttacct tacatcggta ctatcgtcag 3960 ttaccactgc ggtgatgatg tactcttatg cggtgagcac gattaccagc acttaataac 4020 gcgcctgcct tatgagttga acccgagcaa acaaagtttc ggaccacatg ccgaattcct 4080 acgtttgcat aggcatggtg aaaaagttat tggctaccct acacgtgcag tttcatcact 4140 agtctccggc aattggctta gtacgacaag ttggaactgg caaccatccc tgctatctat 4200 aacgaatcaa ataaacgcta tcatctgtcg ctcacagctc tccataagta gaatacgttc 4260 acttgcccaa gaattacgct tccgttactg tccattgctt gacaactaca tcgacccggc 4320 tacaaccagc tttgtagctg cgggctgccc ctcatatcag ccaactgcaa caatgatcac 4380 tcccgacgta ccccacctgg acgccgaaga agtcgaattc actcagctcc accagttggc 4440 cgaatatgct atcaatactt acccgtggct taactctgtc gagtccgtca accagcttgt 4500 cagaagcaga atgcgtaagc ctgcagcacg agacatccat tacagcgtac ttggtccggc 4560 tatcccactt gtttcttacc accaccactg tgatccaatg gtcgttcccc ttacgaggag 4620 gtattatcct cgagaccact tagcgcctcc gattacacct caagttttac ctcctcaacc 4680 agttttttgc gatagagatt tgtccccaat aatggcactc aaaatagctc ccgccggtgt 4740 ggcagtaaag gttactgcag accggccgat agccagtgct taaataggtg gccctctgca 4800 gtgggcctat aactgcagac tcagttggca ttaaacacca gttttttgcc agcacgccta 4860 cgtaagtagg aacagccgta tggctaacaa cccatacgtc tcagcaccag ctttgtgcta 4920 gacctatgac tcccggtctt cctc 4944 //