ID AF325840; SV 1; linear; genomic RNA; STD; VRL; 4844 BP. XX AC AF325840; XX DT 05-DEC-2001 (Rel. 70, Created) DT 04-MAR-2011 (Rel. 108, Last updated, Version 4) XX DE Trichomonas vaginalis virus 3 capsid protein (cap) and RNA-dependent RNA DE polymerase (pol) genes, complete cds. XX KW . XX OS Trichomonas vaginalis virus 3 OC Viruses; Riboviria; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4844 RX DOI; 10.1007/s00705-010-0858-y. RX PUBMED; 21110050. RA Bessarab I.N., Nakajima R., Liu H.W., Tai J.H.; RT "Identification and characterization of a type III Trichomonas vaginalis RT virus in the protozoan pathogen Trichomonas vaginalis"; RL Arch. Virol. 156(2):285-294(2011). XX RN [2] RP 1-4844 RA Bessarab I.N., Tai J.H.; RT "The complete DNA sequence of type III Trichomonas vaginalis virus"; RL Unpublished. XX RN [3] RP 1-4844 RA Bessarab I.N., Tai J.H.; RT ; RL Submitted (04-DEC-2000) to the INSDC. RL Division of Infectious Diseases, Institute of Biomedical Sciences, Academia RL Sinica, Taipei, Taiwan 11529, Republic of China XX DR MD5; aa89013195e15a41f9e6e1aa6b93c4fd. DR EuropePMC; PMC2526308; 17570970. DR EuropePMC; PMC3126235; 21345965. XX FH Key Location/Qualifiers FH FT source 1..4844 FT /organism="Trichomonas vaginalis virus 3" FT /mol_type="genomic RNA" FT /db_xref="taxon:170965" FT gene 360..2486 FT /gene="cap" FT CDS 360..2486 FT /codon_start=1 FT /gene="cap" FT /product="capsid protein" FT /db_xref="UniProtKB/TrEMBL:Q8V616" FT /protein_id="AAL37369.1" FT /translation="MSAPEPLNTEVRSPNGVSEAIETQNMAVTQSSVSNEIKNDTQSDL FT QTLKKQLQPLYRSTDFNTLYNFFYGLDVPASTDRVGHAIQRNTSVNDTNEVVSFPLTAT FT VSHTFSNTPVPANIQPLQISVADDSVNYELDESGTLCPTLDSSVHVQRATSLASALKVK FT LTGEIMHSDSVRPVQTPQLIAYLFGVLLGVKDRVNIHRNQPTNLWRSLCSPGRAAQAKP FT FFDEFPNNKFRTGALLAPPLPDAGFGPFPAEGLNQNSKLDFRSKGYVFYKQRTYNPDEM FT NRAFWFLWAIYNRMPEDFQLSYPLNITFCTSELPVQNPMPGADAISNEQCEKALLLLEK FT VVLEFFNNDRKLAYYYVFKGCQFVMRPCSCYQEGGLIRKASRNVALRGFTGIYYLAGFT FT DQYANMISCAVHPGVVGALFQYVDTMVLQAVFSLSGPKLVRFAAPPEYQGRHACPFSFV FT ADENYWGIAPGIEAEPVGMYYMDIIQRKAEHDLFIETFMDIYGSTASIICANIETSLFT FT SGTNVLNHRMQKDFARDTPKPGTLRHQHAIINRFHEPEYAYRLGILANGVIPLSGSFEV FT DILKEAELLVTGEDIRNLPGLRCLCTRGLDAILGLRPVQQKRKKMCYFRTLDGNFHEVT FT IRSETRDLQVWRDHGYLARPYACHIIDSEGIEFYDKSNGLYKGRVNVLVSGFAIPGRAY FT QGPRLQVATMAAQI" FT gene 2645..4690 FT /gene="pol" FT CDS 2645..4690 FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /db_xref="GOA:Q8V615" FT /db_xref="InterPro:IPR001795" FT /db_xref="UniProtKB/TrEMBL:Q8V615" FT /protein_id="AAL37370.1" FT /translation="MDHVSDIAYLNFLRWVLLPYNGQTLRPHPSVWRQTPYPEHVNLKF FT LNKEMELELFPLKKAPQADLKVNCYARNVLASTELTDDLLKQSLPIGLNNDSVCGIVIV FT LELLRIAGVPSKLLPIIGQAIANKDPFIKELSDFNKMIGATTSRIANILTECNTLIGRG FT VKSSDPSADLYHRVAPEGNRHEAKIPRHILIEAINKIYKNEMTDMPPPGDFKLHLITSP FT LWCKAGSHHHPHFAKYSSRLDFVMDVPADKIAAVPPSVFITQAEKLEHGKTRYIYNCDT FT XSYLFFDYILHYVECVWSNESVLLNPAAMSVERFSVLDYPEYCMIDYTDFNSQHSLESQ FT KLVFECLRPYLPSEMHPILDWCIASMDHMEIGGQHWLSTLPSGHRATTFINSVLNKAYL FT IPYIGDTTSFHCGDDVLLCGKYDYQTLIDTLPYELNKSKQSFGPNAEFLRLHRRAGDVI FT GYPSRAVSSLVSGNWLSKTSWEWQPSLISVTNQCNVIISRSQLNIRFIPAMQQELRNRY FT IDKMSEPFDVGSDYYVMPGCPCYSDAATTIVPNVPQLECSDVPFSQAQKVFDTMRDICP FT EFTTVNDVIDRVLARRTSNAVKNITYNVCAPVAPQVCIAVNPAHYQFLLRKKYYPREHI FT APPGFDDSTNSKLVFSTYDLAPSIAMKSCAVLTPAKIICGHGLRSG" XX SQ Sequence 4844 BP; 1270 A; 1287 C; 1025 G; 1258 T; 4 other; cttaaaaagc ctagtccact ttttaagccg gttagacttc accgtagata cttgggcaaa 60 attaatcaac accctcctgg aatcgccggg gtgttgcgag ccgtaagaga ccggttctaa 120 aggactgaca tagcgccgcg agggtaggtg gtcgatagcc cgtttgacgg agtagcgata 180 ttcctgattc tggtgtagca tcgacggggg ccccctagcg tgagctcagc acgttgtgaa 240 aacgaaaaac tgcatgtctg aggcttcgca gtagcgtgag ctcggagcac cctaaaaagt 300 gctctgtttt gtaacagcca acgattgtta cgaaactcta gtgtattgcg tgcaacggta 360 tgtcagctcc cgagccctta aatactgaag tacgctcacc taatggtgtt agtgaagcca 420 tagaaactca aaacatggct gtcactcaaa gcagtgtgtc aaacgaaatt aaaaatgaca 480 cacaaagtga tctgcaaaca cttaaaaaac agttacaacc gctctacaga tccacagatt 540 ttaatactct ttataatttc ttttatggtt tagatgtccc agcttcaaca gatcgcgtcg 600 gtcacgctat ccagcgtaac acctcagtca atgatacaaa tgaagttgtt agtttccctc 660 ttacggcgac ggtatcacac acgttttcaa acacgccagt tccagccaac atacagcctc 720 ttcaaatctc agtagccgac gattcagtca actatgagct cgatgaaagt ggaacattat 780 gtccaacgtt agatagctcc gtccatgtcc aaagagccac ttccttggcc agcgctctca 840 aggtcaaatt aacgggcgaa atcatgcact ctgactcagt cagaccagtc caaactccac 900 aattgattgc ttatttattc ggcgtcctcc tcggtgtcaa agatcgcgtc aacattcacc 960 gcaatcaacc cactaactta tggagaagtt tatgttcgcc tggtcgtgct gctcaggcaa 1020 agcctttctt cgatgaattt ccaaacaaca agttcaggac cggtgctctt ttggcacctc 1080 ctctccccga tgccggcttc ggtcctttcc cagctgaagg ccttaaccaa aattccaagc 1140 ttgattttag atcaaaggga tacgtcttct acaaacaacg cacttacaat ccagacgaaa 1200 tgaaccgtgc gttctggttc ctttgggcga tctacaatcg tatgcctgaa gatttccaac 1260 tatcataccc attaaacatt accttctgca cttccgaatt gccagtccaa aacccgatgc 1320 caggggccga cgcaatctca aatgagcaat gtgaaaaagc gctccttctc ctcgaaaaag 1380 tcgtcctcga attcttcaac aatgatcgca aactcgctta ttattacgtt ttcaaaggat 1440 gccagttcgt tatgcgccct tgctcctgct accaagaagg aggcttgatc cgcaaagcct 1500 cacgcaacgt tgccctccgt ggcttcaccg gtatctatta tttggccggc ttcacagatc 1560 aatacgccaa catgatttca tgtgctgttc acccaggtgt tgttggcgct ctcttccaat 1620 acgtcgatac gatggtttta caagcggttt tctccctctc tggcccaaag ttggtccgct 1680 tcgccgctcc gcctgaatat caaggtcgcc acgcttgtcc tttctcattt gtcgccgacg 1740 aaaattactg gggcattgct ccgggtattg aagccgaacc cgtcggcatg tattacatgg 1800 acatcatcca acgcaaggcc gagcatgatt tgttcatcga aaccttcatg gatatctacg 1860 gttcaacagc ttccatcatt tgcgccaata tcgaaacgag cttattcacc tctggcacta 1920 acgtcttgaa ccatcgcatg caaaaagact tcgctcgcga cactcctaag ccaggaaccc 1980 tccgtcatca acacgccatc atcaaccgct tccacgaacc tgaatacgcc taccgccttg 2040 gcatccttgc taatggtgtt atccctctca gcggctcatt cgaggtcgat atcctcaaag 2100 aagctgagct tctcgtcaca ggtgaagata tccgcaatct ccctggttta cgttgcttgt 2160 gcacacgcgg ccttgatgca atcctcggtc tccgcccagt ccagcagaag cgcaagaaga 2220 tgtgttactt ccgcactctc gatggcaact tccatgaagt aacaatcaga tcagagacgc 2280 gcgatttaca ggtctggcgt gatcacggct accttgctcg tccatacgcg tgccatatta 2340 tcgattcaga aggcatagaa ttctacgaca aatccaacgg tctctataag ggccgtgtca 2400 atgtccttgt ttctggattt gccatcccag gtcgcgcgta tcagggccct cgcttgcagg 2460 tagcaacaat ggccgcccag atctaagcga cgtcccggcg acaggaagtc tgtccaacct 2520 cattaccctt tccaaagcaa gtcggctacc ataccgtaag ctgagggaag gcgtgagagc 2580 gtcagactac accgtcgccc gcgagttagc tagcgccttt cgcaattctc gcctaactcg 2640 ccaaatggat catgtctcag atatagctta tcttaatttc ctgagatggg tgttgttacc 2700 ttacaacggt caaacattac gaccacaccc cagcgtgtgg cgtcagacac cctaccccga 2760 acatgtcaat ttgaagttcc tgaacaagga aatggagctc gaactcttcc cacttaagaa 2820 ggccccacaa gccgatctta aagtgaactg ttacgcgcga aacgtccttg cttccacaga 2880 gctaaccgac gacttactca agcagagttt acccataggt ctcaacaatg actcagtttg 2940 tggaatcgtc atcgttttag agctgcttcg gattgcaggt gttccaagta agttactacc 3000 aattattggc caagctattg ccaataaaga tccattcatt aaggaattgt ccgatttcaa 3060 caagatgata ggagcgacga cttcccgtat tgctaacatt ctcacagagt gcaatacatt 3120 gataggtcgc ggtgttaagt catctgatcc aagtgccgat ttgtatcacc gggtagcgcc 3180 tgagggcaat aggcacgagg cgaagattcc tcgacacatc cttatcgaag ccataaacaa 3240 aatctacaaa aacgaaatga cagacatgcc tccaccagga gatttcaagc tccacttaat 3300 aacgagcccc ctatggtgta aggctggttc tcatcatcat ccacacttcg ccaagtatag 3360 ttcacgcttg gatttcgtta tggatgttcc agcagacaaa attgctgctg taccaccctc 3420 ggttttcatc actcaagcgg agaaattgga acacggtaag actaggtaca tttataactg 3480 tgacacantt tcttacctgt tcttcgatta catcctacat tatgtcgaat gtgtgtggtc 3540 aaacgagtca gtcttgctca acccagctgc tatgagtgtt gagcggttta gcgtcctaga 3600 ctatccggag tattgcatga tcgattacac agatttcaat tctcaacaca gccttgaatc 3660 ccagaagctt gtattcgagt gtttgagacc gtacttgcca agcgaaatgc atccaatctt 3720 agattggtgt atcgctagca tggaccacat ggaaattggc gggcaacatt ggttaagcac 3780 gttgccttcg ggacatagag ctactacgtt tatcaactca gtcctgaata aagcatacct 3840 gattccttac ataggcgaca ccacctcttt ccattgcggt gacgacgtat tactatgtgg 3900 yaaatacgac tatcagacac tcattgatac cctaccctat gaattaaaca agagcaaaca 3960 gagctttgga cctaatgccg agttcttgcg cttgcatagg cgtgcgggtg acgtgatagg 4020 ctacccatca agggctgttt cgagtctcgt atctggaaat tggttaagta agacttcatg 4080 ggaatggcag ccaagtctca tttcggtcac gaatcaatgc aatgtgatca tctcacgctc 4140 acagttaaac atcaggttca tccccgctat gcagcaggaa ctacgcaatc gttacataga 4200 taaaatgagc gagcccttcg atgtcggctc ggattactac gtcatgccag gttgtccatg 4260 ttatagtgat gccgcgacaa caatcgttcc gaatgttccc caactggaat gttccgacgt 4320 accgttttcg caggcacaaa aggtttttga tactatgcgc gacatctgtc ctgagttcac 4380 cacagtcaac gacgtcatcg acagagttct agctcgtcgg acttccaatg ctgtcaaaaa 4440 catcacgtac aatgtttgtg ctcctgtagc acctcaagtt tgcatagccg taaacccagc 4500 acattaccaa tttctgttac gtaagaagta ctacccacgt gaacacattg cgccccccgg 4560 ctttgatgat tcaacgaact ctaagcttgt tttytcgact tacgatctag ctccttcaat 4620 tgctatgaaa tcgtgcgctg ttttgacccc ggcaaagata atatgcggtc acggactacg 4680 cagtggttga ttyagcacgc tagtaccgcg cgacagtcgg taccgtctag gccacgtaca 4740 gtctattgtc ctcggaccct ctgcctatag gttaatagga atacagtgtt actgttgtgt 4800 gtatcgctct aggcacacga acgtgctacc ccacgtttag ttca 4844 //