ID U57898; SV 1; linear; genomic RNA; STD; VRL; 4648 BP. XX AC U57898; XX DT 14-SEP-1996 (Rel. 49, Created) DT 22-SEP-2009 (Rel. 102, Last updated, Version 3) XX DE Trichomonas vaginalis virus 1 strain TVV-T5 capsid protein (cap) gene, DE complete cds; and RNA-dependent RNA polymerase (pol) gene, partial cds. XX KW . XX OS Trichomonas vaginalis virus 1 OC Viruses; Riboviria; Totiviridae; Trichomonasvirus. XX RN [1] RP 1-4648 RX DOI; 10.1006/viro.1996.0446. RX PUBMED; 8806533. RA Su H.M., Tai J.H.; RT "Genomic organization and sequence conservation in type I Trichomonas RT vaginalis viruses"; RL Virology 222(2):470-473(1996). XX RN [2] RP 1-4648 RA Tai J.-H.; RT ; RL Submitted (10-MAY-1996) to the INSDC. RL Infectious Diseases, Institute of Biomedical Sciences, Academia Sinica, RL IBMS, Academia Sinica, Taipei, Taiwan, 11529, ROC XX DR MD5; 4c19a5e5c04a53c2be1f2de90262bc46. DR EuropePMC; PMC2526308; 17570970. DR EuropePMC; PMC6356713; 30585188. XX FH Key Location/Qualifiers FH FT source 1..4648 FT /organism="Trichomonas vaginalis virus 1" FT /host="Trichomonas vaginalis" FT /lab_host="Trichomoans vaginalis" FT /strain="TVV-T5" FT /mol_type="genomic RNA" FT /proviral FT /db_xref="taxon:674953" FT 5'UTR 1..285 FT gene 286..2322 FT /gene="cap" FT CDS 286..2322 FT /codon_start=1 FT /gene="cap" FT /product="capsid protein" FT /db_xref="UniProtKB/TrEMBL:Q98752" FT /protein_id="AAC55468.1" FT /translation="MEASANGLSHDDKANNSQNVGPSTLPGSDKQGGEKHENSFNSFSN FT DFFFNFLRMSMNTHISDSPGVSFIGKDGAPYSSVTIQSAVGRLTHNVVASAVQLNVTAD FT NVLEVDYGFGQDVSRATGTIPIPIFDGEKYKETARALAMIFSKKGMSVDVTSQTVQETL FT KNSDLTIATVAAGYYTALAARHELTKAVSVAAHTIPFATALSDTFTAAPNAQRSSHVIS FT SCLRCPASGNIQHDIGIGSTIWTNVIVESLSPQNMAVPNPDDISFFIPNKALPSSWWCA FT IWLLNAFLHSFVAPTRIHIFITQGETYHLAPFTDSDVYEAVRFLLAMSKSSRPMPESVE FT SMLYAYSTQMIIQPHSLYTEGGLIRRMIFTVPHLPAHGYFVTNSEFSRYMNIAVPDDPR FT SAKVFVIGAGTGLLQIVLAYQAAFSCAGPIALHWHANDAISQGMDTVASTYLQGRYFTI FT PMAVNVATNVARYTTTVRADPQYKRTLDRILPRIFGPSTDTIFEFIESAISSSWVSIDA FT VRRSGRARKFRTAFINRFHDPEFAYMFGITGNGIERMEGKVTSNIAQEVDYLMNGGDLR FT NCPVLRTLKAAERDETITFMCKEKVGSIFAIDGTVRVLKQYQTIDLSQLGWTSHGKVMK FT PYAFRAPVIQGITICNTAYTTTAIDIVTTVFGPLRQRVGTLFE" FT gene <2306..4576 FT /gene="pol" FT CDS <2306..4576 FT /codon_start=1 FT /gene="pol" FT /product="RNA-dependent RNA polymerase" FT /note="translated via ribosomal frameshift" FT /db_xref="GOA:Q98753" FT /db_xref="InterPro:IPR001795" FT /db_xref="UniProtKB/TrEMBL:Q98753" FT /protein_id="AAC55469.2" FT /translation="VPFLSKAVRCGPVIPFVIHHFNFRRVTTTKRRRNKYVLVPGYGWV FT LQDDYLVNSVKMTGENDLPPNQLPHDDDLLFTYAKILLYDYISYFPKFRHNNPDLLDHK FT TELELFPLKADSAARNKANFYARTLWNDTITDKSAFKPGTYNDTVAGLLLWQQCALMWS FT LPKSVINRTISGVCDALTNRTSLTLLKRISDWLKQLGLACSPIHRLFIELPTLLGRGAI FT PGDADKDIKHRLAFDPSITVDVPKEQLHLLIYRLLSRNLNITKVNSFEHHLEERLLWSK FT SGSHYYPDDKINELLPPQPTRKEFLDVVTTEYIKECKPQVFIRQSRKLEHGKERFIYNC FT DTVSYVYFDFILKLFETGWQDSEAILSPGDYTSERLHAKISSYKYKAMLDYTDFNSQHT FT IQSMRLIFETMKELLPPEATFALDWCIASFDNMQTSDGLKWMATLPSGHRATTFINTVL FT NWCYTQMVGLKFDSFMCAGDDVILMSQQPISLAPILTSHFKFNPSKQSTGTRGEFLRKH FT YSEAGVFAYPCRAIASLVSGNWLSQSLRENTPILVPIQNGIDRLRSRAGLLGVPWKLGL FT SELIEREAIPKEVGMALLNSHAAGPGLITRDYSSFTVTPKPPKLSSTLEYTATRYGLQD FT LSKHVPWKQLTTVESDKLSRQIKKISYRHCSQAKITYNCTYEVFKPRGLPTVLSGSSQP FT SLSMLWWQAMLKQAIQDDSTKKIDARMFAANACTSSVSGDAFLRANASMAGVLITSLIT FT SSS" FT 3'UTR 4577..4648 XX SQ Sequence 4648 BP; 1351 A; 1194 C; 898 G; 1205 T; 0 other; actcaacatt ttcactccgt catgacgaac tcataacgcg gacatataac aagcgtaatg 60 tcctcgacga ttgccatcct cgtgtgaact ccgggctccg cttcactgat gtacctctta 120 ctaagctgga gagacttttt agtcttgaag agccgtaatg tgccctctgc gcctgggacc 180 taatggcgct tttgctgtag gtactttata gaagaagaat gagggttctc aacatactag 240 ttcgctggta tgtcctattc ctacgctata aagaaataaa tacccatgga ggcttctgct 300 aatgggttat cacatgacga taaagcgaat aattcgcaaa atgttggacc ttctactctt 360 ccggggtcag ataaacaagg aggagaaaaa catgaaaatt cttttaattc tttttctaat 420 gatttctttt ttaacttttt acgtatgtcc atgaacactc acatttcaga cagtccaggc 480 gtttctttca tcggaaaaga cggtgcacct tactcatcag taacaattca atcagccgta 540 ggccgtctta cacataacgt agttgcatca gccgttcaac tcaatgtaac agcagacaac 600 gttttagaag tcgattacgg tttcggtcag gatgtttcaa gagctaccgg aacaatccca 660 attccaattt tcgacggcga gaaatacaag gaaactgctc gtgccttagc tatgatcttc 720 agtaagaaag gcatgtcagt tgatgttaca tcccaaacag tacaagaaac acttaagaac 780 tccgatctca ctatcgcgac agttgcagcc ggatattaca cagctttagc tgcacgccac 840 gaactcacga aagctgttag tgttgcagcc cacacaattc ctttcgccac cgccttgtcc 900 gacacattca cggcagctcc aaatgcacag cgttcaagcc acgttatttc ttcttgctta 960 cgctgtccag cttcgggcaa tatccaacac gacatcggaa tcggttctac catctggact 1020 aatgtcatcg tcgaaagtct ttcaccacag aatatggcag ttccaaatcc agacgacata 1080 tcattcttca ttccgaacaa agccctccca tcttcttggt ggtgtgcgat ttggctcctc 1140 aacgcatttc ttcactcctt tgttgcgcca actcgtatcc acatcttcat tacacaagga 1200 gaaacatacc acctcgctcc tttcaccgat tcggatgtct acgaggccgt tcgtttcttg 1260 ctcgcaatgt caaagtcatc acgcccaatg ccagagagcg tcgagagtat gttatatgca 1320 tacagcacac agatgatcat ccaaccacat tcgctctaca cagagggagg cttgatcaga 1380 agaatgatct ttacagttcc acaccttcca gctcatggtt acttcgtcac gaattccgaa 1440 ttctcgagat acatgaatat cgctgttcca gacgacccgc gttctgcaaa agtcttcgtt 1500 atcggtgcag gaacaggtct cttacaaatc gtactggctt accaagctgc tttcagctgt 1560 gctggcccta ttgcacttca ctggcacgca aacgatgcca tctcacaagg catggataca 1620 gttgcgagta cataccttca gggaagatac ttcaccattc ctatggctgt caacgtcgcc 1680 acaaacgtcg ctcgatacac tacgacagtt agagcagacc ctcaatacaa gcgtacactc 1740 gatcggatct taccacgcat cttcggccca tcaactgaca caatattcga gttcatcgaa 1800 tcggctatct cgtcatcttg ggtctccatc gacgctgtca gacgcagcgg tcgcgctcga 1860 aagttcagaa cagctttcat caatcgcttt catgatccag aattcgctta catgttcggt 1920 atcactggca acggcatcga gagaatggaa ggtaaggtta cttcaaacat cgcccaagaa 1980 gtcgattacc tcatgaatgg tggcgacctc cgcaactgcc ctgttctccg cacacttaag 2040 gcagcagaga gagatgaaac tatcaccttc atgtgcaagg agaaagtcgg ttccattttc 2100 gcgatcgatg gtactgtccg cgtactcaaa cagtatcaga ctatcgatct ctcccaactc 2160 ggttggactt cccacggcaa ggtgatgaaa ccttacgctt tcagagctcc agtcatccaa 2220 ggaattacca tctgcaacac agcttacaca accacggcca tcgacattgt cacaacagtc 2280 tttggtccct tacgccaacg tgtaggtacc ctttttgagt aaagctgtac gttgtggccc 2340 tgtaatacca ttcgtcatac accatttcaa cttcagacgt gttacaacta ctaaacgacg 2400 acgcaataaa tacgtacttg tccccggata tggatgggta ttacaggatg actatttggt 2460 taattccgtc aaaatgactg gtgaaaacga tttaccccca aaccagttac ctcatgacga 2520 tgatctttta tttacatacg caaaaatttt actttacgac tacatatctt attttcctaa 2580 attcagacac aataatccag acttactaga tcacaaaaca gaactagaac ttttcccact 2640 caaagctgat tcagctgcta gaaataaagc aaacttctac gcaagaactt tatggaatga 2700 tactatcaca gataaaagcg ctttcaaacc aggaacttat aatgatacag ttgcaggtct 2760 gttattatgg caacagtgtg ctctcatgtg gtcattaccc aagtcagtga tcaacagaac 2820 aattagcggt gtttgtgatg cactaaccaa caggacttca ctcacgctat taaaacgtat 2880 ctcagattgg ctaaaacaac ttggactggc ctgctcaccg atacatcgcc tattcatcga 2940 actccctaca cttctaggac gcggtgcgat cccaggcgat gctgacaaag atataaagca 3000 cagactcgct ttcgacccat caataacagt cgatgtccca aaagaacagt tacatctact 3060 gatctacaga ctcttatcca gaaatctcaa tatcactaaa gtcaatagtt ttgaacacca 3120 cctggaagag cgcttacttt ggtccaaatc aggaagtcac tactaccccg acgacaagat 3180 caacgagtta cttcctccgc aacctactag aaaggaattc ttggatgttg tcacgacaga 3240 atacattaag gagtgcaagc ctcaagtctt catcagacag tctcgtaaac tcgaacacgg 3300 taaggaacga ttcatctaca attgcgacac agtctcatac gtctattttg attttatctt 3360 gaagctcttt gagacaggat ggcaagatag cgaagcaata ttgtcgccag gcgactacac 3420 tagtgaacgt ctccacgcta agatttccag ttataagtat aaagccatgt tagactacac 3480 agacttcaac tcacaacata caatccaaag catgcggttg atcttcgaaa ccatgaaaga 3540 gttactccct ccagaagcga cttttgctct cgattggtgt atcgcctcat ttgacaacat 3600 gcaaacatca gacggtctca aatggatggc tactctccct agtggacacc gtgccactac 3660 attcattaat actgtcctaa attggtgtta cactcagatg gtcggtctca aattcgatag 3720 tttcatgtgc gctggtgatg atgttatcct aatgtcccaa caacccatat cactagcacc 3780 aattcttaca tcacatttta agttcaatcc aagcaagcaa agcacgggta ctagaggtga 3840 attcttacgc aagcactata gcgaagcagg tgtcttcgca tacccatgtc gagcgatcgc 3900 tagcttagtg agcggaaatt ggctaagtca atcactaaga gagaacaccc caatcctggt 3960 ccctatacaa aacggaatcg atagattacg cagtagagca ggtctactcg gagttccttg 4020 gaaactaggt ctctcagagc tcattgagag agaggccatt cctaaggaag tcggcatggc 4080 tctattgaat tcacacgcag cagggcccgg tctgattact cgagactaca gttctttcac 4140 agttacgccc aaacccccca agttaagcag cacactcgaa tacaccgcaa cccgttacgg 4200 tcttcaagat ttatccaaac acgtcccatg gaaacaactc acaacagttg aatctgataa 4260 gttaagtcga caaattaaga aaataagtta caggcattgc agccaggcga agataactta 4320 caattgtacc tacgaagttt ttaaaccacg tgggctccct acagtgttat ccggttccag 4380 ccaaccatcg ttgtcgatgc tatggtggca agcaatgctc aagcaagcaa tacaagatga 4440 ctctacgaag aagatagatg cacgaatgtt tgctgcgaac gcatgtacta gctccgttag 4500 cggagatgcg ttcttgcgag caaacgccag tatggctggt gtcctaatca ctagcctaat 4560 cacttcttca tcataacgta cagctacgaa aaaagtctct atagttgctc aagactacaa 4620 tgagccagat ggccccgcta taccttcg 4648 //