ID AM087225; SV 1; linear; genomic DNA; STD; VRL; 2744 BP. XX AC AM087225; XX DT 16-DEC-2005 (Rel. 86, Created) DT 16-DEC-2005 (Rel. 86, Last updated, Version 1) XX DE Rice tungro bacilliform virus for partial polyprotein, genomic DNA XX KW coat protein; IR region; movement protein; P194; polyprotein; protease. XX OS Rice tungro bacilliform virus OC Viruses; Ortervirales; Caulimoviridae; Tungrovirus. XX RN [1] RP 1-2744 RA Tandon V.; RT ; RL Submitted (21-SEP-2005) to the INSDC. RL Tandon V., Department of Plant Molecular Biology, Delhi University, South RL Campus, University of Delhi, Benito Juarez Road, New Delhi, 110021, INDIA. XX RN [2] RA Tandon V.; RT "Analysis of the complete nucleotide sequence of a representative Indian RT isolate of rice tungro bacilliform virus"; RL Unpublished. XX DR MD5; ad802b05859d94a21ff46e067fc3d352. XX FH Key Location/Qualifiers FH FT source 1..2744 FT /organism="Rice tungro bacilliform virus" FT /host="Oryza sativa" FT /mol_type="genomic DNA" FT /country="India:Orissa" FT /db_xref="taxon:10654" FT CDS <1..>2744 FT /codon_start=1 FT /product="polyprotein" FT /note="P194 (194KDa) polyprotein" FT /note="ORFIII" FT /db_xref="GOA:Q2UVX3" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR034728" FT /db_xref="UniProtKB/TrEMBL:Q2UVX3" FT /protein_id="CAJ32141.1" FT /translation="SIAIKTIGRLTTNIQARYKMNVKDIVEQISSQGITMVAPMEIDSS FT HLDGNEWNLSKFMIQEGTSRVPSKALIYQNLHGGESLRFSNYTQTKMHDPTEINSDEDE FT DLKILGEQLNAKMATFQEETLEQKLERIKEEKKQLLAKLEAKKKEIESKSLKMAVIEDD FT FNPNNEYLDDSYSELEDLEFEQLGLTGWEDLDQESLETEEITEWENPNQVLTREIKAFK FT SVSEQIEDIFGELLKEHGNYDMALKNLEERYDLKNLEGAKNLEEIAKASTSKMMDVKPV FT KRPKEEQTAYEDDMRDDWRRKELTANPEVSSKDRNFERIGGSYKKNFYPSKSEILNLDH FT VPPQFYYDQIITWEGIVKNEWEARKKDGMDMWSWMDGRITGMVLYLIQDWISKNQAAYN FT DIKSRGDRPENFVKMVKDRFLIEDPTDERRTALQRLALRELEALNCEDPVKIQPFMAEY FT LKKAAEAKKGFDVIYVERLFDRLPEAVGKLIKKEFLDAGNSYEAGIGVAVSYISTWMRL FT KCIKETEAKTQKKASLAFCRSIYTIGDYKKRKTLKRVTNYNRNKRKNYVRKPNIKRKCR FT CYICQDENHLANRCPRRYVNQARASMIEGLEEDIVSIASDDEDYENFFEIIELEEFLSK FT SGQQDHEHTWETGGKKEKTCDICDYYTDFNKTIVCKTCEIQYCTTCANQLGIEVEKTYK FT KSREEELYEELRKTVVNLDLRLTIVEHKLEMNKLQEQFDSLQLSKEPSSSETIKALAMQ FT AKESNFIKTNINRTAGCYVEVKLTFLNNSKVTTALIDSGSTHNIICPLLVPEIWIKSLN FT LDIVMTTIDNSKYSLNRGLHDEVKIQFKEVDESFGIKYNLGQTYVAPKPTGTFIIGHRF FT MTSEHGSITIHKDYVTIQKTTGIYPTARHELKSEFARKHGGQRP" FT mat_peptide <1..306 FT /product="movement protein" FT misc_feature 307..921 FT /note="IR region" FT mat_peptide 922..1449 FT /product="coat protein" FT /note="37 kDa protein" FT misc_feature 1500..2163 FT /note="IR region" FT mat_peptide 2164..>2744 FT /product="protease" XX SQ Sequence 2744 BP; 1199 A; 393 C; 490 G; 662 T; 0 other; tccatagcca taaaaactat aggaagatta acaacaaata tccaggccag atataaaatg 60 aatgttaaag acatcgtaga acaaatatca tcccaaggaa taactatggt agcacccatg 120 gaaatagatt catcacattt agatgggaat gaatggaatt taagtaaatt catgattcaa 180 gaaggaacaa gtagagttcc tagtaaagcc ctaatatatc aaaatctgca tggaggagaa 240 tcacttagat tttcaaacta tacacaaacc aaaatgcatg atccaacaga aataaattct 300 gatgaagacg aagatttaaa aattttagga gaacaattaa atgccaaaat ggcaaccttt 360 caagaagaaa ccctagaaca aaaattagaa cgcataaaag aagaaaagaa acaacttcta 420 gcaaaactag aagctaaaaa gaaggaaatt gaatcaaaat ccttaaaaat ggcggtcata 480 gaagatgact ttaacccaaa caacgaatat ttagacgatt catattctga attagaagat 540 ctagaattcg aacaattagg attaactggt tgggaagatc tagatcaaga aagtctagaa 600 acagaagaaa ttaccgaatg ggaaaaccca aaccaagtcc taactagaga aataaaagcc 660 tttaaatcag tatctgaaca aatagaagat atatttggag aattattaaa agaacatgga 720 aattatgaca tggcccttaa aaatttagaa gaaagatatg atctaaaaaa cttagaagga 780 gccaaaaacc tagaagaaat agctaaagca tccacatcaa aaatgatgga tgtaaaacca 840 gtaaaaagac caaaagaaga acaaacggca tatgaagatg atatgagaga tgattggaga 900 agaaaagaat taacagccaa tccagaagta tcctcaaaag atagaaactt tgaaagaata 960 ggaggatcat ataagaaaaa tttttaccct agcaaaagtg aaatcctaaa cctagaccat 1020 gtaccccccc agttctacta tgatcaaatc ataacttggg aaggaatagt aaaaaatgaa 1080 tgggaagcaa gaaaaaagga tggtatggac atgtggtctt ggatggatgg aagaataaca 1140 ggaatggttt tatatctaat acaagattgg atatctaaaa accaagccgc ctacaatgat 1200 ataaaatctc gaggagatag acccgaaaat ttcgtaaaaa tggtaaaaga taggttctta 1260 atagaagatc ctacagatga aaggagaaca gccttacaaa gattagctct aagagaatta 1320 gaagctttaa actgtgagga tccagttaaa attcagccat ttatggcaga ataccttaag 1380 aaagctgctg aagctaaaaa aggatttgat gtcatatatg tcgaaagact atttgacaga 1440 cttcctgagg cagtaggaaa attaataaaa aaggaatttc tagatgcagg aaattcatat 1500 gaagcaggca taggagtagc tgtttcatat atatccacat ggatgagact aaaatgcata 1560 aaagaaacag aagctaaaac acaaaagaaa gcatcattag cattctgtcg atctatatat 1620 actataggtg attataagaa aagaaagaca ctaaaacgtg ttacgaatta caatagaaat 1680 aaaagaaaaa attatgttag aaaacctaac ataaaaagaa agtgtagatg ttatatctgc 1740 caggatgaaa accacctagc aaatagatgt cctagaagat atgtcaatca agctagagct 1800 agcatgattg aaggactaga agaagatata gtgtccatag cttcagatga tgaagactat 1860 gagaacttct ttgaaataat tgaattagaa gagtttttga gtaaatcagg acaacaagat 1920 cacgagcaca cctgggaaac aggaggtaag aaagaaaaaa cttgtgatat ttgtgattat 1980 tatactgatt tcaataaaac catagtatgc aaaacatgtg aaatacaata ttgtactact 2040 tgtgcaaatc aacttggtat agaagtagaa aaaacatata agaaatctag agaagaagaa 2100 ttatatgagg aattaagaaa aacagttgtc aacctagatc taagattgac aatagttgaa 2160 cataaactag aaatgaataa attacaagaa caatttgatt ccttgcaatt atctaaagaa 2220 cctagttctt cagagaccat aaaagcctta gccatgcaag caaaagaatc aaatttcata 2280 aaaaccaata ttaatagaac agcaggatgt tatgtagaag taaagttaac atttttaaat 2340 aactctaaag taaccactgc attaatagat tctggttcca cacataatat catatgtcct 2400 ttattagtac cagaaatatg gattaaaagt ttgaatttag atattgttat gaccacaata 2460 gataatagta aatatagcct caacagaggt ctacatgatg aagtaaaaat acaatttaaa 2520 gaagtagatg aaagttttgg gataaaatac aacttaggac aaacttatgt tgctcctaaa 2580 cctacaggaa cttttataat tggacatagg tttatgacca gtgaacatgg gagtattaca 2640 atccataaag actatgttac aatacaaaaa accacgggaa tttaccccac agctcgtcat 2700 gaactcaaat cagagtttgc gcgaaagcat ggtggacaaa gacc 2744 //