ID FJ025999; SV 1; linear; genomic RNA; STD; VRL; 3587 BP. XX AC FJ025999; XX DT 15-MAY-2009 (Rel. 100, Created) DT 18-JUN-2009 (Rel. 101, Last updated, Version 2) XX DE Cassava brown streak virus isolate LWR2 polyprotein (pol) gene, partial DE cds. XX KW . XX OS Cassava brown streak virus OC Viruses; Riboviria; Potyviridae; Ipomovirus. XX RN [1] RP 1-3587 RX DOI; 10.1128/JVI.00537-09. RX PUBMED; 19386713. RA Mbanzibwa D.R., Tian Y., Mukasa S.B., Valkonen J.P.; RT "Cassava brown streak virus (Potyviridae) encodes a putative Maf/HAM1 RT pyrophosphatase implicated in reduction of mutations and a P1 proteinase RT that suppresses RNA silencing but contains no HC-Pro"; RL J. Virol. 83(13):6934-6940(2009). XX RN [2] RP 1-3587 RA Mbanzibwa D.R., Tian Y., Tugume A.K., Valkonen J.P.T.; RT ; RL Submitted (14-AUG-2008) to the INSDC. RL Applied Biology, University of Helsinki, Latokartnonkaari 7, Helsinki RL 00790, Finland XX DR MD5; 68dba6b3cb9ba0b6b3d3f2e8866291cf. XX FH Key Location/Qualifiers FH FT source 1..3587 FT /organism="Cassava brown streak virus" FT /host="cassava" FT /isolate="LWR2" FT /mol_type="genomic RNA" FT /country="Uganda" FT /collection_date="2007" FT /db_xref="taxon:137758" FT 5'UTR <1..130 FT gene 131..>3587 FT /gene="pol" FT CDS 131..>2254 FT /codon_start=1 FT /gene="pol" FT /product="polyprotein" FT /note="coding region disrupted by sequencing gap" FT /db_xref="InterPro:IPR025910" FT /db_xref="UniProtKB/TrEMBL:C4N219" FT /protein_id="ACM48168.1" FT /translation="MSTIQLFKTIRFGSFEPVKLDEGNNVIEKVPIDLLAGNDGSGPEE FT QSEQKYHRKESGESWRKVTDLYSVIGNSVYCRSYVAMKNFLNDTKWGGLFKNKKGQELK FT AAARLRRATSYGFMYDPVACAFECPVCRTKATALEAFTSDCDHCFEVKHIDDDRIVQTE FT TKFYPINPIELDVEDNLVEAASLEWLKGEVEERVVDRVLLLEDKEIRVTKKALVKRRAE FT TKLVANVADLTKKLTEICCESGIPIIDIDNSKRKAIPMVKLKHIFGKIECDDMFGEDRY FT FLEHSNAGKIFRSCERITYRMIRPGWSGAIILRDNVQGDDHDKFDFINDICVVQGKNLI FT SNKIENAMRVKTETEMDLIDLYSFNLSWAKSKDKFIKHFESDTTQLIRSCCTPSTLWLY FT ARKARFYKFVDHMILKGSPIIDILVKMEYVGKHLDLFNSVEDVCTEYSHFMKELISETV FT NDKSDPDVLRVRNLIRAHFESVLEYNKYELIDRIIEKKTQLEAQEIMSRELIRHQYGEL FT FSWRERLYLKLGIGSSNLYTYWVEREEKQSEKSSAISCIISKPGIEMLINWVSEVCKSK FT YHSLVNCVDSGILFLWSRIVHLARVSVYGYWNLWFRQAMCVLFIFLVSNFSCKIVNYLK FT TLIVNEKKLAIKNEEGFVEVQGRKEESFVLKWCAAATLFLSFLNYDWAVGCVSAIGKMK FT TIFSALGPDFIEKQ" FT mat_peptide 131..1216 FT /gene="pol" FT /product="P1 protein" FT mat_peptide 1217..2098 FT /gene="pol" FT /product="P3 protein" FT mat_peptide 2099..2254 FT /gene="pol" FT /product="6K1 protein" FT gap 2255..2354 FT /estimated_length=unknown FT CDS <2355..>3587 FT /codon_start=1 FT /gene="pol" FT /product="polyprotein" FT /note="coding region disrupted by sequencing gap" FT /db_xref="GOA:C4N220" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR002637" FT /db_xref="InterPro:IPR027502" FT /db_xref="InterPro:IPR029001" FT /db_xref="UniProtKB/TrEMBL:C4N220" FT /protein_id="ACM48169.1" FT /translation="IILMIVVEYCKAVMESEGREMQFKYMCNGDDLILNVPDDEVSIIQ FT SRFRELFSECGLDYNFDDVHKSIETIEYMSHSFMLKDDVYIPKLKKERIVAILEWERGD FT EIMRTRSALNAAYIESYGYDDLMVEIERYAVFWATEKGCEYPLLDRKRVEGLYKDDHTD FT INEEWLRGILPPSFEHCYVDTQTKDLRREEQPELRTESYDRTTQMQMKFPVTFVTGNFG FT KLAEVKSILGIANDVIARNIDLPEVQGTPDEIVRKKAQLAVKMTNSPVLVEDTCLCFNA FT FNGLPGPYIKWFLKELGLEGVVKMLSAFGDKSAYALCTFAYVHNESSDPIVFKGVVNGE FT IVPPRGNNGFGWDPIFKPDECSCTFAEMSSSMKNDFSHRRRALEKVKSFLDNLVVKQEE FT KKARVALTIDVQ" FT mat_peptide <2355..2909 FT /gene="pol" FT /product="NIb protein" FT mat_peptide 2910..3587 FT /gene="pol" FT /product="HAM1-like protein" XX SQ Sequence 3587 BP; 1126 A; 492 C; 841 G; 1028 T; 100 other; ataaacagga cataagaata cataaagata catcgcatac aaacactacg cattcaaatt 60 cgtgagaaag caagctgtta cgaaagttta cgatacatca ttatatattc atagcactga 120 atacctacag atgtcgacta tccagctatt caagactatc agatttggtt cctttgagcc 180 tgtcaagctt gatgaaggaa acaatgttat agagaaggtc ccaattgatc ttctggcagg 240 aaatgatggc tctggtcctg aggagcaatc ggaacagaaa taccatagga aagaaagcgg 300 tgaaagttgg cggaaagtaa ccgacttata ttctgtaatt ggaaattctg tttactgccg 360 cagttacgta gcaatgaaaa actttctcaa tgatacaaag tggggaggtc ttttcaaaaa 420 taagaagggg caagaactaa aggctgctgc aaggcttaga agggctacaa gctatggatt 480 catgtatgat ccagttgctt gtgcctttga atgccctgtt tgcagaacaa aggcaacagc 540 attggaagca tttacaagcg attgtgatca ttgtttcgaa gtcaagcaca ttgatgatga 600 taggatcgtt cagactgaaa caaagtttta tccaataaat cctattgaac tggatgttga 660 ggataactta gtggaagctg catcgctcga gtggttgaaa ggtgaagtgg aagagagagt 720 tgtagacagg gttttactac ttgaggacaa ggagattcgt gttacaaaga aagctctagt 780 taaaaggaga gctgagacaa aactagtggc taatgttgct gatcttacca agaaattgac 840 tgaaatttgc tgtgagtctg gtatccctat cattgatatt gataacagca agaggaaagc 900 cattccgatg gtcaagctca agcatatctt tgggaaaatt gagtgtgatg acatgtttgg 960 ggaagatcgg tatttcctgg agcatagcaa tgcagggaaa attttccgat catgtgagag 1020 aattacttat aggatgattc gtcctggatg gagtggagca atcattctaa gagacaatgt 1080 tcagggagat gatcatgata agttcgactt tatcaatgat atttgtgttg ttcaagggaa 1140 gaatctaatt agcaacaaaa ttgagaacgc catgcgcgtc aagactgaaa cagaaatgga 1200 cttaattgat ttgtattctt tcaatcttag ttgggcgaaa tcaaaagata agttcattaa 1260 gcactttgaa agtgatacaa ctcaattaat tagatcgtgt tgcactccta gtacattgtg 1320 gttgtacgca aggaaagcca ggttttacaa atttgtggat cacatgatct taaagggtag 1380 tccaataatt gatatcctag ttaaaatgga gtatgttggc aagcatttgg acttgtttaa 1440 tagtgtcgaa gatgtgtgca ctgagtattc tcattttatg aaggagttaa ttagtgaaac 1500 tgtcaacgat aaatcagatc cagatgtgtt gcgcgttaga aatttgataa gagcgcattt 1560 tgaatcagtc ttagaatata ataagtatga gttgatagat aggataattg agaaaaaaac 1620 tcagttagaa gcacaagaaa ttatgtctcg cgagctcatc agacatcagt acggagagtt 1680 attttcttgg cgcgaacgtt tgtatttgaa actaggtata ggctcatcga atctctatac 1740 atactgggtg gaacgcgaag agaaacaatc agagaaatcg agtgcaatct cttgcatcat 1800 ttccaaaccg ggaatagaga tgttgataaa ttgggtctca gaagtgtgta agagtaaata 1860 tcatagtctt gtaaattgtg tcgatagtgg tattttgttc ttgtggtcta gaatagttca 1920 tttagcgaga gtctcagttt atggttattg gaatctttgg tttagacaag ctatgtgtgt 1980 gttgttcatt ttcttagttt caaatttctc ttgtaaaata gttaattatc ttaagacatt 2040 aattgtgaat gagaagaagc tagctataaa gaatgaagaa ggtttcgttg aagtacaagg 2100 ccgaaaggaa gagtcttttg ttttaaagtg gtgtgctgct gcaactctgt ttcttagttt 2160 tctaaactat gattgggcag ttggatgtgt atcagccatt ggaaagatga aaacaatatt 2220 tagtgcacta ggtccagatt tcatagaaaa gcaannnnnn nnnnnnnnnn nnnnnnnnnn 2280 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2340 nnnnnnnnnn nnnnattata ctaatgattg ttgtggaata ttgcaaggct gtcatggaga 2400 gtgagggaag agaaatgcaa ttcaagtaca tgtgtaatgg agatgacttg attctcaatg 2460 ttcctgatga tgaagtgagt ataatccaga gtaggttcag agagttgttt tcagaatgtg 2520 gtttggatta taattttgat gatgttcaca agtcaataga gactattgag tatatgagcc 2580 attcattcat gctgaaagat gatgtgtaca ttccaaagtt gaagaaagaa agaattgttg 2640 caattcttga atgggagaga ggtgatgaga ttatgcgaac acggagtgct cttaatgctg 2700 cttatatcga aagttatgga tatgatgatc ttatggttga aattgagcgg tacgcggttt 2760 tctgggctac cgaaaaaggt tgtgagtatc cattactaga taggaagcgt gtggaaggac 2820 tttacaaaga tgatcacaca gatattaatg aagaatggct gagaggtatt ttaccaccat 2880 cattcgagca ttgttatgtt gacacgcaaa ctaaggattt gagaagggaa gagcagcctg 2940 agttgagaac tgaaagttat gacagaacaa cccaaatgca gatgaagttc ccagtcacat 3000 ttgtgacagg aaattttgga aagttagctg aagtgaagtc tattcttggc attgcaaatg 3060 atgttatagc taggaacatt gatttaccag aagtgcaagg aaccccagat gagattgtga 3120 gaaagaaggc tcaattagca gtgaagatga ctaatagtcc tgttttagtc gaagacacgt 3180 gtctttgttt caatgctttt aatggactgc ctgggccata catcaaatgg tttctaaaag 3240 aattgggtct cgaaggtgtt gtcaaaatgc tgtcagcatt tggagataag tcagcatacg 3300 cactatgtac atttgcatat gtgcacaacg agtcgtctga tccaattgtg tttaaaggag 3360 ttgtgaatgg tgaaattgtg ccaccgcgag gtaacaatgg ttttggttgg gatcctatat 3420 tcaaacctga tgaatgtagt tgcacgtttg cagaaatgtc aagcagcatg aagaatgatt 3480 tttcccatag aagaagagct ttagagaaag ttaaatcgtt tcttgataac ttggtggtga 3540 agcaagagga gaagaaggca agggtggctc taacgattga tgttcag 3587 //