ID FJ026000; SV 1; linear; genomic RNA; STD; VRL; 3418 BP. XX AC FJ026000; XX DT 15-MAY-2009 (Rel. 100, Created) DT 18-JUN-2009 (Rel. 101, Last updated, Version 2) XX DE Cassava brown streak virus isolate IGA8 polyprotein (pol) gene, partial DE cds. XX KW . XX OS Cassava brown streak virus OC Viruses; Riboviria; Potyviridae; Ipomovirus. XX RN [1] RP 1-3418 RX DOI; 10.1128/JVI.00537-09. RX PUBMED; 19386713. RA Mbanzibwa D.R., Tian Y., Mukasa S.B., Valkonen J.P.; RT "Cassava brown streak virus (Potyviridae) encodes a putative Maf/HAM1 RT pyrophosphatase implicated in reduction of mutations and a P1 proteinase RT that suppresses RNA silencing but contains no HC-Pro"; RL J. Virol. 83(13):6934-6940(2009). XX RN [2] RP 1-3418 RA Mbanzibwa D.R., Tian Y., Tugume A.K., Valkonen J.P.T.; RT ; RL Submitted (14-AUG-2008) to the INSDC. RL Applied Biology, University of Helsinki, Latokartnonkaari 7, Helsinki RL 00790, Finland XX DR MD5; ff3ef1d057bfb78f6086a38e5d301a89. XX FH Key Location/Qualifiers FH FT source 1..3418 FT /organism="Cassava brown streak virus" FT /host="cassava" FT /isolate="IGA8" FT /mol_type="genomic RNA" FT /country="Uganda" FT /collection_date="2007" FT /db_xref="taxon:137758" FT 5'UTR <1..81 FT gene 82..>3418 FT /gene="pol" FT CDS 82..>2205 FT /codon_start=1 FT /gene="pol" FT /product="polyprotein" FT /note="coding region disrupted by sequencing gap" FT /db_xref="InterPro:IPR025910" FT /db_xref="UniProtKB/TrEMBL:C4N221" FT /protein_id="ACM48170.1" FT /translation="MSTIQLFKTIRFGSFEPVKLDEGNNVIEKVPIDLLAGNDGSGPEE FT QSEQKYHRKESGESWRKVTDLYSVIGNSVYCRSYVAMKNFLNDTKWGGLFKNKKGQELK FT AAARLRRATSYGFMYDPVACAFECPVCRTKATALEAFTSDCDHCFEVKHIDDDRIVQTE FT TKFYPINPIELDVEDNLVEAASLEWLKGEVEERVVDRVLLLEDKEIRVTKKALVKRRAE FT TKLVANVADLTKKLTEICCESGIPIIDIDNSKRKAIPMVKLKHIFGKIECDDMFGEDRY FT FLEHSNAGKIFRSCERITYRMIRPGWSGAIILRDNVQGEDHDKFDFINDICVVQGKNLI FT SNKIENAMRVKTETEMDLIDLYSFNLSWAKSKDKFIKHFESDTTQLIRSCCTPSTLWLY FT ARKARFYKFVDHMILKGSPIIDILVKMEYVGKHLDLFNSVEDVCTEYSHFMKELISETV FT NDKSDPDVLRVRNLIRAHFESVLEYNKYELIDRIIEKKTQLEAQEIMSRELIRHQYGEL FT FSWRERLYLKLGIGSSNLYTYWVEREEKQSEKSSAISCIISKPGIEMLINWVSEVCKSK FT YHSLVNCVDSGILFLWSRIVHLARVSVYGYWNLWFRQAMCVLFIFLVSNFSCKIVNYLK FT TLIVNEKKLAIKNEEGFVEVQGRKEESFVLKWCAAATLFLSFLNYDWAVGCVSAIGKMK FT TIFSALGPDFIEKQ" FT mat_peptide 82..1167 FT /gene="pol" FT /product="P1 protein" FT mat_peptide 1168..2049 FT /gene="pol" FT /product="P3 protein" FT mat_peptide 2050..2205 FT /gene="pol" FT /product="6K1 protein" FT gap 2206..2305 FT /estimated_length=unknown FT CDS <2306..>3418 FT /codon_start=1 FT /gene="pol" FT /product="polyprotein" FT /note="coding region disrupted by sequencing gap" FT /db_xref="GOA:C4N222" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR002637" FT /db_xref="InterPro:IPR027502" FT /db_xref="InterPro:IPR029001" FT /db_xref="UniProtKB/TrEMBL:C4N222" FT /protein_id="ACM48171.1" FT /translation="VSIIQSRFRELFSECGLDYNFDDVHKSIETIEYMSHSFMLKDDVY FT IPKLKKERIVAILEWERGDEIMRTRSALNAAYIESYGYDDLMVEIERYAVFWATEKGCE FT YPLLDRKRVEGLYKDDHTDINEEWLRGILPPSFEHCYVDTQTKDLRREEQRELRPESYD FT RTTQMQMKFPVTFVTGNLGKLAEVKSILGIANDVIARNIDLPEVQGTPDEIVRKKAQLA FT VKMTNSPVLVEDTCLCFNAFNGLPGPYIKWFLKELGLEGVVKMLSAFGDKSAYALCTFA FT YVHNESSDPIVFKGVVNGEIVPPRGNNGFGWDPIFKPDECSCTFAEMSSGMKNDFSHRR FT RALEKVKSFLDNLVVKQEEKKARVALTIDVQ" FT mat_peptide <2306..2740 FT /gene="pol" FT /product="NIb protein" FT mat_peptide 2741..3418 FT /gene="pol" FT /product="HAM1-like protein" XX SQ Sequence 3418 BP; 1061 A; 469 C; 804 G; 984 T; 100 other; gcattcaaat tcgtgagaaa gcaagctgtt acgaaagttt acgatacatc cttatatatt 60 catagcactg aatacctgca gatgtcgact atccagctat tcaagactat cagatttggt 120 tcctttgagc ctgtcaagct tgatgaagga aacaatgtta tagagaaggt cccaattgat 180 cttctggcag gaaatgatgg ctctggtcct gaggagcaat cggaacagaa ataccatagg 240 aaagaaagcg gtgaaagttg gcggaaagta accgacttat attctgtaat tggaaactct 300 gtttactgcc gcagttacgt agcaatgaaa aactttctca atgatacaaa gtggggaggt 360 cttttcaaaa ataagaaggg gcaagaacta aaggctgctg caaggctcag aagggctaca 420 agctatggat tcatgtatga tccagttgct tgtgcctttg aatgccctgt ttgcagaaca 480 aaagctacag cattggaagc atttacaagc gattgtgatc attgtttcga agtcaagcac 540 attgatgatg ataggatcgt tcaaactgaa acaaagtttt atccaataaa tcctattgaa 600 ctggatgttg aggataactt agtggaagct gcatcgctcg agtggttgaa aggtgaagtg 660 gaagagagag ttgtagacag ggttttacta cttgaggaca aggagattcg tgttacaaag 720 aaagctctag tcaaaaggag agctgagaca aaactagtgg ctaacgttgc tgatcttacc 780 aagaaattga ctgaaatttg ctgtgagtct ggtatcccta tcattgacat tgataacagc 840 aagaggaaag ccattccgat ggtcaaactc aagcatatct ttgggaaaat tgagtgtgat 900 gacatgtttg gggaagatcg gtatttcctg gagcatagca atgcagggaa aattttccga 960 tcatgtgaga gaattactta taggatgatt cgtcctggat ggagtggagc aatcattcta 1020 agagacaatg ttcagggaga ggatcatgat aagtttgact ttatcaatga tatttgtgtt 1080 gttcaaggga agaatctaat aagcaacaaa attgagaacg ccatgcgcgt caagactgaa 1140 acagaaatgg acttaattga tttgtattct ttcaatctta gttgggcgaa atcaaaagat 1200 aagttcatta agcactttga aagtgataca actcaattaa ttagatcgtg ttgcactcct 1260 agtacattgt ggttgtacgc aaggaaagcc aggttttaca aatttgtgga tcacatgatc 1320 ttaaagggta gtccaataat tgatattcta gttaaaatgg agtatgttgg caaacatttg 1380 gatttgttta atagtgtcga agatgtgtgc actgagtatt ctcattttat gaaggagtta 1440 attagtgaaa ctgtcaacga taaatcagat ccagatgtgt tgcgcgttag aaatttgata 1500 agagcgcatt ttgagtcagt tttagaatac aataagtatg agttgataga taggataatt 1560 gagaaaaaaa ctcagttaga agcacaagaa attatgtctc gcgagctcat cagacatcag 1620 tacggagagt tattttcttg gcgcgaacgt ttgtatttga aactaggtat aggctcatcg 1680 aatctctata catactgggt ggaacgcgaa gagaaacaat cagagaaatc gagtgcaatc 1740 tcttgcatca tttccaaacc gggaatagag atgttgatca attgggtctc agaagtgtgt 1800 aagagtaaat atcatagtct tgtaaattgt gtcgatagtg gtattttgtt cttgtggtct 1860 agaatagttc atttagcgag agtttcagtt tatggttatt ggaatctttg gtttagacaa 1920 gctatgtgtg tgttgttcat tttcttagtt tcaaatttct cttgtaaaat agttaattat 1980 cttaagacat taattgtgaa tgagaagaag ctagctataa agaatgaaga aggtttcgtt 2040 gaagtacaag gccgaaaaga agagtctttt gttttaaagt ggtgtgctgc tgcaactctg 2100 tttcttagtt ttctaaacta tgattgggca gttggatgtg tatcagccat tggaaagatg 2160 aaaacaatat tcagtgcact aggtccagat ttcatagaaa agcaannnnn nnnnnnnnnn 2220 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2280 nnnnnnnnnn nnnnnnnnnn nnnnngtgag tataatccag agtaggttca gagagttgtt 2340 ttcagaatgt ggtttggatt ataattttga tgatgttcac aagtcaatag agactattga 2400 gtatatgagc cattcattca tgctgaaaga tgatgtgtac attccaaagt tgaagaaaga 2460 aagaattgtt gcaattcttg aatgggaaag aggtgatgag attatgcgaa cacggagtgc 2520 tcttaatgct gcttatatcg aaagttatgg atatgatgat cttatggttg aaattgagcg 2580 gtacgcggtt ttctgggcta ccgaaaaagg ttgtgagtat ccattattag ataggaagcg 2640 tgtggaagga ctttacaaag atgatcacac agatattaat gaagaatggc tgagaggtat 2700 tttaccacca tcattcgaac attgttatgt tgacacgcaa actaaggatt tgagaaggga 2760 agagcagcgt gagttgagac ctgaaagtta tgacagaaca acccaaatgc agatgaagtt 2820 cccagttaca tttgtgacag gaaatttggg gaagttagct gaagtgaagt ctattcttgg 2880 cattgcaaat gatgttatag ccaggaacat tgatttacca gaagtgcaag gaaccccaga 2940 tgagattgtg agaaagaagg ctcaattagc agtgaagatg actaatagtc ctgttttagt 3000 cgaagacacg tgtctttgtt tcaatgcttt taatggactg cctgggccat acatcaaatg 3060 gtttttaaaa gaattgggtc ttgaaggtgt tgtcaaaatg ctgtcagcat ttggagataa 3120 gtcagcgtac gcactatgta catttgcata tgtgcacaat gagtcgtctg atccaattgt 3180 gtttaaagga gttgtgaatg gtgaaattgt gccaccgcga ggtaataatg gttttggttg 3240 ggatcctata ttcaaacctg atgaatgtag ttgtacgttt gcagaaatgt caagcggcat 3300 gaagaatgat ttttcccata gaagaagagc tttagagaaa gtcaaatcat ttcttgataa 3360 cttggtggtg aagcaagagg agaagaaggc aagggtggct ctaacgattg atgttcag 3418 //