ID FJ025998; SV 1; linear; genomic RNA; STD; VRL; 3629 BP. XX AC FJ025998; XX DT 15-MAY-2009 (Rel. 100, Created) DT 18-JUN-2009 (Rel. 101, Last updated, Version 2) XX DE Cassava brown streak virus isolate MLB9 polyprotein (pol) gene, partial DE cds. XX KW . XX OS Cassava brown streak virus OC Viruses; Riboviria; Potyviridae; Ipomovirus. XX RN [1] RP 1-3629 RX DOI; 10.1128/JVI.00537-09. RX PUBMED; 19386713. RA Mbanzibwa D.R., Tian Y., Mukasa S.B., Valkonen J.P.; RT "Cassava brown streak virus (Potyviridae) encodes a putative Maf/HAM1 RT pyrophosphatase implicated in reduction of mutations and a P1 proteinase RT that suppresses RNA silencing but contains no HC-Pro"; RL J. Virol. 83(13):6934-6940(2009). XX RN [2] RP 1-3629 RA Mbanzibwa D.R., Tian Y., Tugume A.K., Valkonen J.P.T.; RT ; RL Submitted (14-AUG-2008) to the INSDC. RL Applied Biology, University of Helsinki, Latokartnonkaari 7, Helsinki RL 00790, Finland XX DR MD5; 989237d068248c8e83c10a380e4db385. XX FH Key Location/Qualifiers FH FT source 1..3629 FT /organism="Cassava brown streak virus" FT /host="cassava" FT /isolate="MLB9" FT /mol_type="genomic RNA" FT /country="Tanzania" FT /collection_date="2007" FT /db_xref="taxon:137758" FT 5'UTR <1..130 FT gene 131..>3629 FT /gene="pol" FT CDS 131..>2254 FT /codon_start=1 FT /gene="pol" FT /product="polyprotein" FT /note="coding region disrupted by sequencing gap" FT /db_xref="InterPro:IPR025910" FT /db_xref="UniProtKB/TrEMBL:C4N217" FT /protein_id="ACM48166.1" FT /translation="MSTIQLFKTITFGSFEPVKLDEENNIIEKIPTDLMTGDDCFGPEE FT QSEQKYHRKESGESRQKVTDLYSVIGNSVYCRSYVAMKNFLNDTKWGGLFKNKKGQELK FT AAARLRRATSYGFMYDPVACAFECPICRMKATALEAFTSDCDHCFEIKHIEDDRIVQTD FT TKFYPINPIELEVDDGLVEAASLEWMKSDVEEKVVDRMLLLEDREIRVTKKALIKRRSE FT MKLVANVSDLTRKLTEICCESGIPIIDIDNCKRKAIPMVKLKHIFGKIECDDMFEDDRH FT FLEHKNASKIFRSCERITYKMIRPGWSGAIILKDNVQREDHEKFDFIDDICVVQGKNLI FT NNRIENAMRVKTETEMDLIDLYSFNLSWAKSKDKFIKHFEGDATQLIKSCCTPSTLWLY FT ARKARFYKFVDHMVLKGSPIIDILVKMEYVGKHLEMFNSIDDVCIEYAHFMKELISETV FT NDKSDPEVLRVRNIIRAHFESVLEYNKYELIDRVIEKKTQLEAQEIMSRELIKHQYGEL FT FSWRERICIKLGIGSSNLYTYWEEREEKHSEKSRAISFIISKPGIEMLINWASEVCKNK FT YHSLVNCVDSGVLFLWSRIVHLMKASVYGYWNLWFRQAMCVLFVFLISNFSGKLVSYLK FT KLIVNERKLAIKNEEGFVEVQGRKEESFVLKWCAVQTLFLSFLNYDWAVGCVSAIGKMK FT TIFGALGPDFIEKQ" FT mat_peptide 131..1216 FT /gene="pol" FT /product="P1 protein" FT mat_peptide 1217..2098 FT /gene="pol" FT /product="P3 protein" FT mat_peptide 2099..2254 FT /gene="pol" FT /product="6K1 protein" FT gap 2255..2354 FT /estimated_length=unknown FT CDS <2355..>3629 FT /codon_start=1 FT /gene="pol" FT /product="polyprotein" FT /note="coding region disrupted by sequencing gap" FT /db_xref="GOA:C4N218" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR002637" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR027502" FT /db_xref="InterPro:IPR029001" FT /db_xref="UniProtKB/TrEMBL:C4N218" FT /protein_id="ACM48167.1" FT /translation="GNNSGQPSTVVDNTIILMIVVEYCKAVMESEGKIMQFKYMCNGDD FT LILNVPDDEVSIIQSRFRNLFSECGLDYNFDDVHKSIETIEYMSHSFMLKDDIYIPKLK FT KERIVTILEWERGDEIMRTRSALNAAYIESYGYEDLMMEIERYAVFWAAEKGCEYPLLD FT RKRVEGLYKDDHTDINEEWLIGILPPSFEHCYVDTQTKDLREKENPELRIESYDGTSQM FT QMKFPVTFVTGNLGKLAEVRSILGIANDVVAKNIDLPEVQGTPDEVVRKKAQLAVKMTN FT SPVLVEDTCLCFNAFNGLPGPYIKWFLKELGLEGVVKMLSAFEDKTAYALCTFAYVHSE FT LSDPIVFKGVVNGVIVPPRGNNGFGWDPIFKPDGCGCTFAEMPSGIKNEFSHRRRALEK FT VKLFLDNQVVRQEKKRASVALTIDVQ" FT mat_peptide <2355..2951 FT /gene="pol" FT /product="NIb protein" FT mat_peptide 2952..3629 FT /gene="pol" FT /product="HAM1-like protein" XX SQ Sequence 3629 BP; 1166 A; 482 C; 845 G; 1036 T; 100 other; ataaacatga cataagaata cataaagata catcgcatac aaacactacg cattcaaact 60 cgtgagaaag caagctgttc caaaagttta cgatacatcc ttatatattc atagcactga 120 gtacctgaag atgtcaacca tccagctatt caagactatc acctttggtt cctttgagcc 180 tgtcaagctt gatgaagaga acaacattat agagaaaatt ccaactgatc ttatgacagg 240 agatgattgc ttcggtcctg aggagcaatc agaacagaaa taccatagga aggaaagcgg 300 tgaaagtcgg cagaaagtaa ccgatttata ttctgtgatt ggaaattctg tctactgtcg 360 gagttacgta gcaatgaaaa acttccttaa cgacacaaag tggggaggtc ttttcaaaaa 420 taagaaagga caagaactga aagctgcagc aaggcttagg agagcaacaa gctatggatt 480 tatgtatgat ccggtcgctt gtgcttttga atgccctatt tgtagaatga aagctacagc 540 attagaagca ttcacaagtg attgtgacca ttgctttgaa attaaacata ttgaggatga 600 taggattgtt caaactgata caaagtttta tccaataaat cctatcgagt tagaagtcga 660 tgatgggtta gttgaggcag cgtcgcttga atggatgaaa agtgacgtgg aggaaaaagt 720 tgttgatagg atgctattgc ttgaggatag agaaatccgt gtgaccaaga aagctttgat 780 aaagaggaga agtgaaatga aacttgttgc aaatgtttct gatctcacaa ggaaactaac 840 tgagatttgc tgcgagtcag gaatcccaat tatcgacata gataattgca agagaaaagc 900 cattccaatg gtgaaactta agcatatctt tgggaaaatt gagtgtgatg acatgtttga 960 agatgatcga catttcttag agcataaaaa tgcaagtaag attttccgat cgtgtgagag 1020 gatcacttat aagatgattc gtcctggatg gagtggggca attattctta aagacaatgt 1080 gcagagagaa gatcatgaaa agtttgattt cattgatgac atttgtgttg ttcaaggaaa 1140 gaatttgatt aacaatagaa ttgagaacgc catgcgtgtt aaaactgaaa cagagatgga 1200 tttgattgat ttatactctt ttaatctgag ttgggcaaag tcaaaggaca agttcattaa 1260 gcattttgaa ggtgacgcaa cccaactaat aaagtcatgt tgcactccta gtacgctatg 1320 gttatatgca aggaaggcta gattctataa gtttgtggat catatggtct tgaagggtag 1380 tccaataatt gatattcttg ttaagatgga gtatgtaggc aaacatctag aaatgtttaa 1440 tagcattgat gatgtgtgta ttgaatatgc ccattttatg aaggagttga ttagcgagac 1500 tgttaatgat aaatcagatc cagaagtgtt gcgtgttaga aacataatta gagcacattt 1560 tgagtcagtt ttagagtata ataagtatga gttgattgat agagtcatag agaaaaaaac 1620 acaattagaa gcacaagaaa ttatgtctcg agagctcatc aaacatcaat acggagaatt 1680 attttcttgg cgagaacgta tatgtataaa attaggtata ggatcatcaa atctatatac 1740 atattgggag gagcgagagg agaaacattc agagaaatcg agagcaatct ccttcatcat 1800 ttccaaaccg ggaatagaga tgttgatcaa ctgggcctca gaagtgtgta aaaataaata 1860 tcatagcctt gtaaattgtg ttgatagtgg tgttttgttt ttgtggtcta gaatagttca 1920 tttgatgaaa gcttcagttt atgggtattg gaatctttgg tttaggcaag caatgtgcgt 1980 gttatttgtt ttcttaattt caaacttttc tggtaaacta gttagttatc ttaagaaatt 2040 gattgttaac gagaggaagc tagcaattaa gaatgaagaa ggttttgttg aagtgcaagg 2100 tcggaaagaa gagtcatttg tcttgaaatg gtgtgctgtg caaactctat tcctcagttt 2160 cttgaactat gattgggcag tagggtgtgt gtcagctatt ggtaagatga agactatctt 2220 tggtgcactt ggcccagatt ttatagagaa acaannnnnn nnnnnnnnnn nnnnnnnnnn 2280 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2340 nnnnnnnnnn nnnnggtaat aacagtggtc agccatcaac ggttgtggac aacactataa 2400 ttctaatgat tgtcgtggaa tattgcaagg ctgtaatgga aagtgaaggg aaaatcatgc 2460 aattcaagta catgtgcaac ggagacgatt taatactcaa tgttcctgat gatgaggtga 2520 gcataatcca aagcaggttt agaaatttat tctcagagtg tggtttggat tataattttg 2580 atgatgtgca taagtctata gaaacaattg agtatatgag tcattcattc atgttgaaag 2640 atgatatata tattccaaag ctgaagaagg agaggatagt tacaatctta gaatgggaga 2700 gaggagatga gatcatgcga actcgtagtg ctctcaatgc tgcttatatt gaaagttatg 2760 gttacgaaga tcttatgatg gaaattgagc ggtatgcggt gttctgggct gccgagaaag 2820 gttgtgagta tccattgctg gatagaaagc gtgtggaagg actttacaag gatgatcata 2880 cagatatcaa tgaagaatgg ttgataggta ttttgccacc atcattcgaa cattgttatg 2940 ttgacacgca aactaaagat ttgagagaaa aagagaatcc tgagttgagg attgagagtt 3000 atgatggaac atctcaaatg caaatgaagt ttccagttac atttgtgaca ggaaatttgg 3060 gaaagttagc agaagtgaga tctattcttg gcattgcaaa tgatgttgta gctaagaaca 3120 ttgatttacc agaggtgcaa gggactccag atgaagttgt gagaaagaag gcacaattag 3180 cagtgaagat gactaatagt cctgttttag tcgaagatac gtgtctctgc tttaatgctt 3240 ttaatgggct tcctgggcca tacatcaaat ggttcttaaa agaattgggg cttgaaggtg 3300 ttgttaaaat gctatcagca tttgaagata aaacagcgta cgcgttatgt acatttgcat 3360 atgtacatag cgagttgtct gatccaattg tgtttaaagg agttgtgaat ggtgtgattg 3420 tgccaccgcg aggcaacaat ggctttggct gggatcctat atttaaacct gacgggtgtg 3480 gttgcacttt tgcagaaatg ccaagtggca ttaagaatga attttcccat agaagaagag 3540 ccctagagaa agttaaattg ttccttgata accaggtggt gaggcaagag aagaagaggg 3600 caagcgtggc tttaacgatt gatgttcag 3629 //