ID FJ026001; SV 1; linear; genomic RNA; STD; VRL; 3467 BP. XX AC FJ026001; XX DT 15-MAY-2009 (Rel. 100, Created) DT 18-JUN-2009 (Rel. 101, Last updated, Version 2) XX DE Cassava brown streak virus isolate BSA4 polyprotein (pol) gene, partial DE cds. XX KW . XX OS Cassava brown streak virus OC Viruses; Riboviria; Potyviridae; Ipomovirus. XX RN [1] RP 1-3467 RX DOI; 10.1128/JVI.00537-09. RX PUBMED; 19386713. RA Mbanzibwa D.R., Tian Y., Mukasa S.B., Valkonen J.P.; RT "Cassava brown streak virus (Potyviridae) encodes a putative Maf/HAM1 RT pyrophosphatase implicated in reduction of mutations and a P1 proteinase RT that suppresses RNA silencing but contains no HC-Pro"; RL J. Virol. 83(13):6934-6940(2009). XX RN [2] RP 1-3467 RA Mbanzibwa D.R., Tian Y., Tugume A.K., Valkonen J.P.T.; RT ; RL Submitted (14-AUG-2008) to the INSDC. RL Applied Biology, University of Helsinki, Latokartnonkaari 7, Helsinki RL 00790, Finland XX DR MD5; ba38b5c3c57ca60900ed5839f88a1b3e. XX FH Key Location/Qualifiers FH FT source 1..3467 FT /organism="Cassava brown streak virus" FT /host="cassava" FT /isolate="BSA4" FT /mol_type="genomic RNA" FT /country="Uganda" FT /collection_date="2007" FT /db_xref="taxon:137758" FT 5'UTR <1..130 FT gene 131..>3467 FT /gene="pol" FT CDS 131..>2254 FT /codon_start=1 FT /gene="pol" FT /product="polyprotein" FT /note="coding region disrupted by sequencing gap" FT /db_xref="InterPro:IPR025910" FT /db_xref="UniProtKB/TrEMBL:C4N223" FT /protein_id="ACM48172.1" FT /translation="MSTIQLFKTIRFGSFEPVKLDEGNNVVEKVPVDLLAGNDGSGPEG FT QSEQKYHRKESGESWRKVTDLYSVIGNSVYCRSYVAMKNFLNDTKWGGLFKNKKGQELK FT AAARLRRATSYGFMYDPVACAFECPVCRTKATALEAFTSDCDHCFEIKHIDDDRIVQTE FT TRFYPINPIELDVEDNLVEAASLEWLGSDVEERVVDRVLLLEDKEIRVTKKALIKRRGE FT TRLVANVSDLTRKLTEICCESGIPIIDIDNSKRKAIPMVKLKHIFGKIECDDMFEEDRY FT FLEHKNAGKIFRSCEKITYRMIRPGWSGAIILRDNVQGEDRDKFDFINDICVVQGKNLI FT SNKIENAMRVKTETEIDLIDLYSFNLSWAKSKDKFIKHFESDITQLVRSCCTPSTLWLY FT ARKARFYKFVDYMILKGSPIIDILVKMEYVGKHLDMFNSVEDVCTEYSHFMKEIISETV FT NDKSDPDVLRVRNLIRAHFESVLEYNKYELIDRIIEKKTQLEAQEIMSRELIKHQYGEL FT FSWRERLCLKLGIGSSNLYTYWVEREEKQSEKSSAISCIISKPGIEMLINWISEVCKSK FT YHSLVNCVDSGILFLWSRIVHLARVSVYGYWNLWFRQAMCVLFIFLVSNFSSKIVGYLK FT KLIVSEKKLAIKNEAGFVEVQGRKEESFVLKWCAAATLFLSFLNYDWAVGCVSAIGKMK FT TIFSALGPDFIEKQ" FT mat_peptide 131..1216 FT /gene="pol" FT /product="P1 protein" FT mat_peptide 1217..2098 FT /gene="pol" FT /product="P3 protein" FT mat_peptide 2099..2254 FT /gene="pol" FT /product="6K1 protein" FT gap 2255..2354 FT /estimated_length=unknown FT CDS <2355..>3467 FT /codon_start=1 FT /gene="pol" FT /product="polyprotein" FT /note="coding region disrupted by sequencing gap" FT /db_xref="GOA:C4N224" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR002637" FT /db_xref="InterPro:IPR027502" FT /db_xref="InterPro:IPR029001" FT /db_xref="UniProtKB/TrEMBL:C4N224" FT /protein_id="ACM48173.1" FT /translation="VSIIQSRFRELFSECGLDYNFDDVHKSIETIEYMSHSFMLKDDVY FT IPKLKKERIVAILEWERGDEIMRTRSALNAAYIESYGYDDLMVEIERYAVFWATEKGCE FT YPLLDRKRVEGLYKDDYTDINEEWLVGILPPSFEHCYVDTQTEDLRGSEKLELRTESRD FT RISQMQMKFPITFVTGNLGKLAEVKSILGISSDVMAKNIDLPEVQGTPDEIVIKKAQLA FT VKMTNSPVLVEDTCLCFNAFNGLPGPYIKWFLKELGLEGVVKMLSAFGDKSAYALCTFA FT YVHNELSDPVVFKGVVNGEIVPPRGNNGFGWDPIFKPDECSCTFAEMPSSFKNDFSHRR FT RALEKVKLFLDDLVVKQEKKEARVALTIDVQ" FT mat_peptide <2355..2789 FT /gene="pol" FT /product="NIb protein" FT mat_peptide 2790..3467 FT /gene="pol" FT /product="HAM1-like protein" XX SQ Sequence 3467 BP; 1086 A; 464 C; 809 G; 1008 T; 100 other; ataaacatga cataagaata cataaagata catcgcatac aaacactgtg cattcaaatt 60 tgtgagaaag caagctgtta caaaagttta cgatacatcc ttatatattt atagcactga 120 atacctgcag atgtcgacta tccagctatt caagactatc agatttggtt cctttgaacc 180 tgtcaaactc gatgaaggaa acaatgttgt agagaaggtc ccagttgatc ttctggcagg 240 aaatgatggc tctggtcctg aggggcaatc agaacagaaa taccatagga aagaaagcgg 300 tgaaagttgg cggaaagtaa ctgacttata ttctgtaatt ggaaattctg tttactgccg 360 cagttacgta gcaatgaaaa acttccttaa tgatacaaaa tggggaggtc ttttcaagaa 420 taaaaagggg caagaattaa aagctgctgc aaggctcaga agggccacaa gctatggatt 480 catgtatgat ccagttgctt gtgcctttga atgtcctgtt tgtaggacaa aagcaacagc 540 attggaagcg ttcacaagcg attgtgacca ttgttttgaa atcaagcaca ttgatgatga 600 taggatcgtt cagactgaaa caaggtttta tccaataaat cctattgaac tggatgttga 660 ggataactta gtggaagctg catcgcttga gtggttggga agtgatgtgg aagagagagt 720 cgtagacagg gttttactac ttgaggataa ggagattcgt gttacgaaga aagccctaat 780 caaaagaaga ggtgagacaa gactcgtggc aaatgtttct gatcttacca ggaaattgac 840 tgaaatttgt tgtgagtctg ggattcccat cattgatatt gataatagca agaggaaagc 900 cattccaatg gttaaactca agcatatctt tgggaaaatt gagtgtgatg acatgtttga 960 agaagatcgg tatttcttgg agcataaaaa tgcagggaaa attttccgat catgcgagaa 1020 aattacttat aggatgattc gtcctggatg gagcggagcg atcattttga gagacaatgt 1080 tcagggagag gatcgtgata agtttgattt tatcaatgac atttgtgttg ttcaagggaa 1140 gaatctgatt agcaataaaa ttgaaaatgc catgcgtgtc aagaccgaaa cagagataga 1200 tttaattgat ttatattctt tcaatcttag ttgggcaaaa tcgaaagata agttcattaa 1260 gcactttgaa agtgatataa ctcaattagt tagatcatgt tgcactccta gtacattgtg 1320 gttatacgca aggaaagcca ggttttacaa atttgtggat tacatgatct tgaagggtag 1380 tccaataatt gatattttag ttaaaatgga gtatgttggc aagcatttgg acatgttcaa 1440 tagtgttgaa gatgtgtgca ctgagtattc tcattttatg aaggagataa ttagtgaaac 1500 tgtcaacgat aaatcggatc cagatgtgtt gcgcgttaga aatttgatta gagcgcattt 1560 tgagtcagtc ttagaatata ataagtatga gttaatagat aggataattg agaaaaaaac 1620 acagttagaa gcacaagaaa ttatgtctcg cgagctcatc aaacatcagt acggagagtt 1680 attttcttgg cgcgaacgtt tgtgtttgaa actaggtata ggctcatcaa atctctatac 1740 gtactgggtg gaacgcgagg agaaacaatc agagaaatcg agtgcaatct cttgcatcat 1800 ttccaaaccg ggaatagaga tgttgatcaa ttggatctca gaagtgtgta aaagtaaata 1860 ccatagtctt gtaaattgtg tcgatagtgg tattttattc ttgtggtcta gaatagttca 1920 tttggctaga gtctcagttt atggttattg gaatctttgg tttagacaag cgatgtgtgt 1980 gttattcatt tttttagttt cgaatttctc tagtaaaata gttggttatc ttaagaaatt 2040 aattgtgagt gagaagaagc tagcaataaa gaatgaggca ggtttcgttg aagtgcaagg 2100 ccgaaaggaa gaatcttttg tcttaaaatg gtgtgctgct gcgactctgt ttcttagttt 2160 cctaaactat gactgggcag ttggatgtgt atcagccatt ggaaagatga aaacaatatt 2220 tagcgcacta ggtccagatt tcatagaaaa gcaannnnnn nnnnnnnnnn nnnnnnnnnn 2280 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2340 nnnnnnnnnn nnnngtgagc ataattcaga gtaggttcag agagttgttt tcagaatgtg 2400 gtttagacta taattttgat gatgttcaca agtcaataga aacaattgag tatatgagcc 2460 attcattcat gctgaaagat gatgtgtata ttccaaagtt gaagaaagaa agaattgttg 2520 caattcttga atgggagaga ggtgatgaga ttatgcgaac gcggagtgct cttaatgctg 2580 cttatattga gagttatgga tatgatgatc ttatggttga aattgagcgg tatgcggttt 2640 tctgggctac cgagaaaggt tgtgagtatc cattactgga tagaaagcgt gtagaaggac 2700 tttacaaaga tgattacaca gatatcaatg aagaatggct ggtaggcatt ttaccaccat 2760 cattcgaaca ttgttatgtt gacacgcaga ctgaggattt gagaggaagt gagaagcttg 2820 agttgagaac tgagagccgt gatagaatat cccaaatgca aatgaagttt ccaattacat 2880 ttgtgacagg aaatttggga aaattagcag aagtgaagtc tattcttggc atttcaagtg 2940 atgttatggc taagaacatt gatttaccag aagtgcaagg cactccggat gaaattgtga 3000 taaagaaagc gcaattggca gtgaagatga ctaatagtcc tgttttagtc gaagatacat 3060 gtctttgctt taatgctttc aatggacttc ctgggccata cataaaatgg tttttaaaag 3120 aattgggtct tgaaggtgtt gttaaaatgc tgtcagcatt tggagataaa tcagcgtacg 3180 cactatgcac attcgcatat gtgcacaatg aattgtctga tccagttgtg tttaaaggtg 3240 ttgtaaatgg tgaaattgtg ccgccacgag gtaacaatgg ttttggttgg gatcctatat 3300 ttaagcctga cgaatgtagt tgtacgttcg cagaaatgcc aagcagcttt aagaatgatt 3360 tttcccatag aagaagagct ttagagaaag tcaaattgtt tcttgatgac ttagtggtga 3420 agcaagagaa gaaggaggca agagtggctc taacgattga tgttcag 3467 //