ID FJ026002; SV 1; linear; genomic RNA; STD; VRL; 3467 BP. XX AC FJ026002; XX DT 15-MAY-2009 (Rel. 100, Created) DT 18-JUN-2009 (Rel. 101, Last updated, Version 2) XX DE Cassava brown streak virus isolate NTG10 polyprotein (pol) gene, partial DE cds. XX KW . XX OS Cassava brown streak virus OC Viruses; Riboviria; Potyviridae; Ipomovirus. XX RN [1] RP 1-3467 RX DOI; 10.1128/JVI.00537-09. RX PUBMED; 19386713. RA Mbanzibwa D.R., Tian Y., Mukasa S.B., Valkonen J.P.; RT "Cassava brown streak virus (Potyviridae) encodes a putative Maf/HAM1 RT pyrophosphatase implicated in reduction of mutations and a P1 proteinase RT that suppresses RNA silencing but contains no HC-Pro"; RL J. Virol. 83(13):6934-6940(2009). XX RN [2] RP 1-3467 RA Mbanzibwa D.R., Tian Y., Tugume A.K., Valkonen J.P.T.; RT ; RL Submitted (14-AUG-2008) to the INSDC. RL Applied Biology, University of Helsinki, Latokartnonkaari 7, Helsinki RL 00790, Finland XX DR MD5; a59e513798087a2786b2c6fd9b015060. XX FH Key Location/Qualifiers FH FT source 1..3467 FT /organism="Cassava brown streak virus" FT /host="cassava" FT /isolate="NTG10" FT /mol_type="genomic RNA" FT /country="Uganda" FT /collection_date="2007" FT /db_xref="taxon:137758" FT 5'UTR <1..130 FT gene 131..>3467 FT /gene="pol" FT CDS 131..>2254 FT /codon_start=1 FT /gene="pol" FT /product="polyprotein" FT /note="coding region disrupted by sequencing gap" FT /db_xref="InterPro:IPR025910" FT /db_xref="UniProtKB/TrEMBL:C4N225" FT /protein_id="ACM48174.1" FT /translation="MSTIQLFKTIRFGSFEPVKLDEGNNVIEKVPIDLLAGNDGSGPEE FT QSEQKYHRKESGESWRKVTDLYSVIGNSVYCRSYVAMKNFLNDTKWGGLFKNKKGQELK FT AAARLRRATSYGFMYDPVACAFECPVCRTKATALEAFTSDCDHCFEVKHIDDDRIVQTE FT TKFYPINPIELDVEDNLVEAASLEWLKGEVEERVVDRVLLLEDKEIRVTKKALVKRRAE FT TKLVANVADLTKKLTEICCESGIPIIDIDNSKRKAIPMVKLKHIFGKIECDDMFGEDRY FT FLEHSNAGKIFRSCERITYRMIRPGWSGAIILRDNVQGEDHDKFDFINDICVVQGKNLI FT SNKIENAMRVKTETEMDLIDLYSFNLSWAKSKDKFIKHFESDTTQLIRSCCTPSTLWLY FT ARKARFYKFVDHMILKGSPIIDILVKMEYVGKHLDLFNSVEDVCTEFFSFMKELISETV FT NDKSDPDVLRVRNLIRAHFESVLEYNKYELIDRIIEKKTQLEAQEIMSRELIRHQYGEL FT FSWRERLYLKLGIGSSNLYTYWVEREEKQSEKSSAISCIISKPGIEMLINWVSEVCKSK FT YHSLVNCVDSGILFLWSRIVHLARVSVYGYWNLWFRQAMCVLFIFLVSNFSCKIVNYLK FT TLIVNEKKLAIKNEEGFVEVQGRKEESFVLKWCAAATLFLSFLNYDWAVGCVSAIGKMK FT TIFSALGPDFIEKQ" FT mat_peptide 131..1216 FT /gene="pol" FT /product="P1 protein" FT mat_peptide 1217..2098 FT /gene="pol" FT /product="P3 protein" FT mat_peptide 2099..2254 FT /gene="pol" FT /product="6K1 protein" FT gap 2255..2354 FT /estimated_length=unknown FT CDS <2355..>3467 FT /codon_start=1 FT /gene="pol" FT /product="polyprotein" FT /note="coding region disrupted by sequencing gap" FT /db_xref="GOA:C4N226" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR002637" FT /db_xref="InterPro:IPR027502" FT /db_xref="InterPro:IPR029001" FT /db_xref="UniProtKB/TrEMBL:C4N226" FT /protein_id="ACM48175.1" FT /translation="VSIIQSRFRELFSECGLDYNFDDVHKSIETIEYMSHSFMLKDDVY FT IPKLKKERIVAILEWERGDEIMRTRSALNAAYIESYGYDDLMVEIERYAVFWATEKGCE FT YPLLDRKRVEGLYKDDHTDINEEWLRGILPPSFEHCYVDTQTKDLRREEQPELRPESYD FT RTTQMQMKFPVTFVTGNLGKLAEVKSILGIANDVIARNIDLPEVQGTPDEIVRKKAQLA FT VKMTNSPVLVEDTCLCFNAFNGLPGPYIKWFLKELGLEGVVKMLSAFGDKSAYALCTFA FT YVHNESSDPIVFKGVVNGEIVPPRGNNGFGWDPIFKPDECSCTFAEMSSSMKNDFSHRR FT RALEKVKSFLDNLVVKQEEKKARVALTIDVQ" FT mat_peptide <2355..2789 FT /gene="pol" FT /product="NIb protein" FT mat_peptide 2790..3467 FT /gene="pol" FT /product="HAM1-like protein" XX SQ Sequence 3467 BP; 1088 A; 477 C; 806 G; 996 T; 100 other; ataaacatga cataagaata cataaagata catcgcatac aaacactgtg cattcaaatt 60 cgtgagaaag caagctgtta cgaaagttta caatacatcc ttatatattc atagcactga 120 atatctgcag atgtcgacta tccagctatt caagactatc agatttggtt cctttgagcc 180 tgtcaagctt gatgaaggaa acaatgttat agagaaggtc ccaattgatc ttctggcagg 240 aaatgatggc tctggtcctg aggagcaatc ggaacagaaa taccatagga aagaaagcgg 300 tgaaagttgg cggaaagtaa ccgacttata ttctgtaatt ggaaattctg tttactgccg 360 cagttacgta gcaatgaaaa actttctcaa tgatacaaag tggggaggtc ttttcaaaaa 420 taagaagggg caagaactaa aggctgctgc aaggctcaga agggctacaa gctatggatt 480 catgtatgat ccagttgctt gtgcctttga atgccctgtt tgcagaacaa aagctacagc 540 attggaagca tttacaagcg attgtgatca ttgtttcgaa gtcaagcaca ttgatgatga 600 taggatcgtt caaactgaaa caaagtttta tccaataaat cctattgaac tggatgttga 660 ggataactta gtggaagctg catcgctcga gtggttgaaa ggtgaagtgg aagagagagt 720 tgtagacagg gttttactac ttgaggacaa ggagattcgt gttacaaaga aagctctagt 780 caaaaggaga gctgagacaa aactagtggc taacgttgct gatcttacca agaaattgac 840 tgaaatttgc tgtgagtctg gtatccctat cattgacatt gataacagca agaggaaagc 900 cattccgatg gtcaaactca agcatatctt tgggaaaatt gagtgtgatg acatgtttgg 960 ggaagatcgg tatttcctgg agcatagcaa tgcagggaaa attttccgat catgtgagag 1020 aattacttat aggatgattc gtcctggatg gagtggagca atcattctaa gagacaatgt 1080 tcagggagag gatcatgata agtttgactt tatcaatgat atttgtgttg ttcaagggaa 1140 gaatctaata agcaacaaaa ttgagaacgc catgcgcgtc aagactgaaa cagaaatgga 1200 cttaattgat ttgtattctt tcaatcttag ttgggcgaaa tcaaaagata agttcattaa 1260 gcactttgaa agtgatacaa ctcaattaat tagatcgtgt tgcactccta gtacattgtg 1320 gttgtacgca aggaaagcca ggttttacaa atttgtggat cacatgatct taaagggtag 1380 tccaataatt gatattctag ttaaaatgga gtatgttggc aaacatttgg atttgtttaa 1440 tagtgttgaa gatgtgtgca ctgagttttt ctcatttatg aaggagttaa ttagtgaaac 1500 tgtcaacgat aaatcagatc cagatgtgtt gcgcgttaga aatttgataa gagcgcattt 1560 tgagtcagtc ttagaataca ataagtatga gttgatagat aggataattg agaaaaaaac 1620 tcagttagaa gcacaagaaa ttatgtctcg cgagctcatc agacatcagt acggagaatt 1680 attttcttgg cgcgaacgtt tgtatttgaa actaggtata ggctcatcga atctctatac 1740 atactgggtg gaacgcgaag agaaacaatc agagaaatcg agtgcaatct cttgcatcat 1800 ttccaaaccg ggaatagaga tgttgatcaa ttgggtctca gaagtgtgta agagtaaata 1860 tcatagtctt gtaaattgtg tcgatagtgg tattttgttc ttgtggtcta gaatagttca 1920 tttagcgaga gtttcagttt atggttattg gaatctttgg tttagacaag ctatgtgtgt 1980 gttgttcatt ttcttagttt caaatttctc ttgtaaaata gttaattatc ttaagacatt 2040 aattgtgaat gagaagaagc tagcaataaa gaatgaagaa ggtttcgttg aagtacaagg 2100 ccgaaaagaa gagtcttttg ttttaaagtg gtgtgctgct gcaactctgt ttcttagttt 2160 tctaaactat gattgggcag ttggatgtgt atcagccatt ggaaagatga aaacaatatt 2220 tagtgcacta ggtccagatt tcatagaaaa gcaannnnnn nnnnnnnnnn nnnnnnnnnn 2280 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2340 nnnnnnnnnn nnnngtgagt ataatccaga gtaggttcag agagttgttt tcagaatgtg 2400 gtttggatta taattttgat gatgttcaca agtcaataga gactattgag tatatgagcc 2460 attcattcat gctgaaagat gatgtgtaca ttccaaagtt gaagaaagaa agaattgttg 2520 caattcttga atgggagaga ggtgatgaga ttatgcgaac acggagtgct cttaatgctg 2580 cttatatcga aagttatgga tatgatgatc ttatggttga aattgagcgg tacgcggttt 2640 tctgggctac cgaaaaaggt tgtgagtatc cattattaga taggaagcgt gtggaaggac 2700 tttacaaaga tgatcacaca gatattaatg aagaatggct gagaggtatt ttaccaccat 2760 cattcgaaca ttgttatgtt gacacgcaaa ctaaggattt gagaagggaa gagcagcctg 2820 agttgagacc tgaaagttat gacagaacaa cccaaatgca gatgaagttc ccagttacat 2880 ttgtgacagg aaatttgggg aagttagctg aagtgaagtc tattcttggc attgcaaatg 2940 atgttatagc caggaacatt gatttaccag aagtgcaagg aaccccagat gagattgtga 3000 gaaagaaggc tcaattagca gtgaagatga ctaatagtcc tgttttagtc gaagacacgt 3060 gtctttgttt caatgctttt aatggactgc ctgggccata catcaaatgg tttttaaaag 3120 aattgggtct tgaaggtgtt gtcaaaatgc tgtcagcatt tggagataag tcagcgtacg 3180 cactatgtac atttgcatat gtgcacaatg agtcgtctga tccaattgtg tttaaaggag 3240 ttgtgaatgg tgaaattgtg ccaccgcgag gtaacaatgg ttttggttgg gatcctatat 3300 tcaaacctga tgaatgtagt tgtacgtttg cagaaatgtc aagcagcatg aagaatgatt 3360 tttcccatag aagaagagct ttagagaaag tcaaatcatt tcttgataac ttggtggtga 3420 agcaagagga gaagaaggca agggtggctc taacgattga tgttcag 3467 //