Dbfetch

ID   FJ026001; SV 1; linear; genomic RNA; STD; VRL; 3467 BP.
XX
AC   FJ026001;
XX
DT   15-MAY-2009 (Rel. 100, Created)
DT   18-JUN-2009 (Rel. 101, Last updated, Version 2)
XX
DE   Cassava brown streak virus isolate BSA4 polyprotein (pol) gene, partial
DE   cds.
XX
KW   .
XX
OS   Cassava brown streak virus
OC   Viruses; Riboviria; Potyviridae; Ipomovirus.
XX
RN   [1]
RP   1-3467
RX   DOI; 10.1128/JVI.00537-09.
RX   PUBMED; 19386713.
RA   Mbanzibwa D.R., Tian Y., Mukasa S.B., Valkonen J.P.;
RT   "Cassava brown streak virus (Potyviridae) encodes a putative Maf/HAM1
RT   pyrophosphatase implicated in reduction of mutations and a P1 proteinase
RT   that suppresses RNA silencing but contains no HC-Pro";
RL   J. Virol. 83(13):6934-6940(2009).
XX
RN   [2]
RP   1-3467
RA   Mbanzibwa D.R., Tian Y., Tugume A.K., Valkonen J.P.T.;
RT   ;
RL   Submitted (14-AUG-2008) to the INSDC.
RL   Applied Biology, University of Helsinki, Latokartnonkaari 7, Helsinki
RL   00790, Finland
XX
DR   MD5; ba38b5c3c57ca60900ed5839f88a1b3e.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3467
FT                   /organism="Cassava brown streak virus"
FT                   /host="cassava"
FT                   /isolate="BSA4"
FT                   /mol_type="genomic RNA"
FT                   /country="Uganda"
FT                   /collection_date="2007"
FT                   /db_xref="taxon:137758"
FT   5'UTR           <1..130
FT   gene            131..>3467
FT                   /gene="pol"
FT   CDS             131..>2254
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="polyprotein"
FT                   /note="coding region disrupted by sequencing gap"
FT                   /db_xref="InterPro:IPR025910"
FT                   /db_xref="UniProtKB/TrEMBL:C4N223"
FT                   /protein_id="ACM48172.1"
FT                   /translation="MSTIQLFKTIRFGSFEPVKLDEGNNVVEKVPVDLLAGNDGSGPEG
FT                   QSEQKYHRKESGESWRKVTDLYSVIGNSVYCRSYVAMKNFLNDTKWGGLFKNKKGQELK
FT                   AAARLRRATSYGFMYDPVACAFECPVCRTKATALEAFTSDCDHCFEIKHIDDDRIVQTE
FT                   TRFYPINPIELDVEDNLVEAASLEWLGSDVEERVVDRVLLLEDKEIRVTKKALIKRRGE
FT                   TRLVANVSDLTRKLTEICCESGIPIIDIDNSKRKAIPMVKLKHIFGKIECDDMFEEDRY
FT                   FLEHKNAGKIFRSCEKITYRMIRPGWSGAIILRDNVQGEDRDKFDFINDICVVQGKNLI
FT                   SNKIENAMRVKTETEIDLIDLYSFNLSWAKSKDKFIKHFESDITQLVRSCCTPSTLWLY
FT                   ARKARFYKFVDYMILKGSPIIDILVKMEYVGKHLDMFNSVEDVCTEYSHFMKEIISETV
FT                   NDKSDPDVLRVRNLIRAHFESVLEYNKYELIDRIIEKKTQLEAQEIMSRELIKHQYGEL
FT                   FSWRERLCLKLGIGSSNLYTYWVEREEKQSEKSSAISCIISKPGIEMLINWISEVCKSK
FT                   YHSLVNCVDSGILFLWSRIVHLARVSVYGYWNLWFRQAMCVLFIFLVSNFSSKIVGYLK
FT                   KLIVSEKKLAIKNEAGFVEVQGRKEESFVLKWCAAATLFLSFLNYDWAVGCVSAIGKMK
FT                   TIFSALGPDFIEKQ"
FT   mat_peptide     131..1216
FT                   /gene="pol"
FT                   /product="P1 protein"
FT   mat_peptide     1217..2098
FT                   /gene="pol"
FT                   /product="P3 protein"
FT   mat_peptide     2099..2254
FT                   /gene="pol"
FT                   /product="6K1 protein"
FT   gap             2255..2354
FT                   /estimated_length=unknown
FT   CDS             <2355..>3467
FT                   /codon_start=1
FT                   /gene="pol"
FT                   /product="polyprotein"
FT                   /note="coding region disrupted by sequencing gap"
FT                   /db_xref="GOA:C4N224"
FT                   /db_xref="InterPro:IPR001205"
FT                   /db_xref="InterPro:IPR002637"
FT                   /db_xref="InterPro:IPR027502"
FT                   /db_xref="InterPro:IPR029001"
FT                   /db_xref="UniProtKB/TrEMBL:C4N224"
FT                   /protein_id="ACM48173.1"
FT                   /translation="VSIIQSRFRELFSECGLDYNFDDVHKSIETIEYMSHSFMLKDDVY
FT                   IPKLKKERIVAILEWERGDEIMRTRSALNAAYIESYGYDDLMVEIERYAVFWATEKGCE
FT                   YPLLDRKRVEGLYKDDYTDINEEWLVGILPPSFEHCYVDTQTEDLRGSEKLELRTESRD
FT                   RISQMQMKFPITFVTGNLGKLAEVKSILGISSDVMAKNIDLPEVQGTPDEIVIKKAQLA
FT                   VKMTNSPVLVEDTCLCFNAFNGLPGPYIKWFLKELGLEGVVKMLSAFGDKSAYALCTFA
FT                   YVHNELSDPVVFKGVVNGEIVPPRGNNGFGWDPIFKPDECSCTFAEMPSSFKNDFSHRR
FT                   RALEKVKLFLDDLVVKQEKKEARVALTIDVQ"
FT   mat_peptide     <2355..2789
FT                   /gene="pol"
FT                   /product="NIb protein"
FT   mat_peptide     2790..3467
FT                   /gene="pol"
FT                   /product="HAM1-like protein"
XX
SQ   Sequence 3467 BP; 1086 A; 464 C; 809 G; 1008 T; 100 other;
     ataaacatga cataagaata cataaagata catcgcatac aaacactgtg cattcaaatt        60
     tgtgagaaag caagctgtta caaaagttta cgatacatcc ttatatattt atagcactga       120
     atacctgcag atgtcgacta tccagctatt caagactatc agatttggtt cctttgaacc       180
     tgtcaaactc gatgaaggaa acaatgttgt agagaaggtc ccagttgatc ttctggcagg       240
     aaatgatggc tctggtcctg aggggcaatc agaacagaaa taccatagga aagaaagcgg       300
     tgaaagttgg cggaaagtaa ctgacttata ttctgtaatt ggaaattctg tttactgccg       360
     cagttacgta gcaatgaaaa acttccttaa tgatacaaaa tggggaggtc ttttcaagaa       420
     taaaaagggg caagaattaa aagctgctgc aaggctcaga agggccacaa gctatggatt       480
     catgtatgat ccagttgctt gtgcctttga atgtcctgtt tgtaggacaa aagcaacagc       540
     attggaagcg ttcacaagcg attgtgacca ttgttttgaa atcaagcaca ttgatgatga       600
     taggatcgtt cagactgaaa caaggtttta tccaataaat cctattgaac tggatgttga       660
     ggataactta gtggaagctg catcgcttga gtggttggga agtgatgtgg aagagagagt       720
     cgtagacagg gttttactac ttgaggataa ggagattcgt gttacgaaga aagccctaat       780
     caaaagaaga ggtgagacaa gactcgtggc aaatgtttct gatcttacca ggaaattgac       840
     tgaaatttgt tgtgagtctg ggattcccat cattgatatt gataatagca agaggaaagc       900
     cattccaatg gttaaactca agcatatctt tgggaaaatt gagtgtgatg acatgtttga       960
     agaagatcgg tatttcttgg agcataaaaa tgcagggaaa attttccgat catgcgagaa      1020
     aattacttat aggatgattc gtcctggatg gagcggagcg atcattttga gagacaatgt      1080
     tcagggagag gatcgtgata agtttgattt tatcaatgac atttgtgttg ttcaagggaa      1140
     gaatctgatt agcaataaaa ttgaaaatgc catgcgtgtc aagaccgaaa cagagataga      1200
     tttaattgat ttatattctt tcaatcttag ttgggcaaaa tcgaaagata agttcattaa      1260
     gcactttgaa agtgatataa ctcaattagt tagatcatgt tgcactccta gtacattgtg      1320
     gttatacgca aggaaagcca ggttttacaa atttgtggat tacatgatct tgaagggtag      1380
     tccaataatt gatattttag ttaaaatgga gtatgttggc aagcatttgg acatgttcaa      1440
     tagtgttgaa gatgtgtgca ctgagtattc tcattttatg aaggagataa ttagtgaaac      1500
     tgtcaacgat aaatcggatc cagatgtgtt gcgcgttaga aatttgatta gagcgcattt      1560
     tgagtcagtc ttagaatata ataagtatga gttaatagat aggataattg agaaaaaaac      1620
     acagttagaa gcacaagaaa ttatgtctcg cgagctcatc aaacatcagt acggagagtt      1680
     attttcttgg cgcgaacgtt tgtgtttgaa actaggtata ggctcatcaa atctctatac      1740
     gtactgggtg gaacgcgagg agaaacaatc agagaaatcg agtgcaatct cttgcatcat      1800
     ttccaaaccg ggaatagaga tgttgatcaa ttggatctca gaagtgtgta aaagtaaata      1860
     ccatagtctt gtaaattgtg tcgatagtgg tattttattc ttgtggtcta gaatagttca      1920
     tttggctaga gtctcagttt atggttattg gaatctttgg tttagacaag cgatgtgtgt      1980
     gttattcatt tttttagttt cgaatttctc tagtaaaata gttggttatc ttaagaaatt      2040
     aattgtgagt gagaagaagc tagcaataaa gaatgaggca ggtttcgttg aagtgcaagg      2100
     ccgaaaggaa gaatcttttg tcttaaaatg gtgtgctgct gcgactctgt ttcttagttt      2160
     cctaaactat gactgggcag ttggatgtgt atcagccatt ggaaagatga aaacaatatt      2220
     tagcgcacta ggtccagatt tcatagaaaa gcaannnnnn nnnnnnnnnn nnnnnnnnnn      2280
     nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn      2340
     nnnnnnnnnn nnnngtgagc ataattcaga gtaggttcag agagttgttt tcagaatgtg      2400
     gtttagacta taattttgat gatgttcaca agtcaataga aacaattgag tatatgagcc      2460
     attcattcat gctgaaagat gatgtgtata ttccaaagtt gaagaaagaa agaattgttg      2520
     caattcttga atgggagaga ggtgatgaga ttatgcgaac gcggagtgct cttaatgctg      2580
     cttatattga gagttatgga tatgatgatc ttatggttga aattgagcgg tatgcggttt      2640
     tctgggctac cgagaaaggt tgtgagtatc cattactgga tagaaagcgt gtagaaggac      2700
     tttacaaaga tgattacaca gatatcaatg aagaatggct ggtaggcatt ttaccaccat      2760
     cattcgaaca ttgttatgtt gacacgcaga ctgaggattt gagaggaagt gagaagcttg      2820
     agttgagaac tgagagccgt gatagaatat cccaaatgca aatgaagttt ccaattacat      2880
     ttgtgacagg aaatttggga aaattagcag aagtgaagtc tattcttggc atttcaagtg      2940
     atgttatggc taagaacatt gatttaccag aagtgcaagg cactccggat gaaattgtga      3000
     taaagaaagc gcaattggca gtgaagatga ctaatagtcc tgttttagtc gaagatacat      3060
     gtctttgctt taatgctttc aatggacttc ctgggccata cataaaatgg tttttaaaag      3120
     aattgggtct tgaaggtgtt gttaaaatgc tgtcagcatt tggagataaa tcagcgtacg      3180
     cactatgcac attcgcatat gtgcacaatg aattgtctga tccagttgtg tttaaaggtg      3240
     ttgtaaatgg tgaaattgtg ccgccacgag gtaacaatgg ttttggttgg gatcctatat      3300
     ttaagcctga cgaatgtagt tgtacgttcg cagaaatgcc aagcagcttt aagaatgatt      3360
     tttcccatag aagaagagct ttagagaaag tcaaattgtt tcttgatgac ttagtggtga      3420
     agcaagagaa gaaggaggca agagtggctc taacgattga tgttcag                    3467
//