ID X96665; SV 1; linear; mRNA; STD; VRL; 2603 BP. XX AC X96665; S73815; S77089; XX DT 20-MAR-1996 (Rel. 47, Created) DT 14-NOV-2006 (Rel. 89, Last updated, Version 20) XX DE Soybean mosiac virus mRNA for NIb and coat protein XX KW coat protein; NIb protein. XX OS Soybean mosaic virus OC Viruses; Riboviria; Potyviridae; Potyvirus. XX RN [2] RC Revised by [5] RA Liu J.; RT ; RL Submitted (15-MAR-1996) to the INSDC. RL J. Liu, Institute of Microbiology, Chinese Academy of Sciencs, Beijing RL 100080, Beijing 100080, PROC XX RN [3] RP 1552-2603 RX PUBMED; 8049345. RA Liu J., Peng X., Li L., Mang K.; RT "Cloning of coat protein gene of soybean mosaic virus and its expression in RT Escherichia coli"; RL Chin. J. Biotechnol. 9(3):143-149(1993). XX RN [4] RP 1-2603 RX PUBMED; 7755870. RA Liu J., Peng X., Mang K.; RT "cDNA cloning and sequence analysis of NIb gene of soybean mosaic virus"; RL Sci. China, Ser. B, Chem. Life Sci. Earth Sci. 38(2):160-168(1995). XX RN [5] RP 1-2603 RA Liu J.; RT ; RL Submitted (21-OCT-1996) to the INSDC. RL J. Liu, Institute of Microbiology, Chinese Academy of Sciencs, Beijing RL 100080, Beijing 100080, PROC XX DR MD5; 13790124301b8c4b948c06632edbf3f3. XX FH Key Location/Qualifiers FH FT source 1..2603 FT /organism="Soybean mosaic virus" FT /strain="BJ" FT /mol_type="mRNA" FT /macronuclear FT /clone="pBluescript KS" FT /db_xref="taxon:12222" FT CDS <1..2349 FT /codon_start=1 FT /product="NIb protein" FT /product="coat protein" FT /db_xref="GOA:Q98704" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001592" FT /db_xref="InterPro:IPR007094" FT /db_xref="UniProtKB/TrEMBL:Q98704" FT /protein_id="CAA65445.1" FT /translation="GKKERWVLDAMEGNLVACGQADSALVTKHVVKGKCPYFAQYLSVN FT QEAKFFFEPLMGAYQPSRLNKDAFKRDFFKYNKPVVLNEVDFQAFEKAVAGVKLMMMEF FT DFKECVYVTDPDEIYDSLNMKAAVGAQYKGKKQDYFSGMDSFDKERLLYLSCERLFYGE FT KGVWNGSLKAELRPIEKVQANKTRTFTAAPIDTFLGAKVCVDDFNNQFYSLNLTCPWTV FT GMTKFYRGWDKLMRSLPDGWVYCHADGSQFDSSLTPLLLNAVLDVRGFFMEDWWVGREM FT LENLYAEIVYTPILAPDGTIFKKFRGNNSGQPSTVVDNTLMVVIAMYYSCCKQGWSEED FT IQERLVFFAIGDDIILAVSEKDTWLYDTLSTSFAELGLNYNFEEQTKKREELWFMSHQA FT MLVDGVYIPKLEPERIVSILEWDRSKELMHRTEAICAAMIEAWGYTELLQEIRKFYLWL FT LNKDEFKELASSGKAPYIAETALRKLYTDVNAQTSELQRYLEVLDFNHADDCCESVSLQ FT SGKEKEGDMDAGKDPKKNTSSSKGAGTSSKDVNVGSKGKVVPRLQKITRKMNLPMVEGK FT IILSLDHLLEYKPNQVDLFNTRATRTQFEAWYNAVKDEYELDDEQMGVVMNGFMVWCID FT NGTSPDANGVWVMMDGEEQIEYPLKPIVENAKPTLRQIMHHFSDAAEAYIEMRNSESPY FT MPRYGLLRNLRDRELARYAFDFYEVTSKTPNRAREAIAQMKAAALSGVNNKLFGLDGNI FT STNSENTERHTARDVNQNMHSLLGMGPQQ" FT mat_peptide <1..1551 FT /product="NIb protein" FT mat_peptide 1552..2346 FT /product="coat protein" FT misc_difference 1552 FT /replace="g" FT /note="conflict" FT /citation=[3] FT misc_difference 2473^2474 FT /replace="t" FT /note="strain SMV-N" FT /note="conflict" FT /citation=[3] XX SQ Sequence 2603 BP; 808 A; 438 C; 654 G; 703 T; 0 other; gggaagaagg agagatgggt cttggatgca atggagggca acctagtggc ttgtgggcaa 60 gctgacagtg cattggtaac aaagcatgtt gttaaaggaa agtgccccta ttttgcacaa 120 tatctttcag tgaatcaaga ggcaaagttc ttctttgaac cactcatggg tgcgtatcaa 180 ccaagccgct taaacaaaga tgcattcaaa cgagacttct tcaaatataa caaaccagtt 240 gttttgaatg aagttgattt tcaagctttt gagaaggcag tggctggagt gaaattgatg 300 atgatggaat ttgatttcaa ggagtgtgtg tatgtgactg atcctgatga aatatacgac 360 tccttgaata tgaaagctgc agttggtgca caatacaaag ggaagaagca agattatttc 420 tctggaatgg acagtttcga caaggaacgc ttgctctatc tcagctgtga aaggttattc 480 tatggggaaa aaggagtgtg gaatggatcc ctgaaagcag agttgaggcc aattgaaaaa 540 gtgcaagcaa acaaaaccag gacattcaca gcagcaccaa tcgacacatt ccttggagca 600 aaggtttgtg ttgatgattt caacaaccaa ttttacagtc tcaatcttac atgcccatgg 660 acagttggga tgacaaaatt ttatagaggt tgggacaagt tgatgagaag tttacccgat 720 ggatgggtgt attgtcatgc agatggctca cagtttgata gttccctgac accattacta 780 ttgaatgcag ttttggatgt taggggcttt ttcatggagg actggtgggt tgggagagaa 840 atgctagaaa acctctatgc tgagatagtc tacacaccaa ttttagcacc tgatggtaca 900 atttttaaga agttcagagg aaacaacagc gggcagccat ctacagttgt ggacaatacc 960 ttgatggtgg tcattgccat gtactattct tgttgtaagc aaggttggtc agaggaggat 1020 attcaggaaa gattagtgtt tttcgccatt ggtgatgaca tcatcctggc ggttagtgag 1080 aaggacacat ggctgtatga cactctaagc acttcatttg ctgaacttgg gctcaattac 1140 aactttgagg aacagacaaa gaaaagggag gaattgtggt tcatgtcaca ccaagccatg 1200 ttagttgatg gagtttatat tccaaaactt gaacctgaga gaattgtttc tatcctagag 1260 tgggatagga gcaaagaact tatgcatcgc actgaggcga tatgcgcagc aatgattgag 1320 gcatggggat acactgaatt gctgcaagag atccgcaaat tttatttgtg gctcctaaac 1380 aaggatgaat ttaaagagct tgcttcgtct ggaaaagcac catatattgc agagacagct 1440 ttgagaaagc tgtacacaga tgttaatgct cagacaagtg agctacaaag atatcttgag 1500 gtgcttgatt tcaatcatgc tgatgactgc tgtgaatcag tgtccttaca atcaggcaag 1560 gagaaggaag gagacatgga tgcaggtaag gatccaaaga agaacaccag cagtagcaaa 1620 ggggctggta caagcagcaa agatgtaaat gttggatcaa aaggaaaggt ggttccgcgt 1680 ttgcagaaga ttacaagaaa gatgaatctt ccaatggttg aaggaaagat tattcttagc 1740 ttagaccact tgcttgagta caaacctaat caggttgatt tattcaacac tcgagcaaca 1800 agaacacagt ttgaagcgtg gtacaatgca gttaaggatg aatatgagct tgatgatgaa 1860 caaatgggtg tggttatgaa tggttttatg gtttggtgta tagacaatgg cacatctcca 1920 gatgccaatg gcgtgtgggt gatgatggat ggagaggaac agattgaata tccgctgaaa 1980 cccattgttg aaaatgcaaa accaactttg agacaaatca tgcatcattt ttcagatgca 2040 gcagaagctt acattgagat gagaaattct gaaagtccat atatgcctag atatggacta 2100 ctgaggaatt tgagagatag ggagttagcc cgttatgcct tcgatttcta tgaggtcacc 2160 tccaaaacac cgaatagggc aagagaggca atagcacaaa tgaaggctgc agctctctcg 2220 ggagttaaca acaagctgtt tgggcttgat ggaaacatct cgaccaactc cgaaaatact 2280 gaaaggcaca ctgcaagaga tgtgaatcaa aacatgcatt ctcttttggg catgggccca 2340 cagcagtaaa ggctaggtaa atcggccaca gttatcattt cgggtcgctt tatagtttac 2400 tatagtatag tagttgcact tcctttaagt atagtgtgat tgcatcacct aataatactt 2460 ttgtttagag tggtctaacc accttagtgt gctttatatt atagtttatg aatagcaggg 2520 agaaccattg caatgccgga gcccttttca agagtgattt tatcatgcat agtggccgag 2580 gtgcggcaat gtttgttgtc gac 2603 //