Dbfetch

ID   AK164293; SV 1; linear; mRNA; HTC; MUS; 2331 BP.
XX
AC   AK164293;
XX
DT   09-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 10)
XX
DE   Mus musculus 9 days embryo whole body cDNA, RIKEN full-length enriched
DE   library, clone:D030074E24 product:calpain 5, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2331
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (14-APR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; f7487c086fb8cf8e778682042f133ec8.
DR   Ensembl-Gn; ENSMUSG00000035547; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0032534; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0032508; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0032444; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0032519; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0032234; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0033017; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0031561; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0032201; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0032353; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0032309; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0032442; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0032342; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0033039; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0031279; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0031678; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000040971; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000107112; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0085081; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0085148; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0085099; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0085120; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0084689; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0085626; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0085209; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0084626; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0084816; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0084673; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0084852; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0084720; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0085901; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0084631; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0083742; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=D030074E24
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2331
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="9 days embryo"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="D030074E24"
FT                   /tissue_type="whole body"
FT                   /db_xref="taxon:10090"
FT   CDS             172..2094
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="calpain 5 (MGD|MGI:1100859 GB|Y10656, evidence:
FT                   BLASTN, 99%, match=2329)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q3TPL4"
FT                   /db_xref="InterPro:IPR000008"
FT                   /db_xref="InterPro:IPR000169"
FT                   /db_xref="InterPro:IPR001300"
FT                   /db_xref="InterPro:IPR022682"
FT                   /db_xref="InterPro:IPR022683"
FT                   /db_xref="InterPro:IPR022684"
FT                   /db_xref="InterPro:IPR033883"
FT                   /db_xref="InterPro:IPR033884"
FT                   /db_xref="MGI:MGI:1100859"
FT                   /db_xref="UniProtKB/TrEMBL:Q3TPL4"
FT                   /protein_id="BAE37722.1"
FT                   /translation="MFSCAKAYEDQNYSALKRACLRKKVLFEDPLFPATDDSLYYKGTP
FT                   GPTVRWKRPKDICDDPRLFVDGISSHDLHQGQVGNCWFVAACSSLASRESLWQKVIPDW
FT                   KEQEWNPEKPDSYAGIFHFNFWRFGEWVDVIVDDRLPTVNNQLIYCHSNSKNEFWCALV
FT                   EKAYAKLAGCYQALDGGNTADALVDFTGGVSEPIDLTEGDLATDEAKRNQLFERVLKVH
FT                   SRGGLISASIKAVTAADMEARLACGLVKGHAYAVTDVRKVRLGHGLLAFFKSEKLDMIR
FT                   LRNPWGEREWTGPWSDTSEEWQKVSKSEREKMGVTVQDDGEFWMTFEDMCRYFTDIIKC
FT                   RLINTSYLSIHKTWEEARLHGAWTRHEDPQQNRSGGCINHKDTFFQNPQYVFEVKKPED
FT                   EVLISIQQRPKRSTRREGKGENLAIGFDIYKVEENRQYRMHSLQHKAASSIYINSRSVF
FT                   LRTELPEGRYVIIPTTFEPGHTGEFLLRVFTDVPSNCRELRLDEPPRTCWSSLCGYPQQ
FT                   VAQVHVLGAAGLKDSPTGANSYVIIKCEGEKVRSAVQRGTSTPEYNVKGIFYRKKLAQP
FT                   ITVQVWNHRVLKDEFLGQVHLKTAPDDLQDLHTLHLQDRSSRQPSDLPGIVAVRVLCSA
FT                   SLTAV"
XX
SQ   Sequence 2331 BP; 495 A; 696 C; 670 G; 470 T; 0 other;
     gaacccccgc ctgcgggctg ccggggtatc atctccccgc agagtcccag gctgtggcgc        60
     gggctggtct agcctccgct ccagtgcccg cactgtgctc tgcatcccgg gagtccagct       120
     ccagctgcgg cgacgcggca ggtgcctccc cttcttgggg acgtggtcac catgttctcc       180
     tgcgcgaagg cctatgagga ccagaactac tcggcgctga agcgggcctg cctgcgcaag       240
     aaggtgctgt tcgaggatcc cctcttccct gccaccgacg actcccttta ctataagggc       300
     accccagggc ccacagtcag gtggaagcgg cctaaggata tctgcgacga tccccggctc       360
     ttcgtagatg gcatcagctc ccatgacctg caccagggcc aggtgggcaa ctgctggttt       420
     gtggctgcct gctcatcact ggcctcccga gagtcactct ggcagaaggt catcccagac       480
     tggaaggagc aggaatggaa ccccgagaag cctgacagct atgctggcat cttccacttc       540
     aacttctggc gctttgggga gtgggtggac gtaatcgtcg atgaccggct gcccacagtc       600
     aacaaccagc tcatttactg ccattccaac tccaaaaatg agttctggtg tgccctggtg       660
     gagaaggcct atgccaagct ggccggctgt taccaggccc tggacggagg caacacggcc       720
     gatgcattgg tggatttcac aggtggtgtt tctgaaccca ttgacctgac cgagggggac       780
     ttggccactg acgaggctaa gaggaatcag ctctttgagc gagtgctgaa ggtgcacagc       840
     agaggcgggc tcatcagtgc ctccatcaag gctgtgacag cagctgacat ggaggcccgc       900
     ctggcatgtg gcctggtgaa gggccatgca tacgctgtca ccgatgtgcg caaggtgcgc       960
     ctgggccatg gcctgctggc cttcttcaag tcagagaagc ttgatatgat ccgtctgagg      1020
     aacccctggg gcgagcggga gtggacgggg ccctggagtg acacgtcaga ggaatggcag      1080
     aaagtgagca agagtgagag ggagaagatg ggcgtgaccg tgcaggatga tggggaattc      1140
     tggatgacct ttgaggacat gtgccggtac tttactgaca tcattaaatg ccgcctgatt      1200
     aacacgtcct acctgagcat ccataagaca tgggaggagg cccggctgca tggtgcctgg      1260
     acgagacatg aggacccaca gcagaaccgc agtggaggct gcatcaacca caaggacact      1320
     ttcttccaga acccacagta cgtatttgaa gtcaagaagc cagaagatga agtgttgatc      1380
     agtatccagc agcggccgaa gcgctcaact cgccgggagg gcaaaggcga gaatctggcc      1440
     attggcttcg acatctataa ggtggaagag aaccgccaat accgtatgca cagcctacag      1500
     cataaggccg ccagctccat ctacatcaat tcccgcagcg tttttttgag gacagagctg      1560
     cccgagggcc gctacgttat catccctacc acctttgagc caggccacac tggcgagttc      1620
     ctgctccgag tcttcacaga tgtcccctcc aactgccggg aactacgcct ggatgagccc      1680
     cctcggacct gttggagttc cctctgtggc taccctcagc aggtggccca ggtacatgtc      1740
     ctgggggctg ctggcctcaa ggactcccca acaggagcaa actcatatgt gatcatcaag      1800
     tgtgagggcg aaaaggttcg ctcagctgtg cagagaggga cctcgacacc agagtacaat      1860
     gtaaaaggca tcttctatcg caagaagctg gctcagccta tcaccgtgca ggtttggaat      1920
     caccgagtcc tgaaggatga attcctgggc caggtgcacc tgaagactgc cccggatgac      1980
     ctgcaggacc tccacaccct ccatctccag gaccgcagta gccggcagcc cagtgacctg      2040
     ccaggcattg tagctgtgcg agtcctctgc agtgcctctc tcacggctgt ctgaccccag      2100
     cctgcctgtc ctgccccact agtcctcacc actactcgca tgtccccacc ttgcctggga      2160
     ccagcctggg aaccagacac tggggccctt tcctcactct tccactgacc cactgtgtga      2220
     cctgaagaga gccctgccct ctctgagcct cagtgtttgg agggccccaa agaattcccg      2280
     tcttgtgggg gagttttctt gcctaagatt taatgcagtt ctctctaccc t               2331
//