Dbfetch

ID   AK147543; SV 1; linear; mRNA; HTC; MUS; 3154 BP.
XX
AC   AK147543;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 11)
XX
DE   Mus musculus cDNA, RIKEN full-length enriched library, clone:M5C1069M13
DE   product:amyloid beta (A4) precursor protein, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-3154
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 9e14dcf3f0b05766fc4c2f2487410551.
DR   Ensembl-Gn; ENSMUSG00000022892; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0022971; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0022939; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0022909; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0022941; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0022703; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0023388; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0022225; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0022672; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0022806; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0022781; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0022875; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0022800; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0023406; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0021967; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0022273; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000005406; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0044934; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0044916; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0044886; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0044892; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0044631; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0045372; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0044753; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0044543; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0044696; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0044646; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0044770; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0044625; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0045429; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0044330; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0044000; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=M5C1069M13
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3154
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="M5C1069M13"
FT                   /note="pooled tissues: (dev_stage=14 days pregnant
FT                   adult,tissue_type=placenta,sex=female),(dev_stage=adult,
FT                   tissue_type=brain,sex=male,cell_type=undefined_cell_type,
FT                   cell_line=UNDEFINED_CELL_LINE)"
FT                   /db_xref="taxon:10090"
FT   CDS             186..2273
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="amyloid beta (A4) precursor protein (MGD|MGI:88059
FT                   GB|X59379, evidence: BLASTN, 99%, match=3037)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q6GR78"
FT                   /db_xref="InterPro:IPR008154"
FT                   /db_xref="InterPro:IPR008155"
FT                   /db_xref="InterPro:IPR011178"
FT                   /db_xref="InterPro:IPR011993"
FT                   /db_xref="InterPro:IPR013803"
FT                   /db_xref="InterPro:IPR015849"
FT                   /db_xref="InterPro:IPR019543"
FT                   /db_xref="InterPro:IPR019744"
FT                   /db_xref="InterPro:IPR019745"
FT                   /db_xref="InterPro:IPR024329"
FT                   /db_xref="InterPro:IPR028866"
FT                   /db_xref="InterPro:IPR036176"
FT                   /db_xref="InterPro:IPR036454"
FT                   /db_xref="InterPro:IPR036669"
FT                   /db_xref="InterPro:IPR037071"
FT                   /db_xref="MGI:MGI:88059"
FT                   /db_xref="UniProtKB/TrEMBL:Q6GR78"
FT                   /protein_id="BAE27986.1"
FT                   /translation="MLPSLALLLLAAWTVRALEVPTDGNAGLLAEPQIAMFCGKLNMHM
FT                   NVQNGKWESDPSGTKTCIGTKEGILQYCQEVYPELQITNVVEANQPVTIQNWCKRGRKQ
FT                   CKTHTHIVIPYRCLVGEFVSDALLVPDKCKFLHQERMDVCETHLHWHTVAKETCSEKST
FT                   NLHDYGMLLPCGIDKFRGVEFVCCPLAEESDSVDSADAEEDDSDVWWGGADTDYADGGE
FT                   DKVVEVAEEEEVADVEEEEADDDEDVEDGDEVEEEAEEPYEEATERTTSTATTTTTTTE
FT                   SVEEVVRVPTTAASTPDAVDKYLETPGDENEHAHFQKAKERLEAKHRERMSQVMREWEE
FT                   AERQAKNLPKADKKAVIQHFQEKVESLEQEAANERQQLVETHMARVEAMLNDRRRLALE
FT                   NYITALQAVPPRPHHVFNMLKKYVRAEQKDRQHTLKHFEHVRMVDPKKAAQIRSQVMTH
FT                   LRVIYERMNQSLSLLYNVPAVAEEIQDEVDELLQKEQNYSDDVLANMISEPRISYGNDA
FT                   LMPSLTETKTTVELLPVNGEFSLDDLQPWHPFGVDSVPANTENEVEPVDARPAADRGLT
FT                   TRPGSGLTNIKTEEISEVKMDAEFGHDSGFEVRHQKLVFFAEDVGSNKGAIIGLMVGGV
FT                   VIATVIVITLVMLKKKQYTSIHHGVVEVDAAVTPEERHLSKMQQNGYENPTYKFFEQMQ
FT                   N"
FT   regulatory      3138..3143
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      3154
FT                   /note="putative"
XX
SQ   Sequence 3154 BP; 837 A; 787 C; 832 G; 698 T; 0 other;
     ggtcggctgc gagccccgcc gcctcgctcc agctctgtca gtttcctcgg cggcgggagg        60
     cgagagcacc gggagcagag cgagcgcgga gccaccggag acggcggcgg cggcgcggac       120
     acagccaggg cgcggcggat cttccactcg cacacggagc actcggtggc ccacgcagga       180
     tcacgatgct gcccagcttg gcactgctcc tgctggccgc ctggacggtt cgggctctgg       240
     aggtacccac tgatggcaac gccgggctgc tggcagaacc ccagatcgcc atgttctgtg       300
     gtaaactcaa catgcacatg aatgtgcaga atggaaagtg ggagtcagac ccgtcaggga       360
     ccaaaacctg cattggcacc aaggagggca tcttgcagta ctgccaagag gtctaccctg       420
     aactgcagat cacaaacgtg gtggaagcca accagccagt gaccatccag aactggtgca       480
     agcggggccg caagcagtgc aagacacaca cccacatcgt gattccttac cgttgcctag       540
     ttggtgagtt tgtgagcgac gcccttctcg tgcccgacaa gtgcaagttc ctacaccagg       600
     agcggatgga tgtttgtgag acccatcttc actggcacac cgtcgccaaa gagacatgca       660
     gcgagaagag cactaacttg cacgactatg gcatgctgct gccctgcggc atcgacaagt       720
     tccgaggggt agagtttgta tgctgcccgt tggccgagga aagcgacagc gtggattctg       780
     cggatgcaga ggaggatgac tctgatgtct ggtggggtgg agcggacaca gactacgctg       840
     atggcggtga agacaaagta gtagaagtcg ccgaagagga ggaagtggct gatgttgagg       900
     aagaggaagc tgatgatgat gaggatgtgg aggatgggga cgaggtggag gaggaggccg       960
     aggagcccta cgaagaggcc accgagagaa caaccagcac tgccaccacc accacaacca      1020
     ccactgagtc cgtggaggag gtggtccgag ttcccacgac agcagccagc acccccgacg      1080
     ccgtcgacaa gtacctggag acacccgggg acgagaacga gcatgcccat ttccagaaag      1140
     ccaaagagag gctggaagcc aagcaccgag agagaatgtc ccaggtcatg agagaatggg      1200
     aagaggcaga gcgtcaagcc aagaacttgc ccaaagctga caagaaggcc gttatccagc      1260
     atttccagga gaaagtggaa tctctggaac aggaagcagc caatgagaga cagcagcttg      1320
     tagagacaca catggccaga gttgaagcca tgctcaatga ccgccgccgc ctggccctcg      1380
     agaattacat cactgcactg caggcggtgc ccccaaggcc tcatcatgtg ttcaacatgc      1440
     tgaagaagta cgtccgtgcg gagcagaaag acagacagca caccctaaag cattttgaac      1500
     atgtgcgcat ggtggacccc aagaaagctg ctcagatccg gtcccaggtt atgacacacc      1560
     tccgtgtgat ctacgagcgc atgaaccagt ctctgtccct gctctacaat gtccctgcgg      1620
     tggctgagga gattcaagat gaagtcgatg agctgcttca gaaggagcag aactactccg      1680
     acgatgtctt ggccaacatg atcagtgagc ccagaatcag ctacggaaac gacgctctca      1740
     tgccttcgct gacggaaacc aagaccaccg tggagctcct tcccgtgaat ggggaattca      1800
     gcctggatga cctccagccg tggcaccctt ttggggtgga ctctgtgcca gccaataccg      1860
     aaaatgaagt cgagcctgtt gacgcccgcc ccgctgctga ccgaggactg accactcgac      1920
     caggttctgg gctgacaaac atcaagacgg aagagatctc ggaagtgaag atggatgcag      1980
     aattcggaca tgattcagga tttgaagtcc gccatcaaaa actggtgttc tttgctgaag      2040
     atgtgggttc gaacaaaggc gccatcatcg gactcatggt gggcggcgtt gtcatagcaa      2100
     ccgtgattgt catcaccctg gtgatgttga agaagaaaca gtacacatcc atccatcatg      2160
     gcgtggtgga ggtcgacgcc gccgtgaccc cagaggagcg ccatctctcc aagatgcagc      2220
     agaacggata tgagaatcca acttacaagt tctttgagca aatgcagaac taagccccac      2280
     ccgcgccaca gcagcggcct ctgaacttgg acagcgaaac cattgcttca ctacccatcg      2340
     gtgttcattt ataaaataac gtggaaagaa acaaaccctt ccgtttattt actcaccctc      2400
     ggcttttgac agctgtgctg taacacaagt agatgcctga acttgaatta atatacaaat      2460
     cagtaatgta ttctcgcttt ctctctttac attctggtct ctacattaca tgattcatgg      2520
     gttttgtgta ctgtaaaaaa aaaaattagc tgtatcaaac tagtgcatga atagattctc      2580
     tcctaattat ttatcacata catagcccct tagcccgttg tatattattc ttgtggtttg      2640
     tggcccggaa aaaactccta cttgaaatat gctttaaaaa tcgatggggg atgcttcttg      2700
     tgaacgtggg cgtctagctg cttctcctac gtattctttt cctgatcact atgcattttg      2760
     aacatttttt taagtattcc aaatgactta gaaaattctt tttccatgac tgcatcttac      2820
     tgtacagatt gctgcttctg ctctctttgt gatataggaa taagaggata cacattgatt      2880
     tctttgtgcc tgttttatgt gcacacatta ggcattgaga atttgaacat tttttttgtc      2940
     catgtatctt tggatctttg ataaaaaaaa attaaaaaaa aattatccct gttcatcata      3000
     agcactttta cgggtggggg gagggagtgt tctgctggtc tccaattacc aagaattctc      3060
     caaaaattaa ttttctgcag gatgattgta cagaatcatt gcttatgcca tgatagcttt      3120
     ctacactgta ttacataaat aaattaaata aaat                                  3154
//