Dbfetch

ID   AK160117; SV 1; linear; mRNA; HTC; MUS; 1826 BP.
XX
AC   AK160117;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 11)
XX
DE   Mus musculus osteoclast-like cell cDNA, RIKEN full-length enriched library,
DE   clone:I420049N22 product:aldehyde dehydrogenase family 7, member A1, full
DE   insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus; Mus.
XX
RN   [1]
RP   1-1826
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; f6e97c43f3782e20a8368ba73708e73b.
DR   Ensembl-Gn; ENSMUSG00000053644; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000066208; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000174518; mus_musculus.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Tissues were provided by Takashi Ishikawa  ( Department of Surgery
CC   2 Yokohama City University 3-9 Fukuura,Kanazawa-ku,Yokohama
CC   236-0004 Japan ) whose assistance we gratefully acknowledge.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=I420049N22
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1826
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="I420049N22"
FT                   /cell_type="osteoclast-like cell"
FT                   /db_xref="taxon:10090"
FT   CDS             86..1621
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="aldehyde dehydrogenase family 7, member A1
FT                   (MGD|MGI:108186 GB|BC012407, evidence: BLASTN, 99%,
FT                   match=1824)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q9DBF1"
FT                   /db_xref="InterPro:IPR015590"
FT                   /db_xref="InterPro:IPR016161"
FT                   /db_xref="InterPro:IPR016162"
FT                   /db_xref="InterPro:IPR016163"
FT                   /db_xref="InterPro:IPR029510"
FT                   /db_xref="MGI:MGI:108186"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9DBF1"
FT                   /protein_id="BAE35641.1"
FT                   /translation="MSTLLIHHPQYAWLQDLGLREDNEGVYNGSWGGRGEVITTYCPAN
FT                   NEPIARVRQASLKDYEETIGKAKKAWNIWADIPAPKRGEIVRKIGDAFREKIQLLGRLV
FT                   SLEMGKILVEGIGEVQEYVDVCDYAAGLSRMIGGPTLPSERPGHALIEMWNPLGLVGII
FT                   TAFNFPVAVFGWNNAIALITGNVCLWKGAPTTSLVSVAVTKIIAQVLEDNLLPGAICSL
FT                   VCGGADIGTTMARDERVNLLSFTGSTQVGKEVALMVQERFGKSLLELGGNNAIIAFEDA
FT                   DLSLVVPSVLFAAVGTAGQRCTTVRRLFLHESIHNEVVDRLRSAYSQIRVGNPWDPNIL
FT                   YGPLHTKQAVSMFVRAVEEAKKQGGTVVYGGKVMDHPGNYVEPTIVTGLAHDAPIVHQE
FT                   TFAPILYVFKFQDEEEVFEWNNEVKQGLSSSIFTKDLGRVFRWLGPKGSDCGIVNVNIP
FT                   TSGAGIGGAFGGEKHTGGGRESGSDAWKQYMRRSTCTINYSTSLPLAQGIKFQ"
XX
SQ   Sequence 1826 BP; 442 A; 441 C; 528 G; 415 T; 0 other;
     ggtgtggcgt gtgccccgca ggctctgtgt gcagtctgtg aagaccagca agctttccgg        60
     accttggagc aggcccgccg cccacatgtc tactctgctg atccatcatc cccagtatgc       120
     ctggctgcaa gacctggggc tccgcgagga taacgagggc gtgtataatg gaagctgggg       180
     cggccgggga gaggtaatta cgacctattg tcctgctaac aatgagccaa tagcaagagt       240
     ccgacaggcc agcctgaagg actatgaaga aaccatcggg aaagccaaga aagcctggaa       300
     catctgggca gatattcctg ccccaaagcg aggagaaata gtcagaaaga ttggcgatgc       360
     cttccgggag aagattcaac tactgggaag actggtgtct ttggagatgg ggaaaatcct       420
     cgtggaagga ataggcgagg ttcaggagta cgtggacgtc tgtgactatg ctgctggctt       480
     gtcgaggatg atcgggggac ccaccttgcc ttctgaaaga cccggccatg ctctcatcga       540
     aatgtggaat cccttaggct tggtgggaat catcactgcc ttcaatttcc ccgtggctgt       600
     gtttggctgg aacaatgcca tagccctgat cacagggaat gtctgccttt ggaaaggagc       660
     accgactacg tccctcgtta gtgtggctgt cacaaagatc atagcccagg ttttggagga       720
     caacctgctg cccggtgcca tttgttccct ggtttgtggt ggagcagata tcggcacaac       780
     gatggccaga gatgagcgtg tgaacctgct gtccttcact gggagcactc aggtggggaa       840
     ggaggtggcc ctcatggtgc aggagaggtt tgggaaaagc ttgttggagc ttggaggaaa       900
     caacgccatt attgctttcg aggacgcgga cctcagcttg gttgttccat cagttctgtt       960
     tgccgccgtg ggaacagctg ggcaaaggtg taccactgtg aggcgactgt ttttgcacga      1020
     aagcatccat aatgaagttg tggacagact gagaagtgcc tactcacaga tccgtgtcgg      1080
     gaacccctgg gaccccaata tcctctatgg accgctccat accaaacagg cagtgagcat      1140
     gtttgtgaga gccgtggaag aagccaagaa acaagggggc acagtggtct atgggggcaa      1200
     ggtcatggac caccctggca attacgtgga acccaccatt gtgaccggtc ttgcccatga      1260
     tgcgcccatt gttcaccagg agacttttgc cccaatcctc tatgtcttca aattccagga      1320
     tgaagaagag gtctttgaat ggaacaatga agtaaaacag ggactttcaa gtagtatctt      1380
     taccaaagat ttgggcagag tcttccgctg gcttggacct aaaggttccg actgtggcat      1440
     cgtgaacgtc aatattccta ccagcggggc tgggattggt ggtgcgtttg gcggagagaa      1500
     gcacactggc ggtggccggg agtctggcag cgacgcctgg aagcagtaca tgaggagatc      1560
     cacatgtacc atcaactaca gcacgtccct ccctctggct cagggaatca agtttcagtg      1620
     atactgcgtg ggcttggcgt cccttagcca ttcctgtggc tgctcggaag aagcctgaaa      1680
     ggaccctgac ttgccccgga taaatgatgg ctttgagtac ggacagcagt gtctaatctc      1740
     cagtgattcc cagacccctg tctaaatcaa gatactcatt tatcaaaagt tcaaaattaa      1800
     atattcacaa cctagctgca tttagc                                           1826
//