Dbfetch

ID   AK083534; SV 1; linear; mRNA; HTC; MUS; 3673 BP.
XX
AC   AK083534;
XX
DT   19-DEC-2002 (Rel. 74, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 16)
XX
DE   Mus musculus 9 days embryo whole body cDNA, RIKEN full-length enriched
DE   library, clone:D030041M02 product:a disintegrin-like and metalloprotease
DE   (reprolysin type) with thrombospondin type 1 motif, 4, full insert
DE   sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-3673
RA   Adachi J., Aizawa K., Akimura T., Arakawa T., Bono H., Carninci P.,
RA   Fukuda S., Furuno M., Hanagaki T., Hara A., Hashizume W., Hayashida K.,
RA   Hayatsu N., Hiramoto K., Hiraoka T., Hirozane T., Hori F., Imotani K.,
RA   Ishii Y., Itoh M., Kagawa I., Kasukawa T., Katoh H., Kawai J., Kojima Y.,
RA   Kondo S., Konno H., Kouda M., Koya S., Kurihara C., Matsuyama T.,
RA   Miyazaki A., Murata M., Nakamura M., Nishi K., Nomura K., Numazaki R.,
RA   Ohno M., Ohsato N., Okazaki Y., Saito R., Saitoh H., Sakai C., Sakai K.,
RA   Sakazume N., Sano H., Sasaki D., Shibata K., Shinagawa A., Shiraki T.,
RA   Sogabe Y., Tagami M., Tagawa A., Takahashi F., Takaku-Akahira S.,
RA   Takeda Y., Tanaka T., Tomaru A., Toya T., Yasunishi A., Muramatsu M.,
RA   Hayashizaki Y.;
RT   ;
RL   Submitted (16-APR-2002) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; cafb17acb66bd7d53fe78aadbc2cdb5b.
DR   Ensembl-Gn; ENSMUSG00000006403; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000111315; mus_musculus.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=D030041M02
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3673
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="9 days embryo"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="D030041M02"
FT                   /tissue_type="whole body"
FT                   /db_xref="taxon:10090"
FT   CDS             392..2929
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="a disintegrin-like and metalloprotease (reprolysin
FT                   type) with thrombospondin type 1 motif, 4 (MGD|MGI:1339949
FT                   GB|AA041973, evidence: BLASTN, 99%, match=405)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q8BNJ2"
FT                   /db_xref="InterPro:IPR000884"
FT                   /db_xref="InterPro:IPR001590"
FT                   /db_xref="InterPro:IPR002870"
FT                   /db_xref="InterPro:IPR006586"
FT                   /db_xref="InterPro:IPR010294"
FT                   /db_xref="InterPro:IPR013273"
FT                   /db_xref="InterPro:IPR024079"
FT                   /db_xref="MGI:MGI:1339949"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q8BNJ2"
FT                   /protein_id="BAC38944.1"
FT                   /translation="MASIHPSCSPGTMSQMGLHPRRGLTGHWLQRFQPCLPLHTVQWRR
FT                   LLLLAFLLSLAWPASPLPREEEIVFPEKLNGSSILPGSGVPARLLYRLPAFGEMLLLEL
FT                   EQDPGVQVEGLTVQYLGQAPEMLGGAEPGTYLTGTINGDPESVASLHWDGGALLGVLQY
FT                   RGAELHLQPLEGGALNSAGGPGAHILRRKSPASSQGPMCTVKAPSGSPSPISRRTKRFA
FT                   SLSRFVETLVVADDKMAAFHGTGLKRYLLTVMAAAAKAFKHPSIRNPVNLVVTRLVILG
FT                   SGQEGPQVGPSAAQTLRSFCTWQRGLNTPNDSDPDHFDTAILFTRQDLCGVSTCDTLGM
FT                   ADVGTVCDPARSCAIVEDDGLQSAFTAAHELGHVFNMLHDNSKPCTNLNGQGGSSRHVM
FT                   APVMAHVDPEEPWSPCSARFITDFLDNGYGHCLLDKPEAPLHLPATFPGKDYDADRQCQ
FT                   LTFGPDSSHCPQLPPPCAALWCSGHLNGHAMCQTKHSPWADGTPCGSSQACMGGRCLHV
FT                   DQLKDFNVPQAGGWGPWGPWGDCSRTCGGGVQFSSRDCTRPVPRNGGKYCEGRRTRFRS
FT                   CNTENCPHGSALTFREEQCAAYNHRTDLFKSFPGPMDWVPRYTGVAPRDQCKLTCQARA
FT                   LGYYYVLEPRVADGTPCSPDTSSVCVQGRCIHAGCDRIIGSKKKFDKCMVCGGDGSRCS
FT                   KQSGSFKKFRYGYSDVVTIPAGATHILVRQQGGSGLKSIYLALKLSDGSYALNGEYTLM
FT                   PSPTDVVLPGAVSLRYSGATAASETLSGHGPLAQPLTLQVLVAGNPQNARLRYSFFVPR
FT                   PVPSTPRPPPQDWLQRRAEILKILRKRPWAGRK"
FT   regulatory      3648..3653
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      3673
FT                   /note="putative"
XX
SQ   Sequence 3673 BP; 780 A; 1023 C; 1058 G; 812 T; 0 other;
     ggggagaacc cggggaagac ccacagatac acagaaacga gagagacaga agaggagaga        60
     gacagagaca aagagacagc agcaggagga ggaggaggag gaggcaaaga cagggcgagc       120
     actgaggcaa agccagacag ctgagcagag ggagaagcca aggagtcaca gacaaagacc       180
     tgggacagaa aggcaaagga ggaagttttg gagcagaaag aacccgtggg cacttttcct       240
     gagcccaggg gctgttttct ttttcatctt ctctgaaagt ccctagccca tttaaaaact       300
     ttgcctcaga ttttggcaca agccccaaag cccttctggc tttaacgagg agccttccac       360
     agttagggtg tggagcattt tggtgccgca gatggcctca atccatccca gctgcagccc       420
     gggtaccatg tcccagatgg gcttgcatcc caggaggggc ttgactgggc actggctgca       480
     aagattccaa ccctgcttgc cgcttcacac tgtgcagtgg cggaggctgc tgctgctggc       540
     cttcctcctg tccttagcgt ggcccgccag ccccctcccc cgggaggagg agatcgtgtt       600
     tccagagaag ctcaatggca gtagcatcct acctggatca ggcgttcctg ccaggctgct       660
     gtaccgattg ccagcctttg gggagatgtt gctactagaa ctagaacagg accctggggt       720
     gcaggtagag ggtttgactg tacagtacct gggccaggca cctgagatgc tgggtggggc       780
     agagccaggt acctacctga ctggcaccat caatggagat ccggagtcgg tggcatctct       840
     gcactgggac gggggagccc tattaggggt actgcagtac cgtggggccg aactccacct       900
     ccagcctctg gaaggaggcg cccttaactc tgctggggga ccgggggctc acatcctacg       960
     ccggaagagt cctgccagca gccaaggtcc catgtgcacc gtcaaggctc cttctgggag      1020
     cccgagtccc atttcccgca gaaccaagcg cttcgcttct ctgagtagat tcgtggagac      1080
     actggtggta gcagatgaca agatggcagc attccatggt acagggttaa agcgctacct      1140
     gctgacggtt atggcagctg ccgctaaagc ctttaaacac ccaagcatcc gaaaccctgt      1200
     caacttggtg gtgacgcgcc tggtgatcct ggggtccggc caggaagggc cccaagtggg      1260
     gccaagtgcc gcccagaccc tacgcagctt ctgcacctgg cagcggggcc tcaacacccc      1320
     taacgactca gatcctgacc actttgacac agccattctg ttcacccggc aggacctgtg      1380
     tggggtctcc acttgtgaca ccctgggtat ggctgatgtg ggcacagtgt gtgatccagc      1440
     taggagctgt gctattgtgg aagatgatgg gctccagtca gccttcactg ctgctcatga      1500
     actgggccat gtcttcaaca tgctccatga taactccaag ccatgcacta acttgaatgg      1560
     gcaggggggt tcctctcgcc atgtcatggc tcctgtcatg gcccatgtgg accctgaaga      1620
     gccctggtcg ccctgcagtg cccgattcat cactgacttc ctggacaatg gttatgggca      1680
     ctgcctctta gacaaaccgg aggctcccct ccatctacca gcgacttttc ctggcaagga      1740
     ctatgacgct gaccgccaat gccaactgac cttcggtcct gactcaagcc attgtccaca      1800
     gctgccaccg ccctgtgctg ccctctggtg ctctggccac ctcaatggcc atgccatgtg      1860
     ccagacgaag cactcacctt gggctgatgg cactccctgc gggtcttcac aggcctgcat      1920
     gggtggccgc tgtctgcacg tggaccagct caaggacttc aatgttcctc aggctggagg      1980
     ctggggcccc tggggaccat ggggtgactg ctccaggact tgtgggggtg gtgtccagtt      2040
     ctcctcccgg gattgcacga ggcccgtccc ccggaacggt ggcaagtatt gtgagggccg      2100
     ccggactcgc ttccgctcct gcaacacgga gaactgccca cacggctcag cattgacctt      2160
     ccgtgaagag cagtgtgctg cctacaacca ccgaaccgac ctcttcaaga gctttccagg      2220
     gcccatggac tgggttccgc gctacacagg tgtggcccct cgagaccaat gcaaactcac      2280
     ctgccaggcc cgggcactgg gctactacta cgtattggag ccccgggtgg cagatgggac      2340
     tccctgctcc ccagacacct cctctgtctg tgtccagggc cgctgtatcc atgctggctg      2400
     tgaccggatc attggctcca aaaagaaatt tgacaagtgc atggtgtgcg gcggggatgg      2460
     ctctcgctgc agcaagcagt cgggctcctt caaaaaattc aggtatggat acagcgatgt      2520
     ggtcacgatc cctgcggggg ccacccatat ccttgtacgg cagcaggggg ggtctggtct      2580
     caagagcatc tacctggccc tgaagctttc tgacggttct tacgccctca atggtgaata      2640
     cacgctgatg ccctccccaa cagatgtggt tcttcctggg gcagtcagct tgcgctacag      2700
     cggagccaca gcagcctcag agacactgtc tggacatggg ccgctggccc agcccttgac      2760
     gctgcaagtc ctggtggctg gcaacccaca gaatgcacgt ctgcggtaca gtttctttgt      2820
     cccgcggcca gtcccttcaa caccacgccc tcctccccaa gactggctgc aacgcagggc      2880
     agagatactg aagatccttc ggaagcgtcc ctgggcaggc cggaaataac ctcactgtcc      2940
     cggctgccct ttttgggcgc cggggcctcg gactcatctg ggagaatgag caggcttctg      3000
     caactgcctc ctgctaaaac acagtaggga ggtgtagagg gtgagatctg cctgcctcac      3060
     tgccccaaac cgcaggctgg ccctgccctg gcttcctgcc ctgggaggca gtgatgtctt      3120
     ggtgaatgga aaggggctag gtgacagtac cctatctact aaactgcccc ctctaccctg      3180
     caggtcacag gaggaatggg gggaagacag ggtgggtcct gggccctagt tgtatttatt      3240
     tggtatttat tcatttttat ttagcaccag gaaaggggac tagggtcttg gggaaactca      3300
     cctattatag ccctaaccta gctatgaaat ccagggtgtt ggtgacaaat atgagtggtg      3360
     tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tttatgtatg aggtacaacc tgccctgctt      3420
     tcctctccct aatttttttt tttttctggg aaaaggaaag tcaaaggtag gactgccttc      3480
     agggagtaag ggatgattgt gtttttaaat tgaagtttgc tatttatatg ctctttttgg      3540
     agtcagacaa atgtgggtta tattctggcc ccgcatcttt gagcattagt tttctcatgt      3600
     gccaataata atcccttaga aattggttgt aaggattaaa tgatgttaat aaagaactag      3660
     catagagcct ctc                                                         3673
//