Dbfetch

ID   AK029102; SV 1; linear; mRNA; HTC; MUS; 3202 BP.
XX
AC   AK029102;
XX
DT   18-DEC-2002 (Rel. 74, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 14)
XX
DE   Mus musculus 10 days neonate skin cDNA, RIKEN full-length enriched library,
DE   clone:4732491M05 product:SIALIN homolog [Homo sapiens], full insert
DE   sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-3202
RA   Adachi J., Aizawa K., Akimura T., Arakawa T., Bono H., Carninci P.,
RA   Fukuda S., Furuno M., Hanagaki T., Hara A., Hashizume W., Hayashida K.,
RA   Hayatsu N., Hiramoto K., Hiraoka T., Hirozane T., Hori F., Imotani K.,
RA   Ishii Y., Itoh M., Kagawa I., Kasukawa T., Katoh H., Kawai J., Kojima Y.,
RA   Kondo S., Konno H., Kouda M., Koya S., Kurihara C., Matsuyama T.,
RA   Miyazaki A., Murata M., Nakamura M., Nishi K., Nomura K., Numazaki R.,
RA   Ohno M., Ohsato N., Okazaki Y., Saito R., Saitoh H., Sakai C., Sakai K.,
RA   Sakazume N., Sano H., Sasaki D., Shibata K., Shinagawa A., Shiraki T.,
RA   Sogabe Y., Tagami M., Tagawa A., Takahashi F., Takaku-Akahira S.,
RA   Takeda Y., Tanaka T., Tomaru A., Toya T., Yasunishi A., Muramatsu M.,
RA   Hayashizaki Y.;
RT   ;
RL   Submitted (16-JUL-2001) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; e738620c4eb333253e1dde4c610deb04.
DR   Ensembl-Gn; ENSMUSG00000049624; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0035072; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0035053; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0034981; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0035041; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0034753; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0035564; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0034071; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0034726; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0034884; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0034827; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0034965; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0034869; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0035585; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0033774; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0034188; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000052441; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000117645; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0093350; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0093431; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0093374; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0093346; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0092926; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0093870; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0093527; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0092854; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0093059; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0092911; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0093070; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0092905; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0094182; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0092940; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0091939; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=4732491M05
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3202
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="10 days neonate"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="4732491M05"
FT                   /tissue_type="skin"
FT                   /db_xref="taxon:10090"
FT   CDS             62..1549
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="SIALIN homolog [Homo sapiens] (SPTR|Q9UGH0,
FT                   evidence: FASTY, 86.3%ID, 100%length, match=1485)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q8BN82"
FT                   /db_xref="InterPro:IPR011701"
FT                   /db_xref="InterPro:IPR020846"
FT                   /db_xref="InterPro:IPR036259"
FT                   /db_xref="MGI:MGI:1924105"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q8BN82"
FT                   /protein_id="BAC26298.1"
FT                   /translation="MRPLLRGPAGNDDEESSDSTPLLPGARQTEAAPVCCSARYNLAIL
FT                   AFCGFFVLYALRVNLSVALVDMVDSNTTLTDNRTSKECAEHSAPIKVHHNHTGKKYKWD
FT                   AETQGWILGSFFYGYIVTQIPGGYIASRVGGKLLLGLGILGTSVFTLFTPLAADLGVVT
FT                   LVVLRALEGLGEGVTFPAMHAMWSSWAPPLERSKLLTISYAGAQLGTVISLPLSGIICY
FT                   YMNWTYVFYLFGIVGIVWFILWMWIVSDTPETHKTISHYEKEYIVSSLKNQLSSQKVVP
FT                   WGSILKSLPLWAIVVAHFSYNWSFYTLLTLLPTYMKEILRFNVQENGFLSALPYFGCWL
FT                   CMILCGQAADYLRVKWNFSTISVRRIFSLVGMVGPAVFLVAAGFIGCDYSLAVAFLTIS
FT                   TTLGGFASSGFSINHLDIAPSYAGILLGITNTFATIPGMTGPIIAKSLTPDNTIREWQT
FT                   VFCIAAAINVFGAIFFTLFAKGEVQSWALSDHHGHRN"
XX
SQ   Sequence 3202 BP; 832 A; 722 C; 747 G; 901 T; 0 other;
     ggtcctgccg agagctaggt tggccaagca acgacgccgg cacatctgct ctccaggcgt        60
     tatgaggccc ctgcttcggg gtccggcggg aaacgacgat gaggagagct cggacagcac       120
     cccgctcctg ccgggcgccc ggcagaccga agcggctcca gtgtgctgct ctgctcggta       180
     caacttagcg attttggcgt tctgtggttt cttcgttctc tatgccttac gggtgaacct       240
     gagtgttgcg ttagtggaca tggtagattc aaatacaact ctgactgata atagaacgtc       300
     taaggagtgt gcggaacatt ctgcccccat aaaagttcac cacaatcaca caggtaaaaa       360
     gtacaagtgg gatgcagaaa ctcaagggtg gattctcggc tctttttttt acggctacat       420
     cgtcacccag attcccggtg ggtacattgc cagcagggtc ggagggaagc tgctgctggg       480
     cctgggcatc ttaggcacct ccgtcttcac cctgttcaca ccgctggccg cagacttagg       540
     cgtggtgact ctcgttgtgc ttagagcgct ggaaggactg ggagagggtg ttacgtttcc       600
     agctatgcac gccatgtggt cttcctgggc tccccctctg gaaagaagca agcttcttac       660
     catttcctat gcgggagcac agcttgggac agtgatctca cttcctcttt ccggaataat       720
     atgctactat atgaactgga cttacgtctt ctatcttttt ggtatagttg gaattgtctg       780
     gtttatttta tggatgtgga tagtcagtga tacaccagaa actcacaaga caatctccca       840
     ttatgaaaaa gaatacattg tttcatcatt aaaaaatcag ctttcttcgc agaaggtggt       900
     gccgtggggg tccatcttga agtcactgcc actttgggca attgtggtag cacatttctc       960
     ctacaactgg tctttttaca ccttattgac gctactgcca acttatatga aggaaatcct      1020
     aaggttcaat gttcaagaga acgggttttt atctgcattg ccttattttg gctgttggtt      1080
     atgcatgatc ctctgtggtc aagccgctga ctatttaagg gtcaagtgga acttttcaac      1140
     tataagcgtt cggagaattt ttagcctcgt aggaatggtt ggccctgcgg ttttcctagt      1200
     cgcggctgga tttataggct gtgactattc cttggccgtt gcgttcctaa ccatatccac      1260
     gacgctggga ggcttcgcct cttctggatt tagcatcaac catctggata tcgctccttc      1320
     gtatgctggc atcctcttgg gcatcacaaa cacgtttgcc actattcccg ggatgacggg      1380
     acccatcatt gctaaaagtc tgacccctga taacactatt agggaatggc agactgtctt      1440
     ctgtatcgct gctgctatta acgtgtttgg ggccattttc ttcacgctgt ttgccaaagg      1500
     ggaagtgcag agctgggctc tcagtgacca ccacggacac agaaactgaa gacaacaaac      1560
     aataattaat gtgtttttat ttatcatgga aactgaaagt gcctttggta tttgaatgtg      1620
     caagcattct atgtgagaga aaaaaagtac ttatatttgt atggtaatca tgaaatgtta      1680
     ctagttccaa gatcatataa aatgagctgt tgggctggac agatggctca gtggttacga      1740
     gcactgattg ctcttggaga ggtcctgagt tcaattccca gcaaccacat ggtggctcat      1800
     aaccatctgt aatgggatca gatgccctct tctggtgtgt ctgaagacag agacagtgta      1860
     ctcacacaca ttaaataaat aaataaatct ttaaaaaaaa agaagaagaa gaggctggtc      1920
     ctcttttaaa aaaataaata aaaaataaaa tgagctgttt ttaattatta ctagtatatc      1980
     tgccaggccg tatactgggt tcacatatca ggctgcagga aaggcagtat gatataaggt      2040
     aagggcatag gaaatgaagc tgaaatggac acacacacac acacacacac tgataaactt      2100
     ggttaattaa atcagataaa tcatttcaga tcttattaaa atacctgctt ttgtctactt      2160
     cccttataaa aatgcttgcc aactctccct gacacctaga cctcaataac gtctccaaag      2220
     aactgccagc cacagtaaga gtcggcagcc tggcagcggt actgggggat ccgtgcccag      2280
     gcagctgtca agcattccct ccctggctgc agggcaggaa tgcccagcac ttaccaggga      2340
     tggtggcttc caagtgcagg gccaatgaca agtgtttggc tgacactctt tcccccatgg      2400
     ctgttaatgt gcagataaag ctctaagtgg actggagcag tgggagccat agctactgtc      2460
     ttgacactgc cagcctgcct cggcagcacc cgggcccgac tcctggcagg tgatctacaa      2520
     agcagaagcg tggtacccag tcagtcacag agaatagcat ccttgattca catcactggc      2580
     ccatgaattg gagaacacat tttcacattc aggtttttct agctacatta gaactttttt      2640
     aaatgtctta catttcttcc cttcagtctg tcttgattta tgcttctggt tacagtttaa      2700
     agggactttc agggccaggg gtttggctca gtggtagagc atttacatag gattgtgctc      2760
     cattcaacac caaaatagag gtaaaaggca gggcacccaa agcttgtcag tcccctttac      2820
     cctgcttcct gaccagtgtc tcccagtggt taggcacact gtaagaggtc cctagtctgt      2880
     gtggtggtac ctttaatgcc agcacttggg aggcagaagc aggcagatct ttgcgacttc      2940
     gaggccagtc tggtctgcaa aggggcttcc aggacagccg gggctataac tcaaaaacaa      3000
     aaacaaaaag attatccaag gattgaaagc ctgccttatg ttttgttgta tctgttcaac      3060
     acttatctat aacattttaa ctaatgattg gttaatgtac tgttgtgtaa atacagttat      3120
     ataaatttgt ttttaatttg taaatataaa ttaagttgcc atgtgacatt tcttttataa      3180
     atcatcaaaa tccttttgca tt                                               3202
//