Dbfetch

ID   AK076345; SV 1; linear; mRNA; HTC; MUS; 3012 BP.
XX
AC   AK076345;
XX
DT   18-DEC-2002 (Rel. 74, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 14)
XX
DE   Mus musculus 10 days neonate skin cDNA, RIKEN full-length enriched library,
DE   clone:4732447O10 product:desmoglein 2, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-3012
RA   Adachi J., Aizawa K., Akimura T., Arakawa T., Bono H., Carninci P.,
RA   Fukuda S., Furuno M., Hanagaki T., Hara A., Hashizume W., Hayashida K.,
RA   Hayatsu N., Hiramoto K., Hiraoka T., Hirozane T., Hori F., Imotani K.,
RA   Ishii Y., Itoh M., Kagawa I., Kasukawa T., Katoh H., Kawai J., Kojima Y.,
RA   Kondo S., Konno H., Kouda M., Koya S., Kurihara C., Matsuyama T.,
RA   Miyazaki A., Murata M., Nakamura M., Nishi K., Nomura K., Numazaki R.,
RA   Ohno M., Ohsato N., Okazaki Y., Saito R., Saitoh H., Sakai C., Sakai K.,
RA   Sakazume N., Sano H., Sasaki D., Shibata K., Shinagawa A., Shiraki T.,
RA   Sogabe Y., Tagami M., Tagawa A., Takahashi F., Takaku-Akahira S.,
RA   Takeda Y., Tanaka T., Tomaru A., Toya T., Yasunishi A., Muramatsu M.,
RA   Hayashizaki Y.;
RT   ;
RL   Submitted (16-APR-2002) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 09b6f8737ffc50008cbbbd465b81aafe.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=4732447O10
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3012
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="10 days neonate"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="4732447O10"
FT                   /tissue_type="skin"
FT                   /db_xref="taxon:10090"
FT   CDS             <1..1379
FT                   /codon_start=3
FT                   /transl_table=1
FT                   /note="desmoglein 2 (SWISSPROT|Q14126, evidence: FASTY,
FT                   70.1%ID, 41.1%length, match=1374)"
FT                   /note="putative"
FT                   /note="start codon is not identified"
FT                   /db_xref="GOA:Q8BK55"
FT                   /db_xref="InterPro:IPR009122"
FT                   /db_xref="InterPro:IPR009123"
FT                   /db_xref="InterPro:IPR027397"
FT                   /db_xref="UniProtKB/TrEMBL:Q8BK55"
FT                   /protein_id="BAC36305.1"
FT                   /translation="NEGAPPEDKVVPSLLVADHAESSAVRGGVGGAMLKEGMKGSSSAS
FT                   VTKGQHELSEVDGRWEEHRSLLTAGATHHVRTAGTIAANEAVRTRATGSSRDMSGARGA
FT                   VAVNEEFLRSYFTEKAASYNGEDDLHMAKDCLLVYSQEDTASLRGSVGCCSFIEGELDD
FT                   LFLDDLGLKFKTLAEVCLGRKIDLDVDIEQRQKPVREASVSAASGSHYEQAVTSSESAY
FT                   SSNTGFPAPKPLHEVHTEKVTQEIVTESSVSSRQSQKVVPPPDPVASGNIIVTETSYAK
FT                   GSAVPPSTVLLAPRQPQSLIVTERVYAPTSTLVDQHYANEEKVLVTERVIQPNGGIPKP
FT                   LEVTQHLKDAQYVMVRERESILAPSSGVQPTLAMPSVAAGGQNVTVTERILTPASTLQS
FT                   SYQIPSETSITARNTVLSSVGSIGPLPNLDLEESDRPNSTITTSSTRVTKHSTMQHSYS
FT                   "
XX
SQ   Sequence 3012 BP; 892 A; 741 C; 672 G; 707 T; 0 other;
     ttaatgaagg ggcacctcct gaggacaagg tggtgccatc gcttctggtg gccgatcatg        60
     cagagagctc ggcagtgaga ggcggcgtag gaggtgcgat gctcaaggaa ggcatgaaag       120
     gcagcagctc agcttccgtt accaaagggc agcatgagct gtctgaggtt gacggaagat       180
     gggaagaaca cagaagcctc ctcaccgctg gggccactca ccatgtaagg acagcaggaa       240
     ccatcgctgc caacgaagcc gtaaggacaa gagccacggg gtcttccaga gacatgagtg       300
     gggctcgagg agccgttgcc gtgaatgagg aattcttaag aagttacttc acagagaaag       360
     cggcctccta caatggggaa gacgaccttc acatggccaa agactgcctt ctcgtttact       420
     ctcaggaaga cacggcctcc ctccgaggct cggtcgggtg ctgcagtttc atcgagggag       480
     aactcgatga cctgttcctg gatgatcttg gccttaaatt caagacccta gctgaagttt       540
     gcctaggtcg aaagatcgat ctggatgtgg acattgaaca gaggcagaag ccggtcagag       600
     aagcgagcgt gagtgcagct tctggctcgc actatgagca agcggtaacc agctcagaga       660
     gcgcgtactc ctctaacacc ggcttccccg cccccaaacc tctgcacgaa gtgcacacag       720
     agaaagtcac acaggaaatc gtcactgaga gctctgtatc ttccaggcag agtcagaagg       780
     tagtaccgcc acctgatcct gtggcttctg gtaatattat agtgacggaa acttcctatg       840
     ccaaaggctc agcagtgcca cccagcactg tgctcctggc tcccagacag ccacagagcc       900
     tgatcgtgac agagagggtg tatgctccaa cctccacctt ggtggatcag cattatgcca       960
     atgaagaaaa agtccttgtt accgaacgag tgatccagcc taatgggggc atccctaagc      1020
     cccttgaggt cacccagcat ctgaaagatg cacagtatgt aatggtgagg gaaagagaga      1080
     gcatccttgc tcccagctca ggcgtgcagc ccactctggc aatgcccagc gtggcagcag      1140
     gaggacagaa tgtcaccgtg acagaaagaa tactaactcc tgcttccact ctgcagtcca      1200
     gctaccagat tcccagtgaa acctccatca cggctaggaa cactgtgctc tctagtgtgg      1260
     gaagcatagg tcctctgccc aatttagatc tagaggaatc tgatcgtccc aattctacta      1320
     taaccacatc ttccaccagg gtcaccaagc atagcaccat gcaacattct tactcctaaa      1380
     gagaagccag ccgcagtcgg atctggacct taactagcat ccactggatg tgtgtgtcca      1440
     tgcccctaat gttataacag ccacacaaga cttgcacatt ggaaaaactt attgacatgc      1500
     caaacaaaga caccatgtct caggtagggt tgtatttcag caatgctctg tgattattgg      1560
     gggatggagt taagatgcta gttcctcagt gccttttaga gtactgtgca agcaatactc      1620
     gcaaacagga catgtaaaat tgagttccaa gaacttacaa aactaaagca gggtatctaa      1680
     ttcatcaggt ccaggaggct gggtctctgc acaatgcatt gatattcagc agggacagtt      1740
     ccacccctta tattacagag ccacacaggt tataggtacc aaatagctca tccacccaga      1800
     gacacatctc cactctttca tctccactgg aaccctgagt acaccagagg gaagaaagaa      1860
     agaaagaaag agagagagag agagagagag agagagagag agagagaaca ctggagccct      1920
     ttttaactgt ctaaagttga cagactttag aacactgtca ggataagaat acatagtgtg      1980
     tatttctctg cacataccat gtcctgtgag ctgaaaaaaa aaatgacttc atatttttcc      2040
     tcacttacca aaggtcctaa aaatattgag catttcaaat tagttttata atctaactaa      2100
     tcatgatagc caggtgtggt ggtgcaggcc tttagtccag aactcaagaa gcaggcaaat      2160
     ctctccgggt tcgaggccaa tctggtgtat ataaagggtt ccaggacacc cagggctaca      2220
     tagaaaaacc ctgtctcaaa aaagcaagca aaacaaacaa caacaaatac tagtaatcca      2280
     ggataaaatt cctagttaac ttttctgcaa tgtgatggca gtctatgaag gaaacgtctt      2340
     catagatctg tttcctatct acaagttcat gcacttaggg gctgcacaaa tgaccgagtg      2400
     gctaagagca cattactgac tgctcttgca gagaacccta cttcagtttc tgacaactac      2460
     atactatggc tcacaaccac ctctaattcc aactcaaggg atctgatgct ctctaatgga      2520
     ctccaaagac attgcactca cagttgagtg gttagtgagc agccgagttc cactcttgca      2580
     tcctcagtgc ctctcacagc cgccatatcc ccctcttgcc attgaccgac ttccatcctc      2640
     agtgcctctc acagaacccc agccattatc tatggcacaa gataagttcc acaaaccact      2700
     cttttacaat tagtcaaata tcaagaagac tcagtagttc agtaattctg aattatgaac      2760
     tttaccatct tgatatctga cataattggc tggtcatgag ttgatatgca tgaagtaaat      2820
     actttatcat aaaaaccctt gagtaactgg cccaatttca ttcaacttcc ttgccttgat      2880
     tctttcaaga agttgtttgg attctacaga acatcaatgg aaaatattac ataacaaaaa      2940
     tttgcttcta aaaaataatg tttcaaaacc tatgtatata tcagtcataa tttatatcca      3000
     tgtatctcac cc                                                          3012
//