Dbfetch

ID   AK004577; SV 1; linear; mRNA; HTC; MUS; 2981 BP.
XX
AC   AK004577;
XX
DT   08-FEB-2001 (Rel. 66, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 22)
XX
DE   Mus musculus adult male lung cDNA, RIKEN full-length enriched library,
DE   clone:1200003K19 product:CD97 antigen, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2981
RA   Adachi J., Aizawa K., Akahira S., Akimura T., Arai A., Aono H., Arakawa T.,
RA   Bono H., Carninci P., Fukuda S., Fukunishi Y., Furuno M., Hanagaki T.,
RA   Hara A., Hayatsu N., Hiramoto K., Hiraoka T., Hori F., Imotani K.,
RA   Ishii Y., Itoh M., Izawa M., Kasukawa T., Kato H., Kawai J., Kojima Y.,
RA   Konno H., Kouda M., Koya S., Kurihara C., Matsuyama T., Miyazaki A.,
RA   Nishi K., Nomura K., Numazaki R., Ohno M., Okazaki Y., Okido T., Owa C.,
RA   Saito H., Saito R., Sakai C., Sakai K., Sano H., Sasaki D., Shibata K.,
RA   Shibata Y., Shinagawa A., Shiraki T., Sogabe Y., Suzuki H., Tagami M.,
RA   Tagawa A., Takahashi F., Tanaka T., Tejima Y., Toya T., Yamamura T.,
RA   Yasunishi A., Yoshida K., Yoshino M., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (10-JUL-2000) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 5a0cc0ff74223a710792f401f8bc3936.
DR   Ensembl-Gn; ENSMUSG00000002885; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0033840; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0033823; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0033749; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0033816; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0033526; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0034336; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0032856; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0033504; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0033657; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0033603; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0033747; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0033645; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0034353; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0032560; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0032969; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000166939; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0089605; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0089686; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0089625; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0089623; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0089188; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0090142; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0089736; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0089135; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0089330; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0089174; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0089364; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0089195; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0090445; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0089159; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0088235; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=1200003K19
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2981
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /sex="male"
FT                   /dev_stage="adult"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="1200003K19"
FT                   /tissue_type="lung"
FT                   /db_xref="taxon:10090"
FT   CDS             191..2359
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="CD97 antigen (MGD|MGI:1347095 GB|NM_011925,
FT                   evidence: BLASTN, 99%, match=2329)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q9DC42"
FT                   /db_xref="InterPro:IPR000152"
FT                   /db_xref="InterPro:IPR000203"
FT                   /db_xref="InterPro:IPR000742"
FT                   /db_xref="InterPro:IPR000832"
FT                   /db_xref="InterPro:IPR001881"
FT                   /db_xref="InterPro:IPR003056"
FT                   /db_xref="InterPro:IPR017981"
FT                   /db_xref="InterPro:IPR018097"
FT                   /db_xref="MGI:MGI:1347095"
FT                   /db_xref="UniProtKB/TrEMBL:Q9DC42"
FT                   /protein_id="BAB23386.1"
FT                   /translation="MRGVRCPGLLVVCILLSLSGAGTQKAENCAKWCPINSKCVSNRSC
FT                   VCKPGFSSEKELITNPAESCEDINECLLPGFSCGDFAMCKNSEGSYTCVCNLGYKLLSG
FT                   AESFVNESENTCQDVDECSSGQHQCHNSTVCKNTVGSYKCHCRPGWKPTSGSLRGPDTI
FT                   CQEPPFPTWTLLPTAHSQTLLRFSVEVQNLLRDFNPATVNYTIQKLIEAVDKLLEDPME
FT                   TETQQVAAQLLSNLEQSLRTLAQFLPKGPFTYTSPSNTELSLMVKEQDNKDVTTVHHGQ
FT                   TWMELDWAVTAGAKISENGSSVAGILSSPNMEKLLGNTPLNLEQRRASLEDFYGSPIPS
FT                   VSLKLLSNINSVFLTNTNTEKLASNVTFKFDFTSVESIEPRHELICAFWKAHNGNGYWD
FT                   TDGCSMNGTGFCHCNHLTSFAILMAQYHVQDPRLELITKVGLLLSLICLLLCILTFLLV
FT                   KPIQSSRTMVHLHLCICLFLGSIIFLVGVENEGGEVGLRCRLVAVMLHFCFLAAFCWMA
FT                   LEGVELYFLVVRVFQGQGLSTWQRCLIGYGVPLLIVAISMAVVKMDGYGHATYCWLDFR
FT                   KQGFLWSFSGPVAFIIFCNAAIFVITVWKLTKKFSEINPNMKKLRKARVLTITSIAQLL
FT                   VLGCTWGFGLFLFNPHSTWLSYIFTLLNCLQGLFLYVMLCLLNKKVREEYWKWACMVTG
FT                   SKYTEFNSSTTGTGTSQTRALRSSESGM"
FT   regulatory      2960..2965
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      2981
FT                   /note="putative"
XX
SQ   Sequence 2981 BP; 683 A; 843 C; 767 G; 688 T; 0 other;
     ggcgtttgtt ccagacgcgg gcgcgcggcg cgccctgggt ctgtttcccc ggcactttcc        60
     tgttactttc cagcggggag gacagttccg gggagggccc tgctcacgcc cgctgcataa       120
     aagcccagcc cagccggccg ccacagccct ctatcaaccc cagacgctgt ccgttccgtg       180
     ccgcgccacc atgaggggcg tcagatgccc cggcctgctt gtggtgtgca ttttactgag       240
     tctctcagga gctggaaccc agaaggcaga gaactgtgcc aagtggtgcc ctatcaactc       300
     aaaatgtgtc agtaacagaa gctgtgtctg caagcccggc ttcagctcgg agaaggaatt       360
     gatcactaac cctgcagaga gctgtgaaga catcaacgag tgtttactac ctggattttc       420
     ctgtggagac tttgccatgt gtaagaactc agaggggagt tacacctgcg tctgtaacct       480
     gggatataag ctcctttctg gtgcagaatc ttttgttaac gagagtgaga atacatgtca       540
     agatgtggat gagtgcagct ctgggcagca tcaatgtcac aattccaccg tctgcaaaaa       600
     caccgtaggc tcctacaagt gccactgccg tccaggttgg aagccaactt ctgggtccct       660
     ccgtggtcca gacaccatat gccaagagcc accgttccct acttggacac tgctgcccac       720
     agcccacagt cagactctct tgagattctc tgttgaagtc cagaatctgc tccgtgactt       780
     caatccagcc acggtcaact acaccatcca gaaactcatc gaggctgtgg acaagctgct       840
     ggaagacccc atggagactg aaacccaaca agtagctgcc cagttgctct cgaacctgga       900
     acaaagcctt cggaccttgg ctcagttttt acccaaaggc cccttcacct acacgtcccc       960
     ttcaaatact gaactgtccc tgatggtgaa ggagcaagac aacaaagatg ttaccacagt      1020
     gcaccacggc caaacttgga tggagctgga ctgggctgtg acagctggag ccaaaatctc      1080
     agaaaatggc tcctcggtgg caggcatcct gtccagccca aacatggaaa agctgctggg      1140
     caacaccccg ctgaacttgg agcagagaag ggcctccttg gaggacttct atggaagccc      1200
     gattccgagt gtctcactta agcttctctc aaacatcaat tctgtcttcc tgaccaacac      1260
     gaacactgaa aagctcgcct cgaatgtcac gttcaagttc gactttactt ctgtggagtc      1320
     aattgagcca cgtcatgagc tgatatgtgc cttctggaaa gcccacaacg ggaatgggta      1380
     ctgggacacc gacggctgct caatgaatgg cactggcttc tgccactgca accacctgac      1440
     cagctttgcc atcctaatgg ctcagtacca tgtgcaggac ccaaggctgg aattgatcac      1500
     caaggtgggg ctgttgttgt ccctgatctg cctgctgctg tgcatcctga ccttcctgct      1560
     ggtgaagccc atccagagct ctcgaaccat ggtgcacctg cacctgtgca tctgcctctt      1620
     cctgggctca atcatattcc tggtcggcgt ggagaatgaa gggggtgagg tgggcctgcg      1680
     ctgccgcctg gtggccgtga tgctgcattt ctgcttcctg gccgccttct gctggatggc      1740
     actggagggc gtggagctct acttcctggt ggtgcgtgtg ttccagggcc aaggcctgag      1800
     tacatggcaa cgctgcctga ttggctacgg ggtgcccctc ctcatcgtgg ccatctcaat      1860
     ggcagttgtc aaaatggatg gctatgggca tgcaacatac tgctggctgg actttaggaa      1920
     gcagggcttt ctctggagct tttcgggacc cgtggccttc atcattttct gcaatgctgc      1980
     cattttcgtg atcactgtct ggaagctcac aaagaagttt tctgaaatca acccaaacat      2040
     gaagaagtta aggaaggcga gggtgctgac catcacctcc atcgcccagc tccttgtgct      2100
     gggctgcacc tggggcttcg gcctgttcct cttcaacccc catagcacat ggctatccta      2160
     catctttacc ctgctcaact gcctgcaggg cctattcctc tatgtgatgc tctgcctgct      2220
     caacaaaaag gtgagggaag agtactggaa atgggcctgc atggtcactg ggagcaagta      2280
     cacagagttc aactcttcca caacaggcac tggcaccagc caaacacggg ctctcaggtc      2340
     ctcagaatca gggatgtgaa ggcgagttcc atgaaggaca gcagcacata ctcacggctg      2400
     tcattctgcc ttctgttcct gccacctttc ggcctttgac cccacttcat ataaagaagg      2460
     gatgcaaacc cagcaatgag ccctgccacg gggcaagggg tcgctggtcc tgcttctggt      2520
     tgtctctgcc tcaccacagc tactgcctac cagagaaaca gaatgcagcc agcccagacc      2580
     tgcagcccac tgcctggtac acgaggcagg cagtcctagg acaaagggaa gagcccttga      2640
     gacctgggct ctttgccagg gtgctgggtt cagtttcctt aagctaagac tgtggatgcc      2700
     acgtcagcca ctgaagcccg cctgctgctg aggctcacgg tacagaggcc tcgctgctcc      2760
     acatccaggg cagaaggtct cacagctaaa gactaggttt tgtaattttt ttaacctgta      2820
     aacctttcaa tgttgacaca caaaattaaa tatcatattg ataaaaaaga tggcacgttt      2880
     gctggtgaga agtttgctgg cttgacccaa ccccatgaga tgttgaccga aagggatggg      2940
     gctggagcca tgcattcaca ataaaagttt attcttacat c                          2981
//