Dbfetch

ID   AK151861; SV 1; linear; mRNA; HTC; MUS; 1751 BP.
XX
AC   AK151861;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 11)
XX
DE   Mus musculus bone marrow macrophage cDNA, RIKEN full-length enriched
DE   library, clone:I830041C24 product:cathepsin C, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus; Mus.
XX
RN   [1]
RP   1-1751
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 3ac4620eef32bdb1d43a3cf6049329ca.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Tissues were provided by David A. Hume ( Depts. of Biochemistry
CC   and Microbiology/Parasitology Institute for Molecular Bioscience
CC   University of Queensland Brisbane,Q 4072 Australia ) whose
CC   assistance we gratefully acknowledge.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=I830041C24
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1751
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="I830041C24"
FT                   /cell_type="macrophage"
FT                   /tissue_type="bone marrow"
FT                   /db_xref="taxon:10090"
FT   CDS             <1..1345
FT                   /codon_start=2
FT                   /transl_table=1
FT                   /note="cathepsin C (MGD|MGI:109553 GB|U74683, evidence:
FT                   BLASTN, 100%, match=1751)"
FT                   /note="putative"
FT                   /note="start codon is not identified"
FT                   /db_xref="GOA:Q3U9B7"
FT                   /db_xref="InterPro:IPR000169"
FT                   /db_xref="InterPro:IPR000668"
FT                   /db_xref="InterPro:IPR013128"
FT                   /db_xref="InterPro:IPR014882"
FT                   /db_xref="InterPro:IPR025660"
FT                   /db_xref="InterPro:IPR025661"
FT                   /db_xref="InterPro:IPR033161"
FT                   /db_xref="MGI:MGI:109553"
FT                   /db_xref="UniProtKB/TrEMBL:Q3U9B7"
FT                   /protein_id="BAE30750.1"
FT                   /translation="LLGVCTVRSDTPANCTYPDLLGTWVFQVGPRSSRSDINCSVMEAT
FT                   EEKVVVHLKKLDTAYDELGNSGHFTLIYNQGFEIVLNDYKWFAFFKYEVRGHTAISYCH
FT                   ETMTGWVHDVLGRNWACFVGKKVESHIEKVNMNAAHLGGLQERYSERLYTHNHNFVKAI
FT                   NTVQKSWTATAYKEYEKMSLRDLIRRSGHSQRIPRPKPAPMTDEIQQQILNLPESWDWR
FT                   NVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSPYAQG
FT                   CDGGFPYLIAGKYAQDFGVVEESCFPYTAKDSPCKPRENCLRYYSSDYYYVGGFYGGCN
FT                   EALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRD
FT                   PVTGIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAIPIPKL"
FT   regulatory      1733..1738
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      1751
FT                   /note="putative"
XX
SQ   Sequence 1751 BP; 499 A; 373 C; 394 G; 485 T; 0 other;
     gcttttggga gtctgcaccg tgcgctccga cactcctgcc aactgcacct accctgatct        60
     gctgggcacc tgggtgttcc aggtgggccc tagaagttcc cgaagcgaca ttaactgctc       120
     ggtgatggaa gcaacagaag aaaaggtagt ggtacacctt aagaagttgg atactgccta       180
     cgacgagctg ggcaattccg ggcattttac cctcatttac aaccaaggct tcgagattgt       240
     gttgaatgac tacaaatggt ttgcgttttt caagtatgaa gtcagaggcc acacagctat       300
     cagttactgc catgagacca tgactgggtg ggtccatgat gtgctgggcc ggaactgggc       360
     ttgctttgtt ggcaagaagg tggaaagtca cattgagaag gttaatatga atgcagcaca       420
     tcttggaggt ctccaggaaa gatattctga aagactctac actcacaacc acaactttgt       480
     gaaggccatc aataccgttc agaagtcttg gactgcaact gcatataagg aatatgagaa       540
     aatgagcctg cgagatctga taaggagaag tggccacagc caaaggatcc caaggcccaa       600
     acctgccccg atgactgatg aaatacagca acaaatttta aatttgccag aatcttggga       660
     ctggagaaac gtccaaggcg tcaattatgt tagccctgtt cgaaaccaag aatcttgtgg       720
     aagctgctac tcatttgcct ctatgggtat gctagaagca agaattcgta tattaaccaa       780
     caattctcag acaccaatcc tgagtcctca ggaggttgta tcttgcagcc cctatgccca       840
     aggttgtgat ggtggattcc catacctcat tgcagggaag tatgcccaag attttggggt       900
     ggtggaagaa agctgctttc cctacacagc caaagattct ccatgcaaac caagggagaa       960
     ttgcctccgt tactattctt ctgactacta ctatgtgggt ggtttctatg gtggctgcaa      1020
     tgaagccctg atgaagcttg agctggtcaa acatggaccc atggcagttg cctttgaagt      1080
     ccacgatgac ttcctacact accacagtgg aatctatcac cacactgggc tgagtgaccc      1140
     tttcaacccc ttcgagctga caaatcatgc tgttttgctt gtgggctatg gaagagatcc      1200
     agttactggg atagaatact ggattataaa gaacagctgg ggctctaact ggggggagag      1260
     tggctacttc cgtatccgca gaggaactga tgaatgtgca attgagagta tagccgtggc      1320
     ggccataccg attcctaaat tataggacat agctcccagt gttacatacg ggtctttatc      1380
     actcacagag tgatttagtc acatgctgaa gactttttca gagcaatatc agaagcttac      1440
     cactaagcat ctttaaagaa ttttgtcttt gaacttaaaa ccatccttga tttttttctt      1500
     ttaatatctt ccccatcaac tactgaacta cttttctttt taaagtactt ggttaagtaa      1560
     tacttttatg agcagtggtt cagttgtcca atattttttg caggtcatct acaatgcaac      1620
     cagatgtttc agttctaaaa atctatgtaa aagtacaagc tcgtttttaa attatgtaag      1680
     tcacatgaaa acatggcaaa aaaattagtt aaatttttta caaagagttt taaataaatg      1740
     tttatgtaat c                                                           1751
//