Dbfetch

ID   AK150761; SV 1; linear; mRNA; HTC; MUS; 1868 BP.
XX
AC   AK150761;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 11)
XX
DE   Mus musculus bone marrow macrophage cDNA, RIKEN full-length enriched
DE   library, clone:I830015C20 product:cathepsin C, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus; Mus.
XX
RN   [1]
RP   1-1868
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; b41ca54b460fbd0ca739129af033d058.
DR   Ensembl-Gn; ENSMUSG00000030560; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000032779; mus_musculus.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Tissues were provided by David A. Hume ( Depts. of Biochemistry
CC   and Microbiology/Parasitology Institute for Molecular Bioscience
CC   University of Queensland Brisbane,Q 4072 Australia ) whose
CC   assistance we gratefully acknowledge.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=I830015C20
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1868
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="I830015C20"
FT                   /cell_type="macrophage"
FT                   /tissue_type="bone marrow"
FT                   /db_xref="taxon:10090"
FT   CDS             71..1459
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="cathepsin C (MGD|MGI:109553 GB|NM_009982, evidence:
FT                   BLASTN, 99%, match=1866)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q3UBY5"
FT                   /db_xref="InterPro:IPR000169"
FT                   /db_xref="InterPro:IPR000668"
FT                   /db_xref="InterPro:IPR013128"
FT                   /db_xref="InterPro:IPR014882"
FT                   /db_xref="InterPro:IPR025660"
FT                   /db_xref="InterPro:IPR025661"
FT                   /db_xref="MGI:MGI:109553"
FT                   /db_xref="UniProtKB/TrEMBL:Q3UBY5"
FT                   /protein_id="BAE29829.1"
FT                   /translation="MGPWTHSLRAVLLLVLLGVCTVRSDTPANCTYPDLLGTWVFQVGP
FT                   RSSRSDINCSVMEATEEKVVVHLKKLDTAYDELGNSGHFTLIYNQGFEIVLNDYKWFAF
FT                   FKYEVRGHTAISYCHETMTGWVHDVLGRNWACFVGKKVESHIEKVNMNAAHLGGLQERY
FT                   SERLYTHNHNFVKAINTVQKSWTATAYKEYEKMSLRDLIRRSGHSQRIPRPKPAPMTDE
FT                   IQQQILNLPESWDWRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPI
FT                   LSPQEVVSCSPYAQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDSPCKPRENCLRYY
FT                   SSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPF
FT                   ELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAIP
FT                   IPKL"
FT   regulatory      1847..1852
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      1868
FT                   /note="putative"
XX
SQ   Sequence 1868 BP; 511 A; 412 C; 428 G; 517 T; 0 other;
     gtgactgcca tcgagtggtg ttccagttga acttgctttc tctgccatct gctccgcggg        60
     cgccgtcagc atgggtccct ggacccactc cttgcgcgcc gtcctgctgc tggtgctttt       120
     gggagtctgc accgtgcgct ccgacactcc tgccaactgc acctaccctg atctgctggg       180
     cacctgggtg ttccaggtgg gccctagaag ttcccgaagc gacattaact gctcggtgat       240
     ggaagcaaca gaagaaaagg tagtggtaca ccttaagaag ttggatactg cctacgacga       300
     gctgggcaat tccgggcatt ttaccctcat ttacaaccaa ggcttcgaga ttgtgttgaa       360
     tgactacaaa tggtttgcgt ttttcaagta tgaagtcaga ggccacacag ctatcagtta       420
     ctgccatgag accatgactg ggtgggtcca tgatgtgctg ggccggaact gggcttgctt       480
     tgttggcaag aaggtggaaa gtcacattga gaaggttaat atgaatgcag cacatcttgg       540
     aggtctccag gaaagatatt ctgaaagact ctacactcac aaccacaact ttgtgaaggc       600
     catcaatacc gttcagaagt cttggactgc aactgcatat aaggaatatg agaaaatgag       660
     cctgcgagat ctgataagga gaagtggcca cagccaaagg atcccaaggc ccaaacctgc       720
     cccgatgact gatgaaatac agcaacaaat tttaaatttg ccagaatctt gggactggag       780
     aaacgtccaa ggcgtcaatt atgttagccc tgttcgaaac caagaatctt gtggaagctg       840
     ctactcattt gcctctatgg gtatgctaga agcaagaatt cgtatattaa ccaacaattc       900
     tcagacacca atcctgagtc ctcaggaggt tgtatcttgc agcccctatg cccaaggttg       960
     tgatggtgga ttcccatacc tcattgcagg gaagtatgcc caagattttg gggtggtgga      1020
     agaaagctgc tttccctaca cagccaaaga ttctccatgc aaaccaaggg agaattgcct      1080
     ccgttactat tcttctgact actactatgt gggtggtttc tatggtggct gcaatgaagc      1140
     cctgatgaag cttgagctgg tcaaacatgg acccatggca gttgcctttg aagtccacga      1200
     tgacttccta cactaccaca gtggaatcta tcaccacact gggctgagtg accctttcaa      1260
     ccccttcgag ctgacaaatc atgctgtttt gcttgtgggc tatggaagag atccagttac      1320
     tgggatagaa tactggatta taaagaacag ctggggctct aactgggggg agagtggcta      1380
     cttccgtatc cgcagaggaa ctgatgaatg tgcaattgag agtatagccg tggcggccat      1440
     accgattcct aaattatagg acatagctcc cagtgttaca tacgggtctt tatcactcac      1500
     agagtgattt agtcacatgc tgaagacttt ttcagagcaa tatcagaagc ttaccactaa      1560
     gcatctttaa agaattttgt ctttgaactt aaaaccatcc ttgatttttt tcttttaata      1620
     tcttccccat caactactga actacttttc tttttaaagt acttggttaa gtaatacttt      1680
     tatgagcagt ggttcagttg tccaatattt tttgcaggtc atctacaatg caaccagatg      1740
     tttcagttct aaaaatctat gtaaaagtac aagctcgttt ttaaattatg taagtcacat      1800
     gaaaacatgg caaaaaaatt agttaaattt tttacaaaga gttttaaata aatgtttatg      1860
     taatcagt                                                               1868
//