Dbfetch

ID   AK146858; SV 1; linear; mRNA; HTC; MUS; 1916 BP.
XX
AC   AK146858;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 11)
XX
DE   Mus musculus 17 days pregnant adult female amnion cDNA, RIKEN full-length
DE   enriched library, clone:I920066G11 product:cathepsin C, full insert
DE   sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-1916
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; d00e63a9caefe90924be00e96cbf8f8e.
DR   Ensembl-Gn; ENSMUSG00000030560; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0032492; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0032466; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0032402; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0032477; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0032192; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0032975; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0031519; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0032159; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0032311; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0032267; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0032400; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0032301; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0032997; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0031237; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0031636; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000032779; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0084900; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0084966; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0084918; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0084939; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0084512; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0085445; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0085025; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0084449; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0084640; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0084495; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0084669; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0084540; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0085726; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0084450; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0083562; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=I920066G11
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1916
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /sex="female"
FT                   /dev_stage="17 days pregnant adult"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="I920066G11"
FT                   /tissue_type="amnion"
FT                   /db_xref="taxon:10090"
FT   CDS             119..1507
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="cathepsin C (MGD|MGI:109553 GB|NM_009982, evidence:
FT                   BLASTN, 99%, match=1866)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q3UBY5"
FT                   /db_xref="InterPro:IPR000169"
FT                   /db_xref="InterPro:IPR000668"
FT                   /db_xref="InterPro:IPR013128"
FT                   /db_xref="InterPro:IPR014882"
FT                   /db_xref="InterPro:IPR025660"
FT                   /db_xref="InterPro:IPR025661"
FT                   /db_xref="InterPro:IPR033161"
FT                   /db_xref="MGI:MGI:109553"
FT                   /db_xref="UniProtKB/TrEMBL:Q3UBY5"
FT                   /protein_id="BAE27487.1"
FT                   /translation="MGPWTHSLRAVLLLVLLGVCTVRSDTPANCTYPDLLGTWVFQVGP
FT                   RSSRSDINCSVMEATEEKVVVHLKKLDTAYDELGNSGHFTLIYNQGFEIVLNDYKWFAF
FT                   FKYEVRGHTAISYCHETMTGWVHDVLGRNWACFVGKKVESHIEKVNMNAAHLGGLQERY
FT                   SERLYTHNHNFVKAINTVQKSWTATAYKEYEKMSLRDLIRRSGHSQRIPRPKPAPMTDE
FT                   IQQQILNLPESWDWRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPI
FT                   LSPQEVVSCSPYAQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDSPCKPRENCLRYY
FT                   SSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPF
FT                   ELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAIP
FT                   IPKL"
FT   regulatory      1895..1900
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      1916
FT                   /note="putative"
XX
SQ   Sequence 1916 BP; 517 A; 423 C; 445 G; 531 T; 0 other;
     ggttttccgg gctcttgggt attttaaggc gggagccact agggtcgcct gactgccatc        60
     gagtggtgtt ccagttgaac ttgctttctc tgccatctgc tccgcgggcg ccgtcagcat       120
     gggtccctgg acccactcct tgcgcgccgt cctgctgctg gtgcttttgg gagtctgcac       180
     cgtgcgctcc gacactcctg ccaactgcac ctaccctgat ctgctgggca cctgggtgtt       240
     ccaggtgggc cctagaagtt cccgaagcga cattaactgc tcggtgatgg aagcaacaga       300
     agaaaaggta gtggtacacc ttaagaagtt ggatactgcc tacgacgagc tgggcaattc       360
     cgggcatttt accctcattt acaaccaagg cttcgagatt gtgttgaatg actacaaatg       420
     gtttgcgttt ttcaagtatg aagtcagagg ccacacagct atcagttact gccatgagac       480
     catgactggg tgggtccatg atgtgctggg ccggaactgg gcttgctttg ttggcaagaa       540
     ggtggaaagt cacattgaga aggttaatat gaatgcagca catcttggag gtctccagga       600
     aagatattct gaaagactct acactcacaa ccacaacttt gtgaaggcca tcaataccgt       660
     tcagaagtct tggactgcaa ctgcatataa ggaatatgag aaaatgagcc tgcgagatct       720
     gataaggaga agtggccaca gccaaaggat cccaaggccc aaacctgccc cgatgactga       780
     tgaaatacag caacaaattt taaatttgcc agaatcttgg gactggagaa acgtccaagg       840
     cgtcaattat gttagccctg ttcgaaacca agaatcttgt ggaagctgct actcatttgc       900
     ctctatgggt atgctagaag caagaattcg tatattaacc aacaattctc agacaccaat       960
     cctgagtcct caggaggttg tatcttgcag cccctatgcc caaggttgtg atggtggatt      1020
     cccatacctc attgcaggga agtatgccca agattttggg gtggtggaag aaagctgctt      1080
     tccctacaca gccaaagatt ctccatgcaa accaagggag aattgcctcc gttactattc      1140
     ttctgactac tactatgtgg gtggtttcta tggtggctgc aatgaagccc tgatgaagct      1200
     tgagctggtc aaacatggac ccatggcagt tgcctttgaa gtccacgatg acttcctaca      1260
     ctaccacagt ggaatctatc accacactgg gctgagtgac cctttcaacc ccttcgagct      1320
     gacaaatcat gctgttttgc ttgtgggcta tggaagagat ccagttactg ggatagaata      1380
     ctggattata aagaacagct ggggctctaa ctggggggag agtggctact tccgtatccg      1440
     cagaggaact gatgaatgtg caattgagag tatagccgtg gcggccatac cgattcctaa      1500
     attataggac atagctccca gtgttacata cgggtcttta tcactcacag agtgatttag      1560
     tcacatgctg aagacttttt cagagcaata tcagaagctt accactaagc atctttaaag      1620
     aattttgtct ttgaacttaa aaccatcctt gatttttttc ttttaatatc ttccccatca      1680
     actactgaac tacttttctt tttaaagtac ttggttaagt aatactttta tgagcagtgg      1740
     ttcagttgtc caatattttt tgcaggtcat ctacaatgca accagatgtt tcagttctaa      1800
     aaatctatgt aaaagtacaa gctcgttttt aaattatgta agtcacatga aaacatggca      1860
     aaaaaattag ttaaattttt tacaaagagt tttaaataaa tgtttatgta atcagt          1916
//