Dbfetch

ID   AK142820; SV 1; linear; mRNA; HTC; MUS; 2658 BP.
XX
AC   AK142820;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 11)
XX
DE   Mus musculus 15 days embryo head cDNA, RIKEN full-length enriched library,
DE   clone:D930021K08 product:weakly similar to Airway trypsin-like protease 1
DE   precursor [Homo sapiens], full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2658
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 55e67a0f39c1bb5ae346bb5e360edf9f.
DR   Ensembl-Gn; ENSMUSG00000072845; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0029669; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0029637; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0029585; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0029646; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0029369; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0030100; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0028777; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0029338; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0029483; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0029444; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0029570; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0029472; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0030133; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0028501; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0028858; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000101073; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0072375; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0072458; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0072376; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0072388; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0072030; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0072851; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0072576; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0071982; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0072120; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0072022; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0072172; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0072042; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0073062; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0072051; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0071178; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=D930021K08
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2658
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="15 days embryo"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="D930021K08"
FT                   /tissue_type="head"
FT                   /db_xref="taxon:10090"
FT   CDS             137..1306
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="weakly similar to Airway trypsin-like protease 1
FT                   precursor [Homo sapiens] (UniProt|Q7RTY4, evidence: FASTY,
FT                   68.5%ID, 100%length, match=1119)"
FT                   /db_xref="GOA:Q3UQ41"
FT                   /db_xref="InterPro:IPR000082"
FT                   /db_xref="InterPro:IPR001254"
FT                   /db_xref="InterPro:IPR001314"
FT                   /db_xref="InterPro:IPR009003"
FT                   /db_xref="InterPro:IPR017329"
FT                   /db_xref="InterPro:IPR018114"
FT                   /db_xref="InterPro:IPR033116"
FT                   /db_xref="MGI:MGI:2684853"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q3UQ41"
FT                   /protein_id="BAE25202.1"
FT                   /translation="MEVAGYGTHNRDLKQWMVTLLSALSLMMVVVTIGLLALFLVFDIQ
FT                   VNSNSGQKSSNQLKDLQETNENLVDEIFIDSALNNRYIKNHVVGLTPEEDDTKADIVMV
FT                   FQPPATGRRTVGKKTHHSILDQKTRNARALPADVSLVQVKDCGKRAIPLIANRIVSGNP
FT                   AAKGAWPWQVSLQRSNIHQCGGTLIGNMWVVTAAHCFRTNSNPRQWTLSFGTTINPPLM
FT                   KRDVRRIIMHERYRPPARDHDIALVQFSPRVTFSDEVRRICLPEPSASFPPNSTVYITG
FT                   FGALYYGGESQNELREARVQIISNDICKKRHVYGNEIKRGMFCAGFLEGNYDACRGDSG
FT                   GPLVIRDNKDTWYLIGIVSWGDNCGQKNKPGVYTQVTYYRHWIASKTGL"
FT   regulatory      2638..2643
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      2658
FT                   /note="putative"
XX
SQ   Sequence 2658 BP; 807 A; 496 C; 574 G; 781 T; 0 other;
     gttctttggg tcctgctcct ttgaagtgtc tgtctttgag ggtagactct ccacaccaga        60
     ggatagctgg tgaggaatct ttaaggaatt caaactggat ttaagattga agggtttctg       120
     gaattttcac agtgggatgg aagtggcagg atatggcacc cacaacagag atctgaagca       180
     atggatggtt acccttctct ctgctctctc cctgatgatg gtggtagtga ccattggact       240
     tctggctctc ttcctcgtgt tcgatattca agtcaatagc aactcaggac aaaagagctc       300
     aaatcaactg aaggacttac aagagacgaa tgaaaatttg gtagatgaaa tatttataga       360
     ttcagccttg aacaatcgct acatcaaaaa ccacgtagtc ggactaacac cagaagagga       420
     tgatacaaaa gcagacattg tcatggtgtt ccagcctccc gctacagggc gaagaaccgt       480
     agggaaaaag acgcaccata gcatcttaga tcagaagaca aggaacgcaa gagctttgcc       540
     agctgatgtc tcactggttc aagttaaaga ttgtggcaag cgagctatcc cattaattgc       600
     caacagaata gtgtctggaa accctgcagc taagggtgcc tggccgtggc aagtttccct       660
     tcagcgaagc aacatccatc agtgtggggg cacgttgatt ggtaacatgt gggtcgtcac       720
     tgcagcacac tgttttagaa ccaattcaaa ccctcgccaa tggactctta gttttggaac       780
     aacaataaat cctcccttaa tgaaaagaga cgtcagaaga attattatgc atgaaaggta       840
     tcgtccccca gcaagagacc atgacattgc tctagtgcag ttttctccca gagtcacctt       900
     ttcggatgaa gtgcgccgaa tttgtttgcc agaaccctct gcatctttcc caccaaattc       960
     aactgtctac atcacaggat tcggagcact ttactatggc ggggaatccc aaaatgagct      1020
     ccgtgaagcc agagtacaaa tcataagcaa tgacatctgc aagaagcgac acgtgtatgg      1080
     caatgaaata aaacgtggga tgttctgtgc tggatttctg gaaggaaatt acgatgcctg      1140
     caggggtgat tctgggggac cattggttat aagagacaac aaagatacct ggtatctcat      1200
     tggaattgtg agctggggag acaactgtgg tcaaaagaac aagcctgggg tgtacacaca      1260
     agtgacttat taccgacact ggattgcttc gaaaacgggt ctctaactcg ctacagatag      1320
     atgttaaaga aaaccgtagc gtgtcggtta tatgcatgta agaattcaga catccatttg      1380
     gtggcattgg cacaatgaaa tgagattaaa tggctggacc tagcaatgtg aaacacatga      1440
     tttatttcag aatcgcatat ttgtgaaggt gtaggctgat ttatcattga aaaatgcagt      1500
     aaagatcctt catctcttta aagcaagggt gcatgggatg cattatgtgg ttttacagat      1560
     ggcctgggag agttttagaa tgtgtttttt ctgggtaccc ttcaagtaaa aaacgctttg      1620
     aatcttcaga gacgtcaagg tgctcataca agtccctttg atttatctag agttcataca      1680
     catctttttg tcttctttac atttcttcta gattttccag ttttttccac cacaggaaat      1740
     aagtttgtta cttctaaatg catatcaggg aatttagaag ggacgattat atcaaaattt      1800
     ggtggcaaaa taaagtgaag ttagtttaaa ctttgctggg aattttttct gttttttttt      1860
     ttatgatttt tagaaacact accatgtata caacactcat atcaccatcc cgatggcata      1920
     gagatacata agagaaatac tgattaatgg tacttgatca atatataatt tttatcctga      1980
     aaaattgaca gggatgatga ctgagctgtt ccaggatagg tgtacaagat attttgaatg      2040
     cgcacttaca tattttggtc gccaggggga gatttttgtc cagcattatc ttaaggtcaa      2100
     ataatggaaa actacaggga atttatgtcc aggaagtaaa aacaaacaga ctcatgaagg      2160
     catcaggaaa gtgaaacatt gttgcccaca atagacctaa agaaggcttt attggataat      2220
     ttcttgcctg tgagatgtct aaattttcaa gttaaggtat ataaaacagc ttaacaaagt      2280
     tatatgcaat ttgcatcctt agttgtcaaa ttcacatgcc ctctttcctt gcaaagtttg      2340
     tttatggctg ctgtatttat tgaatcggtg tacctaggac atttctcact aagtgctatt      2400
     ggatcagttt tcaaactagg aaactcttac taccactact cacttcaaag caaggttctt      2460
     tcttcatata tgtaataaat tctcacataa attttctgca ttacagtggt taactatgat      2520
     tgtgtttcta atgctaagat tcagaaagtg gtcttagagt tgtaaataat gcgctccaaa      2580
     ctgtctgctg cagcatttca ttcatggttg tatgggaaca aataatgttt taaatctaat      2640
     aaagttggtg atcagagc                                                    2658
//