Dbfetch

ID   AK084855; SV 1; linear; mRNA; HTC; MUS; 4244 BP.
XX
AC   AK084855;
XX
DT   19-DEC-2002 (Rel. 74, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 15)
XX
DE   Mus musculus 13 days embryo lung cDNA, RIKEN full-length enriched library,
DE   clone:D430002O20 product:signal transducer and activator of transcription
DE   1, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-4244
RA   Adachi J., Aizawa K., Akimura T., Arakawa T., Bono H., Carninci P.,
RA   Fukuda S., Furuno M., Hanagaki T., Hara A., Hashizume W., Hayashida K.,
RA   Hayatsu N., Hiramoto K., Hiraoka T., Hirozane T., Hori F., Imotani K.,
RA   Ishii Y., Itoh M., Kagawa I., Kasukawa T., Katoh H., Kawai J., Kojima Y.,
RA   Kondo S., Konno H., Kouda M., Koya S., Kurihara C., Matsuyama T.,
RA   Miyazaki A., Murata M., Nakamura M., Nishi K., Nomura K., Numazaki R.,
RA   Ohno M., Ohsato N., Okazaki Y., Saito R., Saitoh H., Sakai C., Sakai K.,
RA   Sakazume N., Sano H., Sasaki D., Shibata K., Shinagawa A., Shiraki T.,
RA   Sogabe Y., Tagami M., Tagawa A., Takahashi F., Takaku-Akahira S.,
RA   Takeda Y., Tanaka T., Tomaru A., Toya T., Yasunishi A., Muramatsu M.,
RA   Hayashizaki Y.;
RT   ;
RL   Submitted (16-APR-2002) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 3aed507081544ded9b414cf997c75205.
DR   Ensembl-Gn; ENSMUSG00000026104; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0016006; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0015988; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0015944; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0015945; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0015777; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0016396; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0015358; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0015748; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0015849; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0015851; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0015920; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0015871; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0016444; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0015144; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0015421; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000070968; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000186857; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0019740; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0019714; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0019674; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0019667; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0019493; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0020166; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0019004; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0019448; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0019563; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0019569; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0019650; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0019540; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0020223; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0018758; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0019089; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=D430002O20
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..4244
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="13 days embryo"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="D430002O20"
FT                   /tissue_type="lung"
FT                   /db_xref="taxon:10090"
FT   CDS             285..2534
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="signal transducer and activator of transcription 1
FT                   (MGD|MGI:103063 GB|NM_009283, evidence: BLASTN, 99%,
FT                   match=2276)"
FT                   /db_xref="GOA:Q8C3V4"
FT                   /db_xref="InterPro:IPR000980"
FT                   /db_xref="InterPro:IPR001217"
FT                   /db_xref="InterPro:IPR008967"
FT                   /db_xref="InterPro:IPR012345"
FT                   /db_xref="InterPro:IPR013799"
FT                   /db_xref="InterPro:IPR013800"
FT                   /db_xref="InterPro:IPR013801"
FT                   /db_xref="InterPro:IPR015988"
FT                   /db_xref="InterPro:IPR022752"
FT                   /db_xref="MGI:MGI:103063"
FT                   /db_xref="UniProtKB/TrEMBL:Q8C3V4"
FT                   /protein_id="BAC39293.1"
FT                   /translation="MSQWFELQQLDSKFLEQVHQLYDDSFPMEIRQYLAQWLEKQDWEH
FT                   AAYDVSFATIRFHDLLSQLDDQYSRFSLENNFLLQHNIRKSKRNLQDNFQEDPVQMSMI
FT                   IYNCLKEERKILENAQRFNQAQEGNIQNTVMLDKQKELDSKVRNVKDQVMCIEQEIKTL
FT                   EELQDEYDFKCKTSQNREGEANGVAKSDQKQEQLLLHKMFLMLDNKRKEIIHKIRELLN
FT                   SIELTQNTLINDELVEWKRRQQSACIGGPPNACLDQLQSWFTIVAETLQQIRQQLKKLE
FT                   ELEQKFTYEPDPITKNKQVLSDRTFLLFQQLIQSSFVVERQPCMPTHPQRPLVLKTGVQ
FT                   FTVKLRLLVKLQELNYNLKVKVSFDKDVNEKNTVKGFRKFNILGTHTKVMNMEESTNGS
FT                   LAAEFRHLQLKEQKNAGNRTNEGPLIVTEELHSLSFETQLCQPGLVIDLETTSLPVVVI
FT                   SNVSQLPSGWASILWYNMLVTEPRNLSFFLNPPCAWWSQLSEVLSWQFSSVTKRGLNAD
FT                   QLSMLGEKLLGPNAGPDGLIPWTRFCKENINDKNFSFWPWIDTILELIKKHLLCLWNDG
FT                   CIMGFISKERERALLKDQQPGTFLLRFSESSREGAITFTWVERSQNGGEPDFHAVEPYT
FT                   KKELSAVTFPDIIRNYKVMAAENIPENPLKYLYPNIDKDHAFGKYYSRPKEAPEPMELD
FT                   DPKRTGYIKTELISVSEVHPSRLQTTDNLLPMSPEEFDEMSRIVGPEFDSMMSTV"
XX
SQ   Sequence 4244 BP; 1173 A; 970 C; 1007 G; 1094 T; 0 other;
     gagcgccgag tctgtcaaag ctccctggag acctccggga ccgcgcccct cagacccact        60
     tgggacactg ctgagcggcg cagagagatt tgcccagact cgagctcctg cgtgcagtga       120
     tcgtttcagc tctgctccat accctgagcc ggcgccacgc cgccgcgcat gcaactggca       180
     tataacttgc tgtgtgtggt gattgcttgt gttgaatccc gaacctgcac ccggagacag       240
     cccagtaagt ctacgtggga acggaagcat ttggaatctc aaggatgtca cagtggttcg       300
     agcttcagca gctggactcc aagttcctgg agcaggtcca ccagctgtac gatgacagtt       360
     tccccatgga aatcagacag tacctggccc agtggctgga aaagcaagac tgggagcacg       420
     ctgcctatga tgtctcgttt gcgaccatcc gcttccatga cctcctctca cagctggacg       480
     accagtacag ccgcttttct ctggagaata atttcttgtt gcagcacaac atacggaaaa       540
     gcaagcgtaa tctccaggat aacttccaag aagatcccgt acagatgtcc atgatcatct       600
     acaactgtct gaaggaagaa aggaagattt tggaaaatgc ccaaagattt aatcaggccc       660
     aggagggaaa tattcagaac actgtgatgt tagataaaca gaaggagctg gacagtaaag       720
     tcagaaatgt gaaggatcaa gtcatgtgca tagagcagga aatcaagacc ctagaagaat       780
     tacaagatga atatgacttt aaatgcaaaa cctctcagaa cagagaaggt gaagccaatg       840
     gtgtggcgaa gagcgaccaa aaacaggaac agctgctgct ccacaagatg tttttaatgc       900
     ttgacaataa gagaaaggag ataattcaca aaatcagaga gttgctgaat tccatcgagc       960
     tcactcagaa cactctgatt aatgacgagc tcgtggagtg gaagcgaagg cagcagagcg      1020
     cctgcatcgg gggaccgccc aacgcctgcc tggatcagct gcaaagctgg ttcaccattg      1080
     ttgcagagac cctgcagcag atccgtcagc agcttaaaaa gctggaggag ttggaacaga      1140
     aattcaccta tgagcccgac cctattacaa aaaacaagca ggtgttgtca gatcgaacct      1200
     tcctcctctt ccagcagctc attcagagct ccttcgtggt agaacgacag ccgtgcatgc      1260
     ccactcaccc gcagaggccc ctggtcttga agactggggt acagttcact gtcaagctga      1320
     gactgttggt gaaattgcaa gagctgaact ataacttgaa agtgaaagtc tcatttgaca      1380
     aagatgtgaa cgagaaaaac acagttaaag gatttcggaa gttcaacatc ttgggtacgc      1440
     acacaaaagt gatgaacatg gaagaatcca ccaacggaag tctggcagct gagttccgac      1500
     acctgcaact gaaggaacag aaaaacgctg ggaacagaac taatgagggg cctctcattg      1560
     tcaccgaaga acttcactct cttagctttg aaacccagtt gtgccagcca ggcttggtga      1620
     ttgacctgga gaccacctct cttcctgtcg tggtgatctc caacgtcagc cagctcccca      1680
     gtggctgggc gtctatcctg tggtacaaca tgctggtgac agagcccagg aatctctcct      1740
     tcttcctgaa ccccccgtgc gcgtggtggt cccagctctc agaggtgttg agttggcagt      1800
     tttcatcagt caccaagaga ggtctgaacg cagaccagct gagcatgctg ggagagaagc      1860
     tgctgggccc taatgctggc cctgatggtc ttattccatg gacaaggttt tgtaaggaaa      1920
     atattaatga taaaaatttc tccttctggc cttggattga caccatccta gagctcatta      1980
     agaagcacct gctgtgcctc tggaatgatg ggtgcattat gggcttcatc agcaaggagc      2040
     gagaacgcgc tctgctcaag gaccagcagc cagggacgtt cctgcttaga ttcagtgaga      2100
     gctcccggga aggggccatc acattcacat gggtggaacg gtcccagaac ggaggtgaac      2160
     ctgacttcca tgccgtggag ccctacacga aaaaagaact ttcagctgtt actttcccag      2220
     atattattcg caactacaaa gtcatggctg ccgagaacat accagagaat cccctgaagt      2280
     atctgtaccc caatattgac aaagaccacg cctttgggaa gtattattcc agaccaaagg      2340
     aagcaccaga accgatggag cttgacgacc ctaagcgaac tggatacatc aagactgagt      2400
     tgatttctgt gtctgaagtc cacccttcta gacttcagac cacagacaac ctgcttccca      2460
     tgtctccaga ggagtttgat gagatgtccc ggatagtggg ccccgaattt gacagtatga      2520
     tgagcacagt ataaacacga atttctctct ggcgacattt ttttcccatc tgtgattcct      2580
     tcctgctact gttccttcat atgcagtatt tctagggaaa tgcaagaaag aaagagcatc      2640
     acatttgctg agcactgctg gtagaaagtg gatatttctc taattagaaa cctgttactc      2700
     tgaaggactt catgcatctt actgaaggtg aaatggaaag tcacttaaca caaaatggat      2760
     tttgtaaaca aagaccaaga gatccaccca agcaccagga ctagagtgcg agtatttggg      2820
     gcaaggtgag gagaacggtc actttagtaa tggtctgtaa tcagtgccca agtgctgcac      2880
     atcactggaa agagacatac ttatggggga ggggccttct tgatggagga atgtttctgt      2940
     cccgggagac attggcactt cccctctcct ggatggccgg aagtcttcca ctgttttaca      3000
     tatggcacag ttcaaagtca actttagatc caatgctcta tcaaactata gtgggcatcc      3060
     ttcatgtgag tgggaagaaa acaaccgtgc tccttactgc agcttctgcc aaggcatggt      3120
     tgctctcctc agggactagc tttgttggtg gcaatggcta cacaaaacta aacaccaaca      3180
     gaagtaagac cattttcatg agtactccat caagttaaag ggtttttgtt gtctttttgg      3240
     tcatggattg aataaaattg tctttgcaca tccattaagg gggccagctt tcttaaagca      3300
     atttttcttt ttttttaact aaaattagat ataggtgaac tcatgttttt tagtgggctg      3360
     aacttatcgg ttttagctgg ttgtcttaat tagccataaa cttggagaaa gcagtgactt      3420
     cttgaatcct tagccaaata tgagtatcag ataattttat tatttttttt tcgagacagg      3480
     gtttctctgt gtagccctgg ctattctgga actcactctg tagatcaggc tggcctggaa      3540
     ctcagaaatc cgcctgtctc tgcctcccga gtgctgggat taaaggtgtg caccaccaat      3600
     gcctggtgag ataactttaa agaactccct ataaatgcat gagaaccact gttactgatg      3660
     aatgtggttt tttgacaact acattcacaa atggcctgtc ttgtgttttg tcaccgtttt      3720
     gagggatgat gttttgtggc acgtgtgtga tcacagcctg atggttctgg tcgtgggttg      3780
     gttcttctgg gccagctttc acagactgct gcgcagctgc acctacagtg ctgccccata      3840
     atactgtttc actttggtga agatcagccc accttacacc ccgagtgcag gtgtgaacca      3900
     cggtaagtgt gcacagtcct tagggaaaac agggacgcag aggcctgcct cctctctttt      3960
     ccatgccaaa atgaaatgac caagaaacaa aacatttaaa aagttgtttc taaatgctga      4020
     gacctaacca ttgcttatat actgttgtct gttgaaacag tttgttacaa tttcattctg      4080
     ttgaactagg tgagacttta agaaatgttg aaattatgtt aatttcctat tattatttaa      4140
     tataaagata tttaaaatgt ctagtgttat gagttggttt aatatatatc tcatgtatgt      4200
     atcagtccta ttttaagcgc tttttaaaaa agacttgttt aggt                       4244
//