Dbfetch

ID   AK157263; SV 1; linear; mRNA; HTC; MUS; 2530 BP.
XX
AC   AK157263;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 11)
XX
DE   Mus musculus activated spleen cDNA, RIKEN full-length enriched library,
DE   clone:F830208J20 product:signal transducer and activator of transcription
DE   1, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2530
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; a5bdb1af0a3b50aa9deefd2ec6ecdce1.
DR   Ensembl-Gn; ENSMUSG00000026104; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0016006; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0015988; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0015944; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0015945; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0015777; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0016396; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0015358; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0015748; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0015849; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0015851; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0015920; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0015871; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0016444; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0015144; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0015421; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000186574; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000189347; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000191435; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0019740; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0019714; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0019674; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0019667; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0019493; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0020166; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0019004; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0019448; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0019563; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0019569; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0019650; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0019540; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0020223; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0018758; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0019089; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Tissues were provided by Dr. John Todd (Dept. of Medical Genetics
CC   Wellcome Trust Centre for Molecular Mechanisms in Disease Wellcome
CC   Trust/MRC building Addenbrookes Hospital Cambridge) whose
CC   assistance we gratefully acknowledge.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=F830208J20
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2530
FT                   /organism="Mus musculus"
FT                   /strain="NOD"
FT                   /mol_type="mRNA"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="F830208J20"
FT                   /tissue_type="activated spleen"
FT                   /db_xref="taxon:10090"
FT   CDS             200..2338
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="signal transducer and activator of transcription 1
FT                   (MGD|MGI:103063 GB|AK039458, evidence: BLASTN, 100%,
FT                   match=2529)"
FT                   /db_xref="GOA:Q99K94"
FT                   /db_xref="InterPro:IPR000980"
FT                   /db_xref="InterPro:IPR001217"
FT                   /db_xref="InterPro:IPR008967"
FT                   /db_xref="InterPro:IPR012345"
FT                   /db_xref="InterPro:IPR013799"
FT                   /db_xref="InterPro:IPR013800"
FT                   /db_xref="InterPro:IPR013801"
FT                   /db_xref="InterPro:IPR015988"
FT                   /db_xref="MGI:MGI:103063"
FT                   /db_xref="UniProtKB/TrEMBL:Q99K94"
FT                   /protein_id="BAE34017.1"
FT                   /translation="MSQWFELQQLDSKFLEQVHQLYDDSFPMEIRQYLAQWLEKQDWEH
FT                   AAYDVSFATIRFHDLLSQLDDQYSRFSLENNFLLQHNIRKSKRNLQDNFQEDPVQMSMI
FT                   IYNCLKEERKILENAQRFNQAQEGNIQNTVMLDKQKELDSKVRNVKDQVMCIEQEIKTL
FT                   EELQDEYDFKCKTSQNREGEANGVAKSDQKQEQLLLHKMFLMLDNKRKEIIHKIRELLN
FT                   SIELTQNTLINDELVEWKRRQQSACIGGPPNACLDQLQSWFTIVAETLQQIRQQLKKLE
FT                   ELEQKFTYEPDPITKNKQVLSDRTFLLFQQLIQSSFVVERQPCMPTHPQRPLVLKTGVQ
FT                   FTVKLRLLVKLQELNYNLKVKVSFDKDVNEKNTVKGFRKFNILGTHTKVMNMEESTNGS
FT                   LAAEFRHLQLKEQKNAGNRTNEGPLIVTEELHSLSFETQLCQPGLVIDLETTSLPVVVI
FT                   SNVSQLPSGWASILWYNMLVTEPRNLSFFLNPPCAWWSQLSEVLSWQFSSVTKRGLNAD
FT                   QLSMLGEKLLGPNAGPDGLIPWTRFCKENINDKNFSFWPWIDTILELIKKHLLCLWNDG
FT                   CIMGFISKERERALLKDQQPGTFLLRFSESSREGAITFTWVERSQNGGEPDFHAVEPYT
FT                   KKELSAVTFPDIIRNYKVMAAENIPENPLKYLYPNIDKDHAFGKYYSRPKEAPEPMELD
FT                   DPKRTGYIKTELISVSEV"
XX
SQ   Sequence 2530 BP; 704 A; 618 C; 641 G; 567 T; 0 other;
     ggagctcctg cgtgcagtga gtgagtgaga gccagtcgtt tcagctctgc tccataccct        60
     gagccggcgc cacgccgccg cgcatgcaac tggcatataa cttgctgtgt gtggtgattg       120
     cttgtgttga atcccgaacc tgcacccgga gacagcccag taagtctacg tgggaacgga       180
     agcatttgga atctcaagga tgtcacagtg gttcgagctt cagcagctgg actccaagtt       240
     cctggagcag gtccaccagc tgtacgatga cagtttcccc atggaaatca gacagtacct       300
     ggcccagtgg ctggaaaagc aagactggga gcacgctgcc tatgatgtct cgtttgcgac       360
     catccgcttc catgacctcc tctcacagct ggacgaccag tacagccgct tttctctgga       420
     gaataatttc ttgttgcagc acaacatacg gaaaagcaag cgtaatctcc aggataactt       480
     ccaagaagat cccgtacaga tgtccatgat catctacaac tgtctgaagg aagaaaggaa       540
     gattttggaa aatgcccaaa gatttaatca ggcccaggag ggaaatattc agaacactgt       600
     gatgttagat aaacagaagg agctggacag taaagtcaga aatgtgaagg atcaagtcat       660
     gtgcatagag caggaaatca agaccctaga agaattacaa gatgaatatg actttaaatg       720
     caaaacctct cagaacagag aaggtgaagc caatggtgtg gcgaagagcg accaaaaaca       780
     ggaacagctg ctgctccaca agatgttttt aatgcttgac aataagagaa aggagataat       840
     tcacaaaatc agagagttgc tgaattccat cgagctcact cagaacactc tgattaatga       900
     cgagctcgtg gagtggaagc gaaggcagca gagcgcctgc atcgggggac cgcccaacgc       960
     ctgcctggat cagctgcaaa gctggttcac cattgttgca gagaccctgc agcagatccg      1020
     tcagcagctt aaaaagctgg aggagttgga acagaaattc acctatgagc ccgaccctat      1080
     tacaaaaaac aagcaggtgt tgtcagatcg aaccttcctc ctcttccagc agctcattca      1140
     gagctccttc gtggtagaac gacagccgtg catgcccact cacccgcaga ggcccctggt      1200
     cttgaagact ggggtacagt tcactgtcaa gctgagactg ttggtgaaat tgcaagagct      1260
     gaactataac ttgaaagtga aagtctcatt tgacaaagat gtgaacgaga aaaacacagt      1320
     taaaggattt cggaagttca acatcttggg tacgcacaca aaagtgatga acatggaaga      1380
     atccaccaac ggaagtctgg cagctgagtt ccgacacctg caactgaagg aacagaaaaa      1440
     cgctgggaac agaactaatg aggggcctct cattgtcacc gaagaacttc actctcttag      1500
     ctttgaaacc cagttgtgcc agccaggctt ggtgattgac ctggagacca cctctcttcc      1560
     tgtcgtggtg atctccaacg tcagccagct ccccagtggc tgggcgtcta tcctgtggta      1620
     caacatgctg gtgacagagc ccaggaatct ctccttcttc ctgaaccccc cgtgcgcgtg      1680
     gtggtcccag ctctcagagg tgttgagttg gcagttttca tcagtcacca agagaggtct      1740
     gaacgcagac cagctgagca tgctgggaga gaagctgctg ggccctaatg ctggccctga      1800
     tggtcttatt ccatggacaa ggttttgtaa ggaaaatatt aatgataaaa atttctcctt      1860
     ctggccttgg attgacacca tcctagagct cattaagaag cacctgctgt gcctctggaa      1920
     tgatgggtgc attatgggct tcatcagcaa ggagcgagaa cgcgctctgc tcaaggacca      1980
     gcagccaggg acgttcctgc ttagattcag tgagagctcc cgggaagggg ccatcacatt      2040
     cacatgggtg gaacggtccc agaacggagg tgaacctgac ttccatgccg tggagcccta      2100
     cacgaaaaaa gaactttcag ctgttacttt cccagatatt attcgcaact acaaagtcat      2160
     ggctgccgag aacataccag agaatcccct gaagtatctg taccccaata ttgacaaaga      2220
     ccacgccttt gggaagtatt attccagacc aaaggaagca ccagaaccga tggagcttga      2280
     cgaccctaag cgaactggat acatcaagac tgagttgatt tctgtgtctg aagtgtaagt      2340
     gagcacagaa gactggcttg ttcaggaccg caggccagcc ccgtccctgg ctgcggtcca      2400
     ttgattgcac tggctcatgg ctcatggctt gctttcccaa tgtaactgat gttgccacca      2460
     cagttggaca ttctggaaag tttgtaacta gtcctggtgt caatgcttac tggcatttta      2520
     ttgccatgag                                                             2530
//