Dbfetch

ID   AK049357; SV 1; linear; mRNA; HTC; MUS; 3114 BP.
XX
AC   AK049357;
XX
DT   18-DEC-2002 (Rel. 74, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 16)
XX
DE   Mus musculus ES cells cDNA, RIKEN full-length enriched library,
DE   clone:C330027D01 product:src homology 2 domain-containing transforming
DE   protein C1, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-3114
RA   Adachi J., Aizawa K., Akimura T., Arakawa T., Bono H., Carninci P.,
RA   Fukuda S., Furuno M., Hanagaki T., Hara A., Hashizume W., Hayashida K.,
RA   Hayatsu N., Hiramoto K., Hiraoka T., Hirozane T., Hori F., Imotani K.,
RA   Ishii Y., Itoh M., Kagawa I., Kasukawa T., Katoh H., Kawai J., Kojima Y.,
RA   Kondo S., Konno H., Kouda M., Koya S., Kurihara C., Matsuyama T.,
RA   Miyazaki A., Murata M., Nakamura M., Nishi K., Nomura K., Numazaki R.,
RA   Ohno M., Ohsato N., Okazaki Y., Saito R., Saitoh H., Sakai C., Sakai K.,
RA   Sakazume N., Sano H., Sasaki D., Shibata K., Shinagawa A., Shiraki T.,
RA   Sogabe Y., Tagami M., Tagawa A., Takahashi F., Takaku-Akahira S.,
RA   Takeda Y., Tanaka T., Tomaru A., Toya T., Yasunishi A., Muramatsu M.,
RA   Hayashizaki Y.;
RT   ;
RL   Submitted (16-JUL-2001) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; d12301ac6c3a2436c0f49f41185abdde.
DR   Ensembl-Gn; ENSMUSG00000042626; mus_musculus.
DR   Ensembl-Gn; MGP_AJ_G0027403; mus_musculus_aj.
DR   Ensembl-Gn; MGP_BALBcJ_G0027412; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0027858; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0026600; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0027128; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0027266; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0027917; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0026334; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0026682; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000039110; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000094378; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000107417; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000191485; mus_musculus.
DR   Ensembl-Tr; MGP_AJ_T0061917; mus_musculus_aj.
DR   Ensembl-Tr; MGP_BALBcJ_T0061844; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0062320; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0061831; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0061503; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0061605; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0062511; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0061384; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0060811; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=C330027D01
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3114
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="C330027D01"
FT                   /cell_type="ES cells"
FT                   /db_xref="taxon:10090"
FT   CDS             136..1545
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="src homology 2 domain-containing transforming
FT                   protein C1 (MGD|MGI:98296 GB|NM_011368, evidence: BLASTN,
FT                   100%, match=1462)"
FT                   /db_xref="GOA:P98083"
FT                   /db_xref="InterPro:IPR000980"
FT                   /db_xref="InterPro:IPR006019"
FT                   /db_xref="InterPro:IPR006020"
FT                   /db_xref="InterPro:IPR011993"
FT                   /db_xref="InterPro:IPR029586"
FT                   /db_xref="MGI:MGI:98296"
FT                   /db_xref="UniProtKB/Swiss-Prot:P98083"
FT                   /protein_id="BAC33706.1"
FT                   /translation="MNKLSGGGGRRTRVEGGQLGGEEWTRHGSFVNKPTRGWLHPNDKV
FT                   MGPGVSYLVRYMGCVEVLQSMRALDFNTRTQVTREAISLVCEAVPGAKGATRRRKPCSR
FT                   PLSSILGRSNLKFAGMPITLTVSTSSLNLMAADCKQIIANHHMQSISFASGGDPDTAEY
FT                   VAYVAKDPVNQRACHILECPEGLAQDVISTIGQAFELRFKQYLRNPPKLVTPHDRMAGF
FT                   DGSAWDEEEEEPPDHQYYNDFPGKEPPLGGVVDMRLREGAARPTLPSAQMSSHLGATLP
FT                   IGQHAAGDHEVRKQMLPPPPCPGRELFDDPSYVNIQNLDKARQAGGGAGPPNPSLNGSA
FT                   PRDLFDMKPFEDALRVPPPPQSMSMAEQLQGEPWFHGKLSRREAEALLQLNGDFLVRES
FT                   TTTPGQYVLTGLQSGQPKHLLLVDPEGVVRTKDHRFESVSHLISYHMDNHLPIISAGSE
FT                   LCLQQPVDRKV"
FT   regulatory      3094..3099
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      3114
FT                   /note="putative"
XX
SQ   Sequence 3114 BP; 692 A; 808 C; 843 G; 771 T; 0 other;
     ttggagtgta ataaagtttg tctggggagg cccaggctgg ggtgaaagtt ggggcggtga        60
     cttaagcaga cagttgcgtg atccggaacc agatcggccc gcggtgcggt gcggagactc       120
     catgagaccc tggacatgaa caagctgagt ggaggcggcg ggcgcaggac tcgggtagaa       180
     gggggccagc tggggggcga ggagtggacc agacacggga gctttgtcaa taagcccaca       240
     cgaggctggc tgcatcccaa cgacaaagtc atgggacctg gggtttccta cttggttcgg       300
     tacatgggct gtgtggaggt cttacagtca atgcgagccc ttgacttcaa tacccggact       360
     caggtcacca gggaggccat cagtttggtg tgtgaagctg tgcctggtgc caaaggggcg       420
     acaaggagga gaaagccttg tagccgccca ctcagctcca tcctggggag gagtaacctg       480
     aagtttgctg gaatgccaat cactctcact gtgtctacca gcagccttaa cctcatggca       540
     gccgactgca aacagatcat tgccaaccat cacatgcaat ctatctcttt cgcgtccggt       600
     ggggatccgg acacagctga gtatgttgcc tatgttgcca aagaccctgt gaatcagaga       660
     gcctgccata tcctggagtg tcctgaaggg cttgctcagg atgtcatcag caccatcggg       720
     caggcctttg agttgcgctt caaacagtat ctcaggaatc caccgaagct ggtcaccccc       780
     catgacagga tggctggctt tgatggctca gcttgggatg aggaggaaga agagccccct       840
     gaccatcagt actacaatga ctttccaggg aaggaacccc ctcttggtgg ggtggtagat       900
     atgaggcttc gggaaggggc tgctcgaccc actctgccta gtgcccagat gtccagccac       960
     ttgggagcta cactgcctat agggcagcat gctgcaggag accatgaagt ccgtaaacag      1020
     atgttgcctc cgccgccttg cccaggcaga gaactcttcg atgacccctc ctatgtcaac      1080
     atccagaatc tagacaaggc ccggcaggct gggggtgggg ctgggccccc aaatccttct      1140
     cttaatggca gtgcaccccg agaccttttt gacatgaagc cctttgaaga tgcacttcgg      1200
     gtgccacccc caccgcagtc catgtccatg gctgagcagc tgcaagggga gccctggttc      1260
     cacgggaagc tgagccggag ggaggccgag gcgctgctgc agctcaatgg tgacttcttg      1320
     gtgcgagaga gcacgaccac gcctggccag tatgtgctca ctggcctgca gagtgggcag      1380
     cccaagcact tgctgctggt ggaccctgaa ggtgtggttc ggacaaagga tcaccgcttt      1440
     gagagtgtca gtcacctgat cagctaccac atggacaatc acttgcccat catctctgcg      1500
     ggcagcgaac tgtgcctaca gcaacccgtg gatcggaaag tgtgatcctt ctcagcttct      1560
     ccaacaggat gctctccatt tccgtctccc gtattctcta acttgtggga cctctgtttt      1620
     gtgggtctgg ccttgggtgg gaactgggag caacgaggac atgggtttag tgcccacttg      1680
     agagagagaa aaagagggtt tcagtaagga gcctggggta gcatcctgcc tctggccaaa      1740
     cttcaccaaa gtattaatgt gcagagtggt cccttgtctg ggccttgcct gtgccaacct      1800
     gatgcccttc ccccccaaag ggtgggttct tataatggaa aatgccctgt gatgataggc      1860
     ccagtggagc aactgccctt tgggggaagg gaaataatta tacctctggt ttactcctgg      1920
     gtcttcaggg taccccagat cccgcataac atatcccact ccctctgctt ccccttaaac      1980
     tttgtgcctt tgactatcat aggtctgcag atacttaatg cagagttctc ggcccttcac      2040
     gtgtggacag gggttactgc caccttggct tctggagccc tgtcctattc agcacccctt      2100
     cctgtgtcta gggagaatag ggacaggagt ggccgctatc tgctctgcct ttcggatgtg      2160
     cagcccttaa gagattgccc caagcctgaa tatggtggcg cacgccttta atcacagcac      2220
     tcaggaggca gagacaggag gaattgtgag gccatccgat ctacaacaga gtgagttcca      2280
     ggacagccag ggctatggag agagaacctg tctccaaaaa ccaaaaagag cgattacccc      2340
     agagccttct tcctgatggt agcggggagg ggcaggactg gacccatctt gctcagtgcc      2400
     tcctgacctc aatgcctttc ctccaagggg tctgtataca tttctcaagc ctgctcctcc      2460
     catgtttgca tgtgtgttat agtctacagc caaagtatag ccctcactgt aaccccatcc      2520
     tgcctccctc ctttgggata ggtgtgtgcg tctgacttgg gcctccaggg tgtgtacagt      2580
     cagtgtgggt tttgtggagg caataagact gaagcagtag acaatcccca ataccatttg      2640
     caggtctgga actgcactct cttttttaaa aaacatgtat acattttagg gctgtagatt      2700
     tattttcctg gttttgtttt tcattgctga cttttgagca cagaattatg ataatcaatt      2760
     acatttatac atcacctcga tgacttttcc aaacttttat tttttttttt aaacaactgt      2820
     tgggatttta ctccctggcc ttaactagga caggattgta ccccactcct cccccccccc      2880
     ctttttcttt ttcgccaaga caactgagca gaaatttggc tgagcagtgt tgtgggacta      2940
     tgatgtgata gttttagatc ctaccttctg ctttcgggca gctgcagcca gcacagaaac      3000
     cttgcaagct cactctgtgt gtaggctttc tggacaagga atggtcgcca aatttttggt      3060
     ttggatgtct tataccaaag ggaaatagtc ttcattaaag ttcgtatttc tttt            3114
//