Dbfetch

ID   AK002752; SV 1; linear; mRNA; HTC; MUS; 1227 BP.
XX
AC   AK002752;
XX
DT   08-FEB-2001 (Rel. 66, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 21)
XX
DE   Mus musculus adult male kidney cDNA, RIKEN full-length enriched library,
DE   clone:0610033N24 product:STIP1 homology and U-Box containing protein 1,
DE   full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-1227
RA   Adachi J., Aizawa K., Akahira S., Akimura T., Arai A., Aono H., Arakawa T.,
RA   Bono H., Carninci P., Fukuda S., Fukunishi Y., Furuno M., Hanagaki T.,
RA   Hara A., Hayatsu N., Hiramoto K., Hiraoka T., Hori F., Imotani K.,
RA   Ishii Y., Itoh M., Izawa M., Kasukawa T., Kato H., Kawai J., Kojima Y.,
RA   Konno H., Kouda M., Koya S., Kurihara C., Matsuyama T., Miyazaki A.,
RA   Nishi K., Nomura K., Numazaki R., Ohno M., Okazaki Y., Okido T., Owa C.,
RA   Saito H., Saito R., Sakai C., Sakai K., Sano H., Sasaki D., Shibata K.,
RA   Shibata Y., Shinagawa A., Shiraki T., Sogabe Y., Suzuki H., Tagami M.,
RA   Tagawa A., Takahashi F., Tanaka T., Tejima Y., Toya T., Yamamura T.,
RA   Yasunishi A., Yoshida K., Yoshino M., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (10-JUL-2000) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 254253064412a2055d633d366ab91f2c.
DR   Ensembl-Gn; ENSMUSG00000039615; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0023373; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0023332; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0023298; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0023337; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0023097; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0023779; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0022599; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0023072; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0023202; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0023172; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0023279; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0023191; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0023819; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0022348; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0022662; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000044911; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0046240; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0046213; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0046164; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0046175; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0045906; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0046647; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0046011; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0045839; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0045953; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0045926; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0046063; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0045899; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0046744; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0045595; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0045281; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=0610033N24
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1227
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /sex="male"
FT                   /dev_stage="adult"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="0610033N24"
FT                   /tissue_type="kidney"
FT                   /db_xref="taxon:10090"
FT   CDS             66..980
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="STIP1 homology and U-Box containing protein 1
FT                   (MGD|MGI:1891731)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q9WUD1"
FT                   /db_xref="InterPro:IPR003613"
FT                   /db_xref="InterPro:IPR011990"
FT                   /db_xref="InterPro:IPR013026"
FT                   /db_xref="InterPro:IPR013083"
FT                   /db_xref="InterPro:IPR019734"
FT                   /db_xref="MGI:MGI:1891731"
FT                   /db_xref="PDB:2C2L"
FT                   /db_xref="PDB:2C2V"
FT                   /db_xref="PDB:3Q47"
FT                   /db_xref="PDB:3Q49"
FT                   /db_xref="PDB:3Q4A"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9WUD1"
FT                   /protein_id="BAB22329.1"
FT                   /translation="MKGKEEKEGGARLGTGGGGTPDKSPSAQELKEQGNRLFVGRKYPE
FT                   AAACYGRAITRNPLVAVYYTNRALCYLKMQQPEQALADCRRALELDGQSVKAHFFLGQC
FT                   QLEMESYDEAIANLQRAYSLAKEQRLNFGDDIPSALRIAKKKRWNSIEERRIHQESELH
FT                   SYLTRLIAAERERELEECQRNHEGHEDDGHIRAQQACIEAKHDKYMADMDELFSQVDEK
FT                   RKKRDIPDYLCGKISFELMREPCITPSGITYDRKDIEEHLQRVGHFDPVTRSPLTQEQL
FT                   IPNLAMKEVIDAFISENGWVEDY"
FT   regulatory      1210..1215
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      1227
FT                   /note="putative"
XX
SQ   Sequence 1227 BP; 263 A; 335 C; 390 G; 239 T; 0 other;
     ggatcgctgc gcgggctgcg agatctaggt ggccgggcgc ggacccaagc cgtgccgccg        60
     gcgccatgaa gggcaaggag gaaaaggagg gcggcgcgcg gctgggcact ggtggcggcg       120
     gcacgcctga taagagcccg agtgcgcaag agctcaagga gcagggaaac cggctcttcg       180
     tgggccgcaa gtacccggag gcggcggcct gctacggccg cgccatcact cggaacccac       240
     ttgtggcagt gtactacact aaccgggccc tgtgctatct gaagatgcag cagcctgaac       300
     aggcacttgc tgactgccgg cgagccctgg agctggacgg gcagtctgtg aaggcgcact       360
     tcttcctggg gcagtgccag ctggagatgg agagttatga tgaggccatt gccaatctgc       420
     agcgagccta tagtttggcc aaggagcagc gactcaactt tggggatgat attcctagtg       480
     cccttcgcat tgctaagaag aagcgctgga acagtatcga ggaacggcgc atccaccagg       540
     agagtgagct gcattcatat ctcaccaggc tcattgctgc tgagcgagag agggaactgg       600
     aggagtgtca gcggaaccac gagggtcatg aagatgatgg ccacatccgg gcccagcagg       660
     cctgcattga ggccaagcac gataaataca tggcagatat ggatgagctc ttctctcagg       720
     tggacgagaa aagaaagaag cgagatatcc ctgactactt gtgtggcaag attagctttg       780
     agctgatgcg ggaaccctgc attacaccca gtggtatcac ctatgaccgc aaggacattg       840
     aggagcacct gcagcgtgtg ggccactttg accctgtgac ccggagccct ctgacccagg       900
     aacagctcat ccccaatttg gccatgaagg aagtcattga cgctttcatc tctgagaacg       960
     gctgggtaga ggactattga ggccccatgt cctgcctggc accctggccc aggaggatct      1020
     ggagacggaa gctccagtcc ctgtatagtt tgtgtccctg ggcctgcccc catcggccct      1080
     gctgatgggt tctgaactgc tccccttctc agcatacccc ttgctggacc atgagcctcc      1140
     cttgtccccc ttctgggctg gagagtgggt gagggtgggc tgaggttgct gctgctgcca      1200
     ctgtcctgta ataaagtctg tgacact                                          1227
//