Dbfetch

ID   AK075635; SV 1; linear; mRNA; HTC; MUS; 1733 BP.
XX
AC   AK075635;
XX
DT   13-DEC-2002 (Rel. 74, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 13)
XX
DE   Mus musculus 18-day embryo whole body cDNA, RIKEN full-length enriched
DE   library, clone:1110025P14 product:steroid 5 alpha-reductase 2-like, full
DE   insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus; Mus.
XX
RN   [1]
RP   1-1733
RA   Adachi J., Aizawa K., Akimura T., Arakawa T., Bono H., Carninci P.,
RA   Fukuda S., Furuno M., Hanagaki T., Hara A., Hashizume W., Hayashida K.,
RA   Hayatsu N., Hiramoto K., Hiraoka T., Hirozane T., Hori F., Imotani K.,
RA   Ishii Y., Itoh M., Kagawa I., Kasukawa T., Katoh H., Kawai J., Kojima Y.,
RA   Kondo S., Konno H., Kouda M., Koya S., Kurihara C., Matsuyama T.,
RA   Miyazaki A., Murata M., Nakamura M., Nishi K., Nomura K., Numazaki R.,
RA   Ohno M., Ohsato N., Okazaki Y., Saito R., Saitoh H., Sakai C., Sakai K.,
RA   Sakazume N., Sano H., Sasaki D., Shibata K., Shinagawa A., Shiraki T.,
RA   Sogabe Y., Tagami M., Tagawa A., Takahashi F., Takaku-Akahira S.,
RA   Takeda Y., Tanaka T., Tomaru A., Toya T., Yasunishi A., Muramatsu M.,
RA   Hayashizaki Y.;
RT   ;
RL   Submitted (16-APR-2002) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; d0da7b505d8ab48d08319a833ec56f46.
DR   Ensembl-Gn; ENSMUSG00000029233; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000031143; mus_musculus.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=1110025P14
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1733
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="18-day embryo"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="1110025P14"
FT                   /tissue_type="whole body"
FT                   /db_xref="taxon:10090"
FT   CDS             52..1044
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="steroid 5 alpha-reductase 2-like (MGD|MGI:1930252
FT                   GB|NM_020611, evidence: BLASTN, 98%, match=1732)"
FT                   /db_xref="GOA:Q9WUP4"
FT                   /db_xref="InterPro:IPR001104"
FT                   /db_xref="MGI:MGI:1930252"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9WUP4"
FT                   /protein_id="BAC35871.1"
FT                   /translation="MAGWAGFELSALNPLRTLWLALAAAFLFALLLQLAPARLLPSCAL
FT                   FQDLLRYGKTKQSGSRRPAVCRAFDVPKRYFSHFYVISVVWNGSLLWLLSQSLFLGAPF
FT                   PNWLSALLRTLGATQFQALEMESKASRMPAAELALSAFLVLVFLWVHSLRRLFECFYVS
FT                   VFSNAAIHVVQYCFGLVYYVLVGLTVLSQVPMDDKNVYVLGKNLLIQARWFHILGMVMF
FT                   FWSSAHQYKCHVILSNLRRNKKGVVIHCQHRIPFGDWFEYVSSANYLAELMIYISMAVT
FT                   FGLHNLTWWLVVTYVFSSQALSAFFNHKFYRSTFVSYPKHRKAFLPFLF"
FT   regulatory      1710..1715
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      1733
FT                   /note="putative"
XX
SQ   Sequence 1733 BP; 363 A; 470 C; 421 G; 479 T; 0 other;
     gagagaagga gggcggcgcg gcggcgcgca tctgtagccg gctcccgggc tatggctggg        60
     tgggccgggt ttgagctctc ggccctgaac cccctgcgga cgctgtggct ggcgctggcc       120
     gccgccttcc tgttcgcgct gctgctgcag ctggcgcccg ccaggctgct gccgagctgc       180
     gcgctcttcc aggacctgct ccgctacggg aagaccaagc agtccggctc gcggcgcccc       240
     gccgtctgca gggccttcga tgtccccaag aggtactttt ctcacttcta cgtcatctca       300
     gttgtgtgga atggctccct gctctggcta ctttctcagt cgttgttcct gggagcacct       360
     tttccaaact ggcttagtgc tctgctcaga actcttgggg ccacacagtt ccaagccctg       420
     gagatggagt ccaaggcttc tcggatgcca gcggctgagc tggctctgtc tgccttcttg       480
     gtcttggtgt tcctctgggt ccacagcctt cggagactct ttgagtgctt ctacgtcagt       540
     gtcttctcta atgcagccat tcacgttgtg cagtactgtt tcgggcttgt ctactatgtg       600
     cttgttggcc tgactgtact gagccaagtg cccatggatg ataagaatgt gtatgttctg       660
     gggaagaatc tactgataca agcccggtgg ttccacatcc tgggcatggt gatgttcttc       720
     tggtcatccg cccatcagta taaatgccac gtcatcctca gcaacctcag gagaaacaaa       780
     aaaggtgtgg tcatccactg ccagcaccgg atcccctttg gagactggtt cgagtacgtg       840
     tcttcagcta actacctagc agagctgatg atctacatct ccatggctgt cacctttggg       900
     ctccacaact taacctggtg gctggtggtg acgtatgtct tctccagcca agccttgtcc       960
     gcattcttca accacaagtt ctacagaagc acatttgtct cctacccaaa gcataggaaa      1020
     gcgttcctcc catttttgtt ttaaagcggg ctttatggtg aagaaagcca ggtgacagat      1080
     tccattccta gaggcactga gacagagacc aaagtacact ttctgcggga atgtttgacg      1140
     gtccttgttc tacttcagag ccagccgagc agtgttcacc gagcgaggcg ttattcctgg      1200
     agaacacatt ccagcagacc taggcttgca ggatcggctt tctgccaagc tttacagagc      1260
     taactgataa caagtaaacg ggtgtctcca aactgctttc tggccgcact aaccagtata      1320
     aacagctgtg catggataca tgcatgcttg agtcagtctc tctctctctc tctctctctc      1380
     tatctctctc tctctctctc tctctctctc tctctctagc tgtgggaaca gttaactgca      1440
     ctgagtgctg tgggagatca taggttttta ataaatgtca catgccaata aaaacaggaa      1500
     actctgaaaa taatatgaat gtacagtatc agaccggtgg ttccagggag taaagagttt      1560
     gttggagtga gttaacattc ttccttttct caccgatatc tcccgttata gccccactcc      1620
     ctcacttgcc ctgcaaccaa cctaataata agtttagaaa cttctttgat ggaattctct      1680
     ttttgatgtt tgtactgaaa acatcgagaa ataaaatatt tttatacgta tgc             1733
//