Dbfetch

ID   AK138681; SV 1; linear; mRNA; HTC; MUS; 2207 BP.
XX
AC   AK138681;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 11)
XX
DE   Mus musculus 0 day neonate thymus cDNA, RIKEN full-length enriched library,
DE   clone:A430003M07 product:steroid 5 alpha-reductase 2-like, full insert
DE   sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2207
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 223a9d49e1ffec1683a753552cd7f389.
DR   Ensembl-Gn; ENSMUSG00000029233; mus_musculus.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0029443; mus_musculus_nodshiltj.
DR   Ensembl-Tr; ENSMUST00000031143; mus_musculus.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0071874; mus_musculus_nodshiltj.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=A430003M07
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2207
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="0 day neonate"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="A430003M07"
FT                   /tissue_type="thymus"
FT                   /db_xref="taxon:10090"
FT   CDS             48..665
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="steroid 5 alpha-reductase 2-like (MGD|MGI:1930252
FT                   GB|NM_020611, evidence: BLASTN, 100%, match=645)"
FT                   /db_xref="GOA:Q9WUP4"
FT                   /db_xref="InterPro:IPR001104"
FT                   /db_xref="MGI:MGI:1930252"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9WUP4"
FT                   /protein_id="BAE23745.1"
FT                   /translation="MAGWAGFELSALNPLRTLWLALAAAFLFALLLQLAPARLLPSCAL
FT                   FQDLLRYGKTKQSGSRRPAVCRAFDVPKRYFSHFYVISVVWNGSLLWLLSQSLFLGAPF
FT                   PNWLSALLRTLGATQFQALEMESKASRMPAAELALSAFLVLVFLWVHSLRRLFECFYVS
FT                   VFSNAAIHVVQYCFGLVYYVLVGLTVLSQVPMDDKNGKWPCQ"
XX
SQ   Sequence 2207 BP; 470 A; 556 C; 581 G; 600 T; 0 other;
     gaaggagggc ggcgcggcgg cgcgcatctg tagccggctc ccgggctatg gctgggtggg        60
     ccgggtttga gctctcggcc ctgaaccccc tgcggacgct gtggctggcg ctggccgccg       120
     ccttcctgtt cgcgctgctg ctgcagctgg cgcccgccag gctgctgccg agctgcgcgc       180
     tcttccagga cctgctccgc tacgggaaga ccaagcagtc cggctcgcgg cgccccgccg       240
     tctgcagggc cttcgatgtc cccaagaggt acttttctca cttctacgtc atctcagttg       300
     tgtggaatgg ctccctgctc tggctacttt ctcagtcgtt gttcctggga gcaccttttc       360
     caaactggct tagtgctctg ctcagaactc ttggggccac acagttccaa gccctggaga       420
     tggagtccaa ggcttctcgg atgccagcgg ctgagctggc tctgtctgcc ttcttggtct       480
     tggtgttcct ctgggtccac agccttcgga gactctttga gtgcttctac gtcagtgtct       540
     tctctaatgc agccattcac gttgtgcagt actgtttcgg gcttgtctac tatgtgcttg       600
     ttggcctgac tgtactgagc caagtgccca tggatgataa gaatggtaag tggccctgcc       660
     agtaggttct aaagtcatca taagtggccc tggcagtagg ttctaaagtc atcatgtcaa       720
     caggtagata cttaaccatc gtgaccacta aaacctagga acagcaccta ggatccgtag       780
     cagatccttc agtgacaata ggacagtaca aatcattcct ggggccgttt aaggaatgaa       840
     gttaaactgt gggaggtgga aggagagctg agatctgaac agcaggtggg agcctccgaa       900
     gaaagggcca ccaaaggaag aggcctcccc acagccccgt tgtctagaag tcactctgtg       960
     ggattaaatc ccagaccagg agacagaggt gcttgaggta agtagcttgg atttgaagaa      1020
     ggaatgcaaa ctgttctctt gactttcttt cttgggatgg ctgccttctc gacaaagccc      1080
     ttcttcatgt ggggcagcct gagggaagag gaaaaccgag tgttgtgcta aatggagatt      1140
     gccttaaggt ttccaagtgc tctctagcca ggaagaggga agcaaacgtt tgtgtgtgag      1200
     aaggaagtga ggttggttgg tgttttacac ggacctgcac accgctgtct gactctcggt      1260
     ttcataagac gatatgtaca tatgacacag agttcttctc ctctttggtg attggaatga      1320
     ttgttttggt gctgaggcta gcttgatctt gaagccttgg gctttcagcc cagatcttct      1380
     gggacccttt tttccatgtc ctatatcata agttaagatg aagatctgta tttccaacta      1440
     ccagtgactg agcctttttt tttttttttt ttttttgggt ttgatttcag tctcagagtg      1500
     ctccttaaag tctgccctgt tgctataata cacttcccgg ggctgtaaga attatttgaa      1560
     aggtaggctt attgaccagg tagcccgtca acttgtgaat tagttattcg ttccagctat      1620
     tcaggcagct gctcttctaa tccctccaga tttcattgta ttggatactt tctggatgct      1680
     gtgataaaac actatgacca agatggcttc cagaaggagg accttaactt ggctcacagt      1740
     gacagagggc tgagagtcga tcccagcagg taggctaggc gtggcagcga accaccagga      1800
     gcagggagct gagagctcac gtctcagcca ggtttaggaa gcagaatgag aaaactgcaa      1860
     gtagggagag gctgtatagt cccaggctca cctgctgtgc tccctccaag gctgcacctc      1920
     ctaaacctcc acagtctcac caactggagc ctaagggttc aaatccctga acctccggag      1980
     aacagctctc acgtaaaatc catagtaatc actcatcaaa tcaggcttcc cgcccaggtc      2040
     tcccctgggt gttcagatgc acagccatca tctggggtta tttggccaga gaactttcca      2100
     gattccttgg ttgcccatta gttttctctc tgtcagcttc tgttggcctt acatcacttt      2160
     tctgaattta tttctgtggc tggagcccag cttctacatt ctgcctc                    2207
//