Dbfetch

ID   AK166931; SV 1; linear; mRNA; HTC; MUS; 2185 BP.
XX
AC   AK166931;
XX
DT   09-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 10)
XX
DE   Mus musculus blastocyst blastocyst cDNA, RIKEN full-length enriched
DE   library, clone:I1C0023I18 product:Spinster-like protein, full insert
DE   sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2185
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (14-APR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; a6681e08a5162fe29eb70333d5d3ef20.
DR   Ensembl-Gn; ENSMUSG00000030741; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0032976; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0032959; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0032891; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0032964; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0032674; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0033470; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0032002; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0032647; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0032799; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0032752; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0032893; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0032784; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0033490; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0031710; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0032115; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000032994; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0086430; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0086513; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0086452; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0086474; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0086036; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0086969; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0086553; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0085976; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0086173; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0086013; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0086208; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0086055; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0087262; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0085965; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0085075; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=I1C0023I18
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2185
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="blastocyst"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="I1C0023I18"
FT                   /cell_type="blastocyst"
FT                   /db_xref="taxon:10090"
FT   CDS             350..1936
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="Spinster-like protein (UniProt|Q9EQK0, evidence:
FT                   FASTY, 98.9%ID, 100%length, match=1584)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q8R0G7"
FT                   /db_xref="InterPro:IPR011701"
FT                   /db_xref="InterPro:IPR020846"
FT                   /db_xref="MGI:MGI:1920908"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q8R0G7"
FT                   /protein_id="BAE39125.1"
FT                   /translation="MAGSDTAPFLSQADDPDDGPAPGHPGLPGPMGNPKSGELEVPDCE
FT                   GLQRITGLSRGHSTIIVVVLCYINLLNYMDRFTVAGVLTDIEQFFNIGDGSTGLIQTVF
FT                   ISSYMVLAPVFGYLGDRYNRKYLMCGGIAFWSLVTLGSSFIPREHFWLLLLTRGLVGVG
FT                   EASYSTIAPTLIADLFVADQRSRMLSIFYFAIPVGSGLGYIAGSKVKDVAGDWHWALRV
FT                   TPGLGVLAVLLLFLVVQEPPRGAVERHSGSPPLSPTSWWADLKALARNPSFVLSSLGFT
FT                   SVAFVTGSLALWAPAFLLRSRVVLGETPPCLPGDSCSSSDSLIFGLITCLTGVLGVGLG
FT                   VEISRRLRRFNPRADPLVCAAGLLGSAPFLFLALACARGSIVATYIFIFIGETLLSMNW
FT                   AIVADILLYVVIPTRRSTAEAFQIVLSHLLGDAGSPYLIGLISDRLRRSWPPSFLSEFR
FT                   ALQFSLMLCAFVGALGGAAFLGTAMFIEDDRRRAQLHVQGLLHESGPSDDRIVVPQRGR
FT                   STRVPVSSVLI"
FT   regulatory      2170..2175
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      2185
FT                   /note="putative"
XX
SQ   Sequence 2185 BP; 364 A; 659 C; 642 G; 520 T; 0 other;
     gtcacatgac ctgctcttgg gcaacatggc ggcagttgtg ttgctgagct tggactgagc        60
     aacagcgagt gtggcgcgct ccttcccggg ctttggagtg cgtcagcgtg agaagaggga       120
     ctgggtcctg gtcttcctgc tcctgtccca gcgtcacctg cacctcctgt gtgctctcct       180
     ctgtcggaac cagagggaat gacgaagccc agggcttttc gcagtggtgc actgtgcatc       240
     ggaccacgct cctaggcgat agtgggcaag tcttctcgca gtcctctctc caacctcctt       300
     ccgtcggagg taggaccgaa gcgtgggcgg ttgcgattcc ccagggacca tggccgggtc       360
     cgacacggcg cccttcctca gccaagcaga tgatcctgat gacgggccag cgcccggcca       420
     tccggggttg ccaggaccca tggggaatcc aaagtccggg gaactcgagg tcccagactg       480
     tgaggggcta cagcgcatca ctggcttatc tcggggccat tcgaccatca tagtggtggt       540
     tctgtgctac attaacctcc tgaactacat ggaccgcttc accgtggcag gggttcttac       600
     agacatcgag cagttcttta acatcggaga tggtagtact ggcctcatcc agactgtgtt       660
     catctccagt tacatggtgt tggcaccagt gtttggctac ctgggtgaca ggtacaatcg       720
     aaagtacctc atgtgcgggg gcattgcctt ctggtccctg gtgacactgg gatcatcctt       780
     catccccaga gagcatttct ggctgcttct cctgacccgg ggcctggtgg gggtcgggga       840
     ggccagttac tccaccattg cgcccaccct gatcgccgac ctcttcgtgg cagaccagcg       900
     gagtcggatg ctcagtatct tctactttgc catccctgtg ggcagtggtc taggttacat       960
     tgctggctcc aaagtgaaag acgtggctgg agactggcac tgggctctac gggtgacacc      1020
     aggtctagga gtgctggctg tcctgctgct gttcctggtg gtccaggagc ccccaagagg      1080
     agccgtggag cgccactcag gttcaccacc cctgagcccc acctcttggt gggcagatct      1140
     gaaggcactg gcacgaaatc ctagtttcgt cctgtcttcc cttggcttca cctctgtggc      1200
     ctttgtcacg ggctccctgg ctctctgggc cccagcgttc ctgctgcgct cccgggttgt      1260
     tctgggagag actccgccct gtctccctgg agattcatgc tcttcctctg acagtctcat      1320
     ctttggactc atcacttgcc tgactggagt cctgggtgtg ggcctgggag tggagatcag      1380
     ccgccgcctt cgccgcttca accctcgggc tgacccactc gtctgtgcag ctggcctcct      1440
     gggttcggcg cctttcctct tcctggccct ggcctgtgcc cgaggtagca tcgtggccac      1500
     ctatattttt atctttattg gggagaccct gttgtccatg aactgggcca ttgtggctga      1560
     catcctgttg tacgtggtga tcccaactcg acggtccacg gctgaggcct tccagatagt      1620
     gctgtcccac ttgctaggag atgcagggag cccttacctc attggtctaa tctctgaccg      1680
     cctccgacgg agctggcccc cttccttcct gtccgagttc cgggctctgc agttctcgct      1740
     catgctctgt gctttcgttg gggcactggg tggtgcggcc ttcctgggca ccgccatgtt      1800
     cattgaagat gaccgccggc gggctcaact ccacgtgcag ggtctgttgc atgagtctgg      1860
     gccctcagat gaccggattg tagtacctca gcgaggccgt tctacccgag tccccgtgtc      1920
     cagcgtgctc atctgaggag ccggtgctta cccggccact gatgcatcgc agctgggcct      1980
     tgggcccacc caagacggtt cccaggcaga agccctcacc aggcccaggt ccaagaagga      2040
     agccctggga tatctcccag ctcccagaca ctacatgggc agcacaggga agagatggga      2100
     gtccagaaac ggggaagggg tgtcctctct actaggacag cccaaggggt ttggtgctat      2160
     ttgtaatgga ataaaatttg taatc                                            2185
//