Dbfetch

ID   AK141312; SV 1; linear; mRNA; HTC; MUS; 3267 BP.
XX
AC   AK141312;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 12)
XX
DE   Mus musculus 7 days embryo whole body cDNA, RIKEN full-length enriched
DE   library, clone:C430026L23 product:septin 9, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-3267
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 7fd909ec2156be72d580d1d68aeb8f41.
DR   Ensembl-Gn; ENSMUSG00000059248; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0019446; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0019414; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0019384; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0019389; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0019195; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0019838; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0018749; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0019166; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0019278; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0019270; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0019348; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0019303; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0019877; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0018514; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0018801; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000019038; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000093907; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000100193; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000106349; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0033790; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0033724; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0033714; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0033718; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0033498; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0034218; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0033365; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0033412; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0033555; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0033550; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0033688; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0033559; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0034264; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0033060; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0032970; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=C430026L23
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3267
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="7 days embryo"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="C430026L23"
FT                   /tissue_type="whole body"
FT                   /db_xref="taxon:10090"
FT   CDS             36..1787
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="septin 9 (MGD|MGI:1858222 GB|BC046524, evidence:
FT                   BLASTN, 99%, match=3266)"
FT                   /db_xref="GOA:Q80UG5"
FT                   /db_xref="InterPro:IPR016491"
FT                   /db_xref="InterPro:IPR027417"
FT                   /db_xref="InterPro:IPR030379"
FT                   /db_xref="InterPro:IPR030645"
FT                   /db_xref="MGI:MGI:1858222"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q80UG5"
FT                   /protein_id="BAE24646.1"
FT                   /translation="MKKSYSGVTRTSSGRLRRLADPTGPALKRSFEVEEIEPPNSTPPR
FT                   RVQTPLLRATVASSSQKFQDLGVKNSEPAARLVDSLSQRSPKPSLRRVELAGAKAPEPM
FT                   SRRTEISIDISSKQVESTASAAGPSRFGLKRAEVLGHKTPEPVPRRTEITIVKPQESVL
FT                   RRVETPASKIPEGSAVPATDAAPKRVEIQVPKPAEAPNCPLPSQTLENSEAPMSQLQSR
FT                   LEPRPSVAEVPYRNQEDSEVTPSCVGDMADNPRDAMLKQAPASRNEKAPMEFGYVGIDS
FT                   ILEQMRRKAMKQGFEFNIMVVGQSGLGKSTLINTLFKSKISRKSVQPTSEERIPKTIEI
FT                   KSITHDIEEKGVRMKLTVIDTPGFGDHINNENCWQPIMKFINDQYEKYLQEEVNINRKK
FT                   RIPDTRVHCCLYFIPATGHSLRPLDIEFMKRLSKVVNIVPVIAKADTLTLEERVYFKQR
FT                   ITADLLSNGIDVYPQKEFDEDAEDRLVNEKFREMIPFAVVGSDHEYQVNGKRILGRKTK
FT                   WGTIEVENTTHCEFAYLRDLLIRTHMQNIKDITSNIHFEAYRVKRLNEGNSAMANGIEK
FT                   EPEAQEM"
XX
SQ   Sequence 3267 BP; 763 A; 970 C; 904 G; 630 T; 0 other;
     gacactttcc tgggagcggc ggccacggcg gcaccatgaa gaagtcctac tcaggagtca        60
     cacggacctc cagcggtcgc ctccggaggc ttgcggatcc cactggccca gccttaaaga       120
     gatcgtttga agtcgaggag attgagccgc ccaactccac gccaccccgg agggtccaga       180
     cccctctgct ccgcgccacg gtggccagct ccagccagaa attccaggac ctgggggtga       240
     agaactcaga gcccgctgcc cgccttgtag actccctgag ccagcgctcc cccaagcctt       300
     ccctgcggag ggtggagctg gcgggagcca aggcgcctga gcccatgtct cgacgcacag       360
     agatctccat cgacatctcc tccaagcagg tggagagcac ggcctccgct gccgggccct       420
     cacggttcgg ccttaagagg gctgaagtcc tgggccacaa gaccccagag cctgtccctc       480
     ggaggacgga gatcaccatt gtcaagcctc aggagtcagt gctccgcagg gtggagaccc       540
     ctgcctccaa gattcccgag ggctctgccg tgcctgccac agatgcagcc cccaagaggg       600
     tagagatcca ggtgcccaag ccagctgagg cacccaactg tccgctcccg tcccagaccc       660
     tggagaactc cgaggccccg atgtctcaac tgcagagcag gctggagccc aggccttccg       720
     tggctgaggt cccatatcgg aaccaggaag actccgaggt gactcccagc tgtgttggcg       780
     acatggctga caaccctaga gatgccatgc tcaagcaggc ccccgcgtca cggaacgaaa       840
     aggcccccat ggagttcggc tatgtgggga tcgactccat cctggagcag atgcgcagga       900
     aggctatgaa acagggcttc gagtttaaca tcatggtggt tgggcagagc ggcctcggga       960
     agtccacttt aatcaatacc ctcttcaagt ccaaaatcag ccggaagtcg gtgcagccca      1020
     cctcggagga acgcatcccc aagacgatcg aaatcaagtc gatcacccac gatattgaag      1080
     agaagggggt tcgaatgaag ctgacagtga ttgacacgcc gggcttcgga gaccacatca      1140
     acaatgagaa ctgctggcag cccattatga agttcatcaa tgaccaatat gagaagtacc      1200
     tgcaggagga agtcaacatc aaccggaaga agcgcattcc cgacacccgt gtccactgct      1260
     gcctctactt catcccagcc accggccact cgctcaggcc cctggacatt gaattcatga      1320
     agcgcctaag caaagtggtg aacattgtcc cagtcatcgc caaggctgac acgctgaccc      1380
     tggaggagag ggtctacttc aaacagcgga tcactgcaga cctgctgtcc aacggcattg      1440
     acgtgtaccc gcagaaggag tttgatgagg acgcagaaga ccggctggtg aacgaaaagt      1500
     tccgggagat gatcccattt gctgtggtgg gcagcgacca tgagtatcaa gtcaatggca      1560
     agaggattct gggaaggaag accaagtggg gcactatcga agttgagaat accactcact      1620
     gtgaatttgc ttacctgcgg gatctcctta tcaggacgca catgcaaaac atcaaagaca      1680
     tcaccagcaa catccacttc gaagcctacc gagtgaaacg cctcaacgag ggcaacagcg      1740
     ccatggccaa cgggatcgag aaggagccgg aagcccagga gatgtagatg cgtcccgccc      1800
     ctggacccca cccccagatc ttttcatcat ccctggccca cccacctacc atgtcttatt      1860
     ttatataatt atctccttgt cacctgcctc catccatctc ttcccacact ttgccaggta      1920
     acaagagagg gtttacctcc caagtgtgct cttattggct gcagcagcag ggtgggcggg      1980
     gctaagcctg ggcttgcctc tgtgctctat ttccacccgg gctcagcccc tgaggggtta      2040
     gaagagctat gtgtccgtcc cccgctctga gttctaagct gaagcctgtg ggggccaagt      2100
     cctagggggt gcagaggagc ccgttagacc acaagacccc atggccgcag cctcaagcag      2160
     gttagagact gccccaaagg aggatggagc tggccgggta ttcctgaaac ctcacctgcc      2220
     cctccggggg cgtttcttac agcgccctca gctgcctgcc cctcaaggga actagaggcg      2280
     tcacagccaa agttgccaat cacttagaca aagtggcaac cgtgcccctc gagcttgtcc      2340
     cagagcagaa agtgccttga tctacaagag ccagtcacct cttcccagat gtccctttgg      2400
     gtgaaaagca gggacgtgct ggagagaggg aggtatcttt tctccctcgc ccttgggttc      2460
     tctctcccct gtgctgtaga tatcgctact acactgggct ttaattataa aagacgaagc      2520
     gtgaaagatg ctccccgatg ttaggaagcc ccgcccccaa tgtaaggaag gtcaaagcaa      2580
     gaagatgagt cgaagccatg agggaggaag ccgtggaagg gaggcataag agtgtgtggg      2640
     agctctcctg cccaggtgcc gcggaaggca tatccgcgtg gtcctcagtt tgggccaaga      2700
     tattctggtt acattgatgc tccgctgcct caccctgtcc caccccacac accccaggct      2760
     caagccttga tgattcagtg actgtactgg gtgggagcca gaaacctgac cattttgttg      2820
     tctacatgag cctagactag ccctgtgccc cagaacccat caaaaatacc cctaaagagg      2880
     gaaagatgag ggggtcagag atggatagcc aggctcactc atcttctctc agagggaaca      2940
     ttagggacca tccatgcaca gctgaccaaa gccgtgtcct tcctgcctgc ctcccattct      3000
     catttgcccc tgaggagaaa gtttggtgag gtgcgttagg ttggacccgc ttggggaagg      3060
     tgcctctaca gacccagggc tagctttctg cagcccagaa gtgcagtggg aggggtgggg      3120
     tgcagacaga tggagacgaa cattgtttcc tgctttgggc gtctactccc tcatccaagc      3180
     atggaggggc ccctcgtcat gccctgtggc cgaacagttc gcttgccagc ttgccaagtt      3240
     cttttgccaa aatcaggact ttgaagg                                          3267
//