Dbfetch

ID   AK083350; SV 1; linear; mRNA; HTC; MUS; 2543 BP.
XX
AC   AK083350;
XX
DT   19-DEC-2002 (Rel. 74, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 14)
XX
DE   Mus musculus 2 days neonate thymus thymic cells cDNA, RIKEN full-length
DE   enriched library, clone:C920014F14 product:lectin, galactose binding,
DE   soluble 8, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2543
RA   Adachi J., Aizawa K., Akimura T., Arakawa T., Bono H., Carninci P.,
RA   Fukuda S., Furuno M., Hanagaki T., Hara A., Hashizume W., Hayashida K.,
RA   Hayatsu N., Hiramoto K., Hiraoka T., Hirozane T., Hori F., Imotani K.,
RA   Ishii Y., Itoh M., Kagawa I., Kasukawa T., Katoh H., Kawai J., Kojima Y.,
RA   Kondo S., Konno H., Kouda M., Koya S., Kurihara C., Matsuyama T.,
RA   Miyazaki A., Murata M., Nakamura M., Nishi K., Nomura K., Numazaki R.,
RA   Ohno M., Ohsato N., Okazaki Y., Saito R., Saitoh H., Sakai C., Sakai K.,
RA   Sakazume N., Sano H., Sasaki D., Shibata K., Shinagawa A., Shiraki T.,
RA   Sogabe Y., Tagami M., Tagawa A., Takahashi F., Takaku-Akahira S.,
RA   Takeda Y., Tanaka T., Tomaru A., Toya T., Yasunishi A., Muramatsu M.,
RA   Hayashizaki Y.;
RT   ;
RL   Submitted (16-APR-2002) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; bdcb69771c0ccd7adbc64804f305fa98.
DR   Ensembl-Gn; ENSMUSG00000057554; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0020231; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0020187; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0020162; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0020170; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0019974; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_CBAJ_G0019941; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0020058; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0020043; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0020079; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0020645; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0019275; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0019572; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000099820; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000099821; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000124888; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0036309; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0036232; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0036203; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0036235; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0035999; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_CBAJ_T0035910; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0036055; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0036036; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0036043; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0036755; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0035610; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0035447; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=C920014F14
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2543
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="2 days neonate"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="C920014F14"
FT                   /cell_type="thymic cells"
FT                   /tissue_type="thymus"
FT                   /db_xref="taxon:10090"
FT   CDS             167..1117
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="lectin, galactose binding, soluble 8
FT                   (MGD|MGI:1928481 GB|NM_018886, evidence: BLASTN, 100%,
FT                   match=1086)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q542M5"
FT                   /db_xref="InterPro:IPR001079"
FT                   /db_xref="InterPro:IPR013320"
FT                   /db_xref="InterPro:IPR030638"
FT                   /db_xref="MGI:MGI:1928481"
FT                   /db_xref="UniProtKB/TrEMBL:Q542M5"
FT                   /protein_id="BAC38880.1"
FT                   /translation="MLSLNNLQNIIYNPIIPYVGTITEQLKPGSLIVIRGHVPKDSERF
FT                   QVDFQLGNSLKPRADVAFHFNPRFKRSSCIVCNTLTQEKWGWEEITYDMPFRKEKSFEI
FT                   VFMVLKNKFQVAVNGRHVLLYAHRISPEQIDTVGIYGKVNIHSIGFRFSSDLQSMETSA
FT                   LGLTQINRENIQKPGKLQLSLPFEARLNASMGPGRTVVIKGEVNTNARSFNVDLVAGKT
FT                   RDIALHLNPRLNVKAFVRNSFLQDAWGEEERNITCFPFSSGMYFEMIIYCDVREFKVAI
FT                   NGVHSLEYKHRFKDLSSIDTLSVDGDIRLLDVRSW"
FT   regulatory      2523..2528
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      2543
FT                   /note="putative"
XX
SQ   Sequence 2543 BP; 746 A; 541 C; 572 G; 684 T; 0 other;
     gtccgggaac cggacggcgt ccgtcagggg acacaagtcg gagcaaagcg cttactacca        60
     ccggggacaa gtttttactt tgagtaatcc ttaaatgaag agtgggtaaa gtgtgtatac       120
     ggaagagaga ctccaatcaa caatatcaat aagttgaaaa agaaaaatgt tgtccttaaa       180
     taacctacaa aatatcatct ataacccgat aatcccctat gttggcacca ttactgagca       240
     attgaagcct ggctctctga ttgtaatccg tgggcatgtc cctaaagatt cagaaagatt       300
     ccaggttgac tttcagctgg gcaacagcct gaagccaaga gcagacgtgg ccttccactt       360
     taaccctcgg ttcaaaaggt ctagctgcat tgtttgtaac acactgacac aggagaagtg       420
     gggctgggag gagatcacct acgacatgcc cttcagaaaa gaaaagtcct ttgagatcgt       480
     gttcatggtg ctcaagaaca aattccaggt ggctgtgaac ggaaggcatg ttctgctgta       540
     cgcccacagg atcagcccgg agcagatcga cacagtgggc atctacggca aagtgaacat       600
     ccactccatc gggttcagat tcagctcgga tttacagagt atggaaacat ctgctctggg       660
     actgacacag ataaacagag agaatataca aaagccaggc aagctccagc tgagcctgcc       720
     atttgaagca aggttgaatg cctccatggg tcctggacga accgttgtca ttaaagggga       780
     agtgaacacc aatgcccgaa gctttaatgt tgacctagtg gcaggaaaaa caagggatat       840
     cgctctgcac ttgaacccac gcctcaatgt gaaagcattt gtaagaaatt cctttcttca       900
     ggatgcctgg ggagaagagg agagaaatat tacctgcttc ccatttagtt ctgggatgta       960
     ctttgagatg ataatctact gtgatgtccg ggaattcaag gttgctataa atggtgtgca      1020
     cagcctggag tacaaacaca gatttaaaga cctaagcagt attgatacac tatcagtcga      1080
     tggtgatatc cgtttgctgg atgtaaggag ctggtagcta ccatgactgc caaaaccccc      1140
     gaaatacaaa atggcttatc cggtactggc catgtcaaat gcatctcgct ttcaccatat      1200
     tgtttatatt gctaagttga gctcctccaa catcaagtcc tactggtgtt gtcaggtctg      1260
     gccatgcagt acattcagag gaacagagcc ggggcaatca cagctcactg ccagagaggc      1320
     tctgcacact gggtccctct tataaaccac actcagcaaa tatttaagtg cctaatatac      1380
     tacatatact agctaatagg gatggcaagc atacttcctt tgtatattct ctgagccggg      1440
     cacagacatg gcagggccca gaacttgtgt ggtccatgtt ttctagcact tcgtaccagt      1500
     ttctggcctc ctaatgtagg gtcttcttgc tggcattgca ttaaccccac taggggcctt      1560
     tgcagttaag gtcagaaaaa tatactaatg gatggcaaac actacttccc cagcaaccct      1620
     tttcataatc agcattctat catatctcat aattgaagac tgcatagcat ttacttagct      1680
     ctcaccgctt taaactttat aaaatgtatg atgctgaaca cagcagaaaa actgaggcca      1740
     aaaccctgaa ttatgacaaa acaagtgttc tgctccaagc agatttctgc tggttgattg      1800
     gcgctcaagt ccagggtgtg tgggtacctg tggcaaagta aggcagaagt tggataaacc      1860
     gtgtgtgtaa aaccctcttg gacgtatata taaaacaagc actatcaaag caaacccagg      1920
     ggcgtagtgt gaaaggctta gttggtgtgg agactagccc tcgtgccttt gggtctgaag      1980
     atgtcggtct gcagcagcag cgaggtcagg cacattaaac cacaggacag aattctgctg      2040
     gggtttagga gtattcagcc aatctgctta gatatatgca cttgtgcata tgaaataata      2100
     ccatcagcag tcttactcaa ggcagcaact gctagtcctt cattttcatg caaatttatt      2160
     atgttcactc ttcaatatgg ggtggtgggt ggaactgggt aatttgtacg gaggccagga      2220
     ggctctagaa ttctccagag ttatgtcttc ataaagaatg agtcttcata aagaatgcac      2280
     tactgagata ttgggggctc aaaggcactc aggaaaaaaa aataaaagcc tattcataca      2340
     acagcttatt ttcatttcta tttttacaca attaagactg attctaagta gattcagtct      2400
     taagttctta gatttttttt ttttaaaaag ctgctggaat ttaggttgtg agaccttctg      2460
     ttgtatattc cgaaattcta tctctaaact gcaaaatgcc tttttgcttg tcctaattct      2520
     gcattaaagt tttatattaa att                                              2543
//