Dbfetch

ID   AK171680; SV 1; linear; mRNA; HTC; MUS; 3742 BP.
XX
AC   AK171680;
XX
DT   09-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 9)
XX
DE   Mus musculus activated spleen cDNA, RIKEN full-length enriched library,
DE   clone:F830003K24 product:heparan sulfate 6-O-sulfotransferase 2, full
DE   insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus; Mus.
XX
RN   [1]
RP   1-3742
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (14-APR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; e3baa73627e7923335d9d95d7cb581c3.
DR   Ensembl-Gn; ENSMUSG00000062184; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0035712; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0035695; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0035615; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0035681; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0035385; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0036211; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0034676; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0035359; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0035514; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0035462; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0035601; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0035501; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0036228; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0034375; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0034820; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000088172; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000114871; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0096103; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0096193; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0096131; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0096112; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0095666; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0096624; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0096263; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0095587; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0095791; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0095637; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0095825; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0095607; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0096965; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0095681; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0094651; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Tissues were provided by Dr. John Todd (Dept. of Medical Genetics
CC   Wellcome Trust Centre for Molecular Mechanisms in Disease Wellcome
CC   Trust/MRC building Addenbrookes Hospital Cambridge) whose
CC   assistance we gratefully acknowledge.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=F830003K24
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3742
FT                   /organism="Mus musculus"
FT                   /strain="NOD"
FT                   /mol_type="mRNA"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="F830003K24"
FT                   /tissue_type="activated spleen"
FT                   /db_xref="taxon:10090"
FT   CDS             83..1483
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="heparan sulfate 6-O-sulfotransferase 2
FT                   (MGD|MGI:1354959 GB|BC037659, evidence: BLASTN, 99%,
FT                   match=3454)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q80UW0"
FT                   /db_xref="InterPro:IPR005331"
FT                   /db_xref="InterPro:IPR010635"
FT                   /db_xref="InterPro:IPR027417"
FT                   /db_xref="MGI:MGI:1354959"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q80UW0"
FT                   /protein_id="BAE42608.1"
FT                   /translation="MDEKSNKLLLALVMLFLFAVIVLQYVCPGTECQLLRLQAFSSPVP
FT                   DPYRSEDESSARFVPRYNFSRGDLLRKVDFDIKGDDLIVFLHIQKTGGTTFGRHLVRNI
FT                   QLEQPCECRVGQKKCTCHRPGKRETWLFSRFSTGWSCGLHADWTELTSCVPAVVDGKRD
FT                   ARLRPSRNFHYITILRDPVSRYLSEWRHVQRGATWKASLHVCDGRPPTSEELPSCYTGD
FT                   DWSGCPLKEFMDCPYNLANNRQVRMLSDLTLVGCYNLSVMPEKQRNKVLLDSAKSNLKH
FT                   MAFFGLTEFQRKTQYLFEKTFNMNFISPFTQYNTTRASSVEINEEIQKRIEGLNFLDME
FT                   LYSYAKDLFLQRYQFMRQKEHQDARRKRQEQRKFLKGRFLQTHFQSQSQGQSQSQSPGQ
FT                   NLSQNPNPNPNQNLTQNLSHNLTPSSNPNSTQRENRGSQKQGSGQGQGDSGTSNGTNDY
FT                   IGSVETWR"
FT   regulatory      3726..3731
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      3742
FT                   /note="putative"
XX
SQ   Sequence 3742 BP; 1116 A; 768 C; 830 G; 1028 T; 0 other;
     ggcgcggtgc gggagcgagg cgagcacggg ccaagctgcg ggattgacgt aacgagcttg        60
     ggtttccccc agtgtcggga acatggatga gaaatctaac aagctgctgc tggctttggt       120
     gatgctcttc ctatttgcgg tgatcgtcct ccaatacgtg tgccccggca cagaatgcca       180
     gctcctccgc ctgcaggcgt tcagctcccc ggtgccggat ccgtaccgct cggaggatga       240
     gagttcggcc aggtttgtac cccgctacaa tttcagccgc ggcgatctcc tgcgcaaggt       300
     agacttcgac atcaagggcg atgacctgat cgtgttcctg cacatccaga agactggggg       360
     caccactttt ggccgtcacc tggtgcgcaa catccagctg gagcagccat gtgagtgccg       420
     cgtggggcag aagaaatgca cttgccaccg gccgggtaag agggagacct ggctcttctc       480
     caggttctcc accggctgga gctgcgggct gcatgccgac tggaccgagc tcaccagctg       540
     cgtgccggcg gtggtggatg gcaagcgcga cgccaggctg agaccttcca ggaacttcca       600
     ttacattacc atcctgagag acccagtgtc acggtacttg agtgaatgga ggcatgtcca       660
     gagaggagca acttggaaag catccctgca cgtctgtgat ggaaggcccc caacctctga       720
     agagctgccc agctgctaca ccggtgatga ctggtctgga tgccctctca aagagttcat       780
     ggactgtccc tataatctgg ccaacaaccg ccaagttcgc atgctatctg acctgactct       840
     agtgggatgc tacaacctct ctgtcatgcc tgaaaagcaa agaaacaaag tccttctgga       900
     cagtgccaaa tccaatctga agcacatggc gttctttggc ctcactgagt ttcagcgcaa       960
     gacccagtac ctgtttgaga agaccttcaa catgaacttt atctcgccgt ttacccagta      1020
     taataccact agggcctcta gtgttgagat caatgaggaa atccaaaagc gtattgaggg      1080
     actgaatttt ctggatatgg agttgtacag ctatgctaaa gacctctttc tgcaaaggta      1140
     tcagttcatg aggcagaaag aacatcagga tgccaggcgg aagcgtcagg agcaacgcaa      1200
     atttctgaag ggaaggttcc ttcagaccca tttccagagt cagagtcagg gtcagagcca      1260
     gagccagagt ccaggtcaga atctgagtca gaatccaaat cctaacccaa atcagaacct      1320
     gactcagaac ctgagtcaca atctgactcc gagttcaaat cccaattcga cccagaggga      1380
     gaaccgggga agtcagaagc agggctcagg ccagggacaa ggtgatagcg gcaccagcaa      1440
     tggcaccaat gactacatag ggagcgtaga gacatggcgc taactggcac tcaaagggct      1500
     tgtgcacact gcttcccaaa gcatcaccaa aaagatggcg tgtatcttac aagatgaaaa      1560
     tgtccaaaca catcctgctt ccttcatttg ggaagttcaa aaaaaaaaag tgtagacgtt      1620
     gcctttacag ttgccttttg attcagtgtt atgctgtgtg tgggtacaac aaatctcagt      1680
     gtgtaattaa attgtctgtt ttgggatgaa ctacttctga aatccaagag ccaaaccaga      1740
     ctcaccagaa attgcagctt agatatttta agacgttctt aaattaggta tgggagacac      1800
     aaagaaaaca tgaaatgtgg ccgtttaaac ttatggctaa gaaacagact ttaagtgatt      1860
     ccggatactc tgttaaaacc cagtcttgaa agcacccccc ctcgctgcag ggaagtataa      1920
     gtaaaaacat gaacagaact aaaagacact caaaactgtt catttccttg ttttaacaag      1980
     gagcctattt aaacagaaaa acaactgatg ataaatttgg ccaaactgta ggaaataaac      2040
     aatttctcac ttaaatgtag ctgggtttat gtaaaaataa tggcttttca aatagatcgg      2100
     gattcgtgtc ttgtaactta acagatgttt aaaatttaga atttttctac cttctgctga      2160
     ttaaaaggtt ttaaacaggg tatatctcat attaacaaag cattttttcc aaatggaata      2220
     ccaatggaaa gatccagttt ccagatggtt tctgggcact gtaattccaa caatttcgat      2280
     ctttttggcg ctgcttctca ttcactttaa ggcttctccg agttgtgctc attccaaatg      2340
     aatccatgta tagaattttt cttcttcatc taaacaacgt gttgctaaat ctcttgtgct      2400
     atttaatcct ggactaaagt caggactttc tatagaattg tcttatcaat gtctgtgtag      2460
     taaggtcctt aagaagaact ctcaaacaga ataatttgtc tcttaatatt gctgaagtta      2520
     ttattcttca tcagattgaa ctaaacctga atgtgaactg ttaattatct tgtttattgt      2580
     aatgtttcca gtttggctgg ctatttctgg cccatggttc ttttcttgta ttgacagatt      2640
     tgtacacaat aataagagca aaaaattttc tcaaagacaa cagtagacta atcattaaga      2700
     tagccagagt tgctcaataa caccaaaagc tgcctcttga taattgaggg gaacacttct      2760
     accaggattt tgtttgtgaa agatcaaaaa taaagaatgt tttgcagcga agctaagtgt      2820
     cagggacgtg caagtagagg agaatagttt gatagtctgc atctgtgaag agtattttga      2880
     tggtttcaat tacaaatgga acaaaggaag aattatatac acactgtgtt gtgaaatatc      2940
     atactgcttg gcagtttaca cctcatggtt gttgctacac ttctataaga gcttacatta      3000
     atcccttggc tttcaatcaa aacaaggtga ccagaagtgg tattcatgct ttcaagaggg      3060
     acatgcttag aaatatgatc ataaagtcac atctgtcata gtctgcccaa gtgcatgacc      3120
     tggtttcatc tgtaataata tggccacaag tttggcaaga agttgagtat ctggctctaa      3180
     agagattttc ttgactctga gctctgtgac aatatttgca gcttttgttt tgtggtattt      3240
     gcagtgttga gaatgatgat acctaaaaaa aaaatcaaaa aacaaaaaca aaaaaatgaa      3300
     gaaaaaacaa aaacaaacaa aaaaaaactg gaaggaattt ttaaaagcac atcttgaagt      3360
     caaagtattt ggtgtacatt gtgttcttga aaacttctgt agatcactat ggtatttcag      3420
     taattttagt tttaattgaa aatgattgtt cttaaaactg acagtgttgg aaagcaggac      3480
     gaagtctgag ccagtggaaa agctcattac agaaaaaaaa aagtgttatt gctgtggtgt      3540
     gcatggtttg cagagattaa gtgcattttc tctgtctcta ctgattattg tatatagaga      3600
     atgttataaa tatacatttt tgtcatcatc atgtaaatcc cacgatttca aactgtaaac      3660
     atctgttcag tggtgtagct ttacaaactg ttcgctgatt ttgtgtaatt tatcaataga      3720
     tgtgaaataa agtttcaatt gc                                               3742
//