EBI Dbfetch
ID AK170758; SV 1; linear; mRNA; HTC; MUS; 2365 BP.
XX
AC AK170758;
XX
DT 09-SEP-2005 (Rel. 85, Created)
DT 24-SEP-2008 (Rel. 97, Last updated, Version 9)
XX
DE Mus musculus NOD-derived CD11c +ve dendritic cells cDNA, RIKEN full-length
DE enriched library, clone:F630118J17 product:alpha-N-acetylglucosaminidase
DE (Sanfilippo disease IIIB), full insert sequence.
XX
KW CAP trapper; HTC; HTC_FLI.
XX
OS Mus musculus (house mouse)
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC Muridae; Murinae; Mus.
XX
RN [1]
RP 1-2365
RA Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT ;
RL Submitted (14-APR-2004) to the EMBL/GenBank/DDBJ databases.
RL Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL
RL :http://www.osc.riken.jp/
XX
RN [2]
RX DOI; 10.1126/science.1112014.
RX PUBMED; 16141072.
RG The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG Science Group (Genome Network Project Core Group)
RA ;
RT "The transcriptional landscape of the mammalian genome.";
RL Science 309(5740):1559-1563(2005).
XX
RN [3]
RG RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG Network Project Core Group) and the FANTOM Consortium
RA ;
RT "Antisense Transcription in the Mammalian Transcriptome";
RL Science 309:1564-1566(2005).
XX
RN [4]
RG The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG I and II Team
RA ;
RT "Analysis of the mouse transcriptome based on functional annotation of
RT 60,770 full-length cDNAs";
RL Nature 420:563-573(2002).
XX
RN [5]
RX DOI; 10.1038/35055500.
RX PUBMED; 11217851.
RA RIKEN FANTOM Consortium;
RT "Functional annotation of a full-length mouse cDNA collection";
RL Nature 409(6821):685-690(2001).
XX
RN [6]
RX DOI; 10.1016/S0076-6879(99)03004-9.
RX PUBMED; 10349636.
RA Carninci P., Hayashizaki Y.;
RT "High-efficiency full-length cDNA cloning";
RL Meth. Enzymol. 303:19-44(1999).
XX
RN [7]
RX DOI; 10.1101/gr.145100.
RX PUBMED; 11042159.
RA Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT full-length cDNA libraries for rapid discovery of new genes";
RL Genome Res. 10(10):1617-1630(2000).
XX
RN [8]
RX DOI; 10.1101/gr.152600.
RX PUBMED; 11076861.
RA Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT pipeline with 384 multicapillary sequencer";
RL Genome Res. 10(11):1757-1771(2000).
XX
DR ASTD; TRAN00000165928.
DR Ensembl-Gn; ENSMUSG00000001751; Mus_musculus.
DR Ensembl-Tr; ENSMUST00000001802; Mus_musculus.
XX
CC cDNA library was prepared and sequenced in Mouse Genome
CC Encyclopedia Project of Genome Exploration Research Group in Riken
CC Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC Division of Experimental Animal Research in Riken contributed to
CC prepare mouse tissues.
CC Tissues were provided by Dr. John Todd (Dept. of Medical Genetics
CC Wellcome Trust Centre for Molecular Mechanisms in Disease Wellcome
CC Trust/MRC building Addenbrookes Hospital Cambridge) whose
CC assistance we gratefully acknowledge.
CC Please visit our web site for further details.
CC URL:http://www.osc.riken.jp/
CC URL:http://fantom.gsc.riken.jp/
XX
FH Key Location/Qualifiers
FH
FT source 1..2365
FT /organism="Mus musculus"
FT /strain="NOD"
FT /mol_type="mRNA"
FT /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT /clone="F630118J17"
FT /cell_type="NOD-derived CD11c +ve dendritic cells"
FT /db_xref="taxon:10090"
FT CDS 15..2234
FT /codon_start=1
FT /transl_table=1
FT /note="alpha-N-acetylglucosaminidase (Sanfilippo disease
FT IIIB) (MGD|MGI:1351641 GB|BC055733, evidence: BLASTN, 99%,
FT match=2356)"
FT /note="putative"
FT /db_xref="InterPro:IPR007781"
FT /db_xref="MGI:1351641"
FT /db_xref="UniProtKB/TrEMBL:O88325"
FT /protein_id="BAE42009.1"
FT /translation="MEAAGLAVILGFLLLAGGSVGDEAREAKAVRELVVRLLGPGPAAN
FT FLVSVERALADESGLDTYSLSGGGGVPVLVRGSTGVAAAAGLHRYLRDFCGCQVAWSSA
FT QLHLPWPLPAVPDGLTETTPNRYRYYQNVCTHSYSFVWWDWARWEQEIDWMALNGINLA
FT LAWNGQEAIWQRVYLALGLTQSEIDTYFTGPAFLAWGRMGNLHTWDGPLPRSWHLSQVY
FT LQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVIKLGSWGHFNCSYSCSFLLAP
FT GDPMFPLIGNLFLRELTKEFGTDHIYGADTFNEMQPPFSDPSYLAATTAAVYEAMVTVD
FT PDAVWLLQGWLFQHQPQFWGPSQIRAVLEAVPRGRLLVLDLFAESHPVYMHTASFHGQP
FT FIWCMLHNFGGNHGLFGALEDVNRGPQAARLFPNSTMVGTGIAPEGIGQNEVVYALMAE
FT LGWRKDPVPDLMAWVSSFAIRRYGVSQPDAVAAWKLLLRSVYNCSGEACSGHNRSPLVK
FT RPSLQMSTAVWYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQAVQELVSLCYEE
FT ARTAYLKQELDLLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQARKAAVSEAEAQF
FT YEQNSRYQITLWGPEGNILDYANKQLAGLVADYYQPRWCLFLGTLAHSLARGVPFQQHE
FT FEKNVFPLEQAFVYNKKRYPSQPRGDTVDLSKKIFLKYHPQPDSL"
XX
SQ Sequence 2365 BP; 440 A; 723 C; 701 G; 501 T; 0 other;
ggccggttgg catcatggaa gccgcggggc tggccgtgat tctggggttc ctactcctgg 60
ctgggggctc tgtgggtgat gaggctcggg aggcaaaggc cgtgcgggaa ctggtggtcc 120
ggctactggg acccgggccg gcggccaatt tcttggtgtc cgtggagcgc gcgctggcgg 180
acgagtcggg cctggacacc tacagcctga gcggcggcgg cggggtgcca gtgcttgtgc 240
gcggctccac gggcgtggcg gcagccgcgg ggctgcaccg ctacctgcgt gacttctgtg 300
gctgtcaggt ggcctggtcc agcgctcagc tgcacctgcc gtggccgctg cccgctgtgc 360
ccgacgggct gaccgaaacc acgcccaaca ggtaccgcta ttaccagaat gtgtgcacgc 420
acagctactc cttcgtgtgg tgggactggg cccgttggga gcaagagatt gactggatgg 480
cactgaatgg catcaacctg gctctggctt ggaatggcca ggaggccatc tggcaaaggg 540
tatacctggc cttgggcctg acccagtcag agatcgatac gtacttcacc ggtcctgcct 600
tcctggcctg gggacgcatg ggtaacttgc acacctggga tggccccctg ccccgttcct 660
ggcacctcag tcaagtctac ctgcagcatc gaatcctgga ccggatgcgc tcctttggca 720
tgatcccagt gctgcctgcc ttcgcagggc atgtccccaa ggccatcacc agggtgtttc 780
cacaggtcaa tgtcatcaag ttggggagct ggggacattt caactgctcc tacagctgct 840
ccttccttct ggctccagga gaccccatgt ttcccctcat cgggaacctc ttcctacggg 900
agctgaccaa ggagtttggc acagatcata tctatggggc tgacaccttc aatgagatgc 960
agcctccctt ctctgacccc tcctacctcg ccgctaccac tgcagccgtc tatgaagcca 1020
tggtcactgt ggaccctgat gctgtttggc tgctccaagg ctggctcttc cagcaccagc 1080
cccagttctg gggcccctct caaatcaggg ctgtgctgga ggccgtgccc cgtggtcgtc 1140
tcctggttct ggacctgttt gctgaaagcc atcctgttta catgcacaca gcctccttcc 1200
atggccagcc cttcatctgg tgtatgctcc acaactttgg gggcaaccat ggcctgtttg 1260
gagccctcga ggatgtgaac cgaggccccc aggcagctcg cctcttccct aactccacca 1320
tggtcggcac tggcatagcc cccgagggca ttggccagaa tgaagtggtc tatgctctca 1380
tggctgagct gggctggcgc aaggaccctg taccagattt gatggcctgg gtgagcagct 1440
ttgccatccg ccgatacggg gtctcccagc ctgatgccgt ggcagcttgg aagctcctac 1500
tcagaagtgt ctacaactgc tctggggagg cgtgcagtgg gcacaatcga agcccgttgg 1560
tcaagcggcc gtccctacag atgagtaccg ctgtctggta caacagatca gatgtgtttg 1620
aggcttggcg actgctgtta acagctgccc caaacctgac caccagccca gccttccgct 1680
atgacctgct ggatgtcacc cgccaagccg tgcaggagtt ggtcagcctg tgctatgagg 1740
aggcaaggac cgcctacctg aagcaagagc ttgatctcct gctcagggct ggaggcctcc 1800
tggtctataa actcctgcct acactagatg agctgctggc tagcagcagc cacttcttgc 1860
tgggtacctg gttggatcag gcccggaaag cggccgtaag tgaggccgag gcccagttct 1920
atgaacaaaa cagccgctac cagattaccc tgtgggggcc cgagggcaac attttggatt 1980
atgccaataa gcaactggca ggactggtgg ctgattacta ccagcctcgc tggtgcctct 2040
tcttggggac tctggctcac agcctagcca gaggtgtccc cttccaacag cacgagtttg 2100
agaagaacgt tttcccactc gagcaggctt tcgtttacaa caagaagagg taccccagtc 2160
agccccgagg ggacaccgtg gacctctcca agaagatctt cctcaaatat cacccccagc 2220
ctgactcttt gtgacagatt agccatcgca gggacctgct ggaataggtc ctcaaatcca 2280
gacaagccca gaatgcgccc caccccaccc ccgggcctgg gaggagacag ggtatgacag 2340
tgagtgacaa tgatggcttg gaggg 2365
//
 |