Dbfetch

ID   AK146620; SV 1; linear; mRNA; HTC; MUS; 5169 BP.
XX
AC   AK146620;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 11)
XX
DE   Mus musculus 16 days embryo kidney cDNA, RIKEN full-length enriched
DE   library, clone:I920035I21 product:Unc-51 like kinase 2 (C. elegans), full
DE   insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-5169
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; dad962ac0b80f5d44a22475b53066853.
DR   Ensembl-Gn; ENSMUSG00000004798; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0018447; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0018416; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0018386; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0018387; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0018200; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0018839; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0017757; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0018174; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0018283; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0018274; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0018356; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0018298; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0018880; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0017530; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0017809; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000004920; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0029595; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0029546; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0029520; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0029539; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0029338; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0030038; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0029031; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0029254; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0029399; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0029373; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0029508; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0029359; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0030073; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0028738; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0028828; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=I920035I21
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..5169
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="16 days embryo"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="I920035I21"
FT                   /tissue_type="kidney"
FT                   /db_xref="taxon:10090"
FT   CDS             479..3592
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="Unc-51 like kinase 2 (C. elegans) (MGD|MGI:1352758
FT                   GB|AK122331, evidence: BLASTN, 99%, match=5160)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q9QY01"
FT                   /db_xref="InterPro:IPR000719"
FT                   /db_xref="InterPro:IPR008271"
FT                   /db_xref="InterPro:IPR011009"
FT                   /db_xref="InterPro:IPR016237"
FT                   /db_xref="InterPro:IPR017441"
FT                   /db_xref="InterPro:IPR022708"
FT                   /db_xref="MGI:MGI:1352758"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9QY01"
FT                   /protein_id="BAE27309.1"
FT                   /translation="MEVVGDFEYCKRDLVGHGAFAVVFRGRHRQKTDWEVAIKSINKKN
FT                   LSKSQILLGKEIKILKELQHENIVALYDVQELPNSVFLVMEYCNGGDLADYLQAKGTLS
FT                   EDTIRVFLHQIAAAMRILHSKGIIHRDLKPQNILLSYANRRKSNVSGIRIKIADFGFAR
FT                   YLHSNTMAATLCGSPMYMAPEVIMSQHYDAKADLWSIGTVIYQCLVGKPPFQANSPQDL
FT                   RMFYEKNRSLMPSIPRETSPYLANLLLGLLQRNQKDRMDFEAFFSHPFLEQVPVKKSCP
FT                   VPVPVYSGPVPGSSCSSSPSCRFASPPSLPDMQHIQEENLSSPPLGPPNYLQVSKDSAS
FT                   NSSKNSSCDTDDFVLVPHNISSDHSYDMPMGTTARRASNEFFMCGGQCQPTVSPHSETA
FT                   PIPVPTQVRNYQRIEQNLISTASSGTNPHGSPRSAVVRRSNTSPMGFLRVGSCSPVPGD
FT                   TVQTGGRRLSTGSSRPYSPSPLVGTIPEQFSQCCCGHPQGHEARSRHSSGSPVPQTQAP
FT                   QSLLLGARLQSAPTLTDIYQNKQKLRKQHSDPVCPSHAGAGYSYSPQPSRPGSLGTSPT
FT                   KHTGSSPRNSDWFFKTPLPTIIGSPTKTTAPFKIPKTQASSNLLALVTRHGPAESQSKD
FT                   GNDPRECSHCLSVQGSERHRSEQQQSKAVFGRSVSTGKLSEQQVKAPLGGHQGSTDSLN
FT                   TERPMDVAPAGACGVMLALPAGTAASARAVLFTVGSPPHSATAPTCTHMVLRTRTTSVG
FT                   SSSSGGSLCSASGRVCVGSPPGPGLGSSPPGAEGAPSLRYVPYGASPPSLEGLITFEAP
FT                   ELPEETLMEREHTDTLRHLNMMLMFTECVLDLTAVRGGNPELCTSAVSLYQIQESVVVD
FT                   QISQLSKDWGRVEQLVLYMKAAQLLAASLHLAKAQVKSGKLSPSMAVKQVVKNLNERYK
FT                   FCITMCKKLTEKLNRFFSDKQRFIDEINSVTAEKLIYNCAVEMVQSAALDEMFQQTEDI
FT                   VYRYHKAALLLEGLSKILQDPTDVENVHKYKCSIERRLSALCCSTATV"
FT   regulatory      5145..5150
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      5169
FT                   /note="putative"
XX
SQ   Sequence 5169 BP; 1348 A; 1248 C; 1223 G; 1350 T; 0 other;
     ggggtgctcc cgcgaagcag gcagaagccg ccaccgcggc tgtgcgggtt ccagggcggc        60
     ggccgctaca cagcagcgcc gcgggcgcaa caagcggagg cagccgcacc tgcggggatg       120
     gcgcggcccc ggccctgcac cgctgtcagg cgcgcgggta gctccgggta cgcccagtga       180
     cggcggcggt gccctcggcc gggcagtgcc caggccgctt ggctagagcc ccgcggccga       240
     cggcggcccg ggccatgaga agcgaccggg cccaggcctc gctgaccccg gctccgcgcg       300
     gcggcccgtc ccgtttccgc tccggcctct cggcatgagt gtccatccgg gcccggggcc       360
     gtggctgccc ctgtcccgcc gcccgcgggc gtgcttgggg ccctcgggcc cgcggtgctg       420
     atcccggttc tccgccacgc ctgcccaccc tagcgctcta tgtcccgggg gcgcggccat       480
     ggaggtggtg ggcgacttcg agtactgcaa gcgggacctc gtgggacacg gggccttcgc       540
     tgtggtcttc cgggggcggc accgccagaa aactgattgg gaggtggcta ttaaaagtat       600
     taataaaaag aacttgtcaa aatcacaaat tctgcttgga aaggaaataa aaatcttaaa       660
     ggagcttcag catgaaaaca tcgtagcgct ctatgatgtt caggaattgc ccaactctgt       720
     ctttctggtg atggagtatt gcaatggtgg agacctggca gattatttgc aagctaaagg       780
     aactctgagt gaagatacta tcagagtgtt tctccatcag attgcggcag ccatgcgaat       840
     cctgcacagc aaagggataa tccacaggga tctcaaacca cagaatatcc tgttgtctta       900
     tgccaatcga aggaagtcga atgtcagtgg tattcgtatt aaaatagctg attttggttt       960
     cgcacggtac ctacatagta acacaatggc agcgacactg tgtggatccc caatgtacat      1020
     ggctcccgag gttattatgt ctcaacatta tgatgctaag gcagatttat ggagcatagg      1080
     aacagtgatc tatcaatgcc tagttggaaa accacctttt caggctaata gtcctcagga      1140
     cctaaggatg ttttatgaaa aaaacaggag cttaatgcct agtattccca gagaaacatc      1200
     accttacttg gctaatctcc ttttgggttt gcttcagaga aatcaaaagg atagaatgga      1260
     ctttgaagca tttttcagcc atcctttcct tgagcaagtt ccagttaaaa aatcttgccc      1320
     agtcccagtg cctgtgtatt ctggccctgt ccctggaagc tcctgcagca gctcaccatc      1380
     ttgtcgcttt gcttctccac catcccttcc agatatgcag catattcagg aagaaaactt      1440
     atcctcccca ccgttgggtc ctcccaacta tctacaggtg tccaaagact ctgcgagtaa      1500
     tagtagcaag aactcttctt gtgacacgga tgactttgtt ttggttccac acaacatctc      1560
     gtcagaccac tcatatgaca tgccaatggg gactacggcc agacgtgctt caaatgaatt      1620
     ctttatgtgt ggagggcagt gtcaacctac tgtgtcacct cacagcgaaa cagccccaat      1680
     tccagttcct actcaagtaa ggaattatca gcgcatagaa cagaatctta tatccactgc      1740
     cagctctggc acaaacccac atggttctcc aagatctgca gtagtacgaa ggtctaatac      1800
     cagccccatg ggcttcctcc gggttgggtc ctgctcccct gtaccaggag acacagtgca      1860
     gacaggagga cgaagactct ctactggctc ttccaggcct tactcaccat cccctttggt      1920
     tggtaccatt cctgaacagt ttagtcagtg ctgctgtgga catcctcagg gccatgaagc      1980
     caggagtagg cactcctcag gttctccagt gccacagacc caggcaccac agtcactctt      2040
     actgggtgct agactgcaga gtgcacccac cctcaccgat atctatcaga acaagcagaa      2100
     gctcagaaag cagcactctg accctgtgtg tccgtcccat gctggagctg ggtatagtta      2160
     ctcacctcag cctagtcggc ctggcagcct tgggacctct cccaccaagc acacggggtc      2220
     ctctccacgg aattctgact ggttctttaa aactcctttg ccaacaatca ttggctctcc      2280
     tactaagact acagctcctt tcaaaatccc taaaacacaa gcatcttcta acctgttagc      2340
     cttggttact cgtcatgggc ctgctgaaag ccagtccaaa gatgggaatg accctcgtga      2400
     gtgttcccac tgcctctcag tacaaggaag cgagaggcat cgatctgagc agcagcagag      2460
     caaggcagtg tttggcagat ctgtcagtac tgggaagtta tcagaacaac aagtaaaggc      2520
     acctttaggt ggacaccagg gcagcacgga tagtttaaac acagaacgac caatggatgt      2580
     agctcctgca ggagcctgtg gtgttatgct ggcattgcca gcaggaacag cagcaagcgc      2640
     cagagctgtc ctcttcaccg tggggtctcc tccacacagt gccacagccc ccacttgtac      2700
     tcatatggtc cttcgaacaa gaaccacctc agtggggtcc agcagctcag gaggttcctt      2760
     gtgttctgca agtggccgag tatgtgtggg ctcccctcct ggaccagggt tgggctcttc      2820
     cccaccagga gcagagggag ctcccagcct aagatacgtg ccttatggtg cttcaccacc      2880
     cagcctagag ggtctcatca cctttgaagc ccctgaacta ccagaggaga cactgatgga      2940
     gcgagagcac acagacacct tacgccatct gaacatgatg ttaatgttta ctgagtgtgt      3000
     gctggacctg acggcagtga ggggtgggaa ccctgagctg tgcacatctg ctgtgtcctt      3060
     gtaccagatt caggagagtg tagttgtgga ccagatcagc cagctaagca aagattgggg      3120
     gcgggtggag cagctggtgt tgtacatgaa ggcagcacag ctgctggcgg cttccctgca      3180
     tctcgccaaa gctcaggtca agtctgggaa gctgagccca tccatggctg tgaaacaagt      3240
     tgttaaaaat ctgaatgaaa gatacaaatt ctgcatcacc atgtgcaaga aacttacaga      3300
     aaagctgaat cgcttcttct ccgataaaca gagatttatt gatgaaatca acagtgtgac      3360
     tgcagagaaa ctcatctata attgtgctgt ggaaatggtt caatctgcag ccctggatga      3420
     gatgtttcag cagactgaag acatcgttta tcgctaccac aaggcagccc ttcttttgga      3480
     aggcttaagt aagatcctgc aggaccctac agatgttgaa aatgtgcata agtataaatg      3540
     tagtattgaa agaagattgt cagcactctg ctgtagcact gcaactgtgt gagtagcagg      3600
     cttgtctgtg gactggcatg gaacaggagg tgatacattt gggattacat cttggttctg      3660
     tcacccatcc caggacagtg tggtgactac caaagaacaa gcagcagctt aagaaggaag      3720
     aacaatacaa aactactaca tattgtagaa aacctgcctt attggagaag tcactccccc      3780
     tttcctttct cttcatagaa gcagaacaaa aagttttcca catggctcaa gttatttgaa      3840
     cctggcaaat aataaatgta ccttagaact agcatcataa gtacagttat tcttgtggat      3900
     aattaaacag gaaacaaagt gagtggcttc attcagctct ctgagcaata catgaaattc      3960
     cgtgttttgc taaagcatac acaagcaaaa gcgtatacag ctcgtccaga cgatctgagg      4020
     tttggggtaa gtttgtctga atctgagcct gcatctttaa acagctcagg aggaggaatc      4080
     ttaagaagaa acaaaaggtg actcttggat gaattgtgaa atcttcaact tgatctaatg      4140
     tggacatgat tttaatcttc caaaaatctt tcatattgca ctaatttatt aaaataactg      4200
     tgtattggat tttgcaattt aaaactaact gaggcacaat ggacttgttt aaatatttta      4260
     cttgattata tacatgtcct tttcagaatt catgtgtaat ctccactgaa cttttaaatg      4320
     gttggaaatt gcgttcatgt gaacctttgc atttttctca cctgttatct tcaccaacag      4380
     atcttagtgg aattttgagt tgctggttgt ttgcattttt tgttgtgcat gcagaatgtg      4440
     catcgactcc tgtgatctta ggtttactaa aggctaagtt tatttgggca gtattagaaa      4500
     gctatcatga atcaagaaat actcttgaga attttaatag ggccaaatca aagtgatgaa      4560
     taatggcttg ttagtgatgt ggagtttcta catgaaatta gtaagaaatg aggttcgttt      4620
     tcccttagga aatgggcagc tctctccaga ctaatgtgta cttctcagtg ttaaccctga      4680
     gttcaccata gctagtcatg gcagatcagc acctctccaa gaatggttcc tgtgttgtat      4740
     attatttggt atcttttact tacctgcttg aatacttgaa taaaccattc accggtttta      4800
     atccttttac ttcaaaactt acacatactg acctactctc tgatagctgc acagaaactc      4860
     tttggcgcca cacttgcttt agtgtgctga ttaaagttaa cagagaaaac atggttttca      4920
     tttacttggt gaacaaagta atgtaatttt tacattattt atctgtatga aattccagca      4980
     gttaatttga acgtttatgt ataggatgtt tgtattctta ggtctttact acagtgtttc      5040
     tacctctcat ttgtaactgc attatcttca aaataggtaa aacccactaa gtcattgtgg      5100
     aatagccttt tttaaattgc ttgtacaaat gtatattaag gttaaataaa actgacagtg      5160
     tttttaggt                                                              5169
//