Dbfetch

ID   AK169140; SV 1; linear; mRNA; HTC; MUS; 2812 BP.
XX
AC   AK169140;
XX
DT   09-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 10)
XX
DE   Mus musculus 17 days embryo kidney cDNA, RIKEN full-length enriched
DE   library, clone:I920083B16 product:valosin containing protein, full insert
DE   sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2812
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (14-APR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 9ffea8e5dd6d51bd405dc83d6bb2a286.
DR   Ensembl-Gn; ENSMUSG00000028452; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0028196; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0028152; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0028104; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0028172; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0027894; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0028613; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0027334; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_FVBNJ_G0027976; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0028108; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0028649; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0027060; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0027414; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000030164; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0065244; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0065290; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0065186; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0065220; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0064895; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0065689; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0065206; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_FVBNJ_T0064901; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0065077; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0065865; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0064742; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0064104; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=I920083B16
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2812
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="17 days embryo"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="I920083B16"
FT                   /tissue_type="kidney"
FT                   /db_xref="taxon:10090"
FT   CDS             232..2652
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="valosin containing protein (MGD|MGI:99919
FT                   GB|BC049114, evidence: BLASTN, 99%, match=2809)"
FT                   /db_xref="GOA:Q01853"
FT                   /db_xref="InterPro:IPR003338"
FT                   /db_xref="InterPro:IPR003593"
FT                   /db_xref="InterPro:IPR003959"
FT                   /db_xref="InterPro:IPR003960"
FT                   /db_xref="InterPro:IPR004201"
FT                   /db_xref="InterPro:IPR005938"
FT                   /db_xref="InterPro:IPR009010"
FT                   /db_xref="InterPro:IPR015415"
FT                   /db_xref="InterPro:IPR027417"
FT                   /db_xref="InterPro:IPR029067"
FT                   /db_xref="MGI:MGI:99919"
FT                   /db_xref="PDB:1E32"
FT                   /db_xref="PDB:1R7R"
FT                   /db_xref="PDB:1S3S"
FT                   /db_xref="PDB:2PJH"
FT                   /db_xref="PDB:3CF0"
FT                   /db_xref="PDB:3CF1"
FT                   /db_xref="PDB:3CF2"
FT                   /db_xref="PDB:3CF3"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q01853"
FT                   /protein_id="BAE40919.1"
FT                   /translation="MASGADSKGDDLSTAILKQKNRPNRLIVDEAINEDNSVVSLSQPK
FT                   MDELQLFRGDTVLLKGKKRREAVCIVLSDDTCSDEKIRMNRVVRNNLRVRLGDVISIQP
FT                   CPDVKYGKRIHVLPIDDTVEGITGNLFEVYLKPYFLEAYRPIRKGDIFLVRGGMRAVEF
FT                   KVVETDPSPYCIVAPDTVIHCEGEPIKREDEEESLNEVGYDDIGGCRKQLAQIKEMVEL
FT                   PLRHPALFKAIGVKPPRGILLYGPPGTGKTLIARAVANETGAFFFLINGPEIMSKLAGE
FT                   SESNLRKAFEEAEKNAPAIIFIDELDAIAPKREKTHGEVERRIVSQLLTLMDGLKQRAH
FT                   VIVMAATNRPNSIDPALRRFGRFDREVDIGIPDATGRLEILQIHTKNMKLADDVDLEQV
FT                   ANETHGHVGADLAALCSEAALQAIRKKMDLIDLEDETIDTEVMNSLAVTMDDFRWALSQ
FT                   SNPSALRETVVEVPQVTWEDIGGLEDVKRELQELVQYPVEHPDKFLKFGMTPSKGVLFY
FT                   GPPGCGKTLLAKAIANECQANFISIKGPELLTMWFGESEANVREIFDKARQAAPCVLFF
FT                   DELDSIAKARGGNIGDGGGAADRVINQILTEMDGMSTKKNVFIIGATNRPDIIDPAILR
FT                   PGRLDQLIYIPLPDEKSRVAILKANLRKSPVAKDVDLEFLAKMTNGFSGADLTEICQRA
FT                   CKLAIRESIESEIRRERERQTNPSAMEVEEDDPVPEIRRDHFEEAMRFARRSVSDNDIR
FT                   KYEMFAQTLQQSRGFGSFRFPSGNQGGAGPSQGSGGGTGGSVYTEDNDDDLYG"
XX
SQ   Sequence 2812 BP; 699 A; 653 C; 783 G; 677 T; 0 other;
     ggttttgatt agtcgctgcc cccctcgcgg attaggagct agcgtctccc gcccgcccgc        60
     ctgccgcccc ggtgcccctg ggaggaagcg agagggaggc tgccagaggg tttgtcactg       120
     ctgttgctcc tccgcctcag cgagtccagc cccggcctag tcggtcgcct gcctttctca       180
     tagccgttac cctcaggccg ccacagccgc cgaccgggag aggcgcgcgc catggcctct       240
     ggagccgatt caaaaggtga tgatttatca acagccattc tcaaacagaa gaaccgaccc       300
     aatcggttaa ttgttgatga agccatcaat gaagataaca gcgtggtgtc cttgtcccag       360
     cccaagatgg atgaactgca gttgttccga ggtgacacgg tgttgctaaa aggaaagaaa       420
     agacgggaag ctgtatgcat tgttctttct gatgacacgt gttctgatga gaagattcga       480
     atgaatagag ttgttcggaa taacctccga gttcgcctag gagatgtcat cagcatccag       540
     ccatgccctg atgtaaagta tggcaaacgt atccacgttc tacccatcga tgacacagtg       600
     gaaggcatca ctggcaatct ctttgaggta taccttaagc cgtacttcct ggaagcttat       660
     cggcccatcc gtaaaggaga tatttttctt gtccggggtg ggatgcgtgc tgtggagttc       720
     aaagttgtag agacagatcc cagcccttac tgtattgttg ctccagacac agtgatccac       780
     tgtgaggggg agccaatcaa gcgagaggat gaggaggaat ccttgaatga agtaggctat       840
     gatgacatcg gtggttgcag gaagcagcta gctcagataa aggagatggt ggagctgcca       900
     ctgagacatc ctgcgctctt taaggcgatt ggtgtaaagc ctcctcgggg aatcttgttg       960
     tatgggcctc ctgggacagg gaagaccctg attgctcgag ctgtggcaaa tgaaactgga      1020
     gccttcttct ttctgatcaa tggtcctgaa atcatgagca aattggctgg tgagtctgag      1080
     agcaaccttc gtaaagcctt tgaggaagct gaaaagaatg ctcctgctat catcttcatc      1140
     gatgagcttg atgccattgc acccaaaaga gagaaaactc atggggaagt ggagcgtcgc      1200
     atcgtgtctc agttgttgac cctcatggat ggcctaaagc agagagcaca tgtgatagtt      1260
     atggcagcaa ccaatagacc caacagcatt gacccagccc tacggcgatt tggtcgcttt      1320
     gacagagagg tagatattgg aatacctgat gctacaggac gtttggagat tcttcagatc      1380
     cataccaaga acatgaaact ggcagatgat gtggacttgg aacaggtagc caatgagact      1440
     catggtcatg ttggtgctga tttggcagcc ctatgttcag aggctgctct gcaggccatc      1500
     cggaaaaaaa tggacctcat tgacctagaa gatgagacca ttgatactga ggtcatgaat      1560
     tccctggcag ttactatgga tgacttccgg tgggctttga gtcaaagcaa cccatcagca      1620
     cttcgggaaa ctgtggtaga ggtgccacaa gtaacctggg aagatattgg aggcctggag      1680
     gatgtcaaac gtgagcttca ggagttggtt cagtatcctg tggaacatcc agacaaattc      1740
     ctcaaatttg gcatgactcc ctccaaaggc gttcttttct atggacctcc tggctgtggg      1800
     aaaaccttac tggctaaagc cattgctaat gaatgccagg ccaacttcat ctccatcaag      1860
     ggtcctgagc tgcttaccat gtggtttggg gaatctgagg ccaatgtccg ggaaattttt      1920
     gacaaggcac ggcaagctgc cccctgtgta ctcttctttg atgagttaga ttcaattgcc      1980
     aaggctcgag gtggtaatat tggagatggt ggtggagctg ctgaccgagt catcaatcag      2040
     atcctgacag aaatggatgg catgtctaca aaaaagaatg tgtttatcat tggagctacc      2100
     aacaggcctg acatcattga tcctgctatc ctaagacctg gccgtctaga tcagctcatt      2160
     tatatcccac ttcctgatga gaagtcccgt gttgccatcc taaaagccaa tctgcgaaag      2220
     tccccagttg ccaaggatgt ggatttggag ttcctggcta agatgactaa tggcttttct      2280
     ggagctgatt tgacagaaat ttgccaacgg gcttgtaaac tggccattcg tgaatctatt      2340
     gagagtgaga ttaggcgaga acgagagagg cagacaaatc catcggctat ggaggtagaa      2400
     gaggacgatc cagtgcctga gatccgcaga gatcactttg aggaagccat gcgttttgcc      2460
     cgacgttctg tcagcgataa tgacattcgg aagtatgaaa tgttcgccca gacactgcag      2520
     cagagtcgag gttttggcag cttcagattc ccttcaggga accagggtgg agctggtccc      2580
     agccagggca gtggaggtgg cacaggtggc agtgtgtaca cagaagacaa tgacgatgac      2640
     ctgtatggct aagtgatgtg ccagcatgca gcgagctggc ctggctggac cttgttccct      2700
     gggggagggg gcgcttgccc aagagggacc aggggtgtgc ccatggcctg ttccattcct      2760
     cagtctgaac agttcagccc cagtcagact ctggacaggg gtttcctgtt gc              2812
//