Dbfetch

ID   AK085458; SV 1; linear; mRNA; HTC; MUS; 4088 BP.
XX
AC   AK085458;
XX
DT   19-DEC-2002 (Rel. 74, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 14)
XX
DE   Mus musculus 0 day neonate kidney cDNA, RIKEN full-length enriched library,
DE   clone:D630029M03 product:WISKOTT-ALDRICH SYNDROME PROTEIN FAMILY MEMBER 2
DE   (WASP-FAMILY PROTEIN MEMBER 2) (VERPROLIN HOMOLOGY DOMAIN-CONTAINING
DE   PROTEIN 2) homolog [Homo sapiens], full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus; Mus.
XX
RN   [1]
RP   1-4088
RA   Adachi J., Aizawa K., Akimura T., Arakawa T., Bono H., Carninci P.,
RA   Fukuda S., Furuno M., Hanagaki T., Hara A., Hashizume W., Hayashida K.,
RA   Hayatsu N., Hiramoto K., Hiraoka T., Hirozane T., Hori F., Imotani K.,
RA   Ishii Y., Itoh M., Kagawa I., Kasukawa T., Katoh H., Kawai J., Kojima Y.,
RA   Kondo S., Konno H., Kouda M., Koya S., Kurihara C., Matsuyama T.,
RA   Miyazaki A., Murata M., Nakamura M., Nishi K., Nomura K., Numazaki R.,
RA   Ohno M., Ohsato N., Okazaki Y., Saito R., Saitoh H., Sakai C., Sakai K.,
RA   Sakazume N., Sano H., Sasaki D., Shibata K., Shinagawa A., Shiraki T.,
RA   Sogabe Y., Tagami M., Tagawa A., Takahashi F., Takaku-Akahira S.,
RA   Takeda Y., Tanaka T., Tomaru A., Toya T., Yasunishi A., Muramatsu M.,
RA   Hayashizaki Y.;
RT   ;
RL   Submitted (16-APR-2002) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 27596ca6ed253b2ba3f9b9833f259775.
DR   Ensembl-Gn; ENSMUSG00000028868; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0028886; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0028845; mus_musculus_aj.
DR   Ensembl-Gn; MGP_BALBcJ_G0028868; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0028582; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0029310; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0028015; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0028548; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0028697; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0028664; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0028798; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0028690; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0029348; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0027736; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0028095; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000084241; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000105912; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0068334; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0068390; mus_musculus_aj.
DR   Ensembl-Tr; MGP_BALBcJ_T0068313; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0067978; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0068802; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0068424; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0067926; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0068060; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0067971; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0068157; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0067976; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0068983; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0067929; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0067165; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=D630029M03
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..4088
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="0 day neonate"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="D630029M03"
FT                   /tissue_type="kidney"
FT                   /db_xref="taxon:10090"
FT   CDS             140..1633
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="WISKOTT-ALDRICH SYNDROME PROTEIN FAMILY MEMBER 2
FT                   (WASP-FAMILY PROTEIN MEMBER 2) (VERPROLIN HOMOLOGY
FT                   DOMAIN-CONTAINING PROTEIN 2) homolog [Homo sapiens]
FT                   (SWISSPROT|Q9Y6W5, evidence: FASTY, 87%ID, 100%length,
FT                   match=1491)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q8BH43"
FT                   /db_xref="InterPro:IPR003124"
FT                   /db_xref="InterPro:IPR028288"
FT                   /db_xref="MGI:MGI:1098641"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q8BH43"
FT                   /protein_id="BAC39451.1"
FT                   /translation="MPLVTRNIEPRHLCRQTLPSDTSELECRTNITLANVIRQLGSLSK
FT                   YAEDIFGEICTQASAFASRVNSLAERVDRVQVKVTQLDPKEEEVSLQGINTRKAFRSST
FT                   TQDQKLFDRNSLPVPVLETYNSCDAPPPLNNLSPYRDDGKEALKFYTNPSYFFDLWKEK
FT                   MLQDTKDIMKEKRKHRKEKKDNPNRGNVNPRKIKTRKEEWEKMKMGQEFVESKERLGPS
FT                   GYSSTLVYQNGSIGSVENVDAASYPPPPQSDSASSPSPSFSEDNLPPPPAEFSYPADNQ
FT                   RGSVLAGPKRTSMVSPSHPPPAPPLSSPPGPKPGFAPPPAPPPPPPMSVPPPLPSMGFG
FT                   SPGTPPPPSPPSFPPHPDFAAPPPPPPPPAADYPMPPPPLSQPSGGAPPPPPPPPPPGP
FT                   PPLPFSGADGQPAAPPPPPPSEATKPKSSLPAVSDARSDLLSAIRQGFQLRRVEEQREQ
FT                   EKRDVVGNDVATILSRRIAVEYSDSEDDSSEFDEDDWSD"
XX
SQ   Sequence 4088 BP; 887 A; 1220 C; 1054 G; 927 T; 0 other;
     gagcagcccg cgaagcagta ggctggagcg ttctccgtgg ggacgcggac cggctgccca        60
     gcccttcatg tcgctgaggg cgactgcgcg aggtcaggtt tttttcatca ttgagaacct       120
     cgcctgaagc aggtccacta tgccgttagt aaccaggaac atcgagccaa ggcacctgtg       180
     ccgtcagacg ttgcctagcg atacaagcga gctggaatgc aggaccaaca tcaccctggc       240
     aaatgtcatc cgacagctgg gcagcctgag taagtatgca gaggacattt tcggagagat       300
     ttgtactcag gcaagtgcct ttgcctcccg agtaaactcc cttgctgagc gggttgaccg       360
     agtacaagtt aaagtcactc agctggatcc caaggaagaa gaagtgtcac tacaaggaat       420
     caacactcgg aaggccttca gaagttctac cacccaagac cagaagctct ttgacaggaa       480
     ctctctcccg gtgcccgtct tagagaccta taacagctgt gacgctcctc cccctcttaa       540
     caatctcagt ccttacaggg acgatggaaa agaggcactc aaattctaca ccaacccctc       600
     atacttcttt gatctttgga aggagaagat gctgcaggac accaaggata tcatgaaaga       660
     gaagagaaag cataggaaag aaaagaaaga taatccaaat agagggaatg tgaacccacg       720
     taaaatcaag acacgcaagg aagagtggga gaagatgaaa atgggacaag aatttgtaga       780
     gtccaaggag aggctggggc cttctgggta ttcgtccacc ttggtgtacc agaatggcag       840
     cattggctct gttgaaaatg tggatgcggc cagctaccca ccaccaccac agtcggactc       900
     tgcctcctca ccttcccctt cattctctga agacaacttg cctcccccgc cagcagaatt       960
     cagctaccct gcagacaacc aaagaggatc tgtcttggct ggacccaaaa gaaccagcat      1020
     ggtcagccca agccatccac ccccagctcc tcctctgagc tctccgccag gtcccaagcc      1080
     tggcttcgct cctccacctg cccctccacc tccacctccc atgagtgtgc cacctccact      1140
     accatcgatg ggattcgggt ctcctgggac ccctccaccc ccatcacctc catctttccc      1200
     tcctcaccct gattttgctg ctcctccacc acctccccca ccaccagcag ctgactaccc      1260
     aatgccacca cctcctttgt cccaaccgtc tggaggtgct cctccgcctc ctcctccccc      1320
     tccccctcca gggcccccgc ctctcccctt cagcggtgct gatggccagc ctgctgcacc      1380
     accaccacca ccaccttctg aggccaccaa gcccaagtct tcattgcctg ctgtgagtga      1440
     cgcgcgcagt gacctgcttt ctgccattcg ccaaggcttt cagctgcgaa gggttgaaga      1500
     gcaacgggag caagagaagc gtgatgtggt gggcaatgat gtggccacca tcctgtcccg      1560
     tcggatcgct gttgagtaca gcgactcgga agatgattct tctgaatttg atgaagacga      1620
     ctggtcggat taactccgcc tgctgcccac attcctcttt ctagtcctcc cacctgcctt      1680
     cctccataca aactaacagt ctcatggggg gaagaaagca agggacaaag aactgcaagg      1740
     ggccaaagcc aaagatctat ccgagtgctt ccttcaactg accatatgct tcttcctcgt      1800
     gtctgcctgc gggctcctga gggtccacag ctgagctgat ccatctttcc ctcaagtgac      1860
     cgtatctagt gacaggaagg gagaacccta cagtagatcc ctctgcttgg gtttcaactg      1920
     gagatagtgc cttcccctga taaccatttt aacccatggc cttcgtcaaa accctttcat      1980
     ttagctactt caaataatgt tatccacctc ctgtctcaca gcagtgggta cagcctgtgt      2040
     gggcgcaggt ggtccctttt ctgtgccctg acaagtagac tggcacactc ctgccccaca      2100
     ctcagccttc tgcagcaggt aggcttgctt ctgttagctc tgcctggatg cctttgttcc      2160
     agaaggcatc aggcctctaa agactgtctc acaagtgatg cccttctgga gactgagggt      2220
     gtggccttcc tcccatgctc tctctgggtg agtgtggggt ctgggtgtcc tttgcattcc      2280
     cctcctcacc ccaggtgcct taagacagcc cagttctaac ctgatattca tgaccaaact      2340
     agccctgaca ctcagggacc tgggcctctg ctgactgtca ggagcaaaac ttgaaggtgt      2400
     ggcaccgcag gactggaaga gagtggaaca ccttgggtcc tgcctgttca gggtactgag      2460
     gccaagatga gaagtgactc tctcaaggtc acacagatag gtgatagaac aagagcccag      2520
     ggttcctgtt tccaagtcac ctgtgtttcc tgggaccaaa taatgagcac ctgctggagt      2580
     ctgggcagag cagcctggct cccttctacc ccagagctcc caaatgccgg ccttgcattt      2640
     ggagggttca tggcctgtgc ggggaggtgg gggggggggc tctccggtgc agctttaact      2700
     gcagccgctc tggtaagccg ctcttatttc cccttctgtc tgcagccagt catgtggcat      2760
     gggagctgtg ctggggaggc agtgtagaga ggagctgtgg ggcagccagc cttggcaggt      2820
     gcagtttgca gttcactgca gctccctccc ccttttgttt cctcaattta agcagaggtt      2880
     gtgccagctg tagaaacttc agtccctcag gctggcagcc gctgcaggta cctgcctggg      2940
     ggggcctggc aggccatgga gaaggctgag aggcagaagg acacgggtct tcctgcctca      3000
     ccaccttgct gtgcatgatg atgctgaccc acccccttta cctcccttcc cagaactgat      3060
     acacgggtgt tattattcag gaaaaaaact gagcagctct ggccagcagc aaggtttctt      3120
     ctctactgcc ccaactattg tgtggcctct tgtgctgaaa tctctggctt cagagcctga      3180
     aacaaagaga aacaggatct gtctctaccc agcacagcaa atggttgtag tattgccaaa      3240
     gccctcataa acccctccgg cttgaggaga gtgcaatcat ggctctgctg cctgcacttg      3300
     ctgggttcct cccctctgcc tcctttcttg aactcagagg tagggactaa gcaccaggtg      3360
     ccagcaggcc ctcctgctgt ggccactcca gagtgctccg tgtctccatg tgtccgtggc      3420
     ctgccccccc cccaaccccc caaatccagt gtctggcacc gggtttggca gtcaactcca      3480
     tgctgtgcat gtttcccacc ccgtgcctta gaagctgaag gtgctttttc atcagaacct      3540
     taacatggct gttgatggta ccctggctgc agcccagcgg gggctggaga ggcaggggag      3600
     ggcagtggtc cttccaaata gaagtcctgg ggcctggcca ggccagggtt tgggcctaat      3660
     ggtttttact aaattactcc cctcctgctc tccgagaaag gggagccaga gccgctcact      3720
     gtggttctgt tccgaccttg aaggggcggt attggcctgg cttctggaat ggacagagtc      3780
     caccaggaaa gactgggggc gggaggagct ggggaggggc cctgtctgcg gaaggtagga      3840
     ttagatcatt agctcagtga cctcctaggg tttcgatgtg ctgtgttctc atcctacagc      3900
     tggtttggta atgatctgca agtcccggag agcaacagca cagctccgcc tgacgctctc      3960
     attaaaatcc atgcagccaa gctctgcgct ttgtagcagc cggccttgca gagcctcctc      4020
     agcttggggg gctggggacc cagtcagctg agaggtcctc taggctctac ttaggcatat      4080
     gtgtaccc                                                               4088
//