Dbfetch

ID   BC059838; SV 1; linear; mRNA; STD; MUS; 4265 BP.
XX
AC   BC059838;
XX
DT   11-OCT-2003 (Rel. 77, Created)
DT   24-SEP-2008 (Rel. 97, Last updated, Version 5)
XX
DE   Mus musculus PR domain containing 16, mRNA (cDNA clone MGC:69644
DE   IMAGE:6409778), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus; Mus.
XX
RN   [1]
RP   1-4265
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-4265
RA   Strausberg R.;
RT   ;
RL   Submitted (07-OCT-2003) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Cancer
RL   Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03,
RL   Bethesda, MD 20892-2590, USA
XX
DR   MD5; c36e13a296bbe1f4033d83b333d280fb.
DR   Ensembl-Gn; ENSMUSG00000039410; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000030902; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000070313; mus_musculus.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Dr. Jim Lin, University of Iowa
CC   cDNA Library Preparation: M. Bento Soares, University of Iowa
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Genome Sequence Centre,
CC   BC Cancer Agency, Vancouver, BC, Canada
CC   info@bcgsc.bc.ca
CC   Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield,
CC   Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin,
CC   Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo
CC   Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven
CC   Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline
CC   Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott,
CC   Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy,
CC   George Yang, Scott Zuyderduyn, Marco Marra.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 132 Row: b Column: 5
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: Hexamer frequency ORF
CC   analysis, Similarity but not identity to protein.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..4265
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="C57BL/6"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NIH_BMAP_FO0"
FT                   /clone="MGC:69644 IMAGE:6409778"
FT                   /tissue_type="Brain, enriched mouse brain 12.5dp"
FT                   /note="Vector: pYX-ASC"
FT                   /db_xref="taxon:10090"
FT   gene            1..4265
FT                   /gene="Prdm16"
FT                   /note="synonym: mel1"
FT   CDS             93..3629
FT                   /codon_start=1
FT                   /product="Prdm16 protein"
FT                   /db_xref="GOA:A2A935"
FT                   /db_xref="InterPro:IPR001214"
FT                   /db_xref="InterPro:IPR007087"
FT                   /db_xref="InterPro:IPR013087"
FT                   /db_xref="InterPro:IPR015880"
FT                   /db_xref="InterPro:IPR030413"
FT                   /db_xref="MGI:MGI:1917923"
FT                   /db_xref="UniProtKB/Swiss-Prot:A2A935"
FT                   /protein_id="AAH59838.1"
FT                   /translation="MRSKARARKLAKSDGDVVNNMYEPDPDLLAGQSAEEETEDGILSP
FT                   IPMGPPSPFPTSEDFTPKEGSPYEAPVYIPEDIPIPPDFELRESSIPGAGLGIWAKRKM
FT                   EIGERFGPYVVTPRAALKEADFGWEQMLTDTEVSSQESCIKKQISEDLGSEKFCVDANQ
FT                   AGSGSWLKYIRVACSCDDQNLAMCQINEQIYYKVIKDIEPGEELLVHVKEGAYSLGVMA
FT                   PSLDEDPTFRCDECDELFQCRLDLRRHKKYACSSAGAQLYEGLGEELKPEGLGVGSDGQ
FT                   AHECKDCERMFPNKYSLEQHMIVHTEEREYKCDQCPKAFNWKSNLIRHQMSHDSGKRFE
FT                   CENCVKVFTDPSNLQRHIRSQHVGARAHACPDCGKTFATSSGLKQHKHIHSTVKPFICE
FT                   VCHKSYTQFSNLCRHKRMHADCRTQIKCKDCGQMFSTTSSLNKHRRFCEGKNHYTPGSI
FT                   FTPGLPLTPSPMMDKTKPSPTLNHGGLGFSEYFPSRPHPGSLPFSAAPPAFPALTPGFP
FT                   GIFPPSLYPRPPLLPPTPLLKSPLNHAQDAKLPSPLGNPALPLVSAVSNSSQGATAATG
FT                   SEEKFDGRLEDAYAEKVKNRSPDMSDGSDFEDINTTTGTDLDTTTGTGSDLDSDLDSDR
FT                   DKGKDKGKPVESKPEFGGASVPPGAMNSVAEVPAFYSQHSFFPPPEEQLLTASGAAGDS
FT                   IKAIASIAEKYFGPGFMSMQEKKLGSLPYHSVFPFQFLPNFPHSLYPFTDRALAHNLLV
FT                   KAEPKSPRDALKVGGPSAECPFDLTTKPKEAKPALLAPKVPLIPSSGEEQPLDLSIGSR
FT                   ARASQNGGGREPRKNHVYGERKPGVSEGLPKVCPAQLPQQPSLHYAKPSPFFMDPIYSR
FT                   VEKRKVADPVGVLKEKYLRPSPLLFHPQMSAIETMTEKLESFAAMKADSGSSLQPLPHH
FT                   PFNFRSPPPTLSDPILRKGKERYTCRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRS
FT                   FSISSNLQRHVRNIHNKEKPFKCHLCNRCFGQQTNLDRHLKKHEHEGAPVSQHSGVLTN
FT                   HLGTSASSPTSESDNHALLDEKEDSYFSEIRNFIANSEMNQASTRMDKRPEIQDLDSNP
FT                   PCPGSASAKPEDVEEEEEEELEEEDDDSLAGKSQEDTVSPTPEPQGVYEDEEDEEPPSL
FT                   TMGFDHTRRHMQ"
FT   misc_feature    342..722
FT                   /note="SET; Region: SET (Su(var)3-9, Enhancer-of-zeste,
FT                   Trithorax) domain"
FT   misc_feature    1023..1091
FT                   /note="zf-C2H2; Region: Zinc finger, C2H2 type. The C2H2
FT                   zinc finger is the classical zinc finger domain. The two
FT                   conserved cysteines and histidines co-ordinate a zinc ion.
FT                   The following pattern describes the zinc finger.
FT                   #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
FT                   any amino acid, and numbers in brackets indicate the number
FT                   of residues. The positions marked # are those that are
FT                   important for the stable fold of the zinc finger. The final
FT                   position can be either his or cys. The C2H2 zinc finger is
FT                   composed of two short beta strands followed by an alpha
FT                   helix. The amino terminal part of the helix binds the major
FT                   groove in DNA binding zinc fingers"
FT   misc_feature    1278..1346
FT                   /note="zf-C2H2; Region: Zinc finger, C2H2 type. The C2H2
FT                   zinc finger is the classical zinc finger domain. The two
FT                   conserved cysteines and histidines co-ordinate a zinc ion.
FT                   The following pattern describes the zinc finger.
FT                   #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
FT                   any amino acid, and numbers in brackets indicate the number
FT                   of residues. The positions marked # are those that are
FT                   important for the stable fold of the zinc finger. The final
FT                   position can be either his or cys. The C2H2 zinc finger is
FT                   composed of two short beta strands followed by an alpha
FT                   helix. The amino terminal part of the helix binds the major
FT                   groove in DNA binding zinc fingers"
FT   misc_feature    2949..3017
FT                   /note="zf-C2H2; Region: Zinc finger, C2H2 type. The C2H2
FT                   zinc finger is the classical zinc finger domain. The two
FT                   conserved cysteines and histidines co-ordinate a zinc ion.
FT                   The following pattern describes the zinc finger.
FT                   #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
FT                   any amino acid, and numbers in brackets indicate the number
FT                   of residues. The positions marked # are those that are
FT                   important for the stable fold of the zinc finger. The final
FT                   position can be either his or cys. The C2H2 zinc finger is
FT                   composed of two short beta strands followed by an alpha
FT                   helix. The amino terminal part of the helix binds the major
FT                   groove in DNA binding zinc fingers"
XX
SQ   Sequence 4265 BP; 1062 A; 1308 C; 1156 G; 739 T; 0 other;
     ggtgtccaaa ctgacaatgc tagggagatg aagatagtgt gtagctgctt ctgggctcaa        60
     ggaggaggag agagattccg cgagccgaca ccatgcgatc caaggcgagg gcgaggaagc       120
     tagccaaaag tgacggtgac gttgtaaata atatgtatga acctgacccg gacctgctgg       180
     ccggccagag tgccgaggag gagaccgaag acggcatcct gtcccccatc cccatggggc       240
     caccgtcccc cttccccacc agcgaggact tcactcccaa ggagggctcg ccctatgagg       300
     ctcctgtcta cattcctgaa gacattccaa tcccaccaga cttcgagcta cgagagtcct       360
     ccataccagg agctggcctg gggatctggg ccaagcggaa gatggaaatc ggggagaggt       420
     ttggccccta cgtggtgacg ccccgggccg cactgaagga ggccgacttt ggatgggagc       480
     agatgctgac ggatacagag gtgtcatccc aggagagctg catcaaaaag cagatctctg       540
     aagacttggg tagcgagaag ttctgcgtgg atgccaatca ggcggggtct ggcagctggc       600
     tcaagtacat ccgtgtagcg tgttcctgtg atgaccaaaa cctcgccatg tgtcagatca       660
     acgaacagat ttactataaa gtcattaagg acatcgagcc tggagaggaa ctgttggtgc       720
     atgtgaaaga aggtgcctac tccttgggtg tcatggcccc cagcttggat gaggacccca       780
     cattccgctg tgatgagtgt gatgagctct tccagtgcag gctggacctg aggcgccaca       840
     agaagtacgc gtgcagctct gcaggagccc agctctacga gggcctaggg gaggaactca       900
     agcccgaggg ccttggcgtg ggcagcgacg ggcaagcgca tgagtgcaag gattgcgagc       960
     ggatgttccc caacaagtac agcttggagc aacacatgat cgtccacacg gaagagcgtg      1020
     agtacaaatg tgaccagtgt cccaaggcct tcaactggaa gtccaacctc atccgccacc      1080
     agatgtctca cgacagtggc aagcgcttcg aatgtgaaaa ctgtgtcaag gtgttcacgg      1140
     accccagcaa cctccagcgt cacatccgct cacagcatgt cggtgcccgg gcccatgcct      1200
     gccctgactg tggcaagacc ttcgccacat cctctggcct caaacagcac aagcatatcc      1260
     acagcacggt gaagccattc atatgcgagg tctgccacaa gtcctacacg cagttctcca      1320
     acctgtgccg gcacaagcgg atgcacgccg actgcaggac gcagatcaag tgcaaggact      1380
     gtgggcagat gttcagcact acctcctccc tcaacaagca tcggagattc tgcgagggca      1440
     agaaccatta cacgcctggc agcatcttca ccccaggcct gcccttgacc cccagcccca      1500
     tgatggacaa gacaaaaccc tccccgaccc tcaaccacgg gggcctaggc ttcagcgagt      1560
     acttcccctc cagacctcat cctgggagcc tgcccttctc ggctgctcct ccggccttcc      1620
     ccgcactcac tccgggcttc ccgggcatct ttcctccatc cctgtaccca cgaccacctc      1680
     tgctacctcc cacgccgctg ctcaagagcc ccctgaacca cgcgcaggac gccaagctac      1740
     ccagcccgct gggaaaccca gccctgcccc ttgtctccgc ggtcagcaat agcagccagg      1800
     gtgccacagc ggccaccggg tcagaggaga aatttgatgg ccgcttggaa gacgcatatg      1860
     cggagaaggt caaaaatagg agccctgaca tgtcggatgg cagtgacttt gaggatatca      1920
     acaccacgac cgggacagac ttggacacta ccacgggcac ggggtcagac ctggacagcg      1980
     acctggacag tgacagagac aaaggcaagg acaaggggaa gccagtggag agcaaacctg      2040
     agtttggggg tgcatctgtg ccccctgggg ccatgaacag tgtggccgag gtaccggcct      2100
     tctactcaca gcattccttc ttcccgccac ccgaggaaca gctgctgacg gcctcgggag      2160
     ctgccggcga ctccatcaag gccatcgcgt ccatcgcgga gaaatacttc ggtcctggct      2220
     tcatgagcat gcaggagaag aagctgggct cactacccta ccactccgtg ttccccttcc      2280
     agttcctgcc taactttccc cactccctct acccctttac ggaccgagcc ctcgcccaca      2340
     acttgctggt caaggctgag ccaaagtcac cccgggatgc cctcaaggtg ggcggcccca      2400
     gtgcggagtg ccccttcgac ctcaccacca aaccaaaaga ggccaaaccc gccctgctcg      2460
     cacccaaggt ccccctcatc ccctcatctg gcgaggaaca gccactggac ctgagcatcg      2520
     gcagcagggc cagggcaagc cagaacggag gtggccgtga gccgcggaag aaccacgtct      2580
     acggtgaacg gaagccgggg gtcagcgagg ggctgcctaa ggtgtgccca gcacagctgc      2640
     cccagcagcc ctccttgcat tatgctaagc cttcaccgtt cttcatggat cccatctaca      2700
     gcagggtaga aaagcggaag gtggcagacc ctgtgggagt cctgaaagag aagtacctgc      2760
     ggccgtcccc acttctgttc cacccccaga tgtcagccat agaaaccatg acggagaagc      2820
     tggagagctt tgcagccatg aaggccgact caggcagctc cctgcagccc ctgcctcacc      2880
     acccgttcaa cttccgctcc ccacccccaa cgctctcgga tcccatcctc aggaagggga      2940
     aggagagata cacgtgcagg tactgtggca agatcttccc cagatctgca aatctcacaa      3000
     gacatctgag gacacacaca ggggagcagc catacaggtg caagtactgt gaccggtcat      3060
     tcagcatctc ctccaacctc cagcggcacg tgaggaacat ccacaacaaa gagaagccgt      3120
     tcaagtgcca tctgtgcaac cgctgcttcg ggcagcagac caacctagac cggcacctga      3180
     agaagcacga acacgagggc gcaccagtga gccagcactc cggggtgctc acgaaccacc      3240
     tgggcaccag cgcctcctcc cccacctccg agtcggacaa ccatgcactt ttagatgaga      3300
     aggaagattc ttacttctcc gagatccgaa acttcatcgc caacagcgag atgaaccagg      3360
     catccactcg aatggacaaa cggcctgaga tccaagacct ggacagcaac ccaccgtgtc      3420
     caggctcagc cagtgcaaag ccagaggacg tagaggagga ggaagaggag gagctggagg      3480
     aagaggatga tgacagctta gccgggaagt cacaggagga cacggtgtcc cccacacctg      3540
     agccccaagg agtctatgaa gatgaagagg atgaggaacc acccagcctg accatgggct      3600
     ttgaccatac ccggaggcat atgcaatgat gctgtccctc tctgaagaca ctcctctcca      3660
     cgccccctcc cagagctcac tggatgcttg gttgaacatc acaggaccct cgtcagagtc      3720
     cggagccttt aaccccatca accacctctg aggtcctgga ggccccaggg ccagagtcca      3780
     aagccagggt accagctgca ggagatggag aaggggccag ggagcagccc cccaccctca      3840
     acacctccac tttgcaaagt ccagcttctc cattgaaact cagaacccga aggtcccttg      3900
     agtagccgct ggccttcatc acctctcaga actggcctca aggacacgat ctgcagtggg      3960
     tgcggtgcac gggccaccca ggagctgctc acaggagcca tggatcagaa aactcgtggg      4020
     caagggtggg gtctctatcc cagcaggagc cagttggcca catccaggca actgcatggt      4080
     atgaaagagg aaatcggaaa gacgtgggca agtgctatgg agagagacct catcaatgat      4140
     ttttataatg agaatcacat gattaagcct tttggtaatc ttattgacta tagagtctat      4200
     ttaagcatgt gggttttaaa aaaaatagac ggtatttttt aaaaatcaaa aaaaaaaaaa      4260
     aaaaa                                                                  4265
//