Dbfetch

ID   BC052701; SV 1; linear; mRNA; STD; MUS; 4116 BP.
XX
AC   BC052701;
XX
DT   21-MAY-2003 (Rel. 75, Created)
DT   24-SEP-2008 (Rel. 97, Last updated, Version 7)
XX
DE   Mus musculus Pbx/knotted 1 homeobox, mRNA (cDNA clone MGC:64705
DE   IMAGE:5721441), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-4116
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-4116
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (19-MAY-2003) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; 8f17b59d755ed06184eb108f8d2f8d47.
DR   Ensembl-Gn; ENSMUSG00000006705; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0023491; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0023449; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0023414; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0023454; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0023215; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0023896; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0022716; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0023190; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0023320; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0023290; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0023398; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0023308; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0023938; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0022466; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0022778; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000097352; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000175806; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000176701; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0046754; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0046730; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0046686; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0046693; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0046426; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0047175; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0046522; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0046363; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0046478; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0046440; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0046579; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0046419; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0047268; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0046114; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0045791; mus_musculus_wsbeij.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Dr. Jim Lin, University of Iowa
CC   cDNA Library Preparation: M. Bento Soares, University of Iowa
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: University of Iowa, Dr. M. Bento Soares and Dr.
CC   Thomas L. Casavant.
CC   Web site: http://genome.uiowa.edu
CC   Contact: bento-soares@uiowa.edu; tom-casavant@uiowa.edu
CC   Bonaldo,M.F., Akabogu,I.,  Bair,T., Bair,J., Crouch,K., Davis,A.,
CC   Fishler,K., Keppel,C., Kucaba,T., Lebeck,M., Melo,A., Schaefer,K.,
CC   Scheetz,T., Smith,C., Snir,E., Tack,D., Trout,K., Walters,J.,
CC   Casavant,T., Soares,M.B.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series:  Plate:  Row:  Column: 0
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 34368579.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..4116
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="C57BL/6"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NIH_BMAP_FI0"
FT                   /clone="MGC:64705 IMAGE:5721441"
FT                   /tissue_type="Brain, enriched mouse brain 12.5dpc"
FT                   /note="Vector: pYX-ASC"
FT                   /db_xref="taxon:10090"
FT   gene            1..4116
FT                   /gene="Pknox1"
FT                   /note="synonym: PREP1"
FT   misc_difference 1..11
FT                   /gene="Pknox1"
FT                   /note="11 bases at the 5' end do not align to the mouse
FT                   genome."
FT   CDS             69..1379
FT                   /codon_start=1
FT                   /gene="Pknox1"
FT                   /product="Pbx/knotted 1 homeobox"
FT                   /db_xref="GOA:O70477"
FT                   /db_xref="InterPro:IPR001356"
FT                   /db_xref="InterPro:IPR008422"
FT                   /db_xref="InterPro:IPR009057"
FT                   /db_xref="InterPro:IPR032453"
FT                   /db_xref="MGI:MGI:1201409"
FT                   /db_xref="UniProtKB/Swiss-Prot:O70477"
FT                   /protein_id="AAH52701.1"
FT                   /translation="MMATQTLSIDSYQDGQQMQVVTELKTEQDPNCSDPDAEGVSPPPI
FT                   ESQTPMDADKQAIYRHPLFPLLALLFEKCEQSTQGSEGTTSASFDVDIENFVRKQEKDG
FT                   KPFFCEDPETDNLMVKAIQVLRIHLLELEKVNELCKDFCSRYIACLKTKMNSETLLSGE
FT                   PGSPYSPVQSQQIQSAITGTLSPQGIVVPASALQQGNVTMATVAGGTVYQPVTVVTPQG
FT                   QVVTQALSPGTIRIQNSQLQLQLNQDLSILHQEDGSSKNKRGVLPKHATNVMRSWLFQH
FT                   IGHPYPTEDEKKQIAAQTNLTLLQVNNWFINARRRILQPMLDSSCSETPKTKKKPAQNR
FT                   PVQRFWPDSLASGVAQATPSELAMSEGAVVTITTPVNMNVDSLQSLSSDGATLAVQQVM
FT                   MAGQSEDESVDSTEDEGGALAPTHISGLVLENSDSLQ"
FT   misc_difference 2255^2256
FT                   /gene="Pknox1"
FT                   /note="1 base in the mouse genome, T, is not found in
FT                   cDNA."
FT   misc_difference 2458
FT                   /gene="Pknox1"
FT                   /note="'T' in cDNA is 'C' in the mouse genome."
FT   misc_difference 3132
FT                   /gene="Pknox1"
FT                   /note="'T' in cDNA is 'C' in the mouse genome."
FT   misc_difference 3539^3540
FT                   /gene="Pknox1"
FT                   /note="1 base in the mouse genome, A, is not found in
FT                   cDNA."
FT   misc_difference 4098..4116
FT                   /gene="Pknox1"
FT                   /note="polyA tail: 19 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 4116 BP; 1078 A; 1010 C; 1005 G; 1023 T; 0 other;
     ttggcaccca gacaccgtgt gcttctcgct caagatgatc tgatgtctga agtggactct        60
     cactaaccat gatggcgaca cagacgctaa gtatagacag ctatcaagat ggacagcaaa       120
     tgcaggtggt cacggagtta aaaacagagc aagatcccaa ctgctctgac ccagatgcag       180
     aaggagtgag tcctcctccc atcgagtctc agaccccaat ggatgccgac aagcaggcca       240
     tttataggca tccactattt ccgttgctag ctttgttgtt tgagaagtgt gagcagtcca       300
     cacagggctc agaaggcacg acgtctgcca gcttcgatgt ggacattgag aactttgtca       360
     ggaagcaaga aaaggatggg aaacccttct tctgtgaaga tccggaaact gacaacctaa       420
     tggtgaaagc aatccaggtc ctgcgcattc atcttcttga actggagaag gttaatgagc       480
     tctgtaaaga tttctgtagt cggtacattg cttgtctgaa gacaaaaatg aacagcgaga       540
     ccttgttgag tggggagcct ggaagtccgt actcccctgt gcaatcccag cagattcaga       600
     gtgccatcac aggcacgctc agcccccagg gaatcgtggt gccagcatca gccctacaac       660
     agggaaatgt aaccatggca acagtggcag gtggcacagt gtaccagcct gtcaccgtcg       720
     tcactccgca aggccaagtg gtcacgcagg cattatctcc tgggacaatt aggatccaga       780
     actcacagct gcagttgcag ttgaaccaag acctcagcat cttgcatcaa gaggatggct       840
     cctccaagaa caagaggggt gtcctgccga agcacgccac caacgtgatg cggtcctggc       900
     tctttcagca catagggcat ccctacccaa cagaggatga gaaaaagcag attgctgctc       960
     agacgaatct gaccctgctt caagtcaaca actggttcat caacgcgaga agacgaattc      1020
     tccagccaat gttggattcc agctgctcag agactccaaa aacgaagaaa aaacctgctc      1080
     agaacaggcc agttcagagg ttttggccag attctcttgc atcaggagtg gcacaagcaa      1140
     cacccagcga gcttgccatg tcagaaggtg ccgttgtgac catcaccaca cctgtaaaca      1200
     tgaatgtgga cagccttcag tccctgtcct cagacggggc caccctggcc gtacagcagg      1260
     tcatgatggc agggcagagt gaggacgagt cggtggatag cacagaggat gagggcggcg      1320
     ccctggcgcc cacacacatc agcgggctgg tgctggagaa cagcgactcc cttcagtagg      1380
     aagcaccaac ggggtcagtg tggagctggg cgctggactc ttcactgttt gcacagcaag      1440
     catcttacag ttgtctttgt aacctgtttt atatgtagat atagaaggtg cacttttgta      1500
     tttcgcagca agcttcaaga cgtctttgcc ggtgcagcga cttccttcag atgtgcgtgt      1560
     atgggttttt aatgctagaa acgtggcccc tgccccttga gtccttgacg agattaagga      1620
     acagtgctgc cattttctaa aattctgcag ttgcaattga gtcgtgtggt ttacagcagg      1680
     tctgggggcg gctgctggtg ccgctcacag gtctgaaaga tgacctgcac gcaggctgca      1740
     gagcgagagg gctctggccc cggtgggctg tgggtgccca ccacacactt cagttctctc      1800
     aggcagcatt tgcctgtttc atttgctata tagaaaaaga aactcctatt tttaccttgc      1860
     tgggattatt ggataaaaag ctatttttat aaatcagtta ttaattggat tatgactata      1920
     ttgaggataa atttctagag aagcaacagc acatgcttgt ctttcgattc ggaatgttct      1980
     gaaaggcggc cacctgctca tggtctcctg ctctctgggg aggagagggc tcgcctcagc      2040
     tgctaaaagg aaacaactgt caagtgagct tttgacatct ccagtttctg ctcatgtgtg      2100
     atgcagactg ctacatttta cactcgctag agcatcttac acagtgtact cagtaggacg      2160
     tacggtaagt gagggctctg ccacgcctag tctgactgca gttccatggc tgcctggcta      2220
     tgttaactgg gtggtaattt tagatttttt tttttagtta aaacctgtta tatagagaat      2280
     gtttattata aactaatata aagtgtgttc tgccccactg gctcagagct ggtgtaaaca      2340
     gcaaacacct aaactgtctt tcttactggc agacacctag aagttagaac catggctgta      2400
     acgtggcggt ggggcccact ggtggctctc actgtcactc actctccagg ctcctcctcc      2460
     tctctgttac cactacagac tcaggacagt ttgaggacct catttctgag tgtcaaggct      2520
     gagatttggt ttggtttgtt tgggtctact tgggaagctt tttcttagga aacaggcatc      2580
     tgtcttcaca gtggtaccta ggccacctgc agatagggaa cagctgtcta ggggtccctg      2640
     ccttgggccc cacggttacc ccacggttac ctctttagct aattcaggaa tagttcacca      2700
     tcaccccaaa tgtcttgatt ctcttcagcc attgagcaca cctgaacagc caagtagtca      2760
     gcaaaaagaa tgggggctcc gtttgaacca cttaaggtgc gtccaagaca gcaaaataaa      2820
     acccccagaa cttgccagca actcctttga ggagagaata ctatttcaag cagtctttcc      2880
     cttacctatt tacaaactgt tttaacagtt taagtccggc atacaaaagc ctgcacgatc      2940
     agctggctct attaggttta gagcagaagt acatctctct tgtgagccct gcagagctct      3000
     gccagtcccc aaaatggaag cacagcccct gctgcaccag ctggtggcgc tctagctggg      3060
     attcgaacac catgctctac cgtgtccttt ccagttgaaa tcatctttcc taagcacaga      3120
     actctagcct gttaaatcaa aggaatatca tgaagatcgt aagagaaccc caggcacggt      3180
     gacacactcc tgtaatccca acactcagga acctgagaca ggaggcaagc ctgggctact      3240
     tgaccctgtc ttaaaaaaaa aaagctagta aaagagctgg cacagggtca tatcctttca      3300
     cctccccgtg cacagagatg cttttaagtg ctccccatga ggcgagccat gctggttcat      3360
     agagtccctg cccagcagat ttggtgtctg ccagttgcac ctttgagaca cctgcttgct      3420
     gggcacaatg gcatattcct gtaaacaagc acttaggagg ctgaagcagg aggattgctg      3480
     caaggtcgag ttggcctctg gtgtgtagtg aaacccagtt tcaaaaaaaa aaaaaaaaat      3540
     cgagcaaaac tcctgctcgc ctttccttgg ccagtgcaga aaacctttga gctttgttta      3600
     aatggtcaga ctcccaaaca ttggagcctt ttgaatgtgt tctgagacct acagtcaact      3660
     cgtgtcattc ctgctgtttg gctgtctagc agaaatctag actggctgac ccactcctcc      3720
     ggtgggctca ggttttctgc tttcttgatt ccagattgtc aaatagaagt gtatatgcaa      3780
     tatgcactgt gacccttgcg gctagggctg acgggccact gtgtagcctg ctgggtggcc      3840
     agtgcacctg tgggggcagc gggagacttg ctcctccccc atggctggca gggaaatgcc      3900
     catcccagag gcccctcccg caggttgtga ctggaacaca gcaggctgcc ttcttcagtt      3960
     aagttcggtc tttgaaactg acaatcttta aaatgtgaat actgtaacaa tatgttttct      4020
     tggattgttg tctttaaaag gatttttgtg aagcaattga tttatcaaag aaaaaaaaaa      4080
     ttaaaaccag aaacatgaaa aaaaaaaaaa aaaaaa                                4116
//