Dbfetch
ID BC052701; SV 1; linear; mRNA; STD; MUS; 4116 BP.
XX
AC BC052701;
XX
DT 21-MAY-2003 (Rel. 75, Created)
DT 24-SEP-2008 (Rel. 97, Last updated, Version 7)
XX
DE Mus musculus Pbx/knotted 1 homeobox, mRNA (cDNA clone MGC:64705
DE IMAGE:5721441), complete cds.
XX
KW MGC.
XX
OS Mus musculus (house mouse)
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
XX
RN [1]
RP 1-4116
RX DOI; 10.1073/pnas.242603899.
RX PUBMED; 12477932.
RG Mammalian Gene Collection Program Team
RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT "Generation and initial analysis of more than 15,000 full-length human and
RT mouse cDNA sequences";
RL Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN [2]
RC NIH-MGC Project URL: http://mgc.nci.nih.gov
RP 1-4116
RG NIH MGC Project
RA ;
RT ;
RL Submitted (19-MAY-2003) to the INSDC.
RL National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL MD 20892-2590, USA
XX
DR MD5; 8f17b59d755ed06184eb108f8d2f8d47.
DR Ensembl-Gn; ENSMUSG00000006705; mus_musculus.
DR Ensembl-Gn; MGP_129S1SvImJ_G0023491; mus_musculus_129s1svimj.
DR Ensembl-Gn; MGP_AJ_G0023449; mus_musculus_aj.
DR Ensembl-Gn; MGP_AKRJ_G0023414; mus_musculus_akrj.
DR Ensembl-Gn; MGP_BALBcJ_G0023454; mus_musculus_balbcj.
DR Ensembl-Gn; MGP_C3HHeJ_G0023215; mus_musculus_c3hhej.
DR Ensembl-Gn; MGP_C57BL6NJ_G0023896; mus_musculus_c57bl6nj.
DR Ensembl-Gn; MGP_CASTEiJ_G0022716; mus_musculus_casteij.
DR Ensembl-Gn; MGP_CBAJ_G0023190; mus_musculus_cbaj.
DR Ensembl-Gn; MGP_DBA2J_G0023320; mus_musculus_dba2j.
DR Ensembl-Gn; MGP_FVBNJ_G0023290; mus_musculus_fvbnj.
DR Ensembl-Gn; MGP_LPJ_G0023398; mus_musculus_lpj.
DR Ensembl-Gn; MGP_NODShiLtJ_G0023308; mus_musculus_nodshiltj.
DR Ensembl-Gn; MGP_NZOHlLtJ_G0023938; mus_musculus_nzohlltj.
DR Ensembl-Gn; MGP_PWKPhJ_G0022466; mus_musculus_pwkphj.
DR Ensembl-Gn; MGP_WSBEiJ_G0022778; mus_musculus_wsbeij.
DR Ensembl-Tr; ENSMUST00000097352; mus_musculus.
DR Ensembl-Tr; ENSMUST00000175806; mus_musculus.
DR Ensembl-Tr; ENSMUST00000176701; mus_musculus.
DR Ensembl-Tr; MGP_129S1SvImJ_T0046754; mus_musculus_129s1svimj.
DR Ensembl-Tr; MGP_AJ_T0046730; mus_musculus_aj.
DR Ensembl-Tr; MGP_AKRJ_T0046686; mus_musculus_akrj.
DR Ensembl-Tr; MGP_BALBcJ_T0046693; mus_musculus_balbcj.
DR Ensembl-Tr; MGP_C3HHeJ_T0046426; mus_musculus_c3hhej.
DR Ensembl-Tr; MGP_C57BL6NJ_T0047175; mus_musculus_c57bl6nj.
DR Ensembl-Tr; MGP_CASTEiJ_T0046522; mus_musculus_casteij.
DR Ensembl-Tr; MGP_CBAJ_T0046363; mus_musculus_cbaj.
DR Ensembl-Tr; MGP_DBA2J_T0046478; mus_musculus_dba2j.
DR Ensembl-Tr; MGP_FVBNJ_T0046440; mus_musculus_fvbnj.
DR Ensembl-Tr; MGP_LPJ_T0046579; mus_musculus_lpj.
DR Ensembl-Tr; MGP_NODShiLtJ_T0046419; mus_musculus_nodshiltj.
DR Ensembl-Tr; MGP_NZOHlLtJ_T0047268; mus_musculus_nzohlltj.
DR Ensembl-Tr; MGP_PWKPhJ_T0046114; mus_musculus_pwkphj.
DR Ensembl-Tr; MGP_WSBEiJ_T0045791; mus_musculus_wsbeij.
XX
CC Contact: MGC help desk
CC Email: cgapbs-r@mail.nih.gov
CC Tissue Procurement: Dr. Jim Lin, University of Iowa
CC cDNA Library Preparation: M. Bento Soares, University of Iowa
CC cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC DNA Sequencing by: University of Iowa, Dr. M. Bento Soares and Dr.
CC Thomas L. Casavant.
CC Web site: http://genome.uiowa.edu
CC Contact: bento-soares@uiowa.edu; tom-casavant@uiowa.edu
CC Bonaldo,M.F., Akabogu,I., Bair,T., Bair,J., Crouch,K., Davis,A.,
CC Fishler,K., Keppel,C., Kucaba,T., Lebeck,M., Melo,A., Schaefer,K.,
CC Scheetz,T., Smith,C., Snir,E., Tack,D., Trout,K., Walters,J.,
CC Casavant,T., Soares,M.B.
CC Clone distribution: MGC clone distribution information can be found
CC through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC Series: Plate: Row: Column: 0
CC This clone was selected for full length sequencing because it
CC passed the following selection criteria: matched mRNA gi: 34368579.
CC Differences found between this sequence and the mouse C57BL/6J
CC genome (build 36) are described in misc_difference features below.
XX
FH Key Location/Qualifiers
FH
FT source 1..4116
FT /organism="Mus musculus"
FT /lab_host="DH10B"
FT /strain="C57BL/6"
FT /mol_type="mRNA"
FT /clone_lib="NIH_BMAP_FI0"
FT /clone="MGC:64705 IMAGE:5721441"
FT /tissue_type="Brain, enriched mouse brain 12.5dpc"
FT /note="Vector: pYX-ASC"
FT /db_xref="taxon:10090"
FT gene 1..4116
FT /gene="Pknox1"
FT /note="synonym: PREP1"
FT misc_difference 1..11
FT /gene="Pknox1"
FT /note="11 bases at the 5' end do not align to the mouse
FT genome."
FT CDS 69..1379
FT /codon_start=1
FT /gene="Pknox1"
FT /product="Pbx/knotted 1 homeobox"
FT /db_xref="GOA:O70477"
FT /db_xref="InterPro:IPR001356"
FT /db_xref="InterPro:IPR008422"
FT /db_xref="InterPro:IPR009057"
FT /db_xref="InterPro:IPR032453"
FT /db_xref="MGI:MGI:1201409"
FT /db_xref="UniProtKB/Swiss-Prot:O70477"
FT /protein_id="AAH52701.1"
FT /translation="MMATQTLSIDSYQDGQQMQVVTELKTEQDPNCSDPDAEGVSPPPI
FT ESQTPMDADKQAIYRHPLFPLLALLFEKCEQSTQGSEGTTSASFDVDIENFVRKQEKDG
FT KPFFCEDPETDNLMVKAIQVLRIHLLELEKVNELCKDFCSRYIACLKTKMNSETLLSGE
FT PGSPYSPVQSQQIQSAITGTLSPQGIVVPASALQQGNVTMATVAGGTVYQPVTVVTPQG
FT QVVTQALSPGTIRIQNSQLQLQLNQDLSILHQEDGSSKNKRGVLPKHATNVMRSWLFQH
FT IGHPYPTEDEKKQIAAQTNLTLLQVNNWFINARRRILQPMLDSSCSETPKTKKKPAQNR
FT PVQRFWPDSLASGVAQATPSELAMSEGAVVTITTPVNMNVDSLQSLSSDGATLAVQQVM
FT MAGQSEDESVDSTEDEGGALAPTHISGLVLENSDSLQ"
FT misc_difference 2255^2256
FT /gene="Pknox1"
FT /note="1 base in the mouse genome, T, is not found in
FT cDNA."
FT misc_difference 2458
FT /gene="Pknox1"
FT /note="'T' in cDNA is 'C' in the mouse genome."
FT misc_difference 3132
FT /gene="Pknox1"
FT /note="'T' in cDNA is 'C' in the mouse genome."
FT misc_difference 3539^3540
FT /gene="Pknox1"
FT /note="1 base in the mouse genome, A, is not found in
FT cDNA."
FT misc_difference 4098..4116
FT /gene="Pknox1"
FT /note="polyA tail: 19 bases do not align to the mouse
FT genome."
XX
SQ Sequence 4116 BP; 1078 A; 1010 C; 1005 G; 1023 T; 0 other;
ttggcaccca gacaccgtgt gcttctcgct caagatgatc tgatgtctga agtggactct 60
cactaaccat gatggcgaca cagacgctaa gtatagacag ctatcaagat ggacagcaaa 120
tgcaggtggt cacggagtta aaaacagagc aagatcccaa ctgctctgac ccagatgcag 180
aaggagtgag tcctcctccc atcgagtctc agaccccaat ggatgccgac aagcaggcca 240
tttataggca tccactattt ccgttgctag ctttgttgtt tgagaagtgt gagcagtcca 300
cacagggctc agaaggcacg acgtctgcca gcttcgatgt ggacattgag aactttgtca 360
ggaagcaaga aaaggatggg aaacccttct tctgtgaaga tccggaaact gacaacctaa 420
tggtgaaagc aatccaggtc ctgcgcattc atcttcttga actggagaag gttaatgagc 480
tctgtaaaga tttctgtagt cggtacattg cttgtctgaa gacaaaaatg aacagcgaga 540
ccttgttgag tggggagcct ggaagtccgt actcccctgt gcaatcccag cagattcaga 600
gtgccatcac aggcacgctc agcccccagg gaatcgtggt gccagcatca gccctacaac 660
agggaaatgt aaccatggca acagtggcag gtggcacagt gtaccagcct gtcaccgtcg 720
tcactccgca aggccaagtg gtcacgcagg cattatctcc tgggacaatt aggatccaga 780
actcacagct gcagttgcag ttgaaccaag acctcagcat cttgcatcaa gaggatggct 840
cctccaagaa caagaggggt gtcctgccga agcacgccac caacgtgatg cggtcctggc 900
tctttcagca catagggcat ccctacccaa cagaggatga gaaaaagcag attgctgctc 960
agacgaatct gaccctgctt caagtcaaca actggttcat caacgcgaga agacgaattc 1020
tccagccaat gttggattcc agctgctcag agactccaaa aacgaagaaa aaacctgctc 1080
agaacaggcc agttcagagg ttttggccag attctcttgc atcaggagtg gcacaagcaa 1140
cacccagcga gcttgccatg tcagaaggtg ccgttgtgac catcaccaca cctgtaaaca 1200
tgaatgtgga cagccttcag tccctgtcct cagacggggc caccctggcc gtacagcagg 1260
tcatgatggc agggcagagt gaggacgagt cggtggatag cacagaggat gagggcggcg 1320
ccctggcgcc cacacacatc agcgggctgg tgctggagaa cagcgactcc cttcagtagg 1380
aagcaccaac ggggtcagtg tggagctggg cgctggactc ttcactgttt gcacagcaag 1440
catcttacag ttgtctttgt aacctgtttt atatgtagat atagaaggtg cacttttgta 1500
tttcgcagca agcttcaaga cgtctttgcc ggtgcagcga cttccttcag atgtgcgtgt 1560
atgggttttt aatgctagaa acgtggcccc tgccccttga gtccttgacg agattaagga 1620
acagtgctgc cattttctaa aattctgcag ttgcaattga gtcgtgtggt ttacagcagg 1680
tctgggggcg gctgctggtg ccgctcacag gtctgaaaga tgacctgcac gcaggctgca 1740
gagcgagagg gctctggccc cggtgggctg tgggtgccca ccacacactt cagttctctc 1800
aggcagcatt tgcctgtttc atttgctata tagaaaaaga aactcctatt tttaccttgc 1860
tgggattatt ggataaaaag ctatttttat aaatcagtta ttaattggat tatgactata 1920
ttgaggataa atttctagag aagcaacagc acatgcttgt ctttcgattc ggaatgttct 1980
gaaaggcggc cacctgctca tggtctcctg ctctctgggg aggagagggc tcgcctcagc 2040
tgctaaaagg aaacaactgt caagtgagct tttgacatct ccagtttctg ctcatgtgtg 2100
atgcagactg ctacatttta cactcgctag agcatcttac acagtgtact cagtaggacg 2160
tacggtaagt gagggctctg ccacgcctag tctgactgca gttccatggc tgcctggcta 2220
tgttaactgg gtggtaattt tagatttttt tttttagtta aaacctgtta tatagagaat 2280
gtttattata aactaatata aagtgtgttc tgccccactg gctcagagct ggtgtaaaca 2340
gcaaacacct aaactgtctt tcttactggc agacacctag aagttagaac catggctgta 2400
acgtggcggt ggggcccact ggtggctctc actgtcactc actctccagg ctcctcctcc 2460
tctctgttac cactacagac tcaggacagt ttgaggacct catttctgag tgtcaaggct 2520
gagatttggt ttggtttgtt tgggtctact tgggaagctt tttcttagga aacaggcatc 2580
tgtcttcaca gtggtaccta ggccacctgc agatagggaa cagctgtcta ggggtccctg 2640
ccttgggccc cacggttacc ccacggttac ctctttagct aattcaggaa tagttcacca 2700
tcaccccaaa tgtcttgatt ctcttcagcc attgagcaca cctgaacagc caagtagtca 2760
gcaaaaagaa tgggggctcc gtttgaacca cttaaggtgc gtccaagaca gcaaaataaa 2820
acccccagaa cttgccagca actcctttga ggagagaata ctatttcaag cagtctttcc 2880
cttacctatt tacaaactgt tttaacagtt taagtccggc atacaaaagc ctgcacgatc 2940
agctggctct attaggttta gagcagaagt acatctctct tgtgagccct gcagagctct 3000
gccagtcccc aaaatggaag cacagcccct gctgcaccag ctggtggcgc tctagctggg 3060
attcgaacac catgctctac cgtgtccttt ccagttgaaa tcatctttcc taagcacaga 3120
actctagcct gttaaatcaa aggaatatca tgaagatcgt aagagaaccc caggcacggt 3180
gacacactcc tgtaatccca acactcagga acctgagaca ggaggcaagc ctgggctact 3240
tgaccctgtc ttaaaaaaaa aaagctagta aaagagctgg cacagggtca tatcctttca 3300
cctccccgtg cacagagatg cttttaagtg ctccccatga ggcgagccat gctggttcat 3360
agagtccctg cccagcagat ttggtgtctg ccagttgcac ctttgagaca cctgcttgct 3420
gggcacaatg gcatattcct gtaaacaagc acttaggagg ctgaagcagg aggattgctg 3480
caaggtcgag ttggcctctg gtgtgtagtg aaacccagtt tcaaaaaaaa aaaaaaaaat 3540
cgagcaaaac tcctgctcgc ctttccttgg ccagtgcaga aaacctttga gctttgttta 3600
aatggtcaga ctcccaaaca ttggagcctt ttgaatgtgt tctgagacct acagtcaact 3660
cgtgtcattc ctgctgtttg gctgtctagc agaaatctag actggctgac ccactcctcc 3720
ggtgggctca ggttttctgc tttcttgatt ccagattgtc aaatagaagt gtatatgcaa 3780
tatgcactgt gacccttgcg gctagggctg acgggccact gtgtagcctg ctgggtggcc 3840
agtgcacctg tgggggcagc gggagacttg ctcctccccc atggctggca gggaaatgcc 3900
catcccagag gcccctcccg caggttgtga ctggaacaca gcaggctgcc ttcttcagtt 3960
aagttcggtc tttgaaactg acaatcttta aaatgtgaat actgtaacaa tatgttttct 4020
tggattgttg tctttaaaag gatttttgtg aagcaattga tttatcaaag aaaaaaaaaa 4080
ttaaaaccag aaacatgaaa aaaaaaaaaa aaaaaa 4116
//