Dbfetch

ID   BC012873; SV 1; linear; mRNA; STD; MUS; 2743 BP.
XX
AC   BC012873;
XX
DT   24-AUG-2001 (Rel. 68, Created)
DT   24-SEP-2008 (Rel. 97, Last updated, Version 10)
XX
DE   Mus musculus TAR DNA binding protein, mRNA (cDNA clone MGC:19284
DE   IMAGE:4016437), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2743
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-2743
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (20-AUG-2001) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; 034171ee975ef1984251cf1f4a965773.
DR   Ensembl-Gn; ENSMUSG00000041459; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0029129; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0029094; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0029042; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0029110; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0028831; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0029559; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0028244; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0028793; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0028942; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0028906; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0029032; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0028933; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0029591; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0027964; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0028324; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000084125; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0069263; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0069327; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0069233; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0069244; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0068913; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0069740; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0069381; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0068853; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0068989; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0068889; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0069076; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0068906; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0069924; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0068877; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0068077; mus_musculus_wsbeij.
DR   EuropePMC; PMC129717; 12361981.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Gilbert Smith, Ph.D.
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Baylor College of Medicine Human Genome
CC   Sequencing Center
CC   Center code: BCM-HGSC
CC   Web site: http://www.hgsc.bcm.tmc.edu/cdna/
CC   Contact: amg@bcm.tmc.edu
CC   Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
CC   Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
CC   A.N., Gibbs, R.A.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 24 Row: a Column: 21.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2743
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="Czech II"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NCI_CGAP_Lu30"
FT                   /clone="MGC:19284 IMAGE:4016437"
FT                   /tissue_type="Mammary tumor metastatized to lung.
FT                   MMTV-LTR/Wnt1 model. Expression driven by an MMTV-LTR
FT                   enhancer."
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..2743
FT                   /gene="Tardbp"
FT   misc_difference 51
FT                   /gene="Tardbp"
FT                   /note="'C' in cDNA is 'T' in the mouse genome."
FT   CDS             89..1333
FT                   /codon_start=1
FT                   /gene="Tardbp"
FT                   /product="TAR DNA binding protein"
FT                   /db_xref="GOA:Q921F2"
FT                   /db_xref="InterPro:IPR000504"
FT                   /db_xref="MGI:MGI:2387629"
FT                   /db_xref="PDB:3D2W"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q921F2"
FT                   /protein_id="AAH12873.1"
FT                   /translation="MSEYIRVTEDENDEPIEIPSEDDGTVLLSTVTAQFPGACGLRYRN
FT                   PVSQCMRGVRLVEGILHAPDAGWGNLVYVVNYPKDNKRKMDETDASSAVKVKRAVQKTS
FT                   DLIVLGLPWKTTEQDLKDYFSTFGEVLMVQVKKDLKTGHSKGFGFVRFTEYETQVKVMS
FT                   QRHMIDGRWCDCKLPNSKQSPDEPLRSRKVFVGRCTEDMTAEELQQFFCQYGEVVDVFI
FT                   PKPFRAFAFVTFADDKVAQSLCGEDLIIKGISVHISNAEPKHNSNRQLERSGRFGGNPG
FT                   GFGNQGGFGNSRGGGAGLGNNQGGNMGGGMNFGAFSINPAMMAAAQAALQSSWGMMGML
FT                   ASQQNQSGPSGNNQSQGSMQREPNQAFGSGNNSYSGSNSGAPLGWGSASNAGSGSGFNG
FT                   GFGSSMDSKSSGWGM"
FT   misc_difference 226
FT                   /gene="Tardbp"
FT                   /note="'G' in cDNA is 'C' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 397
FT                   /gene="Tardbp"
FT                   /note="'G' in cDNA is 'A' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 418
FT                   /gene="Tardbp"
FT                   /note="'G' in cDNA is 'T' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 436
FT                   /gene="Tardbp"
FT                   /note="'A' in cDNA is 'T' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 1362
FT                   /gene="Tardbp"
FT                   /note="'G' in cDNA is 'A' in the mouse genome."
FT   misc_difference 1387
FT                   /gene="Tardbp"
FT                   /note="'G' in cDNA is 'T' in the mouse genome."
FT   misc_difference 2730..2743
FT                   /gene="Tardbp"
FT                   /note="polyA tail: 14 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 2743 BP; 743 A; 465 C; 715 G; 820 T; 0 other;
     gcggtagcgc ggctgttgtc ggattccttc ccgtctgtgc ttcctccttg cgcttcctag        60
     cagtggccta gcggagattt aagcaaagat gtctgaatat attcgggtaa cagaagatga       120
     gaacgatgaa cccattgaaa taccatcaga agacgatggg acggtgttgc tgtccacagt       180
     tacagcccag tttccagggg catgcggcct gcgctaccgg aatccggtgt ctcagtgtat       240
     gagaggagtc cgactggtgg aaggaattct gcatgcccca gatgctggct ggggcaatct       300
     ggtatatgtt gtcaactatc ccaaagataa caaaaggaaa atggatgaga cagatgcttc       360
     ctctgcagtg aaagtgaaaa gagcagtcca gaaaacgtct gacctcatag tgttggggct       420
     cccctggaaa acaacagagc aggatctgaa agactatttc agtacttttg gagaggttct       480
     tatggttcag gtcaagaaag atcttaaaac tggtcactcg aaagggtttg gctttgttcg       540
     atttacagaa tatgaaaccc aagtgaaagt aatgtcacaa cgacatatga tagatgggcg       600
     atggtgtgac tgtaaacttc ccaactctaa gcaaagccca gacgagcctt tgagaagcag       660
     aaaggtgttt gttggacgtt gtacagagga catgactgct gaagagcttc agcagttttt       720
     ctgtcagtat ggagaagtgg tagatgtctt cattcccaaa ccattcagag cttttgcctt       780
     cgtcaccttt gcagatgata aggttgccca gtctctttgt ggagaggatt tgatcattaa       840
     aggaatcagc gtgcatatat ccaatgctga acctaagcat aatagcaata gacagttaga       900
     aagaagtgga agatttggtg gtaatccagg tggctttggg aatcagggtg ggtttggtaa       960
     cagtagaggg ggtggagctg gcttgggaaa taaccagggt ggtaatatgg gtggagggat      1020
     gaactttggt gcttttagca ttaacccagc gatgatggct gcggctcagg cagcgttgca      1080
     gagcagttgg ggtatgatgg gcatgttagc cagccagcag aaccagtcgg gcccatctgg      1140
     gaataaccaa agccagggca gcatgcagag ggaaccaaat caggcttttg gttctggaaa      1200
     taattcctac agtggttcta attctggtgc cccccttggt tgggggtcag catcaaatgc      1260
     aggatcgggc agtggtttta atgggggctt tggctcgagc atggattcta agtcttctgg      1320
     ctggggaatg taggtggtgg ggggtggtta gtaggttggt tgttaggtta ggtagattta      1380
     gaatgggggg attcaaattt ttctaaactc atggtaagta tattgtaaaa tacatatgta      1440
     ctaaaatttt cagattggtt tgttcagtgt ggagtatatt cagcagtatt tttgacattt      1500
     ttctttagaa aaaaagaggg gaaagctaaa tgaattttat aagttttgtt atataaaggg      1560
     ttaaaatact gagtgggtga aagtgaactg ctgtttgcct aattggtaaa ccaacactac      1620
     aattgatctc agaaggtttc tctgtaatat tctatcattg aaattgttaa tgaattcttt      1680
     gcatgttcag agtagaaacc attggttaga actacattct tttctcctta ttttaatttg      1740
     aatcccaccc tatgaatttt ttccttagga aaatctccat ttgggagatc atgatgtcat      1800
     ggtgtttgat tcttttggtt ttgtttttaa cacttgtctt ccttcatata cgaaagtaca      1860
     atatgaagcc ttcatttaat ctctgcagtt catctcattt caaatgttta tggaagaagc      1920
     acttcattga aagtagtgct gtaaatattc tgccatagga atacttctgt ctacatgctt      1980
     tctcatccaa gaattcgtca tcacgctgca caggctgcgt ctttgacggt gggtgttcca      2040
     tttttatccg ctactcttta tttcatggag tcgtatcaac gctatgaacg caaggctgtg      2100
     atatggaacc agaaggctgt ttgaactttt gaaaccttgt gtgggattga tggtggtgcc      2160
     gaggcatgaa aggctagtat gagcgagaaa aggagagcgc gtgcagagac ttggtggtgg      2220
     aaaatggata ttttttaact tggagagatg tgtcactcaa tcctgtggct ttggtgagag      2280
     agtgtgcaga gagcaatgat agcaaataac gtacgaatgt tttacatcaa aggacatcca      2340
     catcagttgg aagactttga gttttgttct taggaaaccc actttagttg aatgtgttaa      2400
     gtgaaatact tgtacttccc tccccctctg tcaactgctg tgaatgctgt atggtgtgtg      2460
     ttctcctctg ttactgatct ggaagtgtgg gaacgtgaac tgaagctgat gggctgcgaa      2520
     catggactga gcttgtggtg tgctttgcag gagaacttgg aagcagagtt caccagtgag      2580
     ctcaggtgtc tcaaagaagg gtggaagttc tcatgtctgt tagctattca taagaatgct      2640
     gtttgctgca gttctgtgtc ctgtgcttgg atgctttttt ataagagttg tcattgttgg      2700
     aaattcttaa ataaaactga tttaaataaa aaaaaaaaaa aaa                        2743
//