Dbfetch

ID   BC025544; SV 1; linear; mRNA; STD; MUS; 2908 BP.
XX
AC   BC025544;
XX
DT   13-MAR-2002 (Rel. 71, Created)
DT   16-JUL-2006 (Rel. 88, Last updated, Version 15)
XX
DE   Mus musculus TAR DNA binding protein, mRNA (cDNA clone MGC:36405
DE   IMAGE:5321007), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2908
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-2908
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (06-MAR-2002) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; 642db693199d93565a67bf15c92dcee8.
DR   Ensembl-Gn; ENSMUSG00000041459; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0029129; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0029094; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0029042; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0029110; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0028831; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0029559; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0028244; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0028793; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0028942; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0028906; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0029032; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0028933; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0029591; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0027964; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0028324; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000084125; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0069263; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0069327; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0069233; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0069244; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0068913; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0069740; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0069381; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0068853; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0068989; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0068889; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0069076; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0068906; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0069924; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0068877; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0068077; mus_musculus_wsbeij.
DR   EuropePMC; PMC2661999; 18703504.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Jeffrey Green M.D.
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: National Institutes of Health Intramural
CC   Sequencing Center (NISC),
CC   Gaithersburg, Maryland;
CC   Web site: http://www.nisc.nih.gov/
CC   Contact: nisc_mgc@nhgri.nih.gov
CC   Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
CC   Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
CC   Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
CC   Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
CC   Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
CC   McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
CC   Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
CC   Young,A., Zhang,L.-H. and Green,E.D.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 56 Row: f Column: 1
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 31543841.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2908
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="FVB/N"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NCI_CGAP_Mam6"
FT                   /clone="MGC:36405 IMAGE:5321007"
FT                   /tissue_type="Mammary tumor. C3(1)-Tag model. Infiltrating
FT                   ductal carcinoma. 5 month old virgin mouse."
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..2908
FT                   /gene="Tardbp"
FT   misc_difference 1..9
FT                   /gene="Tardbp"
FT                   /note="9 bases at the 5' end do not align to the mouse
FT                   genome."
FT   misc_difference 11
FT                   /gene="Tardbp"
FT                   /note="'G' in cDNA is 'A' in the mouse genome."
FT   misc_difference 52
FT                   /gene="Tardbp"
FT                   /note="'C' in cDNA is 'G' in the mouse genome."
FT   CDS             102..1346
FT                   /codon_start=1
FT                   /gene="Tardbp"
FT                   /product="TAR DNA binding protein"
FT                   /db_xref="GOA:Q921F2"
FT                   /db_xref="InterPro:IPR000504"
FT                   /db_xref="MGI:MGI:2387629"
FT                   /db_xref="PDB:3D2W"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q921F2"
FT                   /protein_id="AAH25544.1"
FT                   /translation="MSEYIRVTEDENDEPIEIPSEDDGTVLLSTVTAQFPGACGLRYRN
FT                   PVSQCMRGVRLVEGILHAPDAGWGNLVYVVNYPKDNKRKMDETDASSAVKVKRAVQKTS
FT                   DLIVLGLPWKTTEQDLKDYFSTFGEVLMVQVKKDLKTGHSKGFGFVRFTEYETQVKVMS
FT                   QRHMIDGRWCDCKLPNSKQSPDEPLRSRKVFVGRCTEDMTAEELQQFFCQYGEVVDVFI
FT                   PKPFRAFAFVTFADDKVAQSLCGEDLIIKGISVHISNAEPKHNSNRQLERSGRFGGNPG
FT                   GFGNQGGFGNSRGGGAGLGNNQGGNMGGGMNFGAFSINPAMMAAAQAALQSSWGMMGML
FT                   ASQQNQSGPSGNNQSQGSMQREPNQAFGSGNNSYSGSNSGAPLGWGSASNAGSGSGFNG
FT                   GFGSSMDSKSSGWGM"
FT   misc_difference 2376
FT                   /gene="Tardbp"
FT                   /note="'C' in cDNA is 'T' in the mouse genome."
FT   misc_difference 2881..2908
FT                   /gene="Tardbp"
FT                   /note="polyA tail: 28 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 2908 BP; 796 A; 496 C; 736 G; 880 T; 0 other;
     ccacgcgtcc gcggagcggt agcgcggctg ttgtcggatt ccttcccgtc tctgcttcct        60
     ccttgtgctt cctagcagtg gcctagcgga tttaagcaaa gatgtctgaa tatattcggg       120
     taacagaaga tgagaacgat gaacccattg aaataccatc agaagacgat gggacggtgt       180
     tgctgtccac agttacagcc cagtttccag gggcatgcgg cctgcgctac cggaatcccg       240
     tgtctcagtg tatgagagga gtccgactgg tggaaggaat tctgcatgcc ccagatgctg       300
     gctggggcaa tctggtatat gttgtcaact atcccaaaga taacaaaagg aaaatggatg       360
     agacagatgc ttcctctgca gtgaaagtga aaagagcagt ccagaaaaca tctgacctca       420
     tagtgttggg tctcccctgg aaaacaactg agcaggatct gaaagactat ttcagtactt       480
     ttggagaggt tcttatggtt caggtcaaga aagatcttaa aactggtcac tcgaaagggt       540
     ttggctttgt tcgatttaca gaatatgaaa cccaagtgaa agtaatgtca caacgacata       600
     tgatagatgg gcgatggtgt gactgtaaac ttcccaactc taagcaaagc ccagacgagc       660
     ctttgagaag cagaaaggtg tttgttggac gttgtacaga ggacatgact gctgaagagc       720
     ttcagcagtt tttctgtcag tatggagaag tggtagatgt cttcattccc aaaccattca       780
     gagcttttgc cttcgtcacc tttgcagatg ataaggttgc ccagtctctt tgtggagagg       840
     atttgatcat taaaggaatc agcgtgcata tatccaatgc tgaacctaag cataatagca       900
     atagacagtt agaaagaagt ggaagatttg gtggtaatcc aggtggcttt gggaatcagg       960
     gtgggtttgg taacagtaga gggggtggag ctggcttggg aaataaccag ggtggtaata      1020
     tgggtggagg gatgaacttt ggtgctttta gcattaaccc agcgatgatg gctgcggctc      1080
     aggcagcgtt gcagagcagt tggggtatga tgggcatgtt agccagccag cagaaccagt      1140
     cgggcccatc tgggaataac caaagccagg gcagcatgca gagggaacca aatcaggctt      1200
     ttggttctgg aaataattcc tacagtggtt ctaattctgg tgcccccctt ggttgggggt      1260
     cagcatcaaa tgcaggatcg ggcagtggtt ttaatggggg ctttggctcg agcatggatt      1320
     ctaagtcttc tggctgggga atgtaggtgg tggggggtgg ttagtaggtt ggttattagg      1380
     ttaggtagat ttagaatggt gggattcaaa tttttctaaa ctcatggtaa gtatattgta      1440
     aaatacatat gtactaaaat tttcagattg gtttgttcag tgtggagtat attcagcagt      1500
     atttttgaca tttttcttta gaaaaaaaga ggggaaagct aaatgaattt tataagtttt      1560
     gttatataaa gggttaaaat actgagtggg tgaaagtgaa ctgctgtttg cctaattggt      1620
     aaaccaacac tacaattgat ctcagaaggt ttctctgtaa tattctatca ttgaaattgt      1680
     taatgaattc tttgcatgtt cagagtagaa accattggtt agaactacat tcttttctcc      1740
     ttattttaat ttgaatccca ccctatgaat tttttcctta ggaaaatctc catttgggag      1800
     atcatgatgt catggtgttt gattcttttg gttttgtttt taacacttgt cttccttcat      1860
     atacgaaagt acaatatgaa gccttcattt aatctctgca gttcatctca tttcaaatgt      1920
     ttatggaaga agcacttcat tgaaagtagt gctgtaaata ttctgccata ggaatacttc      1980
     tgtctacatg ctttctcatc caagaattcg tcatcacgct gcacaggctg cgtctttgac      2040
     ggtgggtgtt ccatttttat ccgctactct ttatttcatg gagtcgtatc aacgctatga      2100
     acgcaaggct gtgatatgga accagaaggc tgtttgaact tttgaaacct tgtgtgggat      2160
     tgatggtggt gccgaggcat gaaaggctag tatgagcgag aaaaggagag cgcgtgcaga      2220
     gacttggtgg tggaaaatgg atatttttta acttggagag atgtgtcact caatcctgtg      2280
     gctttggtga gagagtgtgc agagagcaat gatagcaaat aacgtacgaa tgttttacat      2340
     caaaggacat ccacatcagt tggaagactt tgagtcttgt tcttaggaaa cccactttag      2400
     ttgaatgtgt taagtgaaat acttgtactt ccctccccct ctgtcaactg ctgtgaatgc      2460
     tgtatggtgt gtgttctcct ctgttactga tctggaagtg tgggaacgtg aactgaagct      2520
     gatgggctgc gaacatggac tgagcttgtg gtgtgctttg caggagaact tggaagcaga      2580
     gttcaccagt gagctcaggt gtctcaaaga agggtggaag ttctcatgtc tgttagctat      2640
     tcataagaat gctgtttgct gcagttctgt gtcctgtgct tggatgcttt tttataagag      2700
     ttgtcattgt tggaaattct taaataaaac tgatttaaat aatatgtgtc tttgttttgc      2760
     agccctgaat gcaaagaatt catagcagtt aattcccctt tttgaccctt ttgagatgga      2820
     actttcataa agtttcttgg cagtagttta ttttgcttca aataaactta tttgaaaagt      2880
     aaaaaaaaaa aaaaaaaaaa aaaaaaaa                                         2908
//