Dbfetch

ID   BC031126; SV 1; linear; mRNA; STD; MUS; 2753 BP.
XX
AC   BC031126;
XX
DT   14-JUN-2002 (Rel. 72, Created)
DT   24-SEP-2008 (Rel. 97, Last updated, Version 12)
XX
DE   Mus musculus TAR DNA binding protein, mRNA (cDNA clone MGC:36366
DE   IMAGE:4977114), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2753
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-2753
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (03-JUN-2002) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; a8d666aa920ec03fa0f9b3e7b6e8e55f.
DR   Ensembl-Gn; ENSMUSG00000041459; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0029129; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0029094; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0029042; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0029110; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0028831; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0029559; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0028244; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0028793; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0028942; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0028906; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0029032; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0028933; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0029591; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0027964; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0028324; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000084125; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0069263; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0069327; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0069233; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0069244; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0068913; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0069740; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0069381; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0068853; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0068989; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0068889; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0069076; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0068906; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0069924; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0068877; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0068077; mus_musculus_wsbeij.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Jeffrey Green M.D.
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Sequencing Group at the Stanford Human Genome
CC   Center, Stanford University School of Medicine, Stanford, CA  94305
CC   Web site:       http://www-shgc.stanford.edu
CC   Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
CC   Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
CC   R. M.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 59 Row: k Column: 13
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 31543841.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2753
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="FVB/N"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NCI_CGAP_Mam6"
FT                   /clone="MGC:36366 IMAGE:4977114"
FT                   /tissue_type="Mammary tumor. C3(1)-Tag model. Infiltrating
FT                   ductal carcinoma. 5 month old virgin mouse."
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..2753
FT                   /gene="Tardbp"
FT   CDS             93..1337
FT                   /codon_start=1
FT                   /gene="Tardbp"
FT                   /product="TAR DNA binding protein"
FT                   /db_xref="GOA:Q921F2"
FT                   /db_xref="InterPro:IPR000504"
FT                   /db_xref="InterPro:IPR012677"
FT                   /db_xref="InterPro:IPR035979"
FT                   /db_xref="InterPro:IPR041105"
FT                   /db_xref="MGI:MGI:2387629"
FT                   /db_xref="PDB:3D2W"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q921F2"
FT                   /protein_id="AAH31126.1"
FT                   /translation="MSEYIRVTEDENDEPIEIPSEDDGTVLLSTVTAQFPGACGLRYRN
FT                   PVSQCMRGVRLVEGILHAPDAGWGNLVYVVNYPKDNKRKMDETDASSAVKVKRAVQKTS
FT                   DLIVLGLPWKTTEQDLKDYFSTFGEVLMVQVKKDLKTGHSKGFGFVRFTEYETQVKVMS
FT                   QRHMIDGRWCDCKLPNSKQSPDEPLRSRKVFVGRCTEDMTAEELQQFFCQYGEVVDVFI
FT                   PKPFRAFAFVTFADDKVAQSLCGEDLIIKGISVHISNAEPKHNSNRQLERSGRFGGNPG
FT                   GFGNQGGFGNSRGGGAGLGNNQGGNMGGGMNFGAFSINPAMMAAAQAALQSSWGMMGML
FT                   ASQQNQSGPSGNNQSQGSMQREPNQAFGSGNNSYSGSNSGAPLGWGSASNAGSGSGFNG
FT                   GFGSSMDSKSSGWGM"
FT   misc_difference 2736..2753
FT                   /gene="Tardbp"
FT                   /note="polyA tail: 18 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 2753 BP; 750 A; 466 C; 712 G; 825 T; 0 other;
     cggagcggta gcgcggctgt tgtcggattc cttcccgtct gtgcttcctc cttgtgcttc        60
     ctagcagtgg cctagcggag atttaagcaa agatgtctga atatattcgg gtaacagaag       120
     atgagaacga tgaacccatt gaaataccat cagaagacga tgggacggtg ttgctgtcca       180
     cagttacagc ccagtttcca ggggcatgcg gcctgcgcta ccggaatccc gtgtctcagt       240
     gtatgagagg agtccgactg gtggaaggaa ttctgcatgc cccagatgct ggctggggca       300
     atctggtata tgttgtcaac tatcccaaag ataacaaaag gaaaatggat gagacagatg       360
     cttcctctgc agtgaaagtg aaaagagcag tccagaaaac atctgacctc atagtgttgg       420
     gtctcccctg gaaaacaact gagcaggatc tgaaagacta tttcagtact tttggagagg       480
     ttcttatggt tcaggtcaag aaagatctta aaactggtca ctcgaaaggg tttggctttg       540
     ttcgatttac agaatatgaa acccaagtga aagtaatgtc acaacgacat atgatagatg       600
     ggcgatggtg tgactgtaaa cttcccaact ctaagcaaag cccagacgag cctttgagaa       660
     gcagaaaggt gtttgttgga cgttgtacag aggacatgac tgctgaagag cttcagcagt       720
     ttttctgtca gtatggagaa gtggtagatg tcttcattcc caaaccattc agagcttttg       780
     ccttcgtcac ctttgcagat gataaggttg cccagtctct ttgtggagag gatttgatca       840
     ttaaaggaat cagcgtgcat atatccaatg ctgaacctaa gcataatagc aatagacagt       900
     tagaaagaag tggaagattt ggtggtaatc caggtggctt tgggaatcag ggtgggtttg       960
     gtaacagtag agggggtgga gctggcttgg gaaataacca gggtggtaat atgggtggag      1020
     ggatgaactt tggtgctttt agcattaacc cagcgatgat ggctgcggct caggcagcgt      1080
     tgcagagcag ttggggtatg atgggcatgt tagccagcca gcagaaccag tcgggcccat      1140
     ctgggaataa ccaaagccag ggcagcatgc agagggaacc aaatcaggct tttggttctg      1200
     gaaataattc ctacagtggt tctaattctg gtgcccccct tggttggggg tcagcatcaa      1260
     atgcaggatc gggcagtggt tttaatgggg gctttggctc gagcatggat tctaagtctt      1320
     ctggctgggg aatgtaggtg gtggggggtg gttagtaggt tggttattag gttaggtaga      1380
     tttagaatgg tgggattcaa atttttctaa actcatggta agtatattgt aaaatacata      1440
     tgtactaaaa ttttcagatt ggtttgttca gtgtggagta tattcagcag tatttttgac      1500
     atttttcttt agaaaaaaag aggggaaagc taaatgaatt ttataagttt tgttatataa      1560
     agggttaaaa tactgagtgg gtgaaagtga actgctgttt gcctaattgg taaaccaaca      1620
     ctacaattga tctcagaagg tttctctgta atattctatc attgaaattg ttaatgaatt      1680
     ctttgcatgt tcagagtaga aaccattggt tagaactaca ttcttttctc cttattttaa      1740
     tttgaatccc accctatgaa ttttttcctt aggaaaatct ccatttggga gatcatgatg      1800
     tcatggtgtt tgattctttt ggttttgttt ttaacacttg tcttccttca tatacgaaag      1860
     tacaatatga agccttcatt taatctctgc agttcatctc atttcaaatg tttatggaag      1920
     aagcacttca ttgaaagtag tgctgtaaat attctgccat aggaatactt ctgtctacat      1980
     gctttctcat ccaagaattc gtcatcacgc tgcacaggct gcgtctttga cggtgggtgt      2040
     tccattttta tccgctactc tttatttcat ggagtcgtat caacgctatg aacgcaaggc      2100
     tgtgatatgg aaccagaagg ctgtttgaac ttttgaaacc ttgtgtggga ttgatggtgg      2160
     tgccgaggca tgaaaggcta gtatgagcga gaaaaggaga gcgcgtgcag agacttggtg      2220
     gtggaaaatg gatatttttt aacttggaga gatgtgtcac tcaatcctgt ggctttggtg      2280
     agagagtgtg cagagagcaa tgatagcaaa taacgtacga atgttttaca tcaaaggaca      2340
     tccacatcag ttggaagact ttgagttttg ttcttaggaa acccacttta gttgaatgtg      2400
     ttaagtgaaa tacttgtact tccctccccc tctgtcaact gctgtgaatg ctgtatggtg      2460
     tgtgttctcc tctgttactg atctggaagt gtgggaacgt gaactgaagc tgatgggctg      2520
     cgaacatgga ctgagcttgt ggtgtgcttt gcaggagaac ttggaagcag agttcaccag      2580
     tgagctcagg tgtctcaaag aagggtggaa gttctcatgt ctgttagcta ttcataagaa      2640
     tgctgtttgc tgcagttctg tgtcctgtgc ttggatgctt ttttataaga gttgtcattg      2700
     ttggaaattc ttaaataaaa ctgatttaaa taataaaaaa aaaaaaaaaa aaa             2753
//