Dbfetch

ID   BC033475; SV 1; linear; mRNA; STD; MUS; 2746 BP.
XX
AC   BC033475;
XX
DT   24-SEP-2002 (Rel. 73, Created)
DT   24-SEP-2008 (Rel. 97, Last updated, Version 10)
XX
DE   Mus musculus TAR DNA binding protein, mRNA (cDNA clone MGC:36341
DE   IMAGE:4953334), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2746
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-2746
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (26-JUN-2002) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; e32321773266808ede0c362b076190ec.
DR   Ensembl-Gn; ENSMUSG00000041459; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000084125; mus_musculus.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Jeffrey Green M.D.
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Baylor College of Medicine Human Genome
CC   Sequencing Center
CC   Center code: BCM-HGSC
CC   Web site: http://www.hgsc.bcm.tmc.edu/cdna/
CC   Contact: amg@bcm.tmc.edu
CC   Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
CC   Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
CC   A.N., Gibbs, R.A.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 60 Row: n Column: 5
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 31543841.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2746
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="FVB/N"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NCI_CGAP_Mam6"
FT                   /clone="MGC:36341 IMAGE:4953334"
FT                   /tissue_type="Mammary tumor. C3(1)-Tag model. Infiltrating
FT                   ductal carcinoma. 5 month old virgin mouse."
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..2746
FT                   /gene="Tardbp"
FT   CDS             89..1333
FT                   /codon_start=1
FT                   /gene="Tardbp"
FT                   /product="TAR DNA binding protein"
FT                   /db_xref="GOA:Q921F2"
FT                   /db_xref="InterPro:IPR000504"
FT                   /db_xref="InterPro:IPR012677"
FT                   /db_xref="MGI:MGI:2387629"
FT                   /db_xref="PDB:3D2W"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q921F2"
FT                   /protein_id="AAH33475.1"
FT                   /translation="MSEYIRVTEDENDEPIEIPSEDDGTVLLSTVTAQFPGACGLRYRN
FT                   PVSQCMRGVRLVEGILHAPDAGWGNLVYVVNYPKDNKRKMDETDASSAVKVKRAVQKTS
FT                   DLIVLGLPWKTTEQDLKDYFSTFGEVLMVQVKKDLKTGHSKGFGFVRFTEYETQVKVMS
FT                   QRHMIDGRWCDCKLPNSKQSPDEPLRSRKVFVGRCTEDMTAEELQQFFCQYGEVVDVFI
FT                   PKPFRAFAFVTFADDKVAQSLCGEDLIIKGISVHISNAEPKHNSNRQLERSGRFGGNPG
FT                   GFGNQGGFGNSRGGGAGLGNNQGGNMGGGMNFGAFSINPAMMAAAQAALQSSWGMMGML
FT                   ASQQNQSGPSGNNQSQGSMQREPNQAFGSGNNSYSGSNSGAPLGWGSASNAGSGSGFNG
FT                   GFGSSMDSKSSGWGM"
FT   misc_difference 2730..2746
FT                   /gene="Tardbp"
FT                   /note="polyA tail: 17 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 2746 BP; 747 A; 465 C; 710 G; 824 T; 0 other;
     gcggtagcgc ggctgttgtc ggattccttc ccgtctgtgc ttcctccttg tgcttcctag        60
     cagtggccta gcggagattt aagcaaagat gtctgaatat attcgggtaa cagaagatga       120
     gaacgatgaa cccattgaaa taccatcaga agacgatggg acggtgttgc tgtccacagt       180
     tacagcccag tttccagggg catgcggcct gcgctaccgg aatcccgtgt ctcagtgtat       240
     gagaggagtc cgactggtgg aaggaattct gcatgcccca gatgctggct ggggcaatct       300
     ggtatatgtt gtcaactatc ccaaagataa caaaaggaaa atggatgaga cagatgcttc       360
     ctctgcagtg aaagtgaaaa gagcagtcca gaaaacatct gacctcatag tgttgggtct       420
     cccctggaaa acaactgagc aggatctgaa agactatttc agtacttttg gagaggttct       480
     tatggttcag gtcaagaaag atcttaaaac tggtcactcg aaagggtttg gctttgttcg       540
     atttacagaa tatgaaaccc aagtgaaagt aatgtcacaa cgacatatga tagatgggcg       600
     atggtgtgac tgtaaacttc ccaactctaa gcaaagccca gacgagcctt tgagaagcag       660
     aaaggtgttt gttggacgtt gtacagagga catgactgct gaagagcttc agcagttttt       720
     ctgtcagtat ggagaagtgg tagatgtctt cattcccaaa ccattcagag cttttgcctt       780
     cgtcaccttt gcagatgata aggttgccca gtctctttgt ggagaggatt tgatcattaa       840
     aggaatcagc gtgcatatat ccaatgctga acctaagcat aatagcaata gacagttaga       900
     aagaagtgga agatttggtg gtaatccagg tggctttggg aatcagggtg ggtttggtaa       960
     cagtagaggg ggtggagctg gcttgggaaa taaccagggt ggtaatatgg gtggagggat      1020
     gaactttggt gcttttagca ttaacccagc gatgatggct gcggctcagg cagcgttgca      1080
     gagcagttgg ggtatgatgg gcatgttagc cagccagcag aaccagtcgg gcccatctgg      1140
     gaataaccaa agccagggca gcatgcagag ggaaccaaat caggcttttg gttctggaaa      1200
     taattcctac agtggttcta attctggtgc cccccttggt tgggggtcag catcaaatgc      1260
     aggatcgggc agtggtttta atgggggctt tggctcgagc atggattcta agtcttctgg      1320
     ctggggaatg taggtggtgg ggggtggtta gtaggttggt tattaggtta ggtagattta      1380
     gaatggtggg attcaaattt ttctaaactc atggtaagta tattgtaaaa tacatatgta      1440
     ctaaaatttt cagattggtt tgttcagtgt ggagtatatt cagcagtatt tttgacattt      1500
     ttctttagaa aaaaagaggg gaaagctaaa tgaattttat aagttttgtt atataaaggg      1560
     ttaaaatact gagtgggtga aagtgaactg ctgtttgcct aattggtaaa ccaacactac      1620
     aattgatctc agaaggtttc tctgtaatat tctatcattg aaattgttaa tgaattcttt      1680
     gcatgttcag agtagaaacc attggttaga actacattct tttctcctta ttttaatttg      1740
     aatcccaccc tatgaatttt ttccttagga aaatctccat ttgggagatc atgatgtcat      1800
     ggtgtttgat tcttttggtt ttgtttttaa cacttgtctt ccttcatata cgaaagtaca      1860
     atatgaagcc ttcatttaat ctctgcagtt catctcattt caaatgttta tggaagaagc      1920
     acttcattga aagtagtgct gtaaatattc tgccatagga atacttctgt ctacatgctt      1980
     tctcatccaa gaattcgtca tcacgctgca caggctgcgt ctttgacggt gggtgttcca      2040
     tttttatccg ctactcttta tttcatggag tcgtatcaac gctatgaacg caaggctgtg      2100
     atatggaacc agaaggctgt ttgaactttt gaaaccttgt gtgggattga tggtggtgcc      2160
     gaggcatgaa aggctagtat gagcgagaaa aggagagcgc gtgcagagac ttggtggtgg      2220
     aaaatggata ttttttaact tggagagatg tgtcactcaa tcctgtggct ttggtgagag      2280
     agtgtgcaga gagcaatgat agcaaataac gtacgaatgt tttacatcaa aggacatcca      2340
     catcagttgg aagactttga gttttgttct taggaaaccc actttagttg aatgtgttaa      2400
     gtgaaatact tgtacttccc tccccctctg tcaactgctg tgaatgctgt atggtgtgtg      2460
     ttctcctctg ttactgatct ggaagtgtgg gaacgtgaac tgaagctgat gggctgcgaa      2520
     catggactga gcttgtggtg tgctttgcag gagaacttgg aagcagagtt caccagtgag      2580
     ctcaggtgtc tcaaagaagg gtggaagttc tcatgtctgt tagctattca taagaatgct      2640
     gtttgctgca gttctgtgtc ctgtgcttgg atgctttttt ataagagttg tcattgttgg      2700
     aaattcttaa ataaaactga tttaaataaa aaaaaaaaaa aaaaaa                     2746
//