Dbfetch

ID   BC012707; SV 1; linear; mRNA; STD; MUS; 941 BP.
XX
AC   BC012707;
XX
DT   21-AUG-2001 (Rel. 68, Created)
DT   24-SEP-2008 (Rel. 97, Last updated, Version 10)
XX
DE   Mus musculus glutathione S-transferase, theta 2, mRNA (cDNA clone MGC:13991
DE   IMAGE:3994154), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-941
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-941
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (15-AUG-2001) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; eba1c322728fe8303820aec7eb6f89f2.
DR   Ensembl-Gn; ENSMUSG00000033318; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0017367; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0017344; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0017305; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0017304; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0017128; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0017762; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0016699; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0017100; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0017205; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0017198; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0017278; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0017226; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0017801; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0016482; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0016762; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000038257; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0025792; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0025769; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0025732; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0025734; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0025544; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0026230; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0025108; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0025479; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0025593; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0025586; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0025706; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0025587; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0026276; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0024819; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0025084; mus_musculus_wsbeij.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Gilbert Smith, Ph.D.
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Institute for Systems Biology
CC   http://www.systemsbiology.org
CC   contact: amadan@systemsbiology.org
CC   Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
CC   Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 18 Row: e Column: 7
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 6754087.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..941
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="Czech II"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NCI_CGAP_Lu29"
FT                   /clone="MGC:13991 IMAGE:3994154"
FT                   /tissue_type="Mammary tumor metastatized to lung. Tumor
FT                   arose spontaneously from a senescent normal mammary
FT                   (clonal) outgrowth infected with the virus MMTV."
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..941
FT                   /gene="Gstt2"
FT                   /note="synonyms: mGSTT2, Yrs"
FT   misc_difference 1..7
FT                   /gene="Gstt2"
FT                   /note="7 bases at the 5' end do not align to the mouse
FT                   genome."
FT   misc_difference 11
FT                   /gene="Gstt2"
FT                   /note="'G' in cDNA is 'T' in the mouse genome."
FT   misc_difference 12^13
FT                   /gene="Gstt2"
FT                   /note="1 base in the mouse genome, C, is not found in
FT                   cDNA."
FT   misc_difference 46
FT                   /gene="Gstt2"
FT                   /note="'T' in cDNA is 'A' in the mouse genome."
FT   misc_difference 150
FT                   /gene="Gstt2"
FT                   /note="'C' in cDNA is 'G' in the mouse genome."
FT   CDS             154..888
FT                   /codon_start=1
FT                   /gene="Gstt2"
FT                   /product="glutathione S-transferase, theta 2"
FT                   /db_xref="GOA:Q61133"
FT                   /db_xref="InterPro:IPR004045"
FT                   /db_xref="InterPro:IPR004046"
FT                   /db_xref="InterPro:IPR010987"
FT                   /db_xref="InterPro:IPR012336"
FT                   /db_xref="MGI:MGI:106188"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q61133"
FT                   /protein_id="AAH12707.1"
FT                   /translation="MGLELYLDLLSQPSRAVYIFAKKNGIPFQTRTVDILKGQHMSEQF
FT                   SQVNCLNKVPVLKDGSFVLTESTAILIYLSSKYQVADHWYPADLQARAQVHEYLGWHAD
FT                   NIRGTFGVLLWTKVLGPLIGVQVPQEKVERNRDRMVLVLQQLEDKFLRDRAFLVGQQVT
FT                   LADLMSLEELMQPVALGYNLFEGRPQLTAWRERVEAFLGAELCQEAHSTILSILGQAAK
FT                   KMLPVPPPEVHASMQLRIARIP"
FT   misc_difference 924..941
FT                   /gene="Gstt2"
FT                   /note="polyA tail: 18 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 941 BP; 225 A; 255 C; 260 G; 201 T; 0 other;
     ccacgcgtcc gcaaagcgtc tccctcccca agatcagcag gtgtctgcta tccagaggag        60
     gaaatcgttt ggcttggcca actgaggctg tgctggaccc cagcttgctg ttatcgaacg       120
     cagtcggcac accatcttgt gtcgctaccc gcaatgggct tggagctcta cctggacctg       180
     ctgtcacaac ccagccgcgc tgtctacatc ttcgccaaga agaatggcat ccccttccag       240
     acgcgtaccg tggatatact caaagggcag cacatgagcg agcaattctc ccaggtgaac       300
     tgcttaaaca aagttcctgt actcaaagac ggaagcttcg tgttgaccga aagcacagcc       360
     atcttgattt acctgagttc caagtaccag gtggcagacc actggtaccc ggccgaccta       420
     caggcccgtg cccaagtcca cgaatacctg ggctggcatg ccgacaacat ccgtggtact       480
     ttcggagtgc tcctatggac caaggtgttg gggccactca ttggggtcca ggttccccag       540
     gagaaggtgg aacggaacag agatagaatg gtcctggttc tgcaacagct ggaggacaag       600
     ttcctcaggg acagggcctt ccttgttggc cagcaggtga cgctagcgga tctcatgtct       660
     ctggaggagt tgatgcagcc cgtggctctt ggctataacc tgtttgaggg acggcctcag       720
     ctgacagcat ggcgagagag ggtggaggcg ttcttgggtg ctgagctgtg tcaggaggct       780
     catagcacca tcctgagcat cctgggacag gcagccaaga aaatgttacc agtaccccct       840
     ccggaggtcc atgccagcat gcagcttcga attgctagga ttccttgagt ggtctgacca       900
     gcaataaaga ctcattttgt gttaaaaaaa aaaaaaaaaa a                           941
//