Dbfetch

ID   BC004850; SV 1; linear; mRNA; STD; MUS; 2350 BP.
XX
AC   BC004850;
XX
DT   25-MAR-2001 (Rel. 67, Created)
DT   24-SEP-2008 (Rel. 97, Last updated, Version 18)
XX
DE   Mus musculus twisted gastrulation homolog 1 (Drosophila), mRNA (cDNA clone
DE   MGC:6913 IMAGE:2810960), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2350
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-2350
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (21-MAR-2001) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; 0ee3ef25de78414df55f7650a4af6be5.
DR   Ensembl-Gn; ENSMUSG00000024098; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0023982; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0023940; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0023909; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0023941; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0023705; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0024387; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0023187; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0023681; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0023809; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0023775; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0023891; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0023803; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0024431; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0022933; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0023253; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000024906; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0048414; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0048397; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0048354; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0048357; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0048103; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0048847; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0048167; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0048028; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0048128; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0048095; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0048239; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0048100; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0048943; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0047755; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0047406; mus_musculus_wsbeij.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Lothar Hennighausen Ph.D., Robin Humphreys
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Baylor College of Medicine Human Genome
CC   Sequencing Center
CC   Center code: BCM-HGSC
CC   Web site: http://www.hgsc.bcm.tmc.edu/cdna/
CC   Contact: amg@bcm.tmc.edu
CC   Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
CC   Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
CC   A.N., Gibbs, R.A.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 5 Row: n Column: 4
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 12746427.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2350
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="mix FVB/N, C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NCI_CGAP_Mam5"
FT                   /clone="MGC:6913 IMAGE:2810960"
FT                   /tissue_type="Mammary tumor. WAP-TGF alpha model. 7 months
FT                   old, gross tissue."
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..2350
FT                   /gene="Twsg1"
FT                   /note="synonyms: Twg, Tsg"
FT   misc_difference 1..5
FT                   /gene="Twsg1"
FT                   /note="5 bases at the 5' end do not align to the mouse
FT                   genome."
FT   misc_difference 7
FT                   /gene="Twsg1"
FT                   /note="'C' in cDNA is 'T' in the mouse genome."
FT   misc_difference 15..16
FT                   /gene="Twsg1"
FT                   /note="2 bases in cDNA are not found in the mouse genome."
FT   CDS             114..782
FT                   /codon_start=1
FT                   /gene="Twsg1"
FT                   /product="twisted gastrulation homolog 1 (Drosophila)"
FT                   /db_xref="GOA:Q9EP52"
FT                   /db_xref="InterPro:IPR006761"
FT                   /db_xref="MGI:MGI:2137520"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9EP52"
FT                   /protein_id="AAH04850.1"
FT                   /translation="MKSHYIVLALASLTFLLCLPVSQSCNKALCASDVSKCLIQELCQC
FT                   RPGEGNCPCCKECMLCLGALWDECCDCVGRCNPRNYSDTPPTSKSTVEELHEPIPSLFR
FT                   ALTEGDTQLNWNIVSFPVAEELSHHENLVSFLETVNQLHHQNVSVPSNNVHAPFPSDKE
FT                   RMCTVVYFDDCMSIHQCKISCESMGASKYRWFHNACCECIGPECIDYGSKTVKCMNCMF
FT                   "
FT   misc_difference 337
FT                   /gene="Twsg1"
FT                   /note="'G' in cDNA is 'T' in the mouse genome; amino acid
FT                   difference: 'R' in cDNA, 'M' in the mouse genome."
FT   misc_difference 1957
FT                   /gene="Twsg1"
FT                   /note="'A' in cDNA is 'G' in the mouse genome."
FT   misc_difference 2329..2350
FT                   /gene="Twsg1"
FT                   /note="polyA tail: 22 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 2350 BP; 619 A; 540 C; 568 G; 623 T; 0 other;
     cccacgcgtc cgcgcggcgc cggggttcgc gggagctgct tggaggctcg gcggccggga        60
     ggaggccggg gccacgcttc ttggaagcta ctgagtgact tctttgaaga accatgaagt       120
     cacactatat tgtgctagct ctagcctccc tgacgttcct gctgtgtctc cccgtgtccc       180
     agagctgtaa caaagcactc tgtgccagcg atgtgagcaa atgcctcatt caggagctct       240
     gccagtgccg gcctggagaa gggaactgcc cctgctgtaa ggagtgcatg ctgtgcctcg       300
     gggccctgtg ggacgagtgc tgcgactgtg tcggtaggtg caaccctcgg aattacagcg       360
     acaccccgcc cacatccaag agcaccgtgg aggagctgca cgagcccatt ccgtccctgt       420
     tcagggcgct gacggagggc gacacccagc tgaactggaa catcgtctcc ttccctgtgg       480
     cagaggagct gtcacaccat gaaaacctag tctccttcct agaaactgtg aaccagctgc       540
     accaccaaaa cgtgtctgtt cccagcaaca atgtccacgc ccccttcccc agcgacaaag       600
     agcgcatgtg cacagtggtt tactttgatg actgcatgtc catccaccag tgtaagatat       660
     cctgcgaatc catgggtgca tccaagtatc gctggtttca caacgcctgc tgcgagtgca       720
     tcggtccaga gtgcattgac tatgggagta aaactgtcaa gtgtatgaac tgcatgtttt       780
     aaagaggggg aagaaatgca aaccaaagca gtaagtcatg aagtgtgcag aaatcttggt       840
     tctggtatgc taggagtgtg ttaagttata tgattgtaac tgtgcttttt atatctggtg       900
     cctattagtg taggtctttt ccattggatt caatggaact ttagtcacat gaggatcggg       960
     agttcagagg agtcctggga aaacctgaca tgctgacaga aggtgccgtc ttcttccagc      1020
     tttccaaaca cttctcgttt tgaacgtgat agcacaagcc tggtacatgt gtggttctca      1080
     cctgccagtt gtagaacact aggtccctat agtcacacat ctcttaattg tgccttggct      1140
     ggcttacctg ttttgtatga gtaaatatta cagtttataa ttctaacaac tcacattcaa      1200
     gccatgctga aacttaattt caaaccactt tacattggtt ttagaaagta aatatttact      1260
     atattttaca acagaagagt tttgcctagg gccagcgagc tgactcagtg gataaaggcg      1320
     cttgctacca agcctgataa cctgagttcc atccccagag cccgtacagt ggaaggacag      1380
     gaccagctgc tgggagttgt cctctgacct ccagacaggc acagtatcat gcgtggaggt      1440
     gtgcttgtgt gtgcacacac ataactaact gtttttaaaa atataaacct cttacatggt      1500
     gaaatctaaa tctgtcgtgt agctctcaca ctgacagtgg tttggatgtt atgtcccctg      1560
     tccgcctgta gtgctggtgt ggtgagacac agagtcgtca ctgctctggt atagaagagt      1620
     tttgtctacc aagagtgtca tggcatacct ttggaacttc atcaaatgca cttgaggatg      1680
     acctgggtca ggaagtagcc aggtaaaagc agcgggactg taggcgatgc tccattagac      1740
     tccgtgcaga gcagcaggtg cacagcatag ctgggtgtgc ggctgaccag gagagggtct      1800
     gactccgcac cagcagaaca gcagggtctc cagcacgtgt gggaagcacg tgggagaggg      1860
     ttgaggaagg atgcacagat gtggacagag aagcataaaa atgtcgggaa ctcctagtag      1920
     ggtccacctt aaaatcgctt tatagtctct ggctttatta ctctgtaaga ttacacttgt      1980
     ttctggatat ctgaatccaa ataagcatca tattttaaga agctctgttt ctgaacttcc      2040
     agggggaaat ctgtttaatg tgtttactcc tagcatacta cagaattttc tagctctata      2100
     gcttcttacc tagcgtttcc atagtgctga gcttcattac tacacgccct tcctagtaat      2160
     aaaattctca ccttcaagca tgaatcaaaa acaaatatct ataatacaca ggttcaattt      2220
     tatagaattg ctattttctc tagtgcatat ctcattaaaa gtaacttttt aggaataatc      2280
     tttatatggg tacatatttt ggtacataaa atagaaaatg ttcttaaaaa aaaaaaaaaa      2340
     aaaaaaaaaa                                                             2350
//