Dbfetch
ID BC004850; SV 1; linear; mRNA; STD; MUS; 2350 BP. XX AC BC004850; XX DT 25-MAR-2001 (Rel. 67, Created) DT 24-SEP-2008 (Rel. 97, Last updated, Version 18) XX DE Mus musculus twisted gastrulation homolog 1 (Drosophila), mRNA (cDNA clone DE MGC:6913 IMAGE:2810960), complete cds. XX KW MGC. XX OS Mus musculus (house mouse) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; OC Murinae; Mus; Mus. XX RN [1] RP 1-2350 RX DOI; 10.1073/pnas.242603899. RX PUBMED; 12477932. RG Mammalian Gene Collection Program Team RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D., RA Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F., RA Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H., RA Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K., RA Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F., RA Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S., RA Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J., RA Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J., RA Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M., RA Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., RA Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., RA Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C., RA Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I., RA Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.; RT "Generation and initial analysis of more than 15,000 full-length human and RT mouse cDNA sequences"; RL Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002). XX RN [2] RC NIH-MGC Project URL: http://mgc.nci.nih.gov RP 1-2350 RG NIH MGC Project RA ; RT ; RL Submitted (21-MAR-2001) to the INSDC. RL National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, RL MD 20892-2590, USA XX DR MD5; 0ee3ef25de78414df55f7650a4af6be5. DR Ensembl-Gn; ENSMUSG00000024098; mus_musculus. DR Ensembl-Gn; MGP_129S1SvImJ_G0023982; mus_musculus_129s1svimj. DR Ensembl-Gn; MGP_AJ_G0023940; mus_musculus_aj. DR Ensembl-Gn; MGP_AKRJ_G0023909; mus_musculus_akrj. DR Ensembl-Gn; MGP_BALBcJ_G0023941; mus_musculus_balbcj. DR Ensembl-Gn; MGP_C3HHeJ_G0023705; mus_musculus_c3hhej. DR Ensembl-Gn; MGP_C57BL6NJ_G0024387; mus_musculus_c57bl6nj. DR Ensembl-Gn; MGP_CASTEiJ_G0023187; mus_musculus_casteij. DR Ensembl-Gn; MGP_CBAJ_G0023681; mus_musculus_cbaj. DR Ensembl-Gn; MGP_DBA2J_G0023809; mus_musculus_dba2j. DR Ensembl-Gn; MGP_FVBNJ_G0023775; mus_musculus_fvbnj. DR Ensembl-Gn; MGP_LPJ_G0023891; mus_musculus_lpj. DR Ensembl-Gn; MGP_NODShiLtJ_G0023803; mus_musculus_nodshiltj. DR Ensembl-Gn; MGP_NZOHlLtJ_G0024431; mus_musculus_nzohlltj. DR Ensembl-Gn; MGP_PWKPhJ_G0022933; mus_musculus_pwkphj. DR Ensembl-Gn; MGP_WSBEiJ_G0023253; mus_musculus_wsbeij. DR Ensembl-Tr; ENSMUST00000024906; mus_musculus. DR Ensembl-Tr; ENSMUST00000233580; mus_musculus. DR Ensembl-Tr; MGP_129S1SvImJ_T0048414; mus_musculus_129s1svimj. DR Ensembl-Tr; MGP_AJ_T0048397; mus_musculus_aj. DR Ensembl-Tr; MGP_AKRJ_T0048354; mus_musculus_akrj. DR Ensembl-Tr; MGP_BALBcJ_T0048357; mus_musculus_balbcj. DR Ensembl-Tr; MGP_C3HHeJ_T0048103; mus_musculus_c3hhej. DR Ensembl-Tr; MGP_C57BL6NJ_T0048847; mus_musculus_c57bl6nj. DR Ensembl-Tr; MGP_CASTEiJ_T0048167; mus_musculus_casteij. DR Ensembl-Tr; MGP_CBAJ_T0048028; mus_musculus_cbaj. DR Ensembl-Tr; MGP_DBA2J_T0048128; mus_musculus_dba2j. DR Ensembl-Tr; MGP_FVBNJ_T0048095; mus_musculus_fvbnj. DR Ensembl-Tr; MGP_LPJ_T0048239; mus_musculus_lpj. DR Ensembl-Tr; MGP_NODShiLtJ_T0048100; mus_musculus_nodshiltj. DR Ensembl-Tr; MGP_NZOHlLtJ_T0048943; mus_musculus_nzohlltj. DR Ensembl-Tr; MGP_PWKPhJ_T0047755; mus_musculus_pwkphj. DR Ensembl-Tr; MGP_WSBEiJ_T0047406; mus_musculus_wsbeij. XX CC Contact: MGC help desk CC Email: cgapbs-r@mail.nih.gov CC Tissue Procurement: Lothar Hennighausen Ph.D., Robin Humphreys CC cDNA Library Preparation: Life Technologies, Inc. CC cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) CC DNA Sequencing by: Baylor College of Medicine Human Genome CC Sequencing Center CC Center code: BCM-HGSC CC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ CC Contact: amg@bcm.tmc.edu CC Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., CC Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, CC A.N., Gibbs, R.A. CC Clone distribution: MGC clone distribution information can be found CC through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov CC Series: IRAK Plate: 5 Row: n Column: 4 CC This clone was selected for full length sequencing because it CC passed the following selection criteria: matched mRNA gi: 12746427. CC Differences found between this sequence and the mouse C57BL/6J CC genome (build 36) are described in misc_difference features below. XX FH Key Location/Qualifiers FH FT source 1..2350 FT /organism="Mus musculus" FT /lab_host="DH10B" FT /strain="mix FVB/N, C57BL/6J" FT /mol_type="mRNA" FT /clone_lib="NCI_CGAP_Mam5" FT /clone="MGC:6913 IMAGE:2810960" FT /tissue_type="Mammary tumor. WAP-TGF alpha model. 7 months FT old, gross tissue." FT /note="Vector: pCMV-SPORT6" FT /db_xref="taxon:10090" FT gene 1..2350 FT /gene="Twsg1" FT /note="synonyms: Twg, Tsg" FT misc_difference 1..5 FT /gene="Twsg1" FT /note="5 bases at the 5' end do not align to the mouse FT genome." FT misc_difference 7 FT /gene="Twsg1" FT /note="'C' in cDNA is 'T' in the mouse genome." FT misc_difference 15..16 FT /gene="Twsg1" FT /note="2 bases in cDNA are not found in the mouse genome." FT CDS 114..782 FT /codon_start=1 FT /gene="Twsg1" FT /product="twisted gastrulation homolog 1 (Drosophila)" FT /db_xref="GOA:Q9EP52" FT /db_xref="InterPro:IPR006761" FT /db_xref="MGI:MGI:2137520" FT /db_xref="UniProtKB/Swiss-Prot:Q9EP52" FT /protein_id="AAH04850.1" FT /translation="MKSHYIVLALASLTFLLCLPVSQSCNKALCASDVSKCLIQELCQC FT RPGEGNCPCCKECMLCLGALWDECCDCVGRCNPRNYSDTPPTSKSTVEELHEPIPSLFR FT ALTEGDTQLNWNIVSFPVAEELSHHENLVSFLETVNQLHHQNVSVPSNNVHAPFPSDKE FT RMCTVVYFDDCMSIHQCKISCESMGASKYRWFHNACCECIGPECIDYGSKTVKCMNCMF FT " FT misc_difference 337 FT /gene="Twsg1" FT /note="'G' in cDNA is 'T' in the mouse genome; amino acid FT difference: 'R' in cDNA, 'M' in the mouse genome." FT misc_difference 1957 FT /gene="Twsg1" FT /note="'A' in cDNA is 'G' in the mouse genome." FT misc_difference 2329..2350 FT /gene="Twsg1" FT /note="polyA tail: 22 bases do not align to the mouse FT genome." XX SQ Sequence 2350 BP; 619 A; 540 C; 568 G; 623 T; 0 other; cccacgcgtc cgcgcggcgc cggggttcgc gggagctgct tggaggctcg gcggccggga 60 ggaggccggg gccacgcttc ttggaagcta ctgagtgact tctttgaaga accatgaagt 120 cacactatat tgtgctagct ctagcctccc tgacgttcct gctgtgtctc cccgtgtccc 180 agagctgtaa caaagcactc tgtgccagcg atgtgagcaa atgcctcatt caggagctct 240 gccagtgccg gcctggagaa gggaactgcc cctgctgtaa ggagtgcatg ctgtgcctcg 300 gggccctgtg ggacgagtgc tgcgactgtg tcggtaggtg caaccctcgg aattacagcg 360 acaccccgcc cacatccaag agcaccgtgg aggagctgca cgagcccatt ccgtccctgt 420 tcagggcgct gacggagggc gacacccagc tgaactggaa catcgtctcc ttccctgtgg 480 cagaggagct gtcacaccat gaaaacctag tctccttcct agaaactgtg aaccagctgc 540 accaccaaaa cgtgtctgtt cccagcaaca atgtccacgc ccccttcccc agcgacaaag 600 agcgcatgtg cacagtggtt tactttgatg actgcatgtc catccaccag tgtaagatat 660 cctgcgaatc catgggtgca tccaagtatc gctggtttca caacgcctgc tgcgagtgca 720 tcggtccaga gtgcattgac tatgggagta aaactgtcaa gtgtatgaac tgcatgtttt 780 aaagaggggg aagaaatgca aaccaaagca gtaagtcatg aagtgtgcag aaatcttggt 840 tctggtatgc taggagtgtg ttaagttata tgattgtaac tgtgcttttt atatctggtg 900 cctattagtg taggtctttt ccattggatt caatggaact ttagtcacat gaggatcggg 960 agttcagagg agtcctggga aaacctgaca tgctgacaga aggtgccgtc ttcttccagc 1020 tttccaaaca cttctcgttt tgaacgtgat agcacaagcc tggtacatgt gtggttctca 1080 cctgccagtt gtagaacact aggtccctat agtcacacat ctcttaattg tgccttggct 1140 ggcttacctg ttttgtatga gtaaatatta cagtttataa ttctaacaac tcacattcaa 1200 gccatgctga aacttaattt caaaccactt tacattggtt ttagaaagta aatatttact 1260 atattttaca acagaagagt tttgcctagg gccagcgagc tgactcagtg gataaaggcg 1320 cttgctacca agcctgataa cctgagttcc atccccagag cccgtacagt ggaaggacag 1380 gaccagctgc tgggagttgt cctctgacct ccagacaggc acagtatcat gcgtggaggt 1440 gtgcttgtgt gtgcacacac ataactaact gtttttaaaa atataaacct cttacatggt 1500 gaaatctaaa tctgtcgtgt agctctcaca ctgacagtgg tttggatgtt atgtcccctg 1560 tccgcctgta gtgctggtgt ggtgagacac agagtcgtca ctgctctggt atagaagagt 1620 tttgtctacc aagagtgtca tggcatacct ttggaacttc atcaaatgca cttgaggatg 1680 acctgggtca ggaagtagcc aggtaaaagc agcgggactg taggcgatgc tccattagac 1740 tccgtgcaga gcagcaggtg cacagcatag ctgggtgtgc ggctgaccag gagagggtct 1800 gactccgcac cagcagaaca gcagggtctc cagcacgtgt gggaagcacg tgggagaggg 1860 ttgaggaagg atgcacagat gtggacagag aagcataaaa atgtcgggaa ctcctagtag 1920 ggtccacctt aaaatcgctt tatagtctct ggctttatta ctctgtaaga ttacacttgt 1980 ttctggatat ctgaatccaa ataagcatca tattttaaga agctctgttt ctgaacttcc 2040 agggggaaat ctgtttaatg tgtttactcc tagcatacta cagaattttc tagctctata 2100 gcttcttacc tagcgtttcc atagtgctga gcttcattac tacacgccct tcctagtaat 2160 aaaattctca ccttcaagca tgaatcaaaa acaaatatct ataatacaca ggttcaattt 2220 tatagaattg ctattttctc tagtgcatat ctcattaaaa gtaacttttt aggaataatc 2280 tttatatggg tacatatttt ggtacataaa atagaaaatg ttcttaaaaa aaaaaaaaaa 2340 aaaaaaaaaa 2350 //