Dbfetch

ID   AK033504; SV 1; linear; mRNA; HTC; MUS; 4003 BP.
XX
AC   AK033504;
XX
DT   18-DEC-2002 (Rel. 74, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 14)
XX
DE   Mus musculus adult male colon cDNA, RIKEN full-length enriched library,
DE   clone:9030407H12 product:twisted gastrulation protein, full insert
DE   sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus; Mus.
XX
RN   [1]
RP   1-4003
RA   Adachi J., Aizawa K., Akimura T., Arakawa T., Bono H., Carninci P.,
RA   Fukuda S., Furuno M., Hanagaki T., Hara A., Hashizume W., Hayashida K.,
RA   Hayatsu N., Hiramoto K., Hiraoka T., Hirozane T., Hori F., Imotani K.,
RA   Ishii Y., Itoh M., Kagawa I., Kasukawa T., Katoh H., Kawai J., Kojima Y.,
RA   Kondo S., Konno H., Kouda M., Koya S., Kurihara C., Matsuyama T.,
RA   Miyazaki A., Murata M., Nakamura M., Nishi K., Nomura K., Numazaki R.,
RA   Ohno M., Ohsato N., Okazaki Y., Saito R., Saitoh H., Sakai C., Sakai K.,
RA   Sakazume N., Sano H., Sasaki D., Shibata K., Shinagawa A., Shiraki T.,
RA   Sogabe Y., Tagami M., Tagawa A., Takahashi F., Takaku-Akahira S.,
RA   Takeda Y., Tanaka T., Tomaru A., Toya T., Yasunishi A., Muramatsu M.,
RA   Hayashizaki Y.;
RT   ;
RL   Submitted (16-JUL-2001) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 2f54ab3e37115207b88b95c5c94d38a2.
DR   Ensembl-Gn; ENSMUSG00000024098; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000024906; mus_musculus.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=9030407H12
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..4003
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /sex="male"
FT                   /dev_stage="adult"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="9030407H12"
FT                   /tissue_type="colon"
FT                   /db_xref="taxon:10090"
FT   CDS             118..786
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="twisted gastrulation protein (MGD|MGI:2137520
FT                   GB|NM_023053, evidence: BLASTN, 99%, match=3987)"
FT                   /db_xref="GOA:Q9EP52"
FT                   /db_xref="InterPro:IPR006761"
FT                   /db_xref="MGI:MGI:2137520"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9EP52"
FT                   /protein_id="BAC28326.1"
FT                   /translation="MKSHYIVLALASLTFLLCLPVSQSCNKALCASDVSKCLIQELCQC
FT                   RPGEGNCPCCKECMLCLGALWDECCDCVGMCNPRNYSDTPPTSKSTVEELHEPIPSLFR
FT                   ALTEGDTQLNWNIVSFPVAEELSHHENLVSFLETVNQLHHQNVSVPSNNVHAPFPSDKE
FT                   RMCTEVYFDDCMSIHQCKISCESMGASKYRWFHNACCECIGPECIDYGSKTVKCMNCMF
FT                   "
FT   regulatory      3987..3992
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      4003
FT                   /note="putative"
XX
SQ   Sequence 4003 BP; 1060 A; 880 C; 910 G; 1153 T; 0 other;
     ggcgggcctg tgtgtccgcg gcgccggggt tcgcgggagc tgcttggagg ctcggcggcc        60
     gggaggaggc cggggccacg cttcttggaa gctactgagt gacttctttg aagaaccatg       120
     aagtcacact atattgtgct agctctagcc tccctgacgt tcctgctgtg tctccccgtg       180
     tcccagagct gtaacaaagc actctgtgcc agcgatgtga gcaaatgcct cattcaggag       240
     ctctgccagt gccggcctgg agaagggaac tgcccctgct gtaaggagtg catgctgtgc       300
     ctcggggccc tgtgggacga gtgctgcgac tgtgtcggta tgtgcaaccc tcggaattac       360
     agcgacaccc cgcccacatc caagagcacc gtggaggagc tgcacgagcc cattccgtcc       420
     ctgttcaggg cgctgacgga gggcgacacc cagctgaact ggaacatcgt ctccttccct       480
     gtggcagagg agctgtcaca ccatgaaaac ctagtctcct tcctagaaac tgtgaaccag       540
     ctgcaccacc aaaacgtgtc tgttcccagc aacaatgtcc acgccccctt ccccagcgac       600
     aaagagcgca tgtgcacaga ggtttacttt gatgactgca tgtccatcca ccagtgtaag       660
     atatcctgcg aatccatggg tgcatccaag tatcgctggt ttcacaacgc ctgctgcgag       720
     tgcatcggtc cagagtgcat tgactatggg agtaaaactg tcaagtgtat gaactgcatg       780
     ttttaaagag ggggaagaaa tgcaaaccaa agcagtaagt catgaagtgt gcagaaatct       840
     tggttctggt atgctaggag tgtgttaagt tatatgattg taactgtgct ttttatatct       900
     ggtgcctatt agtgtaggtc ctttccattg gattcaatgg aactttactc ccatgaggat       960
     cgggagttca gaggagtcct gggaaaacct gacatgctga cagaaggtgc cgtcttcttc      1020
     cagctttcca aacacttctc gttttgaacg tgatagcaca agcctggtac atgtgtggtt      1080
     ctcacctgcc acttgtagaa cactaggtcc ctactagtca cacatctctt aattgtgcct      1140
     tggctggctt acctgttttg tatgagtaaa tattacagtt tataattcta acaactcaca      1200
     ttcaagccat gctgaaactt aatttcaaac cactttacat tggttttaga aagtaaatat      1260
     ttactatatt ttacaacaga agagttttgc ctagggccag cgagctgact cagtggataa      1320
     aggcgcttgc taccaagcct gataacctga gttccatccc cagagcccgt acagtggaag      1380
     gacaggacca cctgctggga gttgtcctct gacctccaga caggcacagt atcatgcgtg      1440
     gaggtgtgct tgtgtgtgca cacacataac taactgtttt taaaaatata aacctcttac      1500
     atggtgaaat ctaaatctgt cgtgtagctc tcacactgac agtggtttgg atgttatgtc      1560
     ccctgtccgc ctgtagtgct ggtgtggtga gacacagagt cgtcactgct ctggtataga      1620
     agagttttgt ctaccaagag tgtcatggca tacctttgga acttcatcaa atgcacttga      1680
     ggatgacctg ggtcaggaag tagccaggta aaagcagcgg gactgtaggc gatgctccat      1740
     tacactccgt gcagagcagg aggtgcacag catagctggg tgtgcggctg accaggagag      1800
     ggtctgactc cgcaccagca gaacagcagg gtctccagca cgtgtgggaa gcacgtggga      1860
     gagggttgag gaaggatgca cagatgtgga cagagaagca taaaaatgtc gggaactcct      1920
     agtagggtcc accttaaaat cgctttatag tctctggctt tgttactctg taagattaca      1980
     cttgtttctg gatatctgaa tccaaataag catcatattt taagaagctc tgtttctgaa      2040
     cttccagggg gaaatctgtt taatgtgttt actcctagca tactacagaa ttttctagct      2100
     ctatagcttc ttacctagcg tttccatagt gctgagcttc attactacac gcccttccta      2160
     gtaataaaat tctcaccttc aagcatgaat caaaaacaaa tatctataat acacaggttc      2220
     aattttatag aattgctatt ttctctagtg catatctcat taaaagtaac tttttaggaa      2280
     taatctttat atgggtacat attttggtac ataaaataga aaatgttgtt aaactcattt      2340
     tgtattattt gaatagttac aagatgattt gtggtatcat gggtacccat tataaaccat      2400
     gctcttccca gtagctgacg aactcaaggt atcacagcct tctaagaagc cgacttagaa      2460
     catggctgta catgaatatt atacattaag gtgtcctctc acttctaccc agagtgcctc      2520
     tgttcaaagg tgccttggaa acatttcagc cccttccttc ttagctccca cagggctgtg      2580
     ggtgttcttg aaatcaggag gcgttttgaa ggaccacagc tgctccattt cagccggtga      2640
     ttcttaggaa agttcatgcc tctgacagaa gtgtgctttg atggcttcta gcggtgcatc      2700
     tcgtctcgtt ttctttgttt gtttttggtt gttgctatca tgggtttggt ttggttttga      2760
     gacaggatct ctgtgcagcc ctggctggcc tggaatgtac tatgtagacc aggctggctc      2820
     tcctcatgtt ttcttagtga tggccataaa cattgttaaa atacatcacc atcttttaaa      2880
     aacttttcat tattaaaatt taaaatatag catgtcattt ttttacccca tacatttgct      2940
     atgaaaaatt ttttaaacca cctgctttaa cttttttatt gccctgtttt tcctattaga      3000
     attgatcccc actgaggtaa attttataat catgttttgt gtatttttcc tggctcgcca      3060
     aggcttatga agaaatagca gccattccct gacaggtttg cgctcccacc acagagaggc      3120
     tgagcaagat gatcagagga tcaaggccag ccagagcaag gcactgccca gaaagcacaa      3180
     gtcctgtgct cagcgttttg cgtagcgttt tattcctaat tgaaatgtaa tatttcagaa      3240
     gctagcagcc tcgctcagtc tagaccttcc acaccaatct agcagcgatt ctcccgtact      3300
     aaagcctttg taagagttta cggttcttcc tcagtgaaaa atgatcttgt ttttcttaca      3360
     gccggatcca aagacgctag atgttaaggg ctgaggctga agcccggtga cggggcgctc      3420
     acctgtcatg gtgcagccct cgttccaccg tgagcaccag caagagacaa acacaagctt      3480
     gtgagtcaga ggccgttatt aaattcatac gcacatactc cctatagcga gacatgggct      3540
     tatgggcagg cttttttttc ataacattta tgagaaaaca atgttttccc cataacattt      3600
     aattaggact gtagcttatt ggtaattaag gtacaaaatc aaagtcgagt agaatgtact      3660
     gttcacacag cgtgttgtga aaggggtcct cacaccaaag tttaactgta aagtttagaa      3720
     aaataacatt gtcattagca tatttgaaca catatttgga atttctaaaa agcatcaaaa      3780
     tagaaaaaga aagtgaaact ctggagaatg agatgctgaa gatgggctat gatttaaagg      3840
     tctgttctgt agttagaaag caccttttaa agactttgtt cattcccaag agtctatgtt      3900
     gattgcattt aacatgaccg acaacttata tatgtaattg tgtacatttt cattggttgt      3960
     ctctgtagtc caaaagaagg tattttaata aaaaatagaa atg                        4003
//