Dbfetch

ID   AK029966; SV 1; linear; mRNA; HTC; MUS; 3707 BP.
XX
AC   AK029966;
XX
DT   18-DEC-2002 (Rel. 74, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 15)
XX
DE   Mus musculus adult male testis cDNA, RIKEN full-length enriched library,
DE   clone:4932411G09 product:large tumor suppressor 2, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-3707
RA   Adachi J., Aizawa K., Akimura T., Arakawa T., Bono H., Carninci P.,
RA   Fukuda S., Furuno M., Hanagaki T., Hara A., Hashizume W., Hayashida K.,
RA   Hayatsu N., Hiramoto K., Hiraoka T., Hirozane T., Hori F., Imotani K.,
RA   Ishii Y., Itoh M., Kagawa I., Kasukawa T., Katoh H., Kawai J., Kojima Y.,
RA   Kondo S., Konno H., Kouda M., Koya S., Kurihara C., Matsuyama T.,
RA   Miyazaki A., Murata M., Nakamura M., Nishi K., Nomura K., Numazaki R.,
RA   Ohno M., Ohsato N., Okazaki Y., Saito R., Saitoh H., Sakai C., Sakai K.,
RA   Sakazume N., Sano H., Sasaki D., Shibata K., Shinagawa A., Shiraki T.,
RA   Sogabe Y., Tagami M., Tagawa A., Takahashi F., Takaku-Akahira S.,
RA   Takeda Y., Tanaka T., Tomaru A., Toya T., Yasunishi A., Muramatsu M.,
RA   Hayashizaki Y.;
RT   ;
RL   Submitted (16-JUL-2001) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 7390aa3274a1fdb665a32fd477a1c35e.
DR   Ensembl-Gn; ENSMUSG00000021959; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0021410; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0021371; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0021346; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0021374; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0021153; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0021813; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0020671; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0021120; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0021241; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0021222; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0021316; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0021250; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0021837; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0020414; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0020723; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000022531; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0039701; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0039672; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0039625; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0039672; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0039409; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0040146; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0039409; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0039328; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0039473; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0039443; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0039561; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0039431; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0040180; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0039022; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0038804; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=4932411G09
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3707
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /sex="male"
FT                   /dev_stage="adult"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="4932411G09"
FT                   /tissue_type="testis"
FT                   /db_xref="taxon:10090"
FT   CDS             397..3540
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="large tumor suppressor 2 (MGD|MGI:1354386
FT                   GB|NM_015771, evidence: BLASTN, 99%, match=3444)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q7TSJ6"
FT                   /db_xref="InterPro:IPR000719"
FT                   /db_xref="InterPro:IPR000961"
FT                   /db_xref="InterPro:IPR008271"
FT                   /db_xref="InterPro:IPR009060"
FT                   /db_xref="InterPro:IPR011009"
FT                   /db_xref="InterPro:IPR015940"
FT                   /db_xref="InterPro:IPR017441"
FT                   /db_xref="InterPro:IPR017892"
FT                   /db_xref="InterPro:IPR028742"
FT                   /db_xref="MGI:MGI:1354386"
FT                   /db_xref="PDB:2COS"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q7TSJ6"
FT                   /protein_id="BAC26704.1"
FT                   /translation="MRPKTFPATTYSGNSRQRLQEIREGLKQPSKASTQGLLVGPNSDT
FT                   SLDAKVLGSKDASRQQQMRATPKFGPYQKALREIRYSLLPFANESGTSAAAEVNRQMLQ
FT                   ELVNAGCDQEMAGRALKQTGSRSIEAALEYISKMGYLDPRNEQIVRVIKQTSPGKGLAP
FT                   TPVTRRPSFEGTGEALPSYHQLGGANYEGPAALEEMPRQYLDFLFPGAGAGTHGAQAHQ
FT                   HPPKGYSTAVEPSAHFPGTHYGRGHLLSEQPGYGVQRSSSFQNKTPPDAYSSMAKARGG
FT                   PPASLTFPAHAGLYTASHHKPAATPPGAHPLHVLGTRGPTFTGESSAQAVLAPSRNSLN
FT                   ADLYELGSTVPWSAAPLARRDSLQKQGLEASRPHVAFRAGPSRTNSFNNPQPEPSLPAP
FT                   NTVTAVTAAHILHPVKSVRVLRPEPQTAVGPSHPAWVAAPTAPATESLETKEGSAGPHP
FT                   LDVDYGGSERRCPPPPYPKHLLLPSKSEQYSVDLDSLCTSVQQSLRGGTEQDRSDKSHK
FT                   GAKGDKAGRDKKQIQTSPVPVRKNSRDEEKRKSRIKSYSPYAFKFFMEQHVENVIKTYQ
FT                   QKVSRRLQLEQEMAKAGLCEAEQEQMRKILYQKESNYNRLKRAKMDKSMFVKIKTLGIG
FT                   AFGEVCLACKLDTHALYAMKTLRKKDVLNRNQVAHVKAERDILAEADNEWVVKLYYSFQ
FT                   DKDSLYFVMDYIPGGDMMSLLIRMEVFPEHLARFYIAELTLAIESVHKMGFIHRDIKPD
FT                   NILIDLDGHIKLTDFGLCTGFRWTHNSKYYQKGNHMRQDSMEPGDLWDDVSNCRCGDRL
FT                   KTLEQRAQKQHQRCLAHSLVGTPNYIAPEVLLRKGYTQLCDWWSVGVILFEMLVGQPPF
FT                   LAPTPTETQLKVINWESTLHIPTQVRLSAEARDLITKLCCAADCRLGRDGADDLKAHPF
FT                   FNTIDFSRDIRKQPAPYVPTISHPMDTSNFDPVDEESPWQRGPAERAPRPGTRWPPPAA
FT                   SIQSTPSMSSPSAGSSMTTAIPSGARSPQSPQRVQSPGDADLEGAAEGLPAGVRVSLS"
FT   regulatory      3683..3688
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      3707
FT                   /note="putative"
XX
SQ   Sequence 3707 BP; 890 A; 1082 C; 1053 G; 682 T; 0 other;
     ggcgcttcgg aaccgcggcg tgagcgcccc gggaagatgg agccgccgcc gccagcgccg        60
     ccgccgcttc cccgggctcc cccgaccctg ccggggtcag caaccactgc cgccaccggc       120
     gcccggtctc ccggcgcgcg aggtcccgga gccgcgggcc aggacgccgc cgagggtgta       180
     gagcgctccc ggagagagtg atggtcttca aaatgaaaac tctggaaaat tttaggttct       240
     ctttaggaac tacaaaaatt gaaggacagc aattttttga aaggaagttg ttctgaaagc       300
     atgtttacaa ttcactgaca ctgttgactg ttctctttaa aataataaga cgctttgaga       360
     agattgtatt tatggtaaaa ggaaactgga ctaacaatga ggccaaagac ttttcctgcc       420
     acaacttact ctggaaatag ccggcagcga ttgcaagaga ttcgagaggg gctgaagcag       480
     ccatccaagg cttccaccca ggggctgctg gtgggaccaa acagtgacac ttccctggat       540
     gccaaagtcc tggggagcaa agatgcctcc aggcagcagc aaatgagagc caccccgaag       600
     tttggacctt atcaaaaagc tctcagggaa atccgatatt ccctcctgcc ttttgccaac       660
     gagtcaggca cttcggcagc tgcagaggtg aaccggcaga tgcttcagga gttggtgaat       720
     gcgggatgtg accaggagat ggctggcaga gcgctcaagc agacgggcag taggagtatc       780
     gaagctgcct tggagtacat cagtaagatg ggctacctgg accccaggaa tgagcagatt       840
     gtgcgagtca tcaagcagac ctccccagga aagggcctgg cgcccacccc ggtgactcgg       900
     cggcccagtt tcgagggcac aggggaagca ctcccatcct accaccagct gggtggtgca       960
     aactacgagg gccccgccgc actggaggag atgccgcggc aatatttaga ctttctcttc      1020
     cctggagccg gagccggcac ccacggtgcc caggctcacc agcatcctcc caaagggtac      1080
     agcacagcag tagagccaag tgcgcacttt ccgggcacac actatggtcg tggtcatcta      1140
     ctatcggagc agcctgggta tggggtgcag cgcagttcct ccttccagaa caagacgcca      1200
     ccagatgcct attccagcat ggccaaggcc cggggtggcc ctcccgccag cctcaccttt      1260
     cctgcccatg ctgggctgta cactgcctcg caccacaagc cggcggctac cccacctggg      1320
     gcccacccat tacatgtgtt gggcacccgg ggtcccacgt ttactggcga aagctctgca      1380
     caggctgtgc tggcaccgtc caggaacagc ctcaatgctg acttgtacga gctgggctcc      1440
     acggtgccct ggtctgcagc tccactggca cgccgcgact cgctgcagaa gcagggtcta      1500
     gaagcctcgc ggccgcatgt ggcttttcgg gctggcccca gcaggaccaa ctccttcaac      1560
     aacccacaac ctgagccctc actgcccgcc cccaacacgg tcaccgccgt gacggccgca      1620
     cacatccttc accctgtgaa gagcgtgcgt gtgctgcggc ccgagcccca gacagccgtg      1680
     gggccctcgc accccgcctg ggtggctgcg cccacagcac ctgccactga gagcctggag      1740
     acgaaggagg gcagcgcagg cccacacccg ctggatgtgg actatggcgg ctccgagcgc      1800
     aggtgcccac cgcctccgta cccaaagcac ttgctgctgc ccagtaagtc tgagcagtac      1860
     agcgtggacc tggacagcct gtgcaccagt gtgcagcaga gtctgcgagg gggcactgag      1920
     caagacagga gtgacaagag ccacaaaggt gcgaagggag acaaagctgg cagagacaaa      1980
     aagcagattc agacctcccc ggtgcctgtc cgcaagaata gcagagatga agagaagaga      2040
     aagtctcgca tcaagagtta ctccccttat gccttcaaat tcttcatgga gcaacacgtg      2100
     gagaatgtca tcaaaaccta ccagcagaag gtcagccgga ggctacagct ggagcaggaa      2160
     atggccaaag ctgggctctg tgaggccgag caggagcaga tgaggaagat cctctaccag      2220
     aaggagtcta actacaaccg gctgaagagg gccaagatgg acaagtccat gtttgtgaaa      2280
     atcaagactc taggcatcgg tgcctttggg gaagtgtgcc tcgcttgtaa gctggacact      2340
     cacgctctgt acgccatgaa gactctcagg aagaaggatg tcctgaaccg gaatcaagtg      2400
     gcccatgtca aggctgagag ggacatcctg gctgaagcag acaatgagtg ggtggtcaaa      2460
     ctctactact ccttccagga caaggacagc ctgtactttg tgatggacta cataccaggc      2520
     ggggatatga tgagcctgct gatcaggatg gaggtcttcc ctgagcacct ggcccgcttc      2580
     tacattgcag agttgaccct ggccattgaa agtgtccaca agatgggctt tatccaccgg      2640
     gacatcaagc ctgacaacat actcatcgac ctggatggtc atattaagct gacagatttt      2700
     ggcctctgca ctggattcag gtggactcac aattccaagt actaccagaa agggaaccac      2760
     atgagacagg acagcatgga gcccggtgac ctctgggacg atgtttccaa ctgtcgctgt      2820
     ggagacaggt taaagaccct ggagcagagg gcgcagaagc agcaccagag gtgcctggca      2880
     cattctcttg tcgggacacc aaattacatc gctccggagg tgcttctccg caaagggtac      2940
     acgcagctct gtgactggtg gagcgtcggt gtgattctct ttgagatgct ggttgggcag      3000
     ccgcctttct tggcccccac ccccacagag acgcagctga aggtgatcaa ctgggagagc      3060
     acgctgcata tccctacgca ggtgaggctc agcgctgagg cccgagacct catcacgaag      3120
     ctgtgctgcg cggctgactg ccgcctgggc agggatgggg cagatgacct caaggcacac      3180
     ccgttcttca acaccatcga cttttcccgt gacatccgaa agcagcctgc accctacgtc      3240
     cccaccatca gccaccccat ggacacctcc aattttgacc cggtggatga agaaagcccc      3300
     tggcagcgag ggccagcgga gagagcgcca aggcctggga cacgctggcc tcccccagca      3360
     gcaagcatcc agagcacgcc ttctatgagt tcaccttccg caggttcttc gatgacaacg      3420
     gctatccctt ccggtgcccg aagccctcag agcccgcaga gagtgcagag cccaggggat      3480
     gcggacttgg aaggtgcggc cgaggggctg ccagccggtg tacgtgtaag cctcagttaa      3540
     ccacaactcg aggaaaccca aaatgagatt tcttttcaga agacaaactc aagcttagga      3600
     atccttcatt tttagttctg gtaaatgggc aacaggaaga gtcaacatga tttcaaatta      3660
     gccctctgag gaccttcact gcattaaaac agtattttta aaaaatt                    3707
//