spacer

EBI Dbfetch

ID   AK141612; SV 1; linear; mRNA; HTC; MUS; 4953 BP.
XX
AC   AK141612;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 12)
XX
DE   Mus musculus adult male hippocampus cDNA, RIKEN full-length enriched
DE   library, clone:C630044E17 product:sema domain, seven thrombospondin repeats
DE   (type 1 and type 1-like), transmembrane domain (TM) and short cytoplasmic
DE   domain, (semaphorin) 5A, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus; Mus.
XX
RN   [1]
RP   1-4953
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 9c2d133910ca240c65e07a010eec46a1.
DR   Ensembl-Gn; ENSMUSG00000022231; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000067458; mus_musculus.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=C630044E17
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..4953
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /sex="male"
FT                   /dev_stage="adult"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="C630044E17"
FT                   /tissue_type="hippocampus"
FT                   /db_xref="taxon:10090"
FT   CDS             603..3827
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="sema domain, seven thrombospondin repeats (type 1
FT                   and type 1-like), transmembrane domain (TM) and short
FT                   cytoplasmic domain, (semaphorin) 5A (MGD|MGI:107556
FT                   GB|NM_009154, evidence: BLASTN, 99%, match=3641)"
FT                   /db_xref="GOA:Q3UPZ0"
FT                   /db_xref="InterPro:IPR000884"
FT                   /db_xref="InterPro:IPR001627"
FT                   /db_xref="InterPro:IPR002165"
FT                   /db_xref="InterPro:IPR015943"
FT                   /db_xref="InterPro:IPR016201"
FT                   /db_xref="InterPro:IPR027231"
FT                   /db_xref="MGI:MGI:107556"
FT                   /db_xref="UniProtKB/TrEMBL:Q3UPZ0"
FT                   /protein_id="BAE24765.1"
FT                   /translation="MKGACILAWLFSSLGVWRLARPETQDPAKCQRAEHPVVSYKEIGP
FT                   WLREFRAENAVDFSRLTFDPGQKELVVGARNYLFRLELEDLSLIQAVEWECDEATKKAC
FT                   YSKGKSKEECQNYIRVLLVGGDRLFTCGTNAFTPVCTIRSLSNLTEIHDQISGMARCPY
FT                   SPQHNSTALLTASGELYAATAMDFPGRDPAIYRSLGTLPPLRTAQYNSKWLNEPNFVSS
FT                   YDIGNFTYFFFRENAVEHDCGKTVFSRAARVCKNDIGGRFLLEDTWTTFMKARLNCSRP
FT                   GEVPFYYNELQGTFFLPELDLIYGIFTTNVNSIAASAVCVFNLSAISQAFNGPFKYQEN
FT                   SRSAWLPYPNPNPNFQCGTMDQGLYVNLTERNLQDAQKFILMHEVVQPVTTVPSFMEDN
FT                   SRFSHLAVDVVQGRETLVHIIYLATDYGTIKKVRAPLSQSSGSCLLEEIELFPERRSEP
FT                   IRSLQILHSQSVLFVGLQEHVAKIPLKRCHFHQTRSACIGAQDPYCGWDAVMKKCTSLE
FT                   ESLSMTQWDQSIPTCPTRNLTVDGSFGPWSPWTPCTHTDGTAVGSCLCRSRSCDSPAPQ
FT                   CGGWQCEGPRMEITNCSRNGGWTPWTSWSPCSTTCGIGFQVRQRSCSNPTPRHGGRVCV
FT                   GQNREERYCNEHLLCPPHVFWTGWGPWERCTAQCGGGIQARRRTCENGPDCAGCNVEYQ
FT                   PCNTNACPELKKTTPWTPWTPVNISDNGGHYEQRFRYTCKARLPDPNLLEVGRQRIEMR
FT                   YCSSDGTSGCSTDGLSGDFLRAGRYSAHTVNGAWSAWTSWSQCSRDCSRGIRNRKRVCN
FT                   NPEPKFGGMPCLGPSLEFQECNILPCPVDGVWSCWSSWSKCSATCGGGHYMRTRSCSNP
FT                   APAYGGDICLGLHTEEALCNTQTCPESWSEWSDWSVCDASGTQVRARQCILLFPVGSQC
FT                   SGNTTESRPCVFDSNFIPEVSVARSSSVEEKRCGEFNMFHMMAVGLSSSILGCLLTLLV
FT                   YTYCQRYQQQSHDATVIHPVSPAALNSSITNHINKLDKYDSVEAIKAFNKNNLILEERN
FT                   KYFNPHLTGKTYSNAYFTDLNNYDEY"
FT   polyA_signal    4933..4938
FT                   /note="putative"
FT   polyA_site      4953
FT                   /note="putative"
XX
SQ   Sequence 4953 BP; 1176 A; 1337 C; 1270 G; 1170 T; 0 other;
     gactcccgcg cgcgctgcgc tcccgccgcg ggtgggctcc tcagcccttg gctgacggtg        60
     gccccagcct tgtggcgacc tctgtatccc accttcccac tgcgggggcg tcccggcacg       120
     tgcaatcctt tttgggggtg ttcttggttc ccacgcgcag ctactaagca gagggtaccc       180
     aactttgcca cctcaccggt cgccctctct cggcgcgagc ctgtccactc agctgcacaa       240
     ctggagactc cggcagacta gctggtcccc agcctggcgc gctgcgagag gaggatagct       300
     ctggagaagt ggcggggtgc agtggtggcg gctgcctcga cttccctcgt agtacactga       360
     agaggaccct tggagcagct ccgggccccg gggcgctgga tgactagcag gaggaacgcg       420
     cccggcatct gggctggtgt gtgacaactg ggcctgggta cccatgaggc tgtgcagttc       480
     agtgtgattc atgaccgagg ctacacgtct ttgccctctc tctcctcggt acttttcaca       540
     catgaggaga aggtgagctt cgcagaagac acgttcccag agtcagagac cccttgccca       600
     ccatgaaggg agcctgcatc cttgcatggc tgttctcaag cctgggagtg tggagacttg       660
     ctaggcccga gacccaggac cctgccaagt gccagagagc tgagcaccct gtcgtctctt       720
     acaaagaaat tggcccctgg ttacgggagt tcagagccga gaatgctgtg gatttctcga       780
     ggttaacatt tgacccagga cagaaagaac ttgtcgtagg agcgagaaac tatctcttca       840
     gattagagct tgaggatctg tctctcatcc aggctgtgga gtgggagtgt gacgaagcca       900
     ccaagaaggc ctgttacagc aagggcaaat cgaaggagga atgtcagaac tacatacgag       960
     tgctcttagt tggtggggac cggctattca cctgtgggac caatgccttc acacctgtct      1020
     gtaccatccg ctcgttaagt aacctgactg agatccatga tcagatcagt ggcatggccc      1080
     gctgtcccta cagtccccag cacaattcta ccgccctact cactgccagt ggtgagctct      1140
     atgctgcaac agccatggat ttcccagggc gagatccagc catttaccga agtctgggca      1200
     ctttgcctcc tcttcgaact gcacagtaca actccaaatg gctcaatgaa ccaaactttg      1260
     tgtcttccta cgacatcgga aatttcacct acttcttctt ccgagaaaac gccgtggaac      1320
     atgactgtgg gaagacggtg ttctccaggg ctgcccgggt ctgtaaaaat gacattggag      1380
     ggcgatttct tctggaggac acctggacta ccttcatgaa ggctcgcctc aactgctccc      1440
     ggcctggtga ggtgcccttc tactacaatg agctgcaggg tactttcttc ctgccggagc      1500
     tggatctgat ctatggtatc ttcaccacca atgtaaacag catcgctgcc tctgctgtct      1560
     gtgtcttcaa cctgagcgcc atctcacagg ccttcaatgg acccttcaag taccaagaaa      1620
     actctcgctc agcctggctg ccttatccca accccaaccc caattttcag tgcggcacca      1680
     tggaccaggg cctgtacgta aacctgacgg aaagaaacct acaggatgct cagaagttca      1740
     ttctgatgca tgaagtggtg cagccagtga ccacagtgcc ctctttcatg gaggacaaca      1800
     gccgcttctc ccacttagct gttgatgttg tgcaaggcag ggagactctg gtccacatca      1860
     tttatctggc cacagattat ggcaccatta agaaagtacg agctcccctg agtcagagct      1920
     caggcagctg tttgctggaa gagattgagc ttttcccaga gaggaggagt gagcccatca      1980
     ggagcctgca gatcctccac agccagagtg tcctatttgt gggactacag gagcatgtgg      2040
     ccaagatccc cctgaagagg tgccacttcc atcaaacacg cagtgcctgc attggcgctc      2100
     aggaccctta ctgtggctgg gatgcggtga tgaagaaatg caccagcctg gaggagagtc      2160
     tgagcatgac ccagtgggat cagagcatcc ccacctgtcc gaccagaaat ctcactgtag      2220
     atgggagctt tggcccatgg tcaccgtgga caccctgtac acacactgat ggcactgctg      2280
     tgggctcctg cctctgccgg tcccgctcct gtgacagccc agctcctcag tgtggtggtt      2340
     ggcagtgtga gggccctaga atggagatca ccaactgttc caggaatgga ggctggactc      2400
     cctggacctc ctggtctccg tgcagcacga cctgtggcat tggcttccaa gtgcggcagc      2460
     gatcctgcag caaccccacg cccaggcatg gcgggcgcgt gtgcgtgggc cagaacaggg      2520
     aggaaagata ctgcaatgaa catttgctct gcccaccaca cgtgttctgg acaggctggg      2580
     gaccttggga acggtgcaca gcccagtgcg ggggtggcat tcaagcccgc cggaggacct      2640
     gtgaaaatgg gcctgactgt gcaggatgca atgtggaata ccagccttgt aacaccaatg      2700
     catgcccgga gctgaagaag accacaccct ggacaccctg gacccctgtc aacatctctg      2760
     acaatggagg tcactatgag cagcgtttcc gctatacctg taaagctcgc ctgccagatc      2820
     caaatttgct ggaagtagga agacagagga tagaaatgcg gtactgttcc agcgatggaa      2880
     ccagtggctg ctccacagac ggactttctg gagactttct aagagctggg agatactctg      2940
     ctcatacagt caatggggca tggtcagcct ggacttcctg gtcacagtgc agcagagact      3000
     gcagcagggg cattcggaac cggaagcgtg tttgcaacaa cccagaaccc aagtttgggg      3060
     gcatgccatg ccttggccca tcgctggagt tccaggaatg caacatttta ccttgtccag      3120
     tggatggtgt gtggtcttgc tggtcatcct ggtctaaatg ttcagcaacc tgtggaggtg      3180
     gccactacat gagaacccgt tcttgttcga atccagcccc agcatatgga ggggacatct      3240
     gcctgggact gcacacagaa gaggcactct gcaacacaca gacctgccca gaaagctggt      3300
     cagagtggtc agactggtct gtgtgcgatg catctggtac ccaggtccgt gctcggcagt      3360
     gcatccttct gtttccagtg ggcagccaat gttctggaaa tactacagag agccggcctt      3420
     gtgtatttga ctctaatttc atcccagaag tatctgtggc aagatccagc agcgtagaag      3480
     aaaaaaggtg tggagagttc aacatgttcc acatgatggc cgtggggctt agcagttcca      3540
     ttcttggctg cctcctcaca ctgcttgtct acacctactg ccagaggtac cagcagcagt      3600
     cccatgatgc aactgtcatc caccctgtct ctcctgccgc cctcaacagc agcataacta      3660
     accacatcaa caaactggac aaatatgatt ctgtggaggc catcaaggca tttaacaaaa      3720
     acaacttgat cctagaggag agaaacaaat acttcaaccc acatctcact gggaagacct      3780
     attccaacgc ctactttaca gatctcaaca attatgatga atactagcag ctcttatact      3840
     ttgggctcgt tgtaaactcg ctgctcctca aaaccgtgct ccatggctgc ccatatttct      3900
     gaggcttcag agatgacgtg tggaaccatt tcaagtgcat ttcaaaccag gactttccca      3960
     tcgctaccaa aaaagtgcac acccttaaat gcctaaatgt ggtgttgtga aaatctggtc      4020
     tatagaaaac atttggttgt tgcgaagtga gccatggtat gacattttgt attgtgctca      4080
     tcctaaactc taactggtct actgtttttc aggagtttta tttcataaat gtgccaacca      4140
     tattgaaaag tgtctttgga aacatagcat cgtcatgctt ttgagtgtag aggatcagga      4200
     tgttttaact tggaaacaaa caaacaaata accaggcact ttaatctgag agatgcctcc      4260
     ccccctgccc cccccccaca cacacacaga gtccagagta attcacagaa tgaaggtgtg      4320
     gctaatgcca actcccccaa cctggtgact ctaggttctt agaaacagct tgtttgcact      4380
     tggagtgatt ctgcctaggg attccacctg ccctggagca ccaagtcgtg gtctactgag      4440
     acccgctgct ttggtctggg gcagtgactg tcatcactgt ttggtgaaca catgaagcag      4500
     agagcagcga gtgagacaca caacagtgtt ctacagcttg cagtgaaact aaccttactc      4560
     tgacttttgg attccatgtc atcccaggaa gctccttgcc acgctgtgag cctgaagacc      4620
     aacttcccag acagtgagct tcctggctga agagccactc tgagcatttg ggtagactcc      4680
     acaccctttc ctccctccct ccatgaggag aaaattcttc aacatttatg atggaaaata      4740
     atgaaagtac caaactgtaa ttgaaagcga ctttaaaaaa aattgtgtgt tatgcagtgt      4800
     ttaattaact gatgtaattg aaagctactt ttcatttttt atttttttgt gttatgcagt      4860
     gctgttaatg gctttctttt gtactcagca tctctgatgt ggctaatgtg cagcaaggcc      4920
     cctatcaaga gcattaaagg tgtgtaaagc tgt                                   4953
//


spacer
spacer