Dbfetch

ID   AK143026; SV 1; linear; mRNA; HTC; MUS; 5530 BP.
XX
AC   AK143026;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 12)
XX
DE   Mus musculus 0 day neonate lung cDNA, RIKEN full-length enriched library,
DE   clone:E030043B19 product:sema domain, seven thrombospondin repeats (type 1
DE   and type 1-like), transmembrane domain (TM) and short cytoplasmic domain,
DE   (semaphorin) 5A, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-5530
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; b3112ad4a8d5f79d8e97e91ea9e4a6a2.
DR   Ensembl-Gn; ENSMUSG00000022231; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0021771; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0021732; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0021709; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0021738; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0021513; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0022177; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0021029; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0021482; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0021606; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0021584; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0021675; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0021609; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0022200; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0020773; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0021085; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000067458; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0040740; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0040703; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0040662; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0040702; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0040438; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0041169; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0040487; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0040357; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0040508; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0040471; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0040573; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0040460; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0041216; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0040083; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0039843; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=E030043B19
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..5530
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="0 day neonate"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="E030043B19"
FT                   /tissue_type="lung"
FT                   /db_xref="taxon:10090"
FT   CDS             670..3894
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="sema domain, seven thrombospondin repeats (type 1
FT                   and type 1-like), transmembrane domain (TM) and short
FT                   cytoplasmic domain, (semaphorin) 5A (MGD|MGI:107556
FT                   GB|NM_009154, evidence: BLASTN, 99%, match=3641)"
FT                   /db_xref="GOA:Q3UPZ0"
FT                   /db_xref="InterPro:IPR000884"
FT                   /db_xref="InterPro:IPR001627"
FT                   /db_xref="InterPro:IPR015943"
FT                   /db_xref="InterPro:IPR016201"
FT                   /db_xref="InterPro:IPR027231"
FT                   /db_xref="MGI:MGI:107556"
FT                   /db_xref="UniProtKB/TrEMBL:Q3UPZ0"
FT                   /protein_id="BAE25254.1"
FT                   /translation="MKGACILAWLFSSLGVWRLARPETQDPAKCQRAEHPVVSYKEIGP
FT                   WLREFRAENAVDFSRLTFDPGQKELVVGARNYLFRLELEDLSLIQAVEWECDEATKKAC
FT                   YSKGKSKEECQNYIRVLLVGGDRLFTCGTNAFTPVCTIRSLSNLTEIHDQISGMARCPY
FT                   SPQHNSTALLTASGELYAATAMDFPGRDPAIYRSLGTLPPLRTAQYNSKWLNEPNFVSS
FT                   YDIGNFTYFFFRENAVEHDCGKTVFSRAARVCKNDIGGRFLLEDTWTTFMKARLNCSRP
FT                   GEVPFYYNELQGTFFLPELDLIYGIFTTNVNSIAASAVCVFNLSAISQAFNGPFKYQEN
FT                   SRSAWLPYPNPNPNFQCGTMDQGLYVNLTERNLQDAQKFILMHEVVQPVTTVPSFMEDN
FT                   SRFSHLAVDVVQGRETLVHIIYLATDYGTIKKVRAPLSQSSGSCLLEEIELFPERRSEP
FT                   IRSLQILHSQSVLFVGLQEHVAKIPLKRCHFHQTRSACIGAQDPYCGWDAVMKKCTSLE
FT                   ESLSMTQWDQSIPTCPTRNLTVDGSFGPWSPWTPCTHTDGTAVGSCLCRSRSCDSPAPQ
FT                   CGGWQCEGPRMEITNCSRNGGWTPWTSWSPCSTTCGIGFQVRQRSCSNPTPRHGGRVCV
FT                   GQNREERYCNEHLLCPPHVFWTGWGPWERCTAQCGGGIQARRRTCENGPDCAGCNVEYQ
FT                   PCNTNACPELKKTTPWTPWTPVNISDNGGHYEQRFRYTCKARLPDPNLLEVGRQRIEMR
FT                   YCSSDGTSGCSTDGLSGDFLRAGRYSAHTVNGAWSAWTSWSQCSRDCSRGIRNRKRVCN
FT                   NPEPKFGGMPCLGPSLEFQECNILPCPVDGVWSCWSSWSKCSATCGGGHYMRTRSCSNP
FT                   APAYGGDICLGLHTEEALCNTQTCPESWSEWSDWSVCDASGTQVRARQCILLFPVGSQC
FT                   SGNTTESRPCVFDSNFIPEVSVARSSSVEEKRCGEFNMFHMMAVGLSSSILGCLLTLLV
FT                   YTYCQRYQQQSHDATVIHPVSPAALNSSITNHINKLDKYDSVEAIKAFNKNNLILEERN
FT                   KYFNPHLTGKTYSNAYFTDLNNYDEY"
XX
SQ   Sequence 5530 BP; 1315 A; 1477 C; 1405 G; 1333 T; 0 other;
     gccgctcacc cagcgctggt ctactcgcta ggaaggcggc gtcagcagcg gctgctcacg        60
     ctgtgcccac tcccgcgcgc gctgcgctcc cgccgcgggt gggctcctca gcccttggct       120
     gacggtggcc ccagccttgt ggcgacctct gtatcccacc ttcccactgc gggggcgtcc       180
     cggcacgtgc aatccttttt gggggtgttc ttggttccca cgcgcagcta ctaagcagag       240
     ggtacccaac tttgccacct caccggtcgc cctctctcgg cgcgagcctg tccactcagc       300
     tgcacaactg gagactccgg cagactagct ggtccccagc ctggcgcgct gcgagaggag       360
     gatagctctg gagaagtggc ggggtgcagt ggtggcggct gcctcgactt ccctcgtagt       420
     acactgaaga ggacccttgg agcagctccg ggccccgggg cgctggatga ctagcaggag       480
     gaacgcgccc ggcatctggg ctggtgtgtg acaactgggc ctgggtaccc atgaggctgt       540
     gcagttcagt gtgattcatg accgaggcta cacgtctttg ccctctctct cctcggtact       600
     tttcacacat gaggagaagg tgagcttcgc agaagacacg ttcccagagt cagagacccc       660
     ttgcccacca tgaagggagc ctgcatcctt gcatggctgt tctcaagcct gggagtgtgg       720
     agacttgcta ggcccgagac ccaggaccct gccaagtgcc agagagctga gcaccctgtc       780
     gtctcttaca aagaaattgg cccctggtta cgggagttca gagccgagaa tgctgtggat       840
     ttctcgaggt taacatttga cccaggacag aaagaacttg tcgtaggagc gagaaactat       900
     ctcttcagat tagagcttga ggatctgtct ctcatccagg ctgtggagtg ggagtgtgac       960
     gaagccacca agaaggcctg ttacagcaag ggcaaatcga aggaggaatg tcagaactac      1020
     atacgagtgc tcttagttgg tggggaccgg ctattcacct gtgggaccaa tgccttcaca      1080
     cctgtctgta ccatccgctc gttaagtaac ctgactgaga tccatgatca gatcagtggc      1140
     atggcccgct gtccctacag tccccagcac aattctaccg ccctactcac tgccagtggt      1200
     gagctctatg ctgcaacagc catggatttc ccagggcgag atccagccat ttaccgaagt      1260
     ctgggcactt tgcctcctct tcgaactgca cagtacaact ccaaatggct caatgaacca      1320
     aactttgtgt cttcctacga catcggaaat ttcacctact tcttcttccg agaaaacgcc      1380
     gtggaacatg actgtgggaa gacggtgttc tccagggctg cccgggtctg taaaaatgac      1440
     attggagggc gatttcttct ggaggacacc tggactacct tcatgaaggc tcgcctcaac      1500
     tgctcccggc ctggtgaggt gcccttctac tacaatgagc tgcagggtac tttcttcctg      1560
     ccggagctgg atctgatcta tggtatcttc accaccaatg taaacagcat cgctgcctct      1620
     gctgtctgtg tcttcaacct gagcgccatc tcacaggcct tcaatggacc cttcaagtac      1680
     caagaaaact ctcgctcagc ctggctgcct tatcccaacc ccaaccccaa ttttcagtgc      1740
     ggcaccatgg accagggcct gtacgtaaac ctgacggaaa gaaacctaca ggatgctcag      1800
     aagttcattc tgatgcatga agtggtgcag ccagtgacca cagtgccctc tttcatggag      1860
     gacaacagcc gcttctccca cttagctgtt gatgttgtgc aaggcaggga gactctggtc      1920
     cacatcattt atctggccac agattatggc accattaaga aagtacgagc tcccctgagt      1980
     cagagctcag gcagctgttt gctggaagag attgagcttt tcccagagag gaggagtgag      2040
     cccatcagga gcctgcagat cctccacagc cagagtgtcc tatttgtggg actacaggag      2100
     catgtggcca agatccccct gaagaggtgc cacttccatc aaacacgcag tgcctgcatt      2160
     ggcgctcagg acccttactg tggctgggat gcggtgatga agaaatgcac cagcctggag      2220
     gagagtctga gcatgaccca gtgggatcag agcatcccca cctgtccgac cagaaatctc      2280
     actgtagatg ggagctttgg cccatggtca ccgtggacac cctgtacaca cactgatggc      2340
     actgctgtgg gctcctgcct ctgccggtcc cgctcctgtg acagcccagc tcctcagtgt      2400
     ggtggttggc agtgtgaggg ccctagaatg gagatcacca actgttccag gaatggaggc      2460
     tggactccct ggacctcctg gtctccgtgc agcacgacct gtggcattgg cttccaagtg      2520
     cggcagcgat cctgcagcaa ccccacgccc aggcatggcg ggcgcgtgtg cgtgggccag      2580
     aacagggagg aaagatactg caatgaacat ttgctctgcc caccacacgt gttctggaca      2640
     ggctggggac cttgggaacg gtgcacagcc cagtgcgggg gtggcattca agcccgccgg      2700
     aggacctgtg aaaatgggcc tgactgtgca ggatgcaatg tggaatacca gccttgtaac      2760
     accaatgcat gcccggagct gaagaagacc acaccctgga caccctggac ccctgtcaac      2820
     atctctgaca atggaggtca ctatgagcag cgtttccgct atacctgtaa agctcgcctg      2880
     ccagatccaa atttgctgga agtaggaaga cagaggatag aaatgcggta ctgttccagc      2940
     gatggaacca gtggctgctc cacagacgga ctttctggag actttctaag agctgggaga      3000
     tactctgctc atacagtcaa tggggcatgg tcagcctgga cttcctggtc acagtgcagc      3060
     agagactgca gcaggggcat tcggaaccgg aagcgtgttt gcaacaaccc agaacccaag      3120
     tttgggggca tgccatgcct tggcccatcg ctggagttcc aggaatgcaa cattttacct      3180
     tgtccagtgg atggtgtgtg gtcttgctgg tcatcctggt ctaaatgttc agcaacctgt      3240
     ggaggtggcc actacatgag aacccgttct tgttcgaatc cagccccagc atatggaggg      3300
     gacatctgcc tgggactgca cacagaagag gcactctgca acacacagac ctgcccagaa      3360
     agctggtcag agtggtcaga ctggtctgtg tgcgatgcat ctggtaccca ggtccgtgct      3420
     cggcagtgca tccttctgtt tccagtgggc agccaatgtt ctggaaatac tacagagagc      3480
     cggccttgtg tatttgactc taatttcatc ccagaagtat ctgtggcaag atccagcagc      3540
     gtagaagaaa aaaggtgtgg agagttcaac atgttccaca tgatggccgt ggggcttagc      3600
     agttccattc ttggctgcct cctcacactg cttgtctaca cctactgcca gaggtaccag      3660
     cagcagtccc atgatgcaac tgtcatccac cctgtctctc ctgccgccct caacagcagc      3720
     ataactaacc acatcaacaa actggacaaa tatgattctg tggaggccat caaggcattt      3780
     aacaaaaaca acttgatcct agaggagaga aacaaatact tcaacccaca tctcactggg      3840
     aagacctatt ccaacgccta ctttacagat ctcaacaatt atgatgaata ctagcagctc      3900
     ttatactttg ggctcgttgt aaactcgctg ctcctcaaaa ccgtgctcca tggctgccca      3960
     tatttctgag gcttcagaga tgacgtgtgg aaccatttca agtgcatttc aaaccaggac      4020
     tttcccatcg ctaccaaaaa agtgcacacc cttaaatgcc taaatgtggt gttgtgaaaa      4080
     tctggtctat agaaaacatt tggttgttgc gaagtgagcc atggtatgac attttgtatt      4140
     gtgctcatcc taaactctaa ctggtctact gtttttcagg agttttattt cataaatgtg      4200
     ccaaccatat tgaaaagtgt ctttggaaac atagcatcgt catgcttttg agtgtagagg      4260
     atcaggatgt tttaacttgg aaacaaacaa acaaataacc aggcacttta atctgagaga      4320
     tgcctccccc cctgcccccc ccccacacac acacagagtc cagagtaatt cacagaatga      4380
     aggtgtggct aatgccaact cccccaacct ggtgactcta ggttcttaga aacagcttgt      4440
     ttgcacttgg agtgattctg cctagggatt ccacctgccc tggagcacca agtcgtggtc      4500
     tactgagacc cgctgctttg gtctggggca gtgactgtca tcactgtttg gtgaacacat      4560
     gaagcagaga gcagcgagtg agacacacaa cagtgttcta cagcttgcag tgaaactaac      4620
     cttactctga cttttggatt ccatgtcatc ccaggaagct ccttgccacg ctgtgagcct      4680
     gaagaccaac ttcccagaca gtgagcttcc tggctgaaga gccactctga gcatttgggt      4740
     agactccaca ccctttcctc cctccctcca tgaggagaaa attcttcaac atttatgatg      4800
     gaaaataatg aaagtaccaa actgtaattg aaagcgactt taaaaaaaat tgtgtgttat      4860
     gcagtgttta attaactgat gtaattgaaa gctacttttc attttttatt tttttgtgtt      4920
     atgcagtgct gttaatggct ttcttttgta ctcagcatct ctgatgtggc taatgtgcag      4980
     caaggcccct atcaagagca ttaaaggtgt gtaaagctgt tcctccttgc agtcacactg      5040
     gagtggcact ccagtaccat gctctcctca tttgtcccct gctgccccaa gaccagagga      5100
     gggaaaggct ctggtgtgct ttacctcagt tgctgccttc cttgtcctcc ggcttactga      5160
     gtcccacttc tgctctaaga cagtagctgt cagaatgtat catgagaggt aatggagaag      5220
     attagggttt cagtgaccct cctctctcat cctaatggtt gaatgttcta aggctactct      5280
     tgaggtcctc agaggagtaa agagtggtta gcatctgatc taatgctttc taaggtgagg      5340
     acaaaaggtt tgtctacacc aatggtcact atgataaagg cctgggtagg cactcatgtg      5400
     ttcttatgaa tatataaact ttcatgccta tgcagatata gagcacaggg agtgaagatt      5460
     acagtttttt ttttaatgag tatatgtttt cccccatgga atataaagaa gcaatctcag      5520
     aacacattcc                                                             5530
//