Dbfetch

ID   AK033387; SV 1; linear; mRNA; HTC; MUS; 3522 BP.
XX
AC   AK033387;
XX
DT   18-DEC-2002 (Rel. 74, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 14)
XX
DE   Mus musculus 16 days embryo lung cDNA, RIKEN full-length enriched library,
DE   clone:8430425B13 product:procollagen, type IV, alpha 3, full insert
DE   sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-3522
RA   Adachi J., Aizawa K., Akimura T., Arakawa T., Bono H., Carninci P.,
RA   Fukuda S., Furuno M., Hanagaki T., Hara A., Hashizume W., Hayashida K.,
RA   Hayatsu N., Hiramoto K., Hiraoka T., Hirozane T., Hori F., Imotani K.,
RA   Ishii Y., Itoh M., Kagawa I., Kasukawa T., Katoh H., Kawai J., Kojima Y.,
RA   Kondo S., Konno H., Kouda M., Koya S., Kurihara C., Matsuyama T.,
RA   Miyazaki A., Murata M., Nakamura M., Nishi K., Nomura K., Numazaki R.,
RA   Ohno M., Ohsato N., Okazaki Y., Saito R., Saitoh H., Sakai C., Sakai K.,
RA   Sakazume N., Sano H., Sasaki D., Shibata K., Shinagawa A., Shiraki T.,
RA   Sogabe Y., Tagami M., Tagawa A., Takahashi F., Takaku-Akahira S.,
RA   Takeda Y., Tanaka T., Tomaru A., Toya T., Yasunishi A., Muramatsu M.,
RA   Hayashizaki Y.;
RT   ;
RL   Submitted (16-JUL-2001) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; bf4be66010b0ba87e41c078f105860ae.
DR   Ensembl-Gn; ENSMUSG00000079465; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000113457; mus_musculus.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=8430425B13
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3522
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="16 days embryo"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="8430425B13"
FT                   /tissue_type="lung"
FT                   /db_xref="taxon:10090"
FT   CDS             <1..341
FT                   /codon_start=3
FT                   /transl_table=1
FT                   /note="procollagen, type IV, alpha 3 (MGD|MGI:104688
FT                   GB|NM_007734, evidence: BLASTN, 99%, match=2731)"
FT                   /note="putative"
FT                   /note="start codon is not identified"
FT                   /db_xref="GOA:Q9QZS0"
FT                   /db_xref="InterPro:IPR001442"
FT                   /db_xref="InterPro:IPR008160"
FT                   /db_xref="InterPro:IPR016187"
FT                   /db_xref="MGI:MGI:104688"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9QZS0"
FT                   /protein_id="BAC28260.1"
FT                   /translation="AVHSQTTAIPPCPQDWVSLWKGFSFIMFTSAGSEGAGQALASPGS
FT                   CLEEFRASPFIECHGRGTCNYYSNSYSFWLASLNPERMFRKPIPSTVKAGDLEKIISRC
FT                   QVCMKKRH"
FT   regulatory      3497..3502
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      3522
FT                   /note="putative"
XX
SQ   Sequence 3522 BP; 1046 A; 757 C; 634 G; 1085 T; 0 other;
     tagctgttca cagtcaaact actgctatcc ctccgtgtcc ccaggactgg gtttctctct        60
     ggaaaggttt ttctttcatt atgttcacaa gtgcaggctc tgagggtgct ggacaagcac       120
     ttgcctcgcc tggctcctgc ctggaagaat tccgagccag tccatttata gaatgccatg       180
     gacgagggac atgtaactac tactcaaact cctacagttt ctggctggct tcgctgaacc       240
     cagaaagaat gttcagaaaa cctattccat caactgtgaa agctggagac ttagagaaaa       300
     tcataagccg ctgtcaggtg tgcatgaaga aaagacattg acgaaatcaa agaaaccaga       360
     accgtttttt tttttttttt tttttttttt ggttccttct taagtaacaa agtcatgacc       420
     ttaaaacatg aatttactta ggtctttctc tctaacctta aagatgtagt agtgcaaagt       480
     ttcactacac aagaaagcat ttcattcgtt agtctgtgat gtgggtctct aaacctgtag       540
     gttccaaagt tccctgctgg aaaggcaacc agcttgccca cagaatatcg ctgaaatcac       600
     attccagttg tacccaaaga gctgactaaa tgtcgattgt gtggaacttg aactacaagc       660
     taagagaggt ctgtcctgtg gatgcacggt gtgttccttg tctcccactc ctgagcaaag       720
     gagtgcggtt ccttccatct cctccctgac accttagact tcaactacag aaggagaagg       780
     gagtatacca tcaagcacac cgtgaagaga actggagggt cagagctggt gagtggtgta       840
     aggactagaa cacagaataa atgaggcccc atgatgtttc aggactcatt taaatatgca       900
     caaggctagg accaaaggaa acttcctgaa gcgaatcata acaggcttgt atataaagtg       960
     tagttttgtt ttctgttttg ttgacgatgt ctatttgctt attcactgag ctctttttac      1020
     tatgtcaata gattcaaagc tggtaaaacg tttacccttg gggaactagg ttttactagg      1080
     atggcataca catacattga aacatttctc tttattgttt tcacaagtaa caaggacact      1140
     gcttactaaa tgtgcacttt atccccactc ctatcaacgc cgaccttaaa taatttcacc      1200
     aggaaataat caggtttatc cgacgcaaac atttttttct ttgctctaaa cagtacataa      1260
     catggattta cagatttgtt cctttgttag ttttataact tttcctttgg aaaacacaca      1320
     cgtacgccat gatttttatg ttatagattc ttcgcaacca tgtggaaaga gaccactggg      1380
     taaaacctta ctgtactgta cacagctgct atatgtctta aaacatgttc ttaagtatac      1440
     agatacattc cacagttact taattgagct taacaggttg ttttcttcag ttccaaacta      1500
     tttcccatat actggactgg accggcggtg cagggggtag tgagagagac ttgcttctga      1560
     tctaccctca aaggagataa caaaccccgt gtgtggagac tctcacgcag gcaggaagcc      1620
     catcctctgc tgcttcaccc ccttccctac acggaaaaca aacaaagagt ctcagctatg      1680
     gttgctgttt tagttttccc ttacttctat ttcacagttc agcaggcctg agagaagatt      1740
     taaaaataag agcacttgtt gctctgcaga agcccctagt ttagttccta gtacccacac      1800
     tggtactgtt tcacagtagc ctcttccaga tgacccaaca tccacttctg gactctgcac      1860
     ccacgctcag ctagatatac aaaagtttta aaacttttaa aataagatct atgtccaggc      1920
     taagttcaaa acctctaact tctcttaggc atgccagaag ctaatgtctg aaaaatatta      1980
     agaaatcggg aactttttaa taatcagata gtgtcgtact atttgaagac tcaaagaggc      2040
     caggcattgt gaggcaaacc tttgagtgta tttcttgacc cacggaagtg gacgtgatgg      2100
     tgctgatact taagttcagg ctggtgctgt aaactcaact gctttgtatt tggaaaatgt      2160
     gacccaagag cacagacaag ggttgtcatt ctttaccatt cgtgtgacca ctacaagaca      2220
     ttttacacct gagctgagac atgcaacatc tcattacaag tatgtatttc agctaacatt      2280
     tttgtttgaa aaatatcctg ttgtcccaca tacattccag tagtgaaaac aaggactaat      2340
     ctccttggcc agtgtaggag tagcagctgt ggcattcagg agcttgtcct aattgcccat      2400
     tgtagatcat tctaaacctg caaagtttga aactgtaact agcaaacaga aatctgtgtc      2460
     ttgtcttgat gagaagcatc tcattgtaca acccaggatg acccccaaat caccacaagt      2520
     tttctgcctc agccacccag aattttgttt tcaaagtttt catcatcctc tcatgttctc      2580
     cactcaacta ggcacacacc caagaaactg gccacaatcc ttttcacctt tgtctctctt      2640
     taaaactcac tggtgacttt cagggtaatc tatttcacta ctagactggt gctgatgtca      2700
     caacaggtac ccttcatttg ctcataggcc cctttcttct tctgtcctct tatcagtatt      2760
     taaaacaggt aatactggac accttaatgc ccaaagaagc catatcaaca gtcagtcatt      2820
     tgtacttcta agagtgagca ataaaacaca gcccgtgcat agtacttccc agatgtttat      2880
     gagtaaatta tacatttaaa attcaatttc ctatgtggct ggagaaatgg ttcagtagtt      2940
     cagagtgctg gtttctctgc agagaaccaa agtatagttc tcaataccca ggtcatgcag      3000
     ctcaggactg ctcaaaggga ccccgctggc ctccaaggat acccacatat aggtggcatt      3060
     caaggatacc cacatatagg tggcattcac tagcacaggc actgccacat aaatacaaat      3120
     aaatctttca acatttttaa ctgattaaca aaacgtttgt ttcaattttg gtgcttcaac      3180
     tttggagttt cacagtgtgc taaaacaacc aacttgtcaa ttgttctttt tttgcaagat      3240
     aacacattat taatttattt ataaaagtat taaaaatgtt taaatttgaa ttttattgtt      3300
     aattttacaa aggttttcta tatttcaaat gtagcttgct accactaaac ttaatttaat      3360
     attcttctca ataaccactt gaaatttatt taaaatatat caaatatttt atattgttat      3420
     atcctgacaa aattataatg tattacaatg tactaatatt tctgtaaata taactaaagt      3480
     tctattgttt tgtcaaaata aaagaaaagg aacaattcta tc                         3522
//