spacer

EBI Dbfetch

ID   K00650; SV 1; linear; genomic DNA; STD; HUM; 6210 BP.
XX
AC   K00650; M16287;
XX
DT   26-JUL-1991 (Rel. 28, Created)
DT   14-NOV-2006 (Rel. 89, Last updated, Version 4)
XX
DE   Human fos proto-oncogene (c-fos), complete cds.
XX
KW   c-myc proto-oncogene; fos oncogene; proto-oncogene.
XX
OS   Homo sapiens (human)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Homo.
XX
RN   [1]
RP   1-4165
RX   DOI; 10.1073/pnas.80.11.3183.
RX   PUBMED; 6574479.
RA   van Straaten F., Muller R., Curran T., Van Beveren C., Verma I.M.;
RT   "Complete nucleotide sequence of a human c-onc gene: deduced amino acid
RT   sequence of the human c-fos protein";
RL   Proc. Natl. Acad. Sci. U.S.A. 80(11):3183-3187(1983).
XX
RN   [2]
RX   DOI; 10.1016/0092-8674(85)90285-5.
RX   PUBMED; 2414012.
RA   Treisman R.;
RT   "Transient accumulation of c-fos RNA following serum stimulation requires a
RT   conserved 5' element and c-fos 3' sequences";
RL   Cell 42(3):889-902(1985).
XX
RN   [3]
RP   4166-6210
RX   PUBMED; 3555978.
RA   Verma I.M., Deschamps J., Van Beveren C., Sassone-Corsi P.;
RT   "Human fos gene";
RL   Cold Spring Harb. Symp. Quant. Biol. 51:0-0(0).
XX
DR   MD5; 2cdb1f2aa6c13c384c8522058c958a7c.
DR   EPD; EP11145; HS_FOS.
DR   Ensembl-Gn; ENSG00000170345; homo_sapiens.
DR   Ensembl-Tr; ENST00000303562; homo_sapiens.
DR   EuropePMC; PMC116128; 11711622.
DR   EuropePMC; PMC148030; 9837981.
DR   EuropePMC; PMC1752353; 9166000.
DR   EuropePMC; PMC19553; 9012824.
DR   EuropePMC; PMC293562; 8325983.
DR   EuropePMC; PMC3220474; 21937452.
XX
CC   [2]  sites; promoter region.
CC   C-fos is the human cellular homolog of the v-fos oncogene of
CC   Finkel-Biskis-Jinkins murine osteosarcoma virus (FBJ-MuSV).  [2] It
CC   was found that both human and murine c-fos genes contained an
CC   enhancer-like element in their 5' noncoding regions that was
CC   necessary for increased transcription following serum activation.
CC   The FBJ-MuSV v-fos oncogene contains a deletion relative to murine
CC   and human c-fos proto-oncogenes that causes complete divergence of
CC   the COOH terminal protein sequences encoded.  That deletion
CC   corresponds to positions 3182-3285 inclusive of this sequence.  The
CC   FBJ-MuSV v-fos sequence is more closely related to murine than
CC   human c-fos sequences.  The FBJ-MuSV v-fos coding sequence ends at
CC   a 'tag' stop codon coresponding to positions 3434-2436 of this
CC   sequence [1].  [1] notes two alu repeats beginning aproximately 500
CC   and 1700 nucleotides downstream of the last base in this sequence.
CC   A TATA box is located at positions 701-707.  Two potential
CC   polyadenylation signals are present in the 3' untranslated region.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..6210
FT                   /organism="Homo sapiens"
FT                   /map="14q24.3"
FT                   /mol_type="genomic DNA"
FT                   /db_xref="taxon:9606"
FT   misc_feature    402..453
FT                   /note="transcriptional activator region [2]"
FT   prim_transcript 734..>3329
FT                   /note="c-fos mRNA [1]"
FT   gene            889..1029
FT                   /gene="FOS"
FT   CDS             join(889..1029,1783..2034,2466..2573,2688..3329)
FT                   /codon_start=1
FT                   /note="c-fos protein"
FT                   /db_xref="GOA:P01100"
FT                   /db_xref="HGNC:HGNC:3796"
FT                   /db_xref="InterPro:IPR000837"
FT                   /db_xref="InterPro:IPR004827"
FT                   /db_xref="PDB:1A02"
FT                   /db_xref="PDB:1FOS"
FT                   /db_xref="PDB:1S9K"
FT                   /db_xref="UniProtKB/Swiss-Prot:P01100"
FT                   /protein_id="AAA52471.1"
FT                   /translation="MMFSGFNADYEASSSRCSSASPAGDSLSYYHSPADSFSSMGSPVN
FT                   AQDFCTDLAVSSANFIPTVTAISTSPDLQWLVQPALVSSVAPSQTRAPHPFGVPAPSAG
FT                   AYSRAGVVKTMTGGRAQSIGRRGKVEQLSPEEEEKRRIRRERNKMAAAKCRNRRRELTD
FT                   TLQAETDQLEDEKSALQTEIANLLKEKEKLEFILAAHRPACKIPDDLGFPEEMSVASLD
FT                   LTGGLPEVATPESEEAFTLPLLNDPEPKPSVEPVKSISSMELKTEPFDDFLFPASSRPS
FT                   GSETARSVPDMDLSGSFYAADWEPLHSGSLGMGPMATELEPLCTPVVTCTPSCTAYTSS
FT                   FVFTYPEADSFPSCAAAHRKGSSSNEPSSDSLSSPTLLAL"
FT   exon            <889..1029
FT                   /gene="FOS"
FT                   /number=1
FT                   /note="c-fos protein; G00-119-917"
FT   intron          1030..1782
FT                   /note="c-fos intron A"
FT   exon            1783..2034
FT                   /number=2
FT   intron          2035..2465
FT                   /note="c-fos intron B"
FT   exon            2466..2573
FT                   /number=3
FT   intron          2574..2687
FT                   /note="c-fos intron C"
FT   exon            2688..>3329
FT                   /number=4
FT                   /note="c-fos protein"
XX
SQ   Sequence 6210 BP; 1497 A; 1571 C; 1619 G; 1523 T; 0 other;
     GCAGGAACAG TGCTAGTATT GCTCGAGCCC GAGGGCTGGA GGTTAGGGGA TGAAGGTCTG        60
     CTTCCACGCT TTGCACTGAA TTAGGGCTAG AATTGGGGAT GGGGGTAGGG GCGCATTCCT       120
     TCGGGAGCCG AGGCTTAAGT CCTCGGGGTC CTGTACTCGA TGCCGTTTCT CCTATCTCTG       180
     AGCCTCAGAA CTGTCTTCAG TTTCCGTACA AGGGTAAAAA GGCGCTCTCT GCCCCATCCC       240
     CCCCGACCTC GGGAACAAGG GTCCGCATTG AACCAGGTGC GAATGTTCTC TCTCATTCTG       300
     CGCCGTTCCC GCCTCCCCTC CCCCAGCCGC GGCCCCCGCC TCCCCCCGCA CTGCACCCTC       360
     GGTGTTGGCT GCAGCCCGCG AGCAGTTCCC GTCAATCCCT CCCCCCTTAC ACAGGATGTC       420
     CATATTAGGA CATCTGCGTC AGCAGGTTTC CACGGCCTTT CCCTGTAGCC CTGGGGGGAG       480
     CCATCCCCGA AACCCCTCAT CTTGGGGGGC CCACGAGACC TCTGAGACAG GAACTGCGAA       540
     ATGCTCACGA GATTAGGACA CGCGCCAAGG CGGGGGCAGG GAGCTGCGAG CGCTGGGGAC       600
     GCAGCCGGGC GGCCGCAGAA GCGCCCAGGC CCGCGCGCCA CCCCTCTGGC GCCACCGTGG       660
     TTGAGCCCGT GACGTTTACA CTCATTCATA AAACGCTTGT TATAAAAGCA GTGGCTGCGG       720
     CGCCTCGTAC TCCAACCGCA TCTGCAGCGA GCAACTGAGA AGCCAAGACT GAGCCGGCGG       780
     CCGCGGCGCA GCGAACGAGC AGTGACCGTG CTCCTACCCA GCTCTGCTTC ACAGCGCCCA       840
     CCTGTCTCCG CCCCTCGGCC CCTCGCCCGG CTTTGCCTAA CCGCCACGAT GATGTTCTCG       900
     GGCTTCAACG CAGACTACGA GGCGTCATCC TCCCGCTGCA GCAGCGCGTC CCCGGCCGGG       960
     GATAGCCTCT CTTACTACCA CTCACCCGCA GACTCCTTCT CCAGCATGGG CTCGCCTGTC      1020
     AACGCGCAGG TAAGGCTGGC TTCCCGTCGC CGCGGGGCCG GGGGCTTGGG GTCGCGGAGG      1080
     AGGAGACACC GGGCGGGACG CTCCAGTAGA TGAGTAGGGG GCTCCCTTGT GCCTGGAGGG      1140
     AGGCTGCCGT GGCCGGAGCG GTGCCGGCTC GGGGGCTCGG GACTTGCTCT GAGCGCACGC      1200
     ACGCTTGCCA TAGTAAGAAT TGGTTCCCCC TTCGGGAGGC AGGTTCGTTC TGAGCAACCT      1260
     CTGGTCTGCA CTCCAGGACG GATCTCTGAC ATTAGCTGGA GCAGACGTGT CCCAAGCACA      1320
     AACTCGCTAA CTAGAGCCTG GCTTCTTCGG GGAGGTGGCA GAAAGCGGCA ATCCCCCCTC      1380
     CCCCGGCAGC CTGGAGCACG GAGGAGGGAT GAGGGAGGAG GGTGCAGCGG GCGGGTGTGT      1440
     AAGGCAGTTT CATTGATAAA AAGCGAGTTC ATTCTGGAGA CTCCGGAGCG GCGCCTGCGT      1500
     CAGCGCAGAC GTCAGGGATA TTTATAACAA ACCCCCTTTC AAGCAAGTGA TGCTGAAGGG      1560
     ATAACGGGAA CGCAGCGGCA GGATGGAAGA GACAGGCACT GCGCTGCGGA ATGCCTGGGA      1620
     GGAAAAGGGG GAGACCTTTC ATCCAGGATG AGGGACATTT AAGATGAAAT GTCCGTGGCA      1680
     GGATCGTTTC TCTTCACTGC TGCATGCGGC ACTGGGAACT CGCCCCACCT GTGTCCGGAA      1740
     CCTGCTCGCT CACGTCGGCT TTCCCCTTCT GTTTTGTTCT AGGACTTCTG CACGGACCTG      1800
     GCCGTCTCCA GTGCCAACTT CATTCCCACG GTCACTGCCA TCTCGACCAG TCCGGACCTG      1860
     CAGTGGCTGG TGCAGCCCGC CCTCGTCTCC TCTGTGGCCC CATCGCAGAC CAGAGCCCCT      1920
     CACCCTTTCG GAGTCCCCGC CCCCTCCGCT GGGGCTTACT CCAGGGCTGG CGTTGTGAAG      1980
     ACCATGACAG GAGGCCGAGC GCAGAGCATT GGCAGGAGGG GCAAGGTGGA ACAGGTGAGG      2040
     AACTCTAGCG TACTCTTCCT GGGAATGTGG GGGCTGGGTG GGAAGCAGCC CCGGAGATGC      2100
     AGGAGCCCAG TACAGAGGAT GAAGCCACTG ATGGGGCTGG CTGCACATCC GTAACTGGGA      2160
     GCCCTGGCTC CAAGCCCATT CCATCCCAAC TCAGACTCTG AGTCTCACCC TAAGAAGTAC      2220
     TCTCATAGTT TCTTCCCTAA GTTTCTTACC GCATGCTTTC AGACTGGGCT CTTCTTTGTT      2280
     CTCTTGCTGA GGATCTTATT TTAAATGCAA GTCACACCTA TTCTGCAACT GCAGGTCAGA      2340
     AATGGTTTCA CAGTGGGGTG CCAGGAAGCA GGGAAGCTGC AGGAGCCAGT TCTACTGGGG      2400
     TGGGTGAATG GAGGTGATGG CAGACACTTT TACTGAATGT CGGTCTTTTT TTGTGATTAT      2460
     TCTAGTTATC TCCAGAAGAA GAAGAGAAAA GGAGAATCCG AAGGGAAAGG AATAAGATGG      2520
     CTGCAGCCAA ATGCCGCAAC CGGAGGAGGG AGCTGACTGA TACACTCCAA GCGGTAGGTA      2580
     CTCTGTGGGT TGCTCCTTTT TAAAACTTAA GGGAAAGTTG GAGATTGAGC ATAAGGGCCC      2640
     TTGAGTAAGA CTGTGTCTTA TGCTTTCCTT TATCCCTCTG TATACAGGAG ACAGACCAAC      2700
     TAGAAGATGA GAAGTCTGCT TTGCAGACCG AGATTGCCAA CCTGCTGAAG GAGAAGGAAA      2760
     AACTAGAGTT CATCCTGGCA GCTCACCGAC CTGCCTGCAA GATCCCTGAT GACCTGGGCT      2820
     TCCCAGAAGA GATGTCTGTG GCTTCCCTTG ATCTGACTGG GGGCCTGCCA GAGGTTGCCA      2880
     CCCCGGAGTC TGAGGAGGCC TTCACCCTGC CTCTCCTCAA TGACCCTGAG CCCAAGCCCT      2940
     CAGTGGAACC TGTCAAGAGC ATCAGCAGCA TGGAGCTGAA GACCGAGCCC TTTGATGACT      3000
     TCCTGTTCCC AGCATCATCC AGGCCCAGTG GCTCTGAGAC AGCCCGCTCC GTGCCAGACA      3060
     TGGACCTATC TGGGTCCTTC TATGCAGCAG ACTGGGAGCC TCTGCACAGT GGCTCCCTGG      3120
     GGATGGGGCC CATGGCCACA GAGCTGGAGC CCCTGTGCAC TCCGGTGGTC ACCTGTACTC      3180
     CCAGCTGCAC TGCTTACACG TCTTCCTTCG TCTTCACCTA CCCCGAGGCT GACTCCTTCC      3240
     CCAGCTGTGC AGCTGCCCAC CGCAAGGGCA GCAGCAGCAA TGAGCCTTCC TCTGACTCGC      3300
     TCAGCTCACC CACGCTGCTG GCCCTGTGAG GGGGCAGGGA AGGGGAGGCA GCCGGCACCC      3360
     ACAAGTGCCA CTGCCCGAGC TGGTGCATTA CAGAGAGGAG AAACACATCT TCCCTAGAGG      3420
     GTTCCTGTAG ACCTAGGGAG GACCTTATCT GTGCGTGAAA CACACCAGGC TGTGGGCCTC      3480
     AAGGACTTGA AAGCATCCAT GTGTGGACTC AAGTCCTTAC CTCTTCCGGA GATGTAGCAA      3540
     AACGCATGGA GTGTGTATTG TTCCCAGTGA CACTTCAGAG AGCTGGTAGT TAGTAGCATG      3600
     TTGAGCCAGG CCTGGGTCTG TGTCTCTTTT CTCTTTCTCC TTAGTCTTCT CATAGCATTA      3660
     ACTAATCTAT TGGGTTCATT ATTGGAATTA ACCTGGTGCT GGATATTTTC AAATTGTATC      3720
     TAGTGCAGCT GATTTTAACA ATAACTACTG TGTTCCTGGC AATAGTGTGT TCTGATTAGA      3780
     AATGACCAAT ATTATACTAA GAAAAGATAC GACTTTATTT TCTGGTAGAT AGAAATAAAT      3840
     AGCTATATCC ATGTACTGTA GTTTTTCTTC AACATCAATG TTCATTGTAA TGTTACTGAT      3900
     CATGCATTGT TGAGGTGGTC TGAATGTTCT GACATTAACA GTTTTCCATG AAAACGTTTT      3960
     ATTGTGTTTT TAATTTATTT ATTAAGATGG ATTCTCAGAT ATTTATATTT TTATTTTATT      4020
     TTTTTCTACC TTGAGGTCTT TTGACATGTG GAAAGTGAAT TTGAATGAAA AATTTAAGCA      4080
     TTGTTTGCTT ATTGTTCCAA GACATTGTCA ATAAAAGCAT TTAAGTTGAA TGCGACCAAC      4140
     CTTGTGCTCT TTTCATTCTG GAAGTCTTGT AAGTTTCTGA AAGGTATTAT TGGAGACCAG      4200
     TTTGTCAAGA AGGGTAGCTG CTGGAGGGGG ACACACCCTC TGTCTGATCC CTTATCAAAG      4260
     AGGACAAGGA AACTATAGAG CTGATTTTAG AATATTTTAC AAATACATGC CTTCCATTGG      4320
     AATGCTAAGA TTTTCTACTG CTTCTGGGGA CGGGAAACCG CTGTGTAACA GCTTTTGTGG      4380
     GAATACATTT TTTCTGTTTC AGTACTCGCA GGGGGAAATA TTTAAATTTT GTTGTGCTAA      4440
     TATTAAATTC AGATGTTTTG ATCTTAAAGG AACCCTTTAA GCAAACAGAA CCTAGCTTTG      4500
     TACAGACTAT TTTAACTTTT TATTCTCACA AAATCACGTG GAGGGTTATT CTACTTCAAA      4560
     GATGAGCAAA TTGAAGAATG GTTAGAATAA ACAACTTTCT TGATATTCCG TTATCGGCAT      4620
     TAGAATCTTC CTGCTCGTTA TCGTATCCAG CAGGCTGAAC TGCCTCTTGA TACTTGGTTA      4680
     AAAAAAATTT TCAGGCCGGG CGCGGTGGCC CATGCCTGTA ATCCTAGCAC TTTGGGAGGC      4740
     CGAGGCAGGC GGATCACCTG AGGTCGGGAG TTCGAGACCA GCCTGACCAA CATGGAGAAA      4800
     CCCCGTCTTT ACTAAAAATA CAAAATTAGC CTGGTGTGGT GGTGCATGCC TGTAATCCTA      4860
     GCTACTTGAG AGGCTGAGAC AGGAAAATCA CTTGAACTCG GGAGGCGGAT GTTGCAGCGA      4920
     ACTGAGATTG CGCCATTGCA CTCCAGCCTG GGCAACAAGA TTGAAACTCT GTTTAAAAAA      4980
     AAAAGTTTTC ACTAATGTGT ACATTTTTTT GTACTCTTTT ATTCTCGAAA GGGAAGGAGG      5040
     GCTATTGCCC TATCCCTTAT TAATAAATGC ATTGTGGTTT CTGGTTTCTC TAATACCATA      5100
     TGCCCTTCAT TCAGTTTATA GTGGGCGGAA GTGGGGGAGA AAAAGTTGCT CAGAAATCAA      5160
     AAGATATCTC AAACAGCACA AATAATGGCT GATCGTTCTG CAAACAAAAA GTTACATAAT      5220
     AGCTCAAGAA GGAGAAGTCA ACATGACTCT GAACAAGCTT TAACTTAGAA ACTTTATCAT      5280
     CTTAAGGAAG AACGTGACCT TTGTCCAGGA CGTCTCTGGT AATGGGGCAC TTACACACAC      5340
     ATGCACACGT ACAAACCACA GGGAAAGGAG ACCGCCCTTC TGCCTCTGCT CGCGAGTATC      5400
     ACGCAGGCAC CATGCACTAT GTTTTCACAC ACACTGGGTG GAAGAAGAGC TTCAGCGCCA      5460
     GTCTTCTAAT GCTTTGGTGA TAATGAAAAT CACTGGGTGC TTATGGGGTG TCATATTCAA      5520
     TCGAGTTAAA AGTTTTAATT CAAAATGACA GTTTTACTGA GGTTGATGTT CTCGTCTATG      5580
     ATATCTCTGC CCCTCCCATA AAAATGGACA TTTAAAAGCA ACTTACCGCT CTTTAGATCA      5640
     CTCCTATATC ACACACCACT TGGGGTGCTG TTTCTGCTAG ACTTGTGATG ACAGTGGCCT      5700
     TAGGATCCCT GTTTGCTGTT CAAAGGGCAA ATATTTTATA GCCTTTAAAT ATACCTAAAC      5760
     TAAATACAGA ATTAATATAA CTAACAAACA CCTGGTCTGA AATAACAAGG TGATCTACCC      5820
     TGGAAGGAAC CCAGCTGGTG GGCCAGGAGC GGTGGCTCAC ACCTGTAATT CCAGCACTTT      5880
     GGGAGGCTGA GACAGGAGGA TCACTGGAGT CCAGGAGTTT GAGACCAGCC TGGGCAACAT      5940
     GGCAAAACCC AGTGTGCTTC TGTTGTCCCA GCTACACTAC TCAGGAGGCT GAGGCAGGAG      6000
     TATGACTTGA GCCTGGGAGG GGGAGGTTGC AGAGAACTGA TATTGCACCA CCACTGCACT      6060
     CCAGCCTGGG TGACACAGCA AAACCCTATC TCAAAAAAAA AAAAAAAAAA AAGGAACCCA      6120
     GCTGGTTCCT GTAGGTGTGC AATAATAACA ACCAGAGGAA GAAAAGGAAG ACGATTTCCC      6180
     AGATGAAGAA GGGCAGCTGG ACCTTCGGAC                                       6210
//



spacer
spacer