Dbfetch
ID K00650; SV 1; linear; genomic DNA; STD; HUM; 6210 BP.
XX
AC K00650; M16287;
XX
DT 26-JUL-1991 (Rel. 28, Created)
DT 14-NOV-2006 (Rel. 89, Last updated, Version 4)
XX
DE Human fos proto-oncogene (c-fos), complete cds.
XX
KW c-myc proto-oncogene; fos oncogene; proto-oncogene.
XX
OS Homo sapiens (human)
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
XX
RN [1]
RP 1-4165
RX DOI; 10.1073/pnas.80.11.3183.
RX PUBMED; 6574479.
RA van Straaten F., Muller R., Curran T., Van Beveren C., Verma I.M.;
RT "Complete nucleotide sequence of a human c-onc gene: deduced amino acid
RT sequence of the human c-fos protein";
RL Proc. Natl. Acad. Sci. U.S.A. 80(11):3183-3187(1983).
XX
RN [2]
RX DOI; 10.1016/0092-8674(85)90285-5.
RX PUBMED; 2414012.
RA Treisman R.;
RT "Transient accumulation of c-fos RNA following serum stimulation requires a
RT conserved 5' element and c-fos 3' sequences";
RL Cell 42(3):889-902(1985).
XX
RN [3]
RP 4166-6210
RX PUBMED; 3555978.
RA Verma I.M., Deschamps J., Van Beveren C., Sassone-Corsi P.;
RT "Human fos gene";
RL Cold Spring Harb. Symp. Quant. Biol. 51:0(0).
XX
DR MD5; 2cdb1f2aa6c13c384c8522058c958a7c.
DR EPD; EP11145; HS_FOS.
DR Ensembl-Gn; ENSG00000170345; homo_sapiens.
DR Ensembl-Tr; ENST00000303562; homo_sapiens.
DR Ensembl-Tr; ENST00000535987; homo_sapiens.
DR Ensembl-Tr; ENST00000555686; homo_sapiens.
DR EuropePMC; PMC116128; 11711622.
DR EuropePMC; PMC19553; 9012824.
DR EuropePMC; PMC3220474; 21937452.
DR EuropePMC; PMC3750569; 23919306.
DR EuropePMC; PMC4786693; 26786102.
XX
CC [2] sites; promoter region.
CC C-fos is the human cellular homolog of the v-fos oncogene of
CC Finkel-Biskis-Jinkins murine osteosarcoma virus (FBJ-MuSV). [2] It
CC was found that both human and murine c-fos genes contained an
CC enhancer-like element in their 5' noncoding regions that was
CC necessary for increased transcription following serum activation.
CC The FBJ-MuSV v-fos oncogene contains a deletion relative to murine
CC and human c-fos proto-oncogenes that causes complete divergence of
CC the COOH terminal protein sequences encoded. That deletion
CC corresponds to positions 3182-3285 inclusive of this sequence. The
CC FBJ-MuSV v-fos sequence is more closely related to murine than
CC human c-fos sequences. The FBJ-MuSV v-fos coding sequence ends at
CC a 'tag' stop codon coresponding to positions 3434-2436 of this
CC sequence [1]. [1] notes two alu repeats beginning aproximately 500
CC and 1700 nucleotides downstream of the last base in this sequence.
CC A TATA box is located at positions 701-707. Two potential
CC polyadenylation signals are present in the 3' untranslated region.
XX
FH Key Location/Qualifiers
FH
FT source 1..6210
FT /organism="Homo sapiens"
FT /map="14q24.3"
FT /mol_type="genomic DNA"
FT /db_xref="taxon:9606"
FT misc_feature 402..453
FT /note="transcriptional activator region [2]"
FT prim_transcript 734..>3329
FT /note="c-fos mRNA [1]"
FT gene 889..1029
FT /gene="FOS"
FT CDS join(889..1029,1783..2034,2466..2573,2688..3329)
FT /codon_start=1
FT /note="c-fos protein"
FT /db_xref="GOA:P01100"
FT /db_xref="HGNC:HGNC:3796"
FT /db_xref="InterPro:IPR000837"
FT /db_xref="InterPro:IPR004827"
FT /db_xref="InterPro:IPR029816"
FT /db_xref="PDB:1A02"
FT /db_xref="PDB:1FOS"
FT /db_xref="PDB:1S9K"
FT /db_xref="UniProtKB/Swiss-Prot:P01100"
FT /protein_id="AAA52471.1"
FT /translation="MMFSGFNADYEASSSRCSSASPAGDSLSYYHSPADSFSSMGSPVN
FT AQDFCTDLAVSSANFIPTVTAISTSPDLQWLVQPALVSSVAPSQTRAPHPFGVPAPSAG
FT AYSRAGVVKTMTGGRAQSIGRRGKVEQLSPEEEEKRRIRRERNKMAAAKCRNRRRELTD
FT TLQAETDQLEDEKSALQTEIANLLKEKEKLEFILAAHRPACKIPDDLGFPEEMSVASLD
FT LTGGLPEVATPESEEAFTLPLLNDPEPKPSVEPVKSISSMELKTEPFDDFLFPASSRPS
FT GSETARSVPDMDLSGSFYAADWEPLHSGSLGMGPMATELEPLCTPVVTCTPSCTAYTSS
FT FVFTYPEADSFPSCAAAHRKGSSSNEPSSDSLSSPTLLAL"
FT exon <889..1029
FT /gene="FOS"
FT /number=1
FT /note="c-fos protein; G00-119-917"
FT intron 1030..1782
FT /note="c-fos intron A"
FT exon 1783..2034
FT /number=2
FT intron 2035..2465
FT /note="c-fos intron B"
FT exon 2466..2573
FT /number=3
FT intron 2574..2687
FT /note="c-fos intron C"
FT exon 2688..>3329
FT /number=4
FT /note="c-fos protein"
XX
SQ Sequence 6210 BP; 1497 A; 1571 C; 1619 G; 1523 T; 0 other;
gcaggaacag tgctagtatt gctcgagccc gagggctgga ggttagggga tgaaggtctg 60
cttccacgct ttgcactgaa ttagggctag aattggggat gggggtaggg gcgcattcct 120
tcgggagccg aggcttaagt cctcggggtc ctgtactcga tgccgtttct cctatctctg 180
agcctcagaa ctgtcttcag tttccgtaca agggtaaaaa ggcgctctct gccccatccc 240
ccccgacctc gggaacaagg gtccgcattg aaccaggtgc gaatgttctc tctcattctg 300
cgccgttccc gcctcccctc ccccagccgc ggcccccgcc tccccccgca ctgcaccctc 360
ggtgttggct gcagcccgcg agcagttccc gtcaatccct ccccccttac acaggatgtc 420
catattagga catctgcgtc agcaggtttc cacggccttt ccctgtagcc ctggggggag 480
ccatccccga aacccctcat cttggggggc ccacgagacc tctgagacag gaactgcgaa 540
atgctcacga gattaggaca cgcgccaagg cgggggcagg gagctgcgag cgctggggac 600
gcagccgggc ggccgcagaa gcgcccaggc ccgcgcgcca cccctctggc gccaccgtgg 660
ttgagcccgt gacgtttaca ctcattcata aaacgcttgt tataaaagca gtggctgcgg 720
cgcctcgtac tccaaccgca tctgcagcga gcaactgaga agccaagact gagccggcgg 780
ccgcggcgca gcgaacgagc agtgaccgtg ctcctaccca gctctgcttc acagcgccca 840
cctgtctccg cccctcggcc cctcgcccgg ctttgcctaa ccgccacgat gatgttctcg 900
ggcttcaacg cagactacga ggcgtcatcc tcccgctgca gcagcgcgtc cccggccggg 960
gatagcctct cttactacca ctcacccgca gactccttct ccagcatggg ctcgcctgtc 1020
aacgcgcagg taaggctggc ttcccgtcgc cgcggggccg ggggcttggg gtcgcggagg 1080
aggagacacc gggcgggacg ctccagtaga tgagtagggg gctcccttgt gcctggaggg 1140
aggctgccgt ggccggagcg gtgccggctc gggggctcgg gacttgctct gagcgcacgc 1200
acgcttgcca tagtaagaat tggttccccc ttcgggaggc aggttcgttc tgagcaacct 1260
ctggtctgca ctccaggacg gatctctgac attagctgga gcagacgtgt cccaagcaca 1320
aactcgctaa ctagagcctg gcttcttcgg ggaggtggca gaaagcggca atcccccctc 1380
ccccggcagc ctggagcacg gaggagggat gagggaggag ggtgcagcgg gcgggtgtgt 1440
aaggcagttt cattgataaa aagcgagttc attctggaga ctccggagcg gcgcctgcgt 1500
cagcgcagac gtcagggata tttataacaa accccctttc aagcaagtga tgctgaaggg 1560
ataacgggaa cgcagcggca ggatggaaga gacaggcact gcgctgcgga atgcctggga 1620
ggaaaagggg gagacctttc atccaggatg agggacattt aagatgaaat gtccgtggca 1680
ggatcgtttc tcttcactgc tgcatgcggc actgggaact cgccccacct gtgtccggaa 1740
cctgctcgct cacgtcggct ttccccttct gttttgttct aggacttctg cacggacctg 1800
gccgtctcca gtgccaactt cattcccacg gtcactgcca tctcgaccag tccggacctg 1860
cagtggctgg tgcagcccgc cctcgtctcc tctgtggccc catcgcagac cagagcccct 1920
caccctttcg gagtccccgc cccctccgct ggggcttact ccagggctgg cgttgtgaag 1980
accatgacag gaggccgagc gcagagcatt ggcaggaggg gcaaggtgga acaggtgagg 2040
aactctagcg tactcttcct gggaatgtgg gggctgggtg ggaagcagcc ccggagatgc 2100
aggagcccag tacagaggat gaagccactg atggggctgg ctgcacatcc gtaactggga 2160
gccctggctc caagcccatt ccatcccaac tcagactctg agtctcaccc taagaagtac 2220
tctcatagtt tcttccctaa gtttcttacc gcatgctttc agactgggct cttctttgtt 2280
ctcttgctga ggatcttatt ttaaatgcaa gtcacaccta ttctgcaact gcaggtcaga 2340
aatggtttca cagtggggtg ccaggaagca gggaagctgc aggagccagt tctactgggg 2400
tgggtgaatg gaggtgatgg cagacacttt tactgaatgt cggtcttttt ttgtgattat 2460
tctagttatc tccagaagaa gaagagaaaa ggagaatccg aagggaaagg aataagatgg 2520
ctgcagccaa atgccgcaac cggaggaggg agctgactga tacactccaa gcggtaggta 2580
ctctgtgggt tgctcctttt taaaacttaa gggaaagttg gagattgagc ataagggccc 2640
ttgagtaaga ctgtgtctta tgctttcctt tatccctctg tatacaggag acagaccaac 2700
tagaagatga gaagtctgct ttgcagaccg agattgccaa cctgctgaag gagaaggaaa 2760
aactagagtt catcctggca gctcaccgac ctgcctgcaa gatccctgat gacctgggct 2820
tcccagaaga gatgtctgtg gcttcccttg atctgactgg gggcctgcca gaggttgcca 2880
ccccggagtc tgaggaggcc ttcaccctgc ctctcctcaa tgaccctgag cccaagccct 2940
cagtggaacc tgtcaagagc atcagcagca tggagctgaa gaccgagccc tttgatgact 3000
tcctgttccc agcatcatcc aggcccagtg gctctgagac agcccgctcc gtgccagaca 3060
tggacctatc tgggtccttc tatgcagcag actgggagcc tctgcacagt ggctccctgg 3120
ggatggggcc catggccaca gagctggagc ccctgtgcac tccggtggtc acctgtactc 3180
ccagctgcac tgcttacacg tcttccttcg tcttcaccta ccccgaggct gactccttcc 3240
ccagctgtgc agctgcccac cgcaagggca gcagcagcaa tgagccttcc tctgactcgc 3300
tcagctcacc cacgctgctg gccctgtgag ggggcaggga aggggaggca gccggcaccc 3360
acaagtgcca ctgcccgagc tggtgcatta cagagaggag aaacacatct tccctagagg 3420
gttcctgtag acctagggag gaccttatct gtgcgtgaaa cacaccaggc tgtgggcctc 3480
aaggacttga aagcatccat gtgtggactc aagtccttac ctcttccgga gatgtagcaa 3540
aacgcatgga gtgtgtattg ttcccagtga cacttcagag agctggtagt tagtagcatg 3600
ttgagccagg cctgggtctg tgtctctttt ctctttctcc ttagtcttct catagcatta 3660
actaatctat tgggttcatt attggaatta acctggtgct ggatattttc aaattgtatc 3720
tagtgcagct gattttaaca ataactactg tgttcctggc aatagtgtgt tctgattaga 3780
aatgaccaat attatactaa gaaaagatac gactttattt tctggtagat agaaataaat 3840
agctatatcc atgtactgta gtttttcttc aacatcaatg ttcattgtaa tgttactgat 3900
catgcattgt tgaggtggtc tgaatgttct gacattaaca gttttccatg aaaacgtttt 3960
attgtgtttt taatttattt attaagatgg attctcagat atttatattt ttattttatt 4020
tttttctacc ttgaggtctt ttgacatgtg gaaagtgaat ttgaatgaaa aatttaagca 4080
ttgtttgctt attgttccaa gacattgtca ataaaagcat ttaagttgaa tgcgaccaac 4140
cttgtgctct tttcattctg gaagtcttgt aagtttctga aaggtattat tggagaccag 4200
tttgtcaaga agggtagctg ctggaggggg acacaccctc tgtctgatcc cttatcaaag 4260
aggacaagga aactatagag ctgattttag aatattttac aaatacatgc cttccattgg 4320
aatgctaaga ttttctactg cttctgggga cgggaaaccg ctgtgtaaca gcttttgtgg 4380
gaatacattt tttctgtttc agtactcgca gggggaaata tttaaatttt gttgtgctaa 4440
tattaaattc agatgttttg atcttaaagg aaccctttaa gcaaacagaa cctagctttg 4500
tacagactat tttaactttt tattctcaca aaatcacgtg gagggttatt ctacttcaaa 4560
gatgagcaaa ttgaagaatg gttagaataa acaactttct tgatattccg ttatcggcat 4620
tagaatcttc ctgctcgtta tcgtatccag caggctgaac tgcctcttga tacttggtta 4680
aaaaaaattt tcaggccggg cgcggtggcc catgcctgta atcctagcac tttgggaggc 4740
cgaggcaggc ggatcacctg aggtcgggag ttcgagacca gcctgaccaa catggagaaa 4800
ccccgtcttt actaaaaata caaaattagc ctggtgtggt ggtgcatgcc tgtaatccta 4860
gctacttgag aggctgagac aggaaaatca cttgaactcg ggaggcggat gttgcagcga 4920
actgagattg cgccattgca ctccagcctg ggcaacaaga ttgaaactct gtttaaaaaa 4980
aaaagttttc actaatgtgt acattttttt gtactctttt attctcgaaa gggaaggagg 5040
gctattgccc tatcccttat taataaatgc attgtggttt ctggtttctc taataccata 5100
tgcccttcat tcagtttata gtgggcggaa gtgggggaga aaaagttgct cagaaatcaa 5160
aagatatctc aaacagcaca aataatggct gatcgttctg caaacaaaaa gttacataat 5220
agctcaagaa ggagaagtca acatgactct gaacaagctt taacttagaa actttatcat 5280
cttaaggaag aacgtgacct ttgtccagga cgtctctggt aatggggcac ttacacacac 5340
atgcacacgt acaaaccaca gggaaaggag accgcccttc tgcctctgct cgcgagtatc 5400
acgcaggcac catgcactat gttttcacac acactgggtg gaagaagagc ttcagcgcca 5460
gtcttctaat gctttggtga taatgaaaat cactgggtgc ttatggggtg tcatattcaa 5520
tcgagttaaa agttttaatt caaaatgaca gttttactga ggttgatgtt ctcgtctatg 5580
atatctctgc ccctcccata aaaatggaca tttaaaagca acttaccgct ctttagatca 5640
ctcctatatc acacaccact tggggtgctg tttctgctag acttgtgatg acagtggcct 5700
taggatccct gtttgctgtt caaagggcaa atattttata gcctttaaat atacctaaac 5760
taaatacaga attaatataa ctaacaaaca cctggtctga aataacaagg tgatctaccc 5820
tggaaggaac ccagctggtg ggccaggagc ggtggctcac acctgtaatt ccagcacttt 5880
gggaggctga gacaggagga tcactggagt ccaggagttt gagaccagcc tgggcaacat 5940
ggcaaaaccc agtgtgcttc tgttgtccca gctacactac tcaggaggct gaggcaggag 6000
tatgacttga gcctgggagg gggaggttgc agagaactga tattgcacca ccactgcact 6060
ccagcctggg tgacacagca aaaccctatc tcaaaaaaaa aaaaaaaaaa aaggaaccca 6120
gctggttcct gtaggtgtgc aataataaca accagaggaa gaaaaggaag acgatttccc 6180
agatgaagaa gggcagctgg accttcggac 6210
//