![]() |
EBI DbfetchID M10604; SV 1; linear; genomic DNA; STD; FUN; 3793 BP. XX AC M10604; XX DT 18-NOV-1986 (Rel. 10, Created) DT 17-APR-2005 (Rel. 83, Last updated, Version 4) XX DE Yeast (S.carlsbergensis) MEL1 (alpha-galactosidase) gene, complete cds. XX KW alpha-galactosidase; melibiase. XX OS Saccharomyces pastorianus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes; OC Saccharomycetales; Saccharomycetaceae; Saccharomyces. XX RN [1] RP 1-3793 RA Sumner-Smith M.; RT ; RL Unpublished. XX RN [2] RP 930-3793 RX DOI; 10.1016/0378-1119(85)90188-X RX PUBMED; 3000884. RA Sumner-Smith M., Bozzato R.P., Skipper N., Davies R.W., Hopper J.E.; RT "Analysis of the inducible MEL1 gene of Saccharomyces carlsbergensis and RT its secreted product, alpha-galactosidase (melibiase)"; RL Gene 36(3):333-340(1985). XX CC Draft entry and sequence in computer readable form for [1] kindly CC provided by M.Sumner-Smith, 26-DEC-1985. CC The 5' flank of the MEL1 gene contains a region (UASm, positions CC 1177-1229) having certain areas of sequence homology to similar CC sites found upstream of GAL1, GAL7 and GAL10, which are also CC regulated by the action of the products of genes GAL4 and GAL80. CC Potential TATA boxes are found at positions 1323-1328, 1332-1337 CC and 1379-1384 and a potential poly-adenylation signal at 2950-2960. XX FH Key Location/Qualifiers FH FT source 1..3793 FT /organism="Saccharomyces pastorianus" FT /mol_type="genomic DNA" FT /db_xref="taxon:27292" FT sig_peptide 1442..1495 FT /note="alpha-galactosidase signal peptide" FT CDS 1442..2857 FT /codon_start=1 FT /note="pre-alpha galactosidase (melibiase)" FT /db_xref="GOA:P04824" FT /db_xref="InterPro:IPR006215" FT /db_xref="UniProtKB/Swiss-Prot:P04824" FT /protein_id="AAA34770.1" FT /translation="MFAFYFLTACISLKGVFGVSPSYNGLGLTPQMGWDNWNTFACDVS FT EQLLLDTADRISDLGLKDMGYKYIILDDCWSSGRDSDGFLVADEQKFPNGMGHVADHLH FT NNSFLFGMYSSAGEYTCAGYPGSLGREEEDAQFFANNRVDYLKYDNCYNKGQFGTPEIS FT YHRYKAMSDALNKTGRPIFYSLCNWGQDLTFYWGSGIANSWRMSGDVTAEFTRPDSRCP FT CDGDEYDCKYAGFHCSIMNILNKAAPMGQNAGVGGWNDLDNLEVGVGNLTDDEEKAHFS FT MWAMVKSPLIIGANVNNLKASSYSIYSQASVIAINQDSNGIPATRVWRYYVSDTDEYGQ FT GEIQMWSGPLDNGDQVVALLNGGSVSRPMNTTLEEIFFDSNLGSKKLTSTWDIYDLWAN FT RVDNSTASAILGRNKTATGILYNATEQSYKDGLSKNDTRLFGQKIGSLSPNAILNTTVP FT AHGIAFYRLRPSS" FT mat_peptide 1496..2854 FT /note="alpha-galactosidase" XX SQ Sequence 3793 BP; 1133 A; 728 C; 803 G; 1128 T; 1 other; ggatcaaagg aaaaatattt cttgcacttc acaattttgc gctgtcatat atctaggtgc 60 tgctgtctaa gaggataaaa gagaaaaaaa gacgttccca ttcaaagcaa attattattt 120 tctatttcct aacaatagag tcttaaggac aaacaaaata caaacatacc ttacatgtcc 180 ctcgtctatt aaattcaatt cggtaataaa aatgtgaaat tttccccgat tattgctggt 240 tgtatgttac atggttcaac gactccgtgc gtaatggacg ataaggagct gatattcaag 300 atactcaaaa tattgaagca ttgttacaac caaggttcat gtacttttga cagtgcgaat 360 gtgtacagca atgggaaagg gcgagcgttt gttgggtgag gtttttgaaa cactacaaca 420 tcaacaaaga gactattgtt atactctcta agatttatac ctctgttgat gagtcacttg 480 gcgttcttca ccttgggttt agtgaactca ccacatggcc gccactgaag ttagcaaacc 540 aaaaaggctt atttcgtaag cacattctgg atggtgggaa atctgttgaa aggtcgggag 600 catatattga tggtctggca aattcataga atggaccata aaattccaat ggaagaaaca 660 ataaaggctc tggacgacgt cattgagagc ggtgacgtta gacacattgg cgccttcact 720 atggaggaac tatacacctg atatgttgag gttcacttgt ctcttatgct ttctttttta 780 ctttaatatt atgtatactg aaaaatgcac gtgatgataa gactttggaa atttgtgtaa 840 aacccccatt tttttttgct gctgctattg ccaaagncaa caagtcttca ggagacatca 900 acactaagtt tctaccccgt cttcccctag aattctttct gtacgctcag ggtgggcctt 960 taaaggatag caccctaccg aagtcgactt ctaagtaaac accattacta ggagatgact 1020 aaatctggaa aacacatggt ggtctgaatg cgtctagtct ctgccataaa cataacatgt 1080 ttgttttaat gcattctcgt gtttaatcga cattaatgtg gggggagaaa gacatcccat 1140 ccctgaaagg tttttccagg gaatagtcag gacgcattgg ctttcattcg gccatatgtc 1200 ttccgaaaga agaagaaagg aagacatgta ttacattatc caacaaaaaa tggttcttga 1260 cgtctacaaa tcaagaatct taaagacatt gaacgaagta gctgaataaa aattatgaaa 1320 actataaaaa ctataaaaac tgtacttaag tcctcaataa aacataaact tcttactgta 1380 taaggttttc gataatttct tacttgattc taggagagca acggtaataa aagcaacgac 1440 gatgtttgct ttctactttc tcaccgcatg catcagtttg aagggcgttt ttggggtgtc 1500 tccgagttac aatggccttg gtctcactcc acagatgggt tgggacaact ggaatacgtt 1560 tgcctgcgat gtcagtgaac agctacttct agacaccgct gatagaattt ctgacttggg 1620 gctaaaggat atgggttaca agtatatcat tctggatgac tgctggtcta gcggcagaga 1680 ttccgacggt ttcctcgttg cagatgaaca aaaatttccc aatggtatgg gccatgttgc 1740 agaccacctg cataataaca gctttctttt cggtatgtat tcgtctgctg gtgagtacac 1800 ctgtgctgga tatcctgggt ctctgggtcg tgaggaagaa gatgcacagt tctttgcaaa 1860 taaccgcgtt gactacttga agtacgataa ttgttacaat aagggtcagt ttggtacacc 1920 ggaaatttct taccaccgtt acaaggccat gtcagatgct ttgaataaaa ctggtaggcc 1980 tatattctat tctctatgta actggggtca ggatttaaca ttttactggg gctctggtat 2040 cgccaattct tggagaatga gtggagatgt tactgctgag ttcactcgtc cagatagcag 2100 atgtccctgt gatggcgatg aatacgattg caagtacgcc ggtttccatt gttctattat 2160 gaatattctt aacaaggcag ctccaatggg gcaaaatgca ggtgttggtg gttggaatga 2220 tctggacaat ctagaggttg gtgtcgggaa tttgactgac gatgaggaaa aggcacattt 2280 ctctatgtgg gcaatggtaa agtctccact tatcattggt gccaatgtga ataacttaaa 2340 ggcatcttcg tactcaatct atagtcaagc ctctgtcatc gcaattaatc aagattcaaa 2400 tggtattcca gcaacaagag tctggagata ttatgtttca gacacagatg aatatggaca 2460 aggtgaaatt caaatgtgga gtggtcctct tgacaatggt gatcaagtgg ttgctttatt 2520 gaatggagga agcgtatcta gaccaatgaa cacgaccttg gaagagattt tttttgacag 2580 caatctgggt tcaaagaaac tgacatcgac ttgggatatc tacgacctat gggccaacag 2640 agttgacaac tcgacagcgt ctgctatcct tggacggaat aagacagcca ccggtattct 2700 ctacaatgct acggagcaat cctacaaaga cggtttgtct aagaatgata caagactgtt 2760 tggtcagaaa attggtagtc tttctccaaa tgctatactt aacacgactg ttccagctca 2820 cggtatcgcc ttctataggt tgagaccctc ttcttgagct tattgttgag caaagcaggg 2880 cgagaagtat tgatgattgt taaaaagttc atgaaaaaaa tactactcga atatttattc 2940 agagtaacta aataataaac gacagaatag cctatcaggt attccaatag ttttcgtttt 3000 gtaggtacat aatctgaagc ccttgaactt tttctcgttt acatacttca ttgcattagc 3060 gatatttcac atgtgctata ctagtgactt ttgtaaaata cttttctgcc aatgtgattc 3120 tagaattgta tgacatttca cgaagaggaa caacagcttc aggagtacat acaaacgacg 3180 aaatcatcat atcagcaaaa ccggatatcg aagaagcaga aggcgctagt ctacaggtat 3240 ctctgtagtt gatacaaaca aaaacaaaga gttcgaacac atttctgggt cccgagtagt 3300 ggcgttggtg ttccaccatt ggtcgaacaa ttgttttttg ctaccgctcg ctcatagccg 3360 gtttcagtaa aattgaagta agcccagttt catctgaatt tgagaatagg gtctaaaaag 3420 ggagtatcac catcgaacgc ttcctgatag tccttagaga acagtcatca gatgtccata 3480 tattactact acccttctag acctgaccac gtactgttgg atgcattggt aagaattttt 3540 gcccgtatta tcttcaattt tagaataaac atctaacttg gggttagtag cgttcacaaa 3600 tatgccaaca ttgttgacag acttgttgaa catctcaagt tcttcaggac tagcacggct 3660 ttgagtgaag cccacttgtt cttctttgat gtcgaaatcg cacacagtgt tcaggaagaa 3720 attgtagctc ttcgaaatgt tttcgttctc actctaccag gcttcaccaa gcactctgag 3780 aaactcggga tcc 3793 // ![]() |