![]() |
EBI DbfetchID AJ547797; SV 1; linear; genomic DNA; STD; FUN; 6984 BP. XX AC AJ547797; XX DT 25-FEB-2003 (Rel. 74, Created) DT 14-NOV-2006 (Rel. 89, Last updated, Version 3) XX DE Yarrowia lipolytica ugt1 gene for UDP-Glc:glycoprotein glucosyltransferase DE precursor, exons 1-2 XX KW UDP-Glc:glycoprotein glucosyltransferase precursor; ugt1 gene. XX OS Yarrowia lipolytica OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes; OC Saccharomycetales; Dipodascaceae; Yarrowia. XX RN [1] RP 1-6984 RA Babour A.; RT ; RL Submitted (24-FEB-2003) to the EMBL/GenBank/DDBJ databases. RL Babour A., Genetique moleculaire et cellulaire, INRA, CNRS, INA-PG, Ina, RL 78850 Thiverval Grignon, FRANCE. XX RN [2] RA Babour A., Beckerich J.M., Gaillardin C.; RT "Identification of a new UDP-Glc:glycoprotein glucosyltransferase in the RT yeast Yarrowia lipolytica"; RL Unpublished. XX FH Key Location/Qualifiers FH FT source 1..6984 FT /organism="Yarrowia lipolytica" FT /mol_type="genomic DNA" FT /db_xref="taxon:4952" FT CDS join(572..681,1324..5626) FT /gene="ugt1" FT /product="UDP-Glc:glycoprotein glucosyltransferase FT precursor" FT /db_xref="GOA:Q873M5" FT /db_xref="InterPro:IPR009448" FT /db_xref="UniProtKB/TrEMBL:Q873M5" FT /protein_id="CAD67998.1" FT /translation="MKLLKYAAAALFASSVAANVTVELKAQWQSDFLLELVETLSTEHT FT YFDFINHIAEQVEDGANYTDKEWYDNLTAFAASKSEISDFQKSLTDAALAFRRESPLIE FT TFYQLGDSQESECDIFFTYNGKKYCDSNDLFTLKTTKMPKNPSVYFFDHVARSVQANKH FT LKNIPVTVLYADLRSPEFPLFHKILYQEAQDGKMVYILRYRRSDRSERVTMTGYGAELS FT LKKTDYLVLDDDANTEKLTDNKNPVYTKRELQNMGLNAAQFVLNHRKDPEAALKALKEV FT SFDFPLLSSSLNNTKPVKGFQKALQENTAAGDFMPGANQMFVNGALLSTSASNLQSLFD FT LVALEHSRLEVLAKTLKGAISAEQLASILNDYPLQHALESQPQRIDYRDADALLWLNNL FT ATDIQYQEWPRSVASLLQNQINLAHNAQTVVMPFNMDDFADVKVDKETGELINMHPINR FT GKLTVLFTMLQRSMPIQFGVVPYGSTLKGKKLSQYLHYLARNVDATASLRFLFALGAGT FT PVEEIFTQIPAEITQESVDEALKEESYEPYVTASREWMKKLGMNEAQTQEPAFVMNGIV FT MPFSEEWQNLVGARFQQDLPEMLKLVQKQIIKVSKEAPVGEDDEDEDDENAGNPVDKYF FT ANHEIKDLLVEGSPTRRNLILSPASFTNLQYLESDLNLKDVSTATIAGTTASIFSNSIL FT AGNFASTQFVSQLLALIEAQQEEKDRFAFRTQFVHIGQTDSAKGQVINGVVQKLTELAG FT EEQVSLLKDALAYLEGDNASFSAKKVPFDKTITASEEVKYLKQFKTTESSTLLIIDGLV FT NDISSKAMLLDKSEVIALYEREAVRRLELTSVALSDLKLLTKSVENNLDLAKVHIPLAK FT IFYGDATEQESTLYDVRAFHHIRWNKDVSSFVLGDESTSLVKIVAAVDVLSDGGQRLVS FT QIEAISKVTGVSVRVFPSPKAPDARQEPTLPLKRFYRAHNSVVPEFDAEGAHKVPNLNF FT EGLPAQNLLTFGLDAPSSWIAMPADNTHDLDNILLEEDSEDFVDASYSLQNILIEGSII FT DITKNSYAPGTDLLLKSTLTGESSDTLVMSNLGYFQLQAGPGLWELNLGPSASDVYETE FT HEVIIPVTDVLGPHISLSMERKKGKENVVVGASQDKAKLWSKLKKSTGVSTKKQADINI FT FTVASGHLYERFLSIMTASVMAHTDHTVKFWLIENFLSASFKAFLPHLAAHYGFEYELV FT TYQWPHWLRGQTEKQRQIWGYKILFLDVLFPQDLERVIFIDSDQIVRTDLYELVEMDLE FT GAPYGFTPMCDSRKEMDGFRFWKQGYWDTFLGDDLVYHISALFVVDLKVFRAQQIGDRL FT RVHYHQLSADPASLSNLDQDLPNNLQRQVPIFSLPQDWLWCETWCSDESLKTAKTIDMC FT NNPLTKEPKLDRARRQVPEWTKYDDEIRKLRKEAEGIEGKKKEEEERAGPVEVEVEIDE FT PEADLHDEL" FT exon 572..681 FT /gene="ugt1" FT /number=1 FT sig_peptide 572..625 FT /gene="ugt1" FT /experiment="experimental evidence, no additional details FT recorded" FT mat_peptide join(626..681,1324..5623) FT /gene="ugt1" FT /product="UDP-Glc:glycoprotein glucosyltransferase" FT intron 682..1323 FT /gene="ugt1" FT /number=1 FT exon 1324..5626 FT /gene="ugt1" FT /number=2 XX SQ Sequence 6984 BP; 1697 A; 1832 C; 1636 G; 1819 T; 0 other; acggtatggg ctgttctcag tgtataatcg tgtgtaattt ttatataatc acggctaatg 60 aaattatgtt aatgagatgg cactggcgtc gtaccagtag gaaagatatc aacataccaa 120 catacactcc acaatgctgt gtagaataat agagaatata aaaggtcgat acagtcgtgg 180 ctaccgacac tgacgtcttt cacgccctca tctcgaatga aacaactctg cgtctgcctt 240 cccttttccc tggctgtcta ttgttgcccc gcaggacccc tgttactaag tacatatagt 300 tggagtttat ttaggacatg gggtgtcttt atattgggat gtagaggata tagaaagaat 360 ttgaaagcag attctggata ttccaatcaa caccattcta atggtaccaa aaaacctcac 420 ctttgtatta caataatcgc cattttccga cgcccccact ttttccaagg acaacacatg 480 cataagcgag attaagattt ggcgatggtg cgtataccag cctcgacaga gtctcttgtc 540 tgctactttc actacttgtc atatcagcat catgaaactg ttgaagtatg ccgccgcggc 600 gctctttgcg tccagcgtgg ctgccaacgt gacggtcgag ctcaaggctc aatggcagag 660 tgactttctg ctggaattag tgtgagtatc gagcatggca gggcggcgat cttgcgtgtt 720 tggtttggct atgagttctg gcagtggact ggatcaatta gtggtggcca ataccagtgt 780 ccaagactac atgtcacaca ggacccagta tgtgcgacag tccaaggtac atgagcgact 840 tgagtcaggc aatctggatc taaatgatgg cctcggagag ctctgactac gacgatgacc 900 ttagcaggtt attggacaga gacagtactt gggtgattgt gttaaatgtt gagtgactca 960 gccatggaag aggcgatcag tgtagacaag caaatccccg gaccgatatt ggcctgtgtc 1020 taccatcctg gcaccaatgg acggtatgga tccaaggtca gacgattgtt atcggagata 1080 cgaaccaacc cggttcatta gatcctgctc cccgacgacc tccggtagat tcgatggaga 1140 gtactatttc tctctacctg acaagttcgc gctaaagatc atcaccggag ttcacaccag 1200 caaatggtct ccccctcgct ggactgacaa aaagctgatc tcaagactct gtcaccatta 1260 acgtacgagc tgccgatatt ctgttggctg tcgacaaaat tgtccatcgc ctttctaacc 1320 cagtgaaacc ctttcaactg aacacaccta ctttgatttc atcaatcaca ttgccgaaca 1380 ggtcgaggat ggtgccaatt ataccgacaa ggaatggtac gacaacctga cggcctttgc 1440 ggcctccaaa agcgagatct ccgacttcca gaaatctctg accgacgccg ccctggcctt 1500 ccggcgagaa tctcctctga ttgagacctt ctaccagctc ggcgacagcc aggaatccga 1560 gtgcgacatc ttcttcacct acaatggcaa aaaatactgc gactccaacg atctctttac 1620 tctcaagacc accaagatgc ccaagaaccc cagtgtgtac tttttcgacc acgtggctcg 1680 atccgtacag gccaacaagc atcttaagaa catccccgtg accgttcttt acgctgatct 1740 gcgatctccc gagttccctc tgttccacaa gattctgtac caggaggccc aggatggaaa 1800 gatggtgtac attctgcgat accgacgatc agaccgatct gagcgagtta ccatgaccgg 1860 ttacggtgcc gagctgtccc tcaagaaaac tgattacctg gttcttgacg acgacgctaa 1920 cactgagaag ctgactgaca acaagaaccc cgtctacacc aagagggagc tgcagaacat 1980 gggcctgaac gccgcgcagt ttgttctcaa ccatcgaaag gaccccgagg ctgctctcaa 2040 ggccctcaag gaggtgtctt tcgacttccc tcttctatca tcttctctga acaacaccaa 2100 gcctgtcaag ggcttccaga aggccctcca ggaaaataca gccgctggtg atttcatgcc 2160 tggagccaat cagatgtttg tcaacggtgc cttgctctcc acctcggcca gcaacctgca 2220 gtctctgttt gatctcgttg ctcttgagca ctctcgactc gaggttctgg ctaagaccct 2280 aaagggtgca atctctgctg agcagcttgc ttctattctc aacgactatc ctttgcaaca 2340 cgctttggag tcccagcctc agcgaatcga ctaccgtgac gccgatgctc ttctgtggct 2400 caacaacctg gctacagata tccagtacca ggagtggccc cgatctgtgg cttctctgct 2460 ccagaaccag atcaacctgg ctcataacgc ccagacagtg gtgatgcctt tcaacatgga 2520 cgatttcgct gatgtcaagg ttgacaagga gactggagag ttgatcaaca tgcaccctat 2580 taaccgaggt aagctcactg tgctgttcac tatgctgcaa cgaagcatgc ccatccagtt 2640 tggtgtcgtt ccttacggat ctaccctcaa gggtaagaag ctgtctcagt accttcacta 2700 ccttgcacga aatgtcgacg ccactgcttc tcttcgattc ctgttcgccc tcggagctgg 2760 tactcctgtc gaagagatct ttactcaaat ccctgccgag attactcaag aaagtgtgga 2820 cgaagctctc aaggaggaat cctacgaacc ttatgtgact gcctctcgag agtggatgaa 2880 gaaactgggt atgaatgagg cccagacgca ggagcctgct tttgtaatga acggtattgt 2940 catgcccttc tctgaggagt ggcagaatct tgtgggagct cggttccagc aggatctgcc 3000 cgagatgctc aagcttgtcc agaaacagat catcaaggtg tctaaagagg ctcctgttgg 3060 tgaagatgac gaggacgagg atgacgagaa cgctggaaac ccggttgaca agtactttgc 3120 caaccacgaa atcaaggatc tgttggtgga gggaagcccc acccgacgaa acctcattct 3180 gtctcctgca tctttcacca acctgcagta ccttgagagc gacttaaacc tgaaggatgt 3240 ttccactgct acaatcgctg gtacgactgc ctctatcttc tccaactcca tccttgctgg 3300 aaactttgcc tctacccagt ttgtttcaca gcttctggct cttattgagg cacagcagga 3360 ggagaaggac aggttcgctt tccgaaccca atttgttcat atcggacaga ctgatagtgc 3420 caagggtcag gtgatcaacg gagttgttca gaaactgacc gagctggctg gtgaagagca 3480 ggttagtctt ctcaaggatg cccttgctta tcttgagggt gacaatgctt ctttctctgc 3540 caaaaaggtt ccctttgaca agaccatcac tgcttctgag gaggtcaagt atctgaagca 3600 gttcaagact acagagtctt ctacccttct cattatcgat ggcctcgtca atgacatttc 3660 ttccaaggcc atgcttcttg ataagtccga ggtgattgcc ctttacgagc gagaggccgt 3720 tcgacgtctt gagctcactt ctgtcgctct ttctgacctc aagttgctta ccaagtctgt 3780 cgagaacaac ctcgatcttg ccaaggtgca cattcctctc gctaagatct tctacggtga 3840 tgcaactgag caggaatcta cgctttatga tgttcgtgcc ttccaccaca ttcgatggaa 3900 caaggacgtt tcttctttcg tgcttggcga tgagtctact tctctcgtca agattgttgc 3960 agctgttgat gtgctctctg acggaggaca gcgacttgtg tctcagattg aagccatttc 4020 aaaggtcact ggtgtttccg ttcgagtctt cccctccccc aaggcccctg atgcccgcca 4080 ggaacccacc ctgcccctga agcgattcta ccgagctcac aactctgtcg ttccggagtt 4140 tgatgccgag ggagctcaca aggtgcccaa cctcaacttc gagggtcttc ccgctcagaa 4200 cctgcttact ttcggacttg acgctccttc ttcttggatt gcaatgcctg ccgacaacac 4260 gcacgatttg gacaacattt tgcttgagga ggattccgag gactttgtcg acgcttccta 4320 ctcactccag aacattctca ttgagggatc catcatcgac attaccaaga atagctatgc 4380 tcctggaacc gacctgcttc tgaagagcac cctaacaggc gagtcttccg atactcttgt 4440 catgtccaat ctgggttact ttcagcttca ggcaggccct ggtctctggg aacttaacct 4500 cggaccctct gcttctgacg tctatgaaac tgagcacgag gttatcattc ctgtcactga 4560 cgttcttggt cctcacattt ccctctccat ggagcgaaag aagggtaagg aaaatgttgt 4620 cgttggcgcc tctcaggaca aggccaagct gtggagcaag ctcaagaaga gtactggtgt 4680 ctccaccaag aagcaggcgg acattaacat tttcacggtt gcttctggcc atctctacga 4740 gcgattcttg agtatcatga ctgcctctgt catggcccac accgaccaca ctgtcaagtt 4800 ctggctcatt gagaacttcc tttctgcctc cttcaaggcc ttcctccccc accttgcagc 4860 ccactacggc tttgagtacg aactggtcac ctaccagtgg ccccactggc tacgaggtca 4920 gaccgagaag cagcggcaaa tctggggata caagattctg ttcctggatg tcctcttccc 4980 tcaggatctt gaacgagtca tcttcatcga ttccgaccag attgtgcgga ctgatctgta 5040 cgaacttgtc gagatggatc ttgagggtgc tccctacggt ttcaccccca tgtgtgattc 5100 tcgaaaggag atggacggct tccgattctg gaagcagggc tactgggata ctttcctggg 5160 tgatgatctc gtctaccaca tttctgctct gttcgttgtt gatctcaagg tcttccgagc 5220 ccagcagatt ggagaccgac tccgagtcca ctaccaccag ctttctgccg acccagcttc 5280 tctgtccaac ttggaccaag atctgcccaa caacctgcag cggcaggttc ccattttctc 5340 cctgcctcag gactggttgt ggtgcgagac gtggtgttct gacgagtctc tgaagactgc 5400 gaagaccatt gacatgtgta acaaccctct caccaaggag cccaagctcg accgggctcg 5460 acgacaggtt cccgagtgga ccaagtacga cgatgagatt cgaaagcttc gaaaggaggc 5520 tgagggaatt gagggcaaga agaaggagga ggaggagcgc gccggacctg tggaggtcga 5580 ggtggagatc gatgagcctg aggccgatct tcacgacgag ttgtagacaa cgaatgtatt 5640 gaataaacgt atgtattgta cctagccata taaatggcga gcttgtgttt ggtttccaaa 5700 atgtcagctt tttttttttt tttttttttt tttcttctta atctatgaaa tacatttcag 5760 ctacttgagg ccataagaca aagttatacc cgcaaatcgg atggcatccg tactcgcttg 5820 ttcacgtatc ctgtcttcca tttcttggtt ttttctcttt ttattctaat gattcattac 5880 agcctgtttt tttttttctt acataattaa atgttagctt ggttatataa aaaatactct 5940 aagcatcatt cgcctggctg ctcatccgtt aaaccttaaa ccccttttcc taggtatctc 6000 aaggctgtgg gttcaagccc cacgtcgggc tattttcttt tttttactta agatggtgtg 6060 tatgtgacgc cgcgagagaa tccttcaaga ggcttgtagg gtcgcaatgc cgccattgtt 6120 tctttaacaa cccccgatct ttcttagtca tttgcttcta tatatttggg ttgcacttac 6180 ctcacccatc aatctttctc cgcacatcac catgccaaaa aagtatacca aacaatataa 6240 gagatatccg agcgttccga atcagagtac taagccagct gttgcagatt cggcgcaaaa 6300 accatcgctc tcggccatct ttggcgttcc tctcagtcga gaagagctgg tagagacgaa 6360 acgccgagca cgaggtggcg gctctctgag caaatctgtg ggtccagcgg tgcaatccga 6420 accatgtatg gacccggaga ttgctcgctt cgtgcaagat ccgtcgagaa tcacaatgct 6480 gccgccagtg aaggcaggac cacccgcccc agcaagttgg atggtgaaaa agagcgtcca 6540 agagcccgaa ttcaagaggg tcttcaaaca gcaagaggag ggcgtgtcaa gtcttaagag 6600 actatgcatc aagagcgttg cggagttcta ccccgaacac agggcgctgt tggacgtgta 6660 cagtgagtat ctgtctacga atctagtgct cgacatgttg cctcaaatat cacaactaag 6720 caagtacaca gtgtctcccc aggcatacag cttgctcttg catcaaccat catacccgga 6780 aattacccac ctggatctct ctggcctgga cttgagagaa acgagggtac tcgctgactg 6840 gctgctaagt cgcaagaaag agaaggaaga ggaggaagta gattgggagg atgttcagga 6900 tgaagaagca aatgtcgaaa agtttcccaa tctgacatct ctgtgtttgg cataccctcg 6960 catcacaata ctcgtctatg gttt 6984 // ![]() |