spacer
spacer

EBI Dbfetch

ID   AL049913; SV 1; linear; genomic DNA; STD; PRO; 40055 BP.
XX
AC   AL049913;
XX
DT   19-MAY-1999 (Rel. 59, Created)
DT   23-OCT-2008 (Rel. 97, Last updated, Version 5)
XX
DE   Mycobacterium leprae cosmid B1610.
XX
KW   ABC transporter; acyltransferase; anthranilate synthase component I;
KW   antiporter; cyclase; enoyl-CoA hydratase; glutamine amidotransferase;
KW   glycerol-3-phosphate acyltransferase; hisA; hisB; hisC; hisD; hisF; hisH;
KW   hisI2; histidinol dehydrogenase; histidinol-phosphate aminotransferase;
KW   imidazole glycerol-phosphate dehydratase;
KW   indole-3-glycerol phosphate synthase; phosphoribosyl-AMP cyclohydrolase;
KW   phosphoribosylformimino-5-aminoimidazole carboxamide; plsB; plsC;
KW   pseudogene; ribotide isomerase; RLEP; trpB; trpC; trpE;
KW   tryptophan synthase beta chain.
XX
OS   Mycobacterium leprae
OC   Bacteria; Actinobacteria; Actinobacteridae; Actinomycetales;
OC   Corynebacterineae; Mycobacteriaceae; Mycobacterium.
XX
RN   [1]
RP   1-40055
RA   Seeger K.J., Harris D.;
RT   ;
RL   Unpublished.
XX
RN   [2]
RP   1-40055
RA   James K.D., Parkhill J., Barrell B.G., Rajandream M.A.;
RT   ;
RL   Submitted (01-MAR-1999) to the EMBL/GenBank/DDBJ databases.
RL   Mycobacterium leprae sequencing project, Sanger Centre, Wellcome Trust
RL   Genome Campus, Hinxton, Cambridge CB10 1SA E-mail: barrell@sanger.ac.uk
RL   Cosmids supplied by Dr. Stewart T. Cole, [3] Unite de Genetique Moleculaire
RL   Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex
RL   15, France Requests for cosmids should be sent to Karin Eiglmeier
RL   (kei@pasteur.fr).
XX
RN   [3]
RP   1-40055
RX   DOI; 10.1111/j.1365-2958.1993.tb01111.x
RX   PUBMED; 8446027.
RA   Eiglmeier K., Honore N., Woods S.A., Caudron B., Cole S.T.;
RT   "Use of an ordered cosmid library to deduce the genomic organization of
RT   Mycobacterium leprae";
RL   Mol. Microbiol. 7(2):197-206(1993).
XX
DR   RFAM; RF00005; tRNA.
XX
CC   Notes:
CC   
CC   The Sanger Centre is funded to complete the sequence of M. leprae
CC   by the Heiser Program for Research in Leprosy and Tuberculosis of
CC   The New York Community Trust.
CC   
CC   Work in Paris is supported by the Heiser Trust, the Association
CC   Francaise Raoul Follereau and the Groupement de Recherches et des
CC   Etudes des Genomes (GIP-GREG).
CC   
CC   Details of M. leprae sequencing at the Sanger Centre
CC   are available on the World Wide Web.
CC   (URL, http://www.sanger.ac.uk/Projects/)
CC   
CC   CDS are numbered using the following system eg MLCB33.01c.
CC   ML (M. leprae), cB33 (cosmid name), .01 (first CDS),
CC   c (complementary strand).
CC   
CC   The more significant matches with motifs in the PROSITE
CC   database are also included but some of these may be fortuitous.
CC   
CC   The length in codons is given for each CDS.
CC   
CC   Usually the highest scoring match found by fasta -o is given for
CC   CDS which show significant similarity to other CDS in the database.
CC   The position of possible ribosome binding site sequences are
CC   given where these have been used to deduce the initiation codon.
CC   
CC   All CDS over 100 codons have been analysed.  Gene prediction
CC   is based on positional base preference in codons especially
CC   where there is an increase in the observed/expected third
CC   position G + C.  CAUTION:  We may not have predicted the
CC   correct initiation codon.  Where possible we choose an
CC   initiation codon (atg, gtg, or ttg) which is preceded by an
CC   upstream ribosome binding site sequence (optimally 5-13bp
CC   before the initiation codon).  If this cannot be identified
CC   we choose the most upstream initiation codon.
CC   
CC   IMPORTANT: This sequence MAY NOT be the entire insert of
CC   the sequenced clone.  It may be shorter because we only
CC   sequence overlapping sections once, or longer, because we
CC   arrange for a small overlap between neighbouring submissions.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..40055
FT                   /organism="Mycobacterium leprae"
FT                   /mol_type="genomic DNA"
FT                   /clone="cosmid B1610"
FT                   /db_xref="taxon:1769"
FT   misc_feature    complement(1..1858)
FT                   /note="overlap with EMBL:U00010 M.leprae cosmid B1170 from
FT                   2 to 1859"
FT   CDS             205..1089
FT                   /transl_table=11
FT                   /gene="MLCB1610.01"
FT                   /product="putative enoyl-CoA hydratase"
FT                   /note="MLCB1610.01, possible enoyl-CoA hydratase, len: 294
FT                   aa; similar to many e.g. CRT_CLOAB (EMBL:Z92974)
FT                   Clostridium acetobutylicum 3-hydroxybutyryl-CoA dehydratase
FT                   (crotonase) (261 aa), fasta scores; opt: 384 z-score: 456.9
FT                   E(): 3.8e-18, 31.1% identity in 264 aa overlap. Similar to
FT                   TR:O53163 (EMBL:AL021184) Rv1472 (MTV007.19), echA12,
FT                   M.tuberculosis probable enoyl-CoA hydratase (285 aa).
FT                   Annotated as ECHH_MYCLE (EMBL:U00010), fcbB, putative
FT                   enoyl-CoA hydratase in M.leprae cosmid ML010. Contains fam
FT                   match to entry PF00378 ECH, Enoyl-CoA hydratase/isomerase
FT                   family, score 154.20, E-value 2.3e-42 and PS00166 Enoyl-CoA
FT                   hydratase/isomerase signature"
FT                   /db_xref="GOA:P53526"
FT                   /db_xref="InterPro:IPR001753"
FT                   /db_xref="InterPro:IPR018376"
FT                   /db_xref="UniProtKB/Swiss-Prot:P53526"
FT                   /protein_id="CAB43147.1"
FT                   /translation="MSQTDASCTIAELPYRSVTDLVVLDFPRPEVALITLNRPGRMNSM
FT                   ALDLMKSLKQVLKRITYDHSVRVVVLTGAGRGFCSGADQKFTAPVPQVEGLTQPVRALR
FT                   AMELLEEVILALRRLHQPVIAAINGPAIGGGLCLALAADVRVASTRAYFRAAGINNGLS
FT                   ASELGLSYLLPRAVGSSRAFEIMLSGRDVGAEEAEQIGLVSYRVSDDRLLDTCYSIAAR
FT                   MATFSRSGTELTKRALWGGLDAASLDKHMQSESLAQLFIALHTSNFEEAAAPCTEKRPT
FT                   VLVDARGCATSPG"
FT   misc_feature    298..828
FT                   /note="Pfam match to entry PF00378 ECH, Enoyl-CoA
FT                   hydratase/isomerase family, score 154.20, E-value 2.3e-42"
FT   misc_feature    574..636
FT                   /note="PS00166 Enoyl-CoA hydratase/isomerase signature"
FT   CDS             1947..2671
FT                   /pseudo
FT                   /transl_table=11
FT                   /gene="MLCB1610.02"
FT                   /product="putative pseudogene"
FT                   /note="MLCB1610.02, possible regulatory protein pseudogene,
FT                   len: 725 bp; similar to several from M.tuberculosis e.g.
FT                   TR:O53720 (EMBL:AL021931) Rv0386 (MTV036.21) probable
FT                   regulatory protein (1085 aa)"
FT   CDS             complement(2872..3255)
FT                   /transl_table=11
FT                   /gene="MLCB1610.03c"
FT                   /product="hypothetical protein MLCB1610.03c"
FT                   /note="MLCB1610.03c, hypothetical protein, len: 127 aa;
FT                   unknown function, improbable CDS based on ORF alone"
FT                   /db_xref="UniProtKB/TrEMBL:Q9X7A6"
FT                   /protein_id="CAB43149.1"
FT                   /translation="MYPTGLHLTTTATASAAPRKKQSTHCHPAEHNRFRPDQQRMGPVR
FT                   RSPQSVLTGHHHADVARSASGNSHLNYQPSQPVKAPEHRPTLAQSPVTNTIQAATDRHC
FT                   SRRVDIDKPEIRLRPVTKNNSMT"
FT   repeat_region   3966..4679
FT                   /note="identical to M.leprae cosmid L536 (EMBL:Z99125) from
FT                   13809 to 14522 and contains inverted repeat of M.leprae
FT                   cosmid ML013 (EMBL:U00013) from 35358 to 35813"
FT   CDS             5790..6248
FT                   /transl_table=11
FT                   /gene="MLCB1610.04c"
FT                   /product="hypothetical protein MLCB1610.04c"
FT                   /note="MLCB1610.04c hypothetical protein, len: 152 aa;
FT                   unknown function, CDS based on frame analysis and GC
FT                   content"
FT                   /db_xref="UniProtKB/TrEMBL:Q9X7A7"
FT                   /protein_id="CAB43150.1"
FT                   /translation="MASCLPRKWEGLPDVCLAVRSRCWYRFFVWHWSINIKRGQAYSLP
FT                   NRGMELSGICVLVNGQQGFALCLAGSHLRLKRLSRGYAITLLTAAADSERFNAAAPISY
FT                   TQSEASSCLMLHGEEKNPLAPSAKAHAFCAALLTVGVRTGPCRHPRRD"
FT   CDS             6471..7946
FT                   /transl_table=11
FT                   /gene="MLCB1610.05"
FT                   /product="putative membrane protein"
FT                   /note="MLCB1610.05, possible membrane protein, len: 501 aa;
FT                   unknown function, highly similar to a family of
FT                   hypothetical proteins from M.tuberculosis e.g. TR:O53209
FT                   (EMBL:AL021246) v2484c (MTV008.40c), fasta scores; opt:
FT                   2459 z-score: 2790.5 E(): 0, 75.2% identity in 483 aa
FT                   overlap"
FT                   /db_xref="InterPro:IPR004255"
FT                   /db_xref="InterPro:IPR014292"
FT                   /db_xref="UniProtKB/TrEMBL:Q9X7A8"
FT                   /protein_id="CAB43151.1"
FT                   /translation="MADSVEGIGPFDELGALDYLLHRGEANPRTRAGIMAVELLDTTPD
FT                   WNRFRSRIEDVSQRVLRLRQKVVVPTLPTAAPRWVVDPDFELDFHVRRVRVPDPGTLRE
FT                   VFDLAEVIQQSPMDVSRPLWTATLVEGLAAGRAAMLLQISHAITDGVGSVEMFAEMYDL
FT                   ERDPPSRPRSPQPIPQDLSRNDLMLQGINHLPVALAGGVVGGLSGVASVAGRAILRPAS
FT                   TVSGVVGYVRSGIRVLSQAAEPSPLLRQRSLATRTEAIEIQLSDLHKAAKAGDGSINDA
FT                   YLAGLVGALRRYHEALGVSISTLPMAVPVNVRTEADVVGSNRFVGVTLAAPLGTNDPAA
FT                   RMQKIRSQMTQRRDEPAMNIIGSLAPLMTVLPASVLDFIVDSVASSDVNASNIPAYPGD
FT                   TYFAGAKILRQYGIGPRPGVAMMAVLMSRGGFCTVTVRYDRASVKSEALFARCLLEGFD
FT                   EVLALAGDPTPHAVPASFAARSSGSPAGWLSSS"
FT   CDS             7943..9682
FT                   /transl_table=11
FT                   /gene="plsC"
FT                   /product="putative acyltransferase"
FT                   /note="MLCB1610.06, plsC, possible acyltransferase, len:
FT                   604 aa; C-terminus similar to many acyltransferases e.g.
FT                   PLSC_LIMAL (EMBL:U32988) Limnanthes alba lysophosphatidic
FT                   acid acyltransferase (281 aa), fasta scores; opt: 367
FT                   z-score: 407.5 E(): 2.2e-15, 30.8% identity in 211 aa
FT                   overlap. N-terminus similar to TR:O69629 (EMBL:AL022121)
FT                   Rv3661 (MTV025.009) M.tuberculosis hypothetical protein
FT                   (287 aa) (26.6% identity in 304 aa overlap) and TR:O33611
FT                   (EMBL:AB004855) Streptomyces cyaneus protein involved in
FT                   morphological differentiation (277 aa) (28.6% identity in
FT                   266 aa overlap). C-terminus slightly similar to N-terminus
FT                   of TR:O69572 (EMBL:AL022602) MLCB268.24c, M.leprae
FT                   hypothetical protein (244 aa) (24.3% identity in 247 aa
FT                   overlap). Equivalent to TR:O53208 (EMBL:AL021246) Rv2483c
FT                   (MTV008.39c) M.tuberculosis possible transferase (580 aa)
FT                   (77.1% identity in 573 aa overlap)"
FT                   /db_xref="GOA:Q9X7A9"
FT                   /db_xref="InterPro:IPR002123"
FT                   /db_xref="InterPro:IPR006383"
FT                   /db_xref="InterPro:IPR006385"
FT                   /db_xref="UniProtKB/TrEMBL:Q9X7A9"
FT                   /protein_id="CAB43152.1"
FT                   /translation="MSAFDENGTPKSGPEVRLPGSVAEILASPAGPQVGAFFDLDGTLV
FT                   AGFTAVILTHERLRRCDMGVGELLSMIQAGLNHTLGRIEFEDLIGKVSSALRGRLLTDL
FT                   EEIGERLFAQRIESRIYPEMRKLVQAHVARGHTVVLSSSALTIQVGPVARFLGIAHMLT
FT                   NKFEINEDGMLTGGVVKPILWGPGKAAAVQRFAAEHDIDLKDSYFYADGDEDVALMYLV
FT                   GNPRPTNPEGKMAAVAKRRGWPILKFNSRGGVGIGAQLRTLVGLSSVLPLVAGAVGIGV
FT                   LTGSRRRGANFFTSVFPPLVLAAAGVHLNVIGNENLTMQRPAVFIYNHRNQIDPLIAGA
FT                   LVRGNWIAVAKKELAKNPIIRALGKLTDGVFIDRDDPVGAVETLHAVEDRARKGLSILM
FT                   APEGTRLDTTEVGPFKKGPFRIAMAVGIPIVPIVFRNAEIVAARNSNTINPGMVDVAVL
FT                   PAIPVDDWTLDALSERIAEVRQLYLDILADWPVDELPKAALYTRATKATVKKAEPKKGG
FT                   RTSAAKTTAKKTATKATAAKTANTAVTKSMLTPPTRISDKENPKVPKVVSQDLVVKQPR
FT                   ERS"
FT   CDS             9679..12006
FT                   /transl_table=11
FT                   /gene="plsB"
FT                   /product="putative glycerol-3-phosphate acyltransferase"
FT                   /note="MLCB1610.07, plsB, probable glycerol-3-phosphate
FT                   acyltransferase, len: 775 aa; similar to many e.g.
FT                   PLSB_ECOLI (EMBL:K00127) E.coli glycerol-3-phosphate
FT                   acyltransferase (806 aa), fasta scores; opt: 542 z-score:
FT                   638.1 E(): 3.1e-28, 24.4% identity in 693 aa overlap.
FT                   Similar to Y38C_MYCTU (EMBL:AL021246) Rv2482c (MTV008.38c),
FT                   plsB2, M.tuberculosis probable glycerol-3-phosphate
FT                   acyltransferase (789 aa) (80.7% identity in 783 aa
FT                   overlap). The C-terminus is also similar to Y04E_MYCTU
FT                   (EMBL:Z74020) M.tuberculosis possible acyltransferase (621
FT                   aa) (34.5% identity in 566 aa overlap)"
FT                   /db_xref="GOA:Q9X7B0"
FT                   /db_xref="InterPro:IPR002123"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9X7B0"
FT                   /protein_id="CAB43153.1"
FT                   /translation="MTEPDVEISSVLTGEDTLVLASMDTPAEIELVMDWLCQQRNRNPD
FT                   IKFDVLKLPSRNLAPAALTALVEQLESDEDRSVVPVRVFWMPPAERSKLAKLAGLLPGR
FT                   DPYHPNRRQQRHILKTDARRALVIAGDSAKVSELRQYWRDTTVGENECDFAQFVTRRAI
FT                   LAMERAESRILGPQYKSPRLVKPEILASTRFRAGLEKISGATVEEAGKMLDELATGWSR
FT                   ASVDLVSVLGRMLSRGFEPEIDYDEYQVAAMRAALEAHPAVLLFSHRSYIDGAVVPVAM
FT                   QENRLPPVHVFAGINLSFGLMGPLLRRSGVIFIRRNIGDNPLYKYVLREYVGYIVEKRF
FT                   NLSWSIEGTRSRTGKMLPPKLGLLTYVADAYLDGRSEDILLQPVSISFDQLHETAEYAA
FT                   YARGGEKTPEGVAWLYSFIKAQGERNYGKIYVRFPEAVSMRQYLGAPHGALVQDQDAKR
FT                   LALQKMSFEVAWRILCATPVTATALVSALLLTTRGVALTLDQLHHTLQESLDYLERKQT
FT                   PVSKSALRLRSREGVRAAVDALSSGHPITRVDSGREPVWYITPGNEHAAAFYRNSVIHA
FT                   FLETSIVELALAHARHVEGDRMKVFWAQAMRLRDLLKFDFYFADSAAFRANIAEEIAWH
FT                   QNWEDRVSGDGDDIDAMLLTKRPLISDAMLRVFFEAYDIVADVLRDAPADVGQKELTEL
FT                   ALGVGRQYVAQGRVRSGESVSTLLFATAYQVVVDQNLIAPAPDLAERRMVFRRELRDIR
FT                   RDFDYVEQIARSRFIVREFKSR"
FT   CDS             12373..12601
FT                   /pseudo
FT                   /transl_table=11
FT                   /gene="MLCB1610.08"
FT                   /product="putative pseudogene"
FT                   /note="MLCB1610.08, possible pseudogene, len: 229 bp;
FT                   similar to TR:O53205 (EMBL:AL021246) Rv2478c (MTV008.34c)
FT                   M.tuberculosis hypothetical protein (161 aa)"
FT   CDS             12824..14494
FT                   /transl_table=11
FT                   /gene="MLCB1610.09"
FT                   /product="putative ABC transporter ATP-binding protein"
FT                   /note="MLCB1610.09, probable ABC transporter ATP-binding
FT                   protein, len: 556 aa; similar to many putative ABC
FT                   transporter ATP-binding proteins e.g. YJJK_ECOLI
FT                   (EMBL:U14003) from E.coli (554 aa), fasta scores; opt: 2038
FT                   z-score: 2180.8 E(): 0, 56.6% identity in 557 aa overlap.
FT                   N-terminus slightly similar to TR:Q49707 (EMBL:Z99125)
FT                   MLCL536.31 M.leprae probable ABC-type transporter (315 aa)
FT                   (24.2% identity in 269 aa overlap). Equivalent to TR:O53204
FT                   (EMBL:AL021246) Rv2477c (MTV008.33c) M.tuberculosis
FT                   putative ABC transporter ATP-binding protein (558 aa)
FT                   (92.3% identity in 557 aa overlap). Contains Pfam matches
FT                   to entry PF00005 ABC_tran, ABC transporter, score 173.70,
FT                   E-value 2.9e-48 and to entry PF00005 ABC_tran, ABC
FT                   transporter, score 125.60, E-value 9.3e-34. Also contains
FT                   two PS00017 ATP/GTP-binding site motif A (P-loop) and two
FT                   PS00211 ABC transporters family signatures. Contains
FT                   probable coiled-coil from 273 to 311 (39 residues) (Max
FT                   score: 1.594, probability 0.99"
FT                   /db_xref="GOA:Q9X7B1"
FT                   /db_xref="HSSP:1US8"
FT                   /db_xref="InterPro:IPR003439"
FT                   /db_xref="InterPro:IPR003593"
FT                   /db_xref="InterPro:IPR017871"
FT                   /db_xref="UniProtKB/TrEMBL:Q9X7B1"
FT                   /protein_id="CAB43155.1"
FT                   /translation="MAEFIYTMKKVRKAHGDKVILDDVTLNFFPGAKIGVVGPNGAGKS
FT                   SVLRIMAGMDKPNNGDAFLATGASVGILQQEPPLNEEKTVRGNVEEGLGDIKIKLDRFS
FT                   EVAELMATDSSSELMEEMGRLQEELDHADAWDLDSQLEQAMDVLRCPPPDEPVTNLSGG
FT                   ERRRVALCKLLLSKPDLLLLDEPTNHLDAESVQWLEQHLAGYAGAVLAVTHDRYFLDNV
FT                   AEWILELDRGRAYPYEGNYSTYLEKKAERLTTQGRKDAKLQKRLTDELAWVRSGAKARQ
FT                   AKSKARLQRYEEMAAEAEKNRKFDFEEIQIPVGPRLGNMVVEVEHLDKGYGGRTLIKDL
FT                   SFTLPRNGIVGIIGPNGVGKTTLFKTIVGLEQPDGGAVKIGETVKLSYVDQTRAGIDPK
FT                   KIVWEVVSDGLDHIQVGQTEVSSRAYVSAFGFKGPDQQKLAGVLSGGERNRLNLALTLK
FT                   QGGNLILLDEPTNDLDVETLGSLENALVNFPGCAVVISHDRWFLDRTCTHILAWEGDDE
FT                   GKWFWFEGNFGAYEENKVERLGAEAARPHRVTHRKLTRD"
FT   misc_feature    12914..13519
FT                   /note="Pfam match to entry PF00005 ABC_tran, ABC
FT                   transporter, score 173.70, E-value 2.9e-48"
FT   misc_feature    12935..12958
FT                   /note="PS00017 ATP/GTP-binding site motif A (P-loop)"
FT   misc_feature    13301..13345
FT                   /note="PS00211 ABC transporters family signature"
FT   misc_feature    13862..14368
FT                   /note="Pfam match to entry PF00005 ABC_tran, ABC
FT                   transporter, score 125.60, E-value 9.3e-34"
FT   misc_feature    13883..13906
FT                   /note="PS00017 ATP/GTP-binding site motif A (P-loop)"
FT   misc_feature    14150..14194
FT                   /note="PS00211 ABC transporters family signature"
FT   CDS             14553..19421
FT                   /transl_table=11
FT                   /gene="MLCB1610.10"
FT                   /product="hypothetical protein MLCB1610.10"
FT                   /note="MLCB1610.10, hypothetical protein, len: 1662 aa;
FT                   unknown function, similar to RP758 (EMBL:AJ235273)
FT                   Rickettsia prowazekii hypothetical protein (1581 aa), fasta
FT                   scores; opt: 1845 z-score: 2044.5 E(): 0, 32.9% identity in
FT                   1494 aa overlap and equivalent to TR:O53203 (EMBL:AL021246)
FT                   Rv2476c (MTV008.32c) M.tuberculosis hypothetical protein
FT                   (1624 aa) (81.5% identity in 1634 aa overlap)"
FT                   /db_xref="GOA:Q9X7B2"
FT                   /db_xref="InterPro:IPR007780"
FT                   /db_xref="InterPro:IPR016040"
FT                   /db_xref="UniProtKB/TrEMBL:Q9X7B2"
FT                   /protein_id="CAB43156.1"
FT                   /translation="MTIDPGATHVAELCTTFTQGADVPDWISKAYIDSYRGSHGDVREA
FT                   PETSRVNPNALVTPAMLSAHYRLGQCRPNGRNCVRVYPADDPAGFGPALQIVTDHGGMV
FT                   MDSITVLLHRLGVTYTAMMTPVFMVLRSPTGELLGVEPRASSTSHSIEGTWVGEVWIYI
FT                   QLLPAVDSKSLAEVEQLLPRTLVDVQRVAADAAALNATLSGLAADVKTNKEGHFSASDR
FT                   DDVAALLHWLGNGNFLLLGYQRCRVHYGLVSCDRSTGLGVLRARTGSRPRLTDDNELLV
FT                   LAQAAVGNYLRYGAYPYAIAVREYDDGGDGGIIEHRFVGLFTVAAMNADVLEIPSISHR
FT                   VRAALAMANSDPIYPGQLLLDVIQTVPRSELFTLSAERLFTMAKEVVDLGSGRRALLFL
FT                   RADRLQYFVSCLVYVPRDRYTTGVRLQIEDILVREFGGTQVEFTARVSESPWALMHFMV
FT                   RLSEGAATGSVDVSEGNRIRIQAMLSEAARTWSDRLIAAAASFSEGSVSYAEAEHYAAT
FT                   FSETYKQAVTPADAIDHIAIIKELADDSVKLVFFERKADGFAQLTWFLGGRSASLSQLL
FT                   PMLQSMGVVVLEERPFTVARTDGLPVWIYQFKISPHPTIPLASTANERELTAKRFSDAV
FT                   TAIWQGRVEIDRFNELVMRARLTWQQVVLLRAYAKYLRQAGFNYSQSYIESVLNEHPST
FT                   ARSLVALFEALFDPSPLSSSTNCDAQAAAAAVAADIDALVSLDTDRILRAFASLVQATL
FT                   RTNYFVTQKFSARSKGVLVLKLDAQLINELPLPRPKFEIFVYSPRVEGVHLRFGAVARG
FT                   GLRWSDRLDDFRTEILGLVKAQAVKNAVIVPVGAKGGFVLKRPPLPTGDAAADRDAMRA
FT                   EGIACYQLFISGLLDITDNVDHATGKVNAPPQVVRRDSDDAYLVVAADKGTATFSDIAN
FT                   DVAKSYGFWLGDAFASGGSVGYDHKAMGITAKGAWEAVKRHFREMGVDTQNEDFTVVGI
FT                   GDMSGDVFGNGMLLSKHIRLIAAFDHRHVFLDPDPDAAVSWAERQRMFDLPRSSWDDYN
FT                   KSLISEGGGVYSREQKAIPTSPQVRTALGIDGEVTEMAPPNLIRAILQAPVDLLFNGGI
FT                   GTYIKAETESVADVGDRANDPVRVNANQVRAKVIGEGGNLGVTALGRVEFDLSGGRINT
FT                   DAMDNSAGVDCSDHEVNIKILIDSLVTAGKVKVEERKHLLESMTDEVARLVLTDNEDQN
FT                   DLIGTSRANAANMLSVHAMQIKYLVDERGVNRELEALPSEKEIQRRSEAGIGLTSPELS
FT                   TLMAHVKLALKEQMLATELPDQDVFVSRLPRYFPKPLRERFTPEIRSHQLRREIVTTML
FT                   INDLVDTAGISYAFRIAEDIGVGPIDAIRTYVATDAIFGVGDVLRRIRAANLSVVLSDR
FT                   MTLDTRRLIDRAGRWLLNYRPQPLAVGAEINRFAAKVKALTPRMSEWLRGDDQAIVEQQ
FT                   ATEFVSQGAPEDLAYRVAVGLYRYSLLDIIDIADITELDPAEVADTYFSLMDRLGTDGL
FT                   LTAVSKLPQNDRWHSLARLAIRDDIYASLRSLCFDVLAVGEPDESGEEKIAEWEHISAS
FT                   RVERARLMLAEIHASGEKDLATLSVAARQIRRMTRTSGRGSSG"
FT   CDS             19421..19531
FT                   /pseudo
FT                   /transl_table=11
FT                   /gene="MLCB1610.11"
FT                   /product="putative pseudogene"
FT                   /note="MLCB1610.11, possible pseudogene, len: 111 bp;
FT                   similar to TR:O53202 (EMBL:AL021246) Rv2475c (MTV008.31c)
FT                   M.tuberculosis hypothetical protein (138 aa)"
FT   CDS             19876..20282
FT                   /pseudo
FT                   /transl_table=11
FT                   /gene="MLCB1610.12"
FT                   /product="putative pseudogene"
FT                   /note="MLCB1610.12, possible pseudogene, len: 407 bp;
FT                   similar to TR:O53201 (EMBL:AL021246) Rv2474c (MTV008.30c)
FT                   M.tuberculosis hypothetical protein (217 aa)"
FT   CDS             complement(20573..21802)
FT                   /pseudo
FT                   /transl_table=11
FT                   /gene="MLCB1610.13c"
FT                   /product="putative pseudogene"
FT                   /note="MLCB1610.13c, possible maltase pseudogene, len: 1230
FT                   bp; similar to TR:O53198 (EMBL:AL021246) Rv2471 (MTV008.27)
FT                   M.tuberculosis probable maltase (546 aa)"
FT   CDS             complement(21805..22200)
FT                   /transl_table=11
FT                   /gene="MLCB1610.14c"
FT                   /product="putative globin"
FT                   /note="MLCB1610.14c, possible globin, len: 131 aa; similar
FT                   to GLBN_NOSSN (EMBL:L47979) Nostoc sp. (strain mun 8820)
FT                   cyanoglobin (118 aa), fasta scores; opt: 137 z-score: 190.5
FT                   E(): 0.0026, 23.8% identity in 122 aa overlap. Equivalent
FT                   to TR:O53197 (EMBL:AL021246) Rv2470 (MTV008.26), glbO,
FT                   M.tuberculosis possible globin (128 aa) (88.7% identity in
FT                   124 aa overlap)"
FT                   /db_xref="GOA:Q9X7B3"
FT                   /db_xref="HSSP:1NGK"
FT                   /db_xref="InterPro:IPR001486"
FT                   /db_xref="InterPro:IPR009050"
FT                   /db_xref="InterPro:IPR012292"
FT                   /db_xref="InterPro:IPR019795"
FT                   /db_xref="UniProtKB/TrEMBL:Q9X7B3"
FT                   /protein_id="CAB43160.1"
FT                   /translation="MDQVQQSFYDAIGGAETFKAIVSRFYAQVPEDEILRELYPADDLA
FT                   GAEERLRMFLEQYWGGPRTYSSQRGHPRLRMRHAPFRITAIERDAWLRCMHTAVASIDS
FT                   HTLDNEHRRELLDYLEMAAHSLVNSAS"
FT   CDS             22360..23007
FT                   /transl_table=11
FT                   /gene="MLCB1610.15"
FT                   /product="hypothetical protein MLCB1610.15"
FT                   /note="MLCB1610.15, hypothetical protein, len: 215 aa;
FT                   unknown function, similar to TR:P72833 (EMBL:D90901)
FT                   Synechocystis sp. PCC6803 hypothetical protein (165 aa),
FT                   fasta scores; opt: 396 z-score: 496.7 E(): 2.3e-20, 36.9%
FT                   identity in 168 aa overlap. Equivalent to TR:O53196
FT                   (EMBL:AL021246) Rv2469c (MTV008.25c) M.tuberculosis
FT                   hypothetical protein (222 aa) (77.9% identity in 222 aa
FT                   overlap)"
FT                   /db_xref="GOA:Q9X7B4"
FT                   /db_xref="InterPro:IPR002711"
FT                   /db_xref="InterPro:IPR003615"
FT                   /db_xref="UniProtKB/TrEMBL:Q9X7B4"
FT                   /protein_id="CAB43161.1"
FT                   /translation="MVQRKNRRSHRSSGAAANLIRAANPSSLHNVDIHPSTCYESGSIW
FT                   NRRRVLLLNSTYEPLTALPTRRAIIMVICGKADVVHVDPAGPVVHSATRSITVPSVIQL
FT                   RSYVRVPYRARVPMTRAALMHRDRFCCAYCGAKADTVDHVVPRSRGGDHSWENCVACCS
FT                   TCNHRKGDKLLTELGWVLRRTPVLPTGQHWRLLSTVKELDPAWARYLGGGAA"
FT   CDS             23404..23913
FT                   /transl_table=11
FT                   /gene="MLCB1610.16"
FT                   /product="hypothetical protein MLCB1610.16"
FT                   /note="MLCB1610.16, hypothetical protein, len: 169 aa;
FT                   unknown function, similar to TR:O53195 (EMBL:AL021246)
FT                   Rv2468c (MTV008.24c) M.tuberculosis hypothetical protein
FT                   (167 aa), fasta scores; opt: 859 z-score: 945.2 E(): 0,
FT                   81.2% identity in 165 aa overlap"
FT                   /db_xref="UniProtKB/TrEMBL:Q9X7B5"
FT                   /protein_id="CAB43162.1"
FT                   /translation="MTGTGAMGHQSSQSEVAPVVRGDVVTELPKGWVITTSGRVSGVTE
FT                   PGDRSVHYPFPIKDLVALDDALTYSSRASHARFAVYLGDLGNDTAALAREILAQVPTPD
FT                   DAVLVAVSPNQCAIEVVYGSQVRGRGAESAAPLGVAAASSAFEQGNLIDGLISAVRVLS
FT                   AGISRS"
FT   repeat_region   complement(23907..24656)
FT                   /note="RLEP element"
FT   CDS             24178..24516
FT                   /transl_table=11
FT                   /gene="MLCB1610.17"
FT                   /product="hypothetical protein MLCB1610.17"
FT                   /note="MLCB1610.17, hypothetical protein, len: 112 aa;
FT                   unknown function, CDS within RLEP element, highly similar
FT                   to CDS in other RLEP elements e.g. TR:O32969 (EMBL:Z98741)
FT                   MLCB22.36c (113 aa), fasta scores; opt: 781 z-score: 923.0
FT                   E(): 0, 99.1% identity in 112 aa overlap"
FT                   /db_xref="InterPro:IPR009418"
FT                   /db_xref="UniProtKB/TrEMBL:O32969"
FT                   /protein_id="CAB43163.1"
FT                   /translation="MTTPTPQGHDMHTKTPLPRGANNYPHTHACIDIAFSTAQVPSPWH
FT                   HQHVDQAASTTDMLTCAALIVSTAAKHTKPHRKQAVSHPPTKTPQHSKTRQQDSGLPPT
FT                   TTEHIGQS"
FT   CDS             25030..25311
FT                   /transl_table=11
FT                   /gene="MLCB1610.18"
FT                   /product="hypothetical protein MLCB1610.18"
FT                   /note="MLCB1610.18, hypothetical protein, len: 93 aa;
FT                   unknown function, possible CDS based on amino acid
FT                   composition and positional base preference"
FT                   /db_xref="UniProtKB/TrEMBL:Q9X7B6"
FT                   /protein_id="CAB43164.1"
FT                   /translation="MASSRREIRSYVTRSGKVGQVLGSAPRYLVYCRRFRDVTRDVADG
FT                   ICTDSGIYYQPVLDDAVILDVHSWVRFVPGHVDARTVIPRILDMLAEV"
FT   CDS             complement(25536..25837)
FT                   /pseudo
FT                   /transl_table=11
FT                   /gene="MLCB1610.19c"
FT                   /product="putative pseudogene"
FT                   /note="MLCB1610.19c, possible pseudogene, len: 302 bp;
FT                   similar to TR:O06592 (EMBL:Z95586) Rv1598c (MTCY336.06)
FT                   M.tuberculosis hypothetical protein (136 aa)"
FT   misc_feature    26005..27297
FT                   /note="Pfam match to entry PF00815 Histidinol_dh,
FT                   Histidinol dehydrogenase, score 736.10, E-value 1.5e-217"
FT   CDS             26005..27300
FT                   /transl_table=11
FT                   /gene="hisD"
FT                   /product="putative histidinol dehydrogenase"
FT                   /note="MLCB1610.20, hisD, probable histidinol
FT                   dehydrogenase, len: 431 aa; similar to many e.g. HISX_MYCSM
FT                   (EMBL:X65542), hisD, Mycobacterium smegmatis histidinol
FT                   dehydrogenase (445 aa), fasta scores; opt: 2232 z-score:
FT                   2569.0 E(): 0, 77.5% identity in 449 aa overlap. Equivalent
FT                   to HISX_MYCTU (EMBL:Z95586) Rv1599 (MTCY336.05c), hisD,
FT                   M.tuberculosis probable histidinol dehydrogenase (438 aa)
FT                   (84.8% identity in 442 aa overlap). Contains Pfam match to
FT                   entry PF00815 Histidinol_dh, Histidinol dehydrogenase,
FT                   score 736.10, E-value 1.5e-217 and PS00611 Histidinol
FT                   dehydrogenase signature"
FT                   /db_xref="GOA:Q9CC57"
FT                   /db_xref="InterPro:IPR001692"
FT                   /db_xref="InterPro:IPR012131"
FT                   /db_xref="InterPro:IPR016161"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9CC57"
FT                   /protein_id="CAB43166.1"
FT                   /translation="MTATQLRATLPRGGADVEAVLPTVWPIVQAVAECGADAALEFGAL
FT                   FDGVRPPTVRVPDAALDAALAGLDPDVRDALQVMIERTRVVHADQRRTDVTTALGPGAT
FT                   VTERWVPVERVGLYVPGGNAVYPSSVVMNVVPAQTAGVDSLVVASPPQFTSGGRFHGLP
FT                   HPTILAAARLLGVDEVWAVGGAQAVALLAYGGTDSDDCELAPVDMITGPGNIYVTAAKR
FT                   LCRSRVGIDGEAGPTEIAILADHTADPAHVAADMISQAEHDEMAASVLVTPSTDLADAT
FT                   DAELAAQLRTTVHRKRVVAALGGRQSAIILVDDLEAGVKVVNLYAAEHLEIQTAEASRV
FT                   ASRIRCAGAIFVGPWAPVSLGDYCAGSNHVLPTAGFARHSGGLSVQTFLRGIHVVNYTK
FT                   TALKDISGHVITLAKAEDLPAHGEAVRRRFAR"
FT   misc_feature    26692..26790
FT                   /note="PS00611 Histidinol dehydrogenase signature"
FT   CDS             27297..28430
FT                   /transl_table=11
FT                   /gene="hisC"
FT                   /product="putative histidinol-phosphate aminotransferase"
FT                   /note="MLCB1610.21, hisC, probable histidinol-phosphate
FT                   aminotransferase, len: 377 aa; similar to many e.g.
FT                   HIS8_STRCO (EMBL:M31628), hisC, Streptomyces coelicolor
FT                   histidinol-phosphate aminotransferase (369 aa), fasta
FT                   scores; opt: 1295 z-score: 1506.1 E(): 0, 56.4% identity in
FT                   360 aa overlap. Equivalent to TR:O06591 (EMBL:Z95586)
FT                   ,Rv1600 (MTCY336.04c), hisC, M.tuberculosis probable
FT                   histidinol-phosphate aminotransferase (380 aa) (83.6%
FT                   identity in 366 aa overlap). Contains Pfam match to entry
FT                   PF00222 aminotran_2, Aminotransferases class-II, score
FT                   171.60, E-value 1.3e-47"
FT                   /db_xref="GOA:Q9X7B8"
FT                   /db_xref="InterPro:IPR004839"
FT                   /db_xref="InterPro:IPR005861"
FT                   /db_xref="InterPro:IPR015421"
FT                   /db_xref="InterPro:IPR015424"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9X7B8"
FT                   /protein_id="CAB43167.1"
FT                   /translation="MNVPEPTLDDLPLRDNLRGKSPYGAMQLLVPVLLNTNENPHPPTK
FT                   ALVDDVVRSVQKVAVDLHRYPDRDAVALRQDLASYLTAQTGIRLGVENIWAANGSNEIL
FT                   QQLLQAFGGPGRSAIGFVPSYSMHPIIADGTHTEWLETVRADDFSLDVEAAVTAVADRK
FT                   PDVVFIASPNNPSGQSISLADLRRLLDVVPGILIVDEAYGEFSSRPSAVALVGEYPTKI
FT                   VVTRTTSKAFAFAGGRLGYLIATPALVEAMLLVRLPYHLSSVTQAAARAALRHADDTLG
FT                   SVAALIAERERVTKSLVHMGFRVIPSDANFVLFGHFSDAAGAWQHYLDTGVLIRDVGIP
FT                   GYLRATTGLAEENDAFLKASSEIAATELAPATTLGAS"
FT   misc_feature    27558..28328
FT                   /note="Pfam match to entry PF00222 aminotran_2,
FT                   Aminotransferases class-II, score 171.60, E-value 1.3e-47"
FT   CDS             28427..29059
FT                   /transl_table=11
FT                   /gene="hisB"
FT                   /product="putative imidazole glycerol-phosphate
FT                   dehydratase"
FT                   /note="MLCB1610.22, hisB, probable imidazole
FT                   glycerol-phosphate dehydratase, len: 210 aa; similar to
FT                   many e.g. HIS7_STRCO (EMBL:M31628), hisB, Streptomyces
FT                   coelicolor imidazole glycerol-phosphate dehydratase (197
FT                   aa), fasta scores; opt: 731 z-score: 894.4 E(): 0, 56.4%
FT                   identity in 202 aa overlap. Equivalent to HIS7_MYCTU
FT                   (EMBL:Z95586) Rv1601 (MTCY336.03c), hisB, M.tuberculosis
FT                   probable imidazole glycerol-phosphate dehydratase (210 aa)
FT                   (84.8% identity in 210 aa overlap). Contains Pfam matches
FT                   to entry PF00475 IGPD, Imidazoleglycerol-phosphate
FT                   dehydratase, score 191.10, E-value 3e-65 and entry PF00475
FT                   IGPD, Imidazoleglycerol-phosphate dehydratase, score 64.70,
FT                   E-value 1.7e-21. Also contains PS00954
FT                   Imidazoleglycerol-phosphate dehydratase signature 1 and
FT                   PS00955 Imidazoleglycerol-phosphate dehydratase signature
FT                   2"
FT                   /db_xref="GOA:Q9X7B9"
FT                   /db_xref="InterPro:IPR000807"
FT                   /db_xref="InterPro:IPR020565"
FT                   /db_xref="InterPro:IPR020568"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9X7B9"
FT                   /protein_id="CAB43168.1"
FT                   /translation="MTNTEVGKTTRRARIERRTSESDIVVELDLDGTGQVHIDTGVSFY
FT                   DHMLTALGSHASFDLTVCTKGDVEIEAHHTIEDTAIALGQAFGQALGNKKGIRRFGDAF
FT                   IPMDETLVHAVVDVSGRPYCVHTGEPDHLQHNIISGSSVPYSTVINRHVFESLAANARI
FT                   ALHVRVLYGRDPHHITEAQYKAVARALSEAVKFDPRFSGVPSTKGVL"
FT   misc_feature    28544..28807
FT                   /note="Pfam match to entry PF00475 IGPD,
FT                   Imidazoleglycerol-phosphate dehydratase, score 191.10,
FT                   E-value 3e-65"
FT   misc_feature    28634..28675
FT                   /note="PS00954 Imidazoleglycerol-phosphate dehydratase
FT                   signature 1"
FT   misc_feature    28859..28999
FT                   /note="Pfam match to entry PF00475 IGPD,
FT                   Imidazoleglycerol-phosphate dehydratase, score 64.70,
FT                   E-value 1.7e-21"
FT   misc_feature    28940..28978
FT                   /note="PS00955 Imidazoleglycerol-phosphate dehydratase
FT                   signature 2"
FT   CDS             29056..29676
FT                   /transl_table=11
FT                   /gene="hisH"
FT                   /product="putative glutamine amidotransferase"
FT                   /note="MLCB1610.23, hisH, probable glutamine
FT                   amidotransferase, len: 206 aa; similar to many e.g.
FT                   HIS5_STRCO (EMBL:M31628), hisH, Streptomyces coelicolor
FT                   glutamine amidotransferase (222 aa), fasta scores; opt: 827
FT                   z-score: 973.0 E(): 0, 58.8% identity in 211 aa overlap.
FT                   Equivalent to TR:O06589 (EMBL:Z95586) Rv1602 (MTCY336.02c),
FT                   hisH, M.tuberculosis probable amidotransferase (206 aa)
FT                   (79.4% identity in 204 aa overlap). Contains Pfam match to
FT                   entry PF00117 GATase, Glutamine amidotransferases class-I,
FT                   score 50.40, E-value 3.9e-11 and PS00442 Glutamine
FT                   amidotransferases class-I active site"
FT                   /db_xref="GOA:Q9X7C0"
FT                   /db_xref="InterPro:IPR000991"
FT                   /db_xref="InterPro:IPR010139"
FT                   /db_xref="InterPro:IPR016226"
FT                   /db_xref="InterPro:IPR017926"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9X7C0"
FT                   /protein_id="CAB43169.1"
FT                   /translation="MRSKSVVVLDYGSGNLWSVQRALQRVGAAVEVTADSAAGAAADGL
FT                   LVPGVGAFEACMAGLRKIAGERTIAERIVAGRPVLGVCVGMQILFARGVEFGVETTGCR
FT                   QWPGVVTRLDAPVVPHMGWNVVDSASGSALFKGLDAGVRFYFVHSYAAQRWEGSSKALL
FT                   TWATHQVPFLAAVEEGPLVATQFHPEKSGDAGATLLSNWLGEL"
FT   misc_feature    29068..29670
FT                   /note="Pfam match to entry PF00117 GATase, Glutamine
FT                   amidotransferases class-I, score 50.40, E-value 3.9e-11"
FT   misc_feature    29287..29322
FT                   /note="PS00442 Glutamine amidotransferases class-I active
FT                   site"
FT   CDS             29680..30423
FT                   /transl_table=11
FT                   /gene="hisA"
FT                   /product="putative phosphoribosylformimino-5-aminoimidazole
FT                   carboxamide ribotide isomerase"
FT                   /note="MLCB1610.24, hisA, probable
FT                   phosphoribosylformimino-5-aminoimidazole carboxamide
FT                   ribotide isomerase, len: 248 aa; similar to many e.g.
FT                   HIS4_STRCO (EMBL:M31628), hisA, Streptomyces coelicolor
FT                   phosphoribosylformimino-5-aminoimidazole carboxamide
FT                   ribotide isomerase (240 aa), fasta scores; opt: 1059
FT                   z-score: 1268.4 E(): 0, 66.3% identity in 240 aa overlap.
FT                   Equivalent to HIS4_MYCTU (EMBL:Z95586) Rv1603
FT                   (MTV046.01-MTCY336.01c), hisA, M.tuberculosis Probable
FT                   phosphoribosylformimino-5-aminoimidazole ribotide isomerase
FT                   (240 aa) (84.4% identity in 244 aa overlap). Contains Pfam
FT                   match to entry PF00977 His_biosynth, Histidine biosynthesis
FT                   protein, score 160.40, E-value 3.2e-44"
FT                   /db_xref="GOA:Q9CC56"
FT                   /db_xref="InterPro:IPR006062"
FT                   /db_xref="InterPro:IPR010188"
FT                   /db_xref="InterPro:IPR011060"
FT                   /db_xref="InterPro:IPR013785"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9CC56"
FT                   /protein_id="CAB43170.1"
FT                   /translation="MLLMSLILLPAVDVVEGRAVRLVQGKAGSENDYGSALDAALCWQR
FT                   DGADWIHLVDLDAAFGRGSNRELLSEMVGKLDVQVELSGGIRDDDSLNAALATGCARVN
FT                   LGTAACENPHWCAQVIAEHGDKIAVGLDVQIVDGQHRLRGRGWETDGGDLWDVLENLDR
FT                   QGCSRFIVTDVTKDGTLDGPNLDLLASVSDRTNVPVIASGGVSSLDDLRAIAKFTERGI
FT                   EGAIVGKALYAERFTLPQALAVVRM"
FT   misc_feature    29695..30414
FT                   /note="Pfam match to entry PF00977 His_biosynth, Histidine
FT                   biosynthesis protein, score 160.40, E-value 3.2e-44"
FT   CDS             30549..31226
FT                   /pseudo
FT                   /transl_table=11
FT                   /gene="MLCB1610.25"
FT                   /product="putative inositol monophosphate phosphatase
FT                   pseudogene"
FT                   /note="MLCB1610.25, probable inositol monophosphate
FT                   phosphatase pseudogene, len: 678 bp; similar to many e.g.
FT                   TR:O51845 (EMBL:AF005905), impA, Mycobacterium smegmatis
FT                   inositol monophosphate phosphatase (276 aa) and equivalent
FT                   to TR:O53907 (EMBL:AL022001) Rv1604 (MTV046.02), impA,
FT                   M.tuberculosis probable inositol monophosphatase (270 aa).
FT                   The M.leprae sequence is truncated at the N-terminus with
FT                   respect to the M.tuberculosis sequence and contains a stop
FT                   codon"
FT   CDS             31231..32016
FT                   /transl_table=11
FT                   /gene="hisF"
FT                   /product="putative cyclase"
FT                   /note="MLCB1610.26, hisF, probable cyclase, len: 261 aa;
FT                   similar to many e.g. HIS6_AZOBR (EMBL:X61207), hisF,
FT                   Azospirillum brasilense cyclase (261 aa), fasta scores;
FT                   opt: 980 z-score: 1121.0 E(): 0, 61.3% identity in 256 aa
FT                   overlap. Equivalent to HIS6_MYCTU (EMBL:AL022001) Rv1605
FT                   (MTV046.03), hisF, M.tuberculosis cyclase (267 aa) (89.8%
FT                   identity in 256 aa overlap). Contains Pfam match to entry
FT                   PF00977 His_biosynth, Histidine biosynthesis protein, score
FT                   377.70, E-value 1.2e-109"
FT                   /db_xref="GOA:Q9X7C2"
FT                   /db_xref="InterPro:IPR004651"
FT                   /db_xref="InterPro:IPR006062"
FT                   /db_xref="InterPro:IPR011060"
FT                   /db_xref="InterPro:IPR013785"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9X7C2"
FT                   /protein_id="CAB43172.1"
FT                   /translation="MYSGNGLAVRVIPCLDVYCGRVVKGVNFKNLRDAGDLVELAAAYD
FT                   AEGADELAFLDVTASSSGRATMLEVVRCTAEQVFIPLMVGGGVRTVADVDVLLRAGADK
FT                   VAVNTAAIARPELLADMAGQFGSQCIVLSVDARTVPTGSARTPSGWEATTHGGYRGTGI
FT                   DAVEWAARGADLGVGEILLNSMDADGTKAGFDLAMLRAVRAAVTVPVIASGGAGAIEHF
FT                   VPAVTAGADAVLAASVFHFRELTIGQVKDAMAAAGIAVR"
FT   misc_feature    31255..31989
FT                   /note="Pfam match to entry PF00977 His_biosynth, Histidine
FT                   biosynthesis protein, score 377.70, E-value 1.2e-109"
FT   CDS             32013..32360
FT                   /transl_table=11
FT                   /gene="hisI2"
FT                   /product="putative phosphoribosyl-AMP cyclohydrolase"
FT                   /note="MLCB1610.27, hisI2, probable phosphoribosyl-AMP
FT                   cyclohydrolase, len: 115 aa; similar to many e.g.
FT                   HIS3_RHOSH (EMBL:X82010), hisI, Rohodobacter sphaeroides
FT                   phosphoribosyl-AMP cyclohydrolase (119 aa), fasta scores;
FT                   opt: 352 z-score: 480.8 E(): 1.8e-19, 47.7% identity in 109
FT                   aa overlap. Equivalent to HIS3_MYCTU (EMBL:AL022001) Rv1606
FT                   (MTV046.04), hisI2, M.tuberculosis probable
FT                   phosphoribosyl-AMP cyclohydrolase (115 aa) (84.3% identity
FT                   in 115 aa overlap)"
FT                   /db_xref="GOA:Q9X7C3"
FT                   /db_xref="InterPro:IPR002496"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9X7C3"
FT                   /protein_id="CAB43173.1"
FT                   /translation="MTLDPDIAVRLKRNAEGLFTAVVQERSSGDVLMVAWMDDQALART
FT                   LETREANYYSRSRAEQWIKGSTSGNTQHVHSVRLDCDGDTVLLTVDQVGGACHTGAHSC
FT                   FDSAMLLAPQD"
FT   CDS             complement(33173..33835)
FT                   /pseudo
FT                   /transl_table=11
FT                   /gene="MLCB1610.28c"
FT                   /product="putative serine/threonine protein kinase
FT                   pseudogene"
FT                   /note="MLCB1610.28c, possible serine/threonine protein
FT                   kinase pseudogene, len: 663 bp; similar to TR:P72003
FT                   (EMBL:Z95890) Rv1746 (MTCY28.09,MTCY04C12.30), pknF,
FT                   M.tuberculosis probable serine/threonine protein kinase
FT                   (476 aa)"
FT   CDS             34094..35188
FT                   /transl_table=11
FT                   /gene="MLCB1610.29"
FT                   /product="putative antiporter"
FT                   /note="MLCB1610.29, probable antiporter, len: 364 aa;
FT                   similar to many e.g. CHAA_ECOLI (EMBL:L28709), chaA, E.coli
FT                   calcium/proton antiporter (366 aa), fasta scores; opt: 736
FT                   z-score: 857.1 E(): 0, 36.3% identity in 364 aa overlap.
FT                   Possibly calcium/proton antiporter in M.leprae. Equivalent
FT                   to TR:O53910 (EMBL:AL022001) Rv1607 (MTV046.05), chaA,
FT                   M.tuberculosis putative calcium/proton antiporter (360 aa)
FT                   (77.7% identity in 364 aa overlap)"
FT                   /db_xref="GOA:Q9X7C4"
FT                   /db_xref="InterPro:IPR004837"
FT                   /db_xref="UniProtKB/TrEMBL:Q9X7C4"
FT                   /protein_id="CAB43175.1"
FT                   /translation="MLKRIAWTALVPLFALAVLALTWGREIGPVVTALQAALLTGAVLA
FT                   AVHHAEVVAHRVGEPFGSLVLAAAVTVIEVALIVTLMASGENESWTLARDTAFAALMIT
FT                   TNGIAGFSLLLGSRRYGVTLFNAHGSGAALATLTTLATLSLVLPTFTTSHRGNEFSPGQ
FT                   LAFAAVASLGLYLLFVFTQTIRHRDFFLPVAQKGQKGLFEEDESHADPPSARSALISLA
FT                   LLLVALIAVVGLAELQSSAIEHLVTAVGFPQPFVGVVIATLVLLPETLAAVRAARRGRI
FT                   QTSLNLAYGSAMASIGLTIPAIALASIWLTGPLILGLGATQLVLLALTVVISVLTVVPG
FT                   RATRLQGEVHLVLLAAFVFLAIIP"
FT   CDS             complement(35235..35684)
FT                   /pseudo
FT                   /transl_table=11
FT                   /gene="MLCB1610.30"
FT                   /product="putative pseudogene"
FT                   /note="MLCB1610.30, probable pseudogene, len: 450 bp,
FT                   similar to TR:O53911 (EMBL:AL02200) Rv1608c (MTV046.06),
FT                   bcpB, M.tuberculosis probable bacterioferritin comigratory
FT                   protein (154 aa)"
FT   CDS             35793..37382
FT                   /transl_table=11
FT                   /gene="trpE"
FT                   /product="putative anthranilate synthase component I"
FT                   /note="MLCB1610.31, trpE, probable anthranilate synthase
FT                   component I, len: 529 aa; similar to many e.g. TRPE_PSEPU
FT                   (EMBL:M33799), trpE, Pseudomonas putida anthranilate
FT                   synthase component I (493 aa), fasta scores; opt: 1300
FT                   z-score: 1494.3 E(): 0, 44.4% identity in 504 aa overlap.
FT                   Equivalent to TR:O06127 (EMBL:Z95554) Rv1609
FT                   (MTCY01B2.01-MTV046.07), trpE, M.tuberculosis probable
FT                   anthranilate synthase component I (516 aa) (88.0% identity
FT                   in 508 aa overlap). Contains fam match to entry PF00425
FT                   chorismate_bind, chorismate binding enzyme, score 498.40,
FT                   E-value 5.6e-146"
FT                   /db_xref="GOA:Q9X7C5"
FT                   /db_xref="InterPro:IPR005256"
FT                   /db_xref="InterPro:IPR005801"
FT                   /db_xref="InterPro:IPR006805"
FT                   /db_xref="InterPro:IPR015890"
FT                   /db_xref="InterPro:IPR019999"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9X7C5"
FT                   /protein_id="CAB43177.1"
FT                   /translation="MHAHLAATTSREDFRQLAVDHRVVPVTRKVLADSETPLSAYRKLA
FT                   ANRPSTFLLESAENGRSWSQWSFIGVGAPSALTIRDGEAVWLGTVPQDAPTGGDPLHVL
FT                   QATLELLATAAMPGLPPLSSGMVGFFAYDMVRRLERLPELALNDLQLPDMLLLLATDVA
FT                   AVDHHEGTITLIANAVNWNGTDERVDQAYDDAIARLDVMTAALGQPLPSTIATFSRPDP
FT                   RRRAQCTIEEYGAIVDHLVDQIAAGEAFQVVPSQRFEVDTDVDPIDVYRMLRVTNPSPY
FT                   MYLLHVPNSDGATGFSIVGSSPEALVTVKDGRVTTHPIAGTRWRGQTEEEDQLLEKELL
FT                   ADEKERAEHLMLVDLGRNDLGRVCTPGTVRVEDYSHVERYSHVMHMVSTVTGLLGEGRT
FT                   ALDAVTACFPAGTLSGAPKVRSMELIEEVEKTRRGLYGGVVGYLDFAGNADFAIAIRTA
FT                   LMRDGIAYVQAGGGVVADSNGPYEYIEASNKARAVLNAIAAAETLTSLDFGVALAPGRV
FT                   AARGEAGNQGRL"
FT   misc_feature    36477..37283
FT                   /note="Pfam match to entry PF00425 chorismate_bind,
FT                   chorismate binding enzyme, score 498.40, E-value 5.6e-146"
FT   CDS             37382..38176
FT                   /transl_table=11
FT                   /gene="MLCB1610.32"
FT                   /product="putative membrane protein"
FT                   /note="MLCB1610.32, possible membrane protein, len: 264 aa;
FT                   unknown function, similar to TR:O06128 (EMBL:Z95554) Rv1610
FT                   (MTCY01B2.02) M.tuberculosis hypothetical protein (235 aa),
FT                   fasta scores; opt: 800 z-score: 925.0 E(): 0, 65.4%
FT                   identity in 234 aa overlap"
FT                   /db_xref="InterPro:IPR011746"
FT                   /db_xref="InterPro:IPR019051"
FT                   /db_xref="UniProtKB/TrEMBL:Q9X7C6"
FT                   /protein_id="CAB43178.1"
FT                   /translation="MAPDIKSARAGRLTIQIAQLLLVVAAGALWMAARLPWVVIRSFDG
FT                   LGPPKEVALSGASWSAVLLPLALLMLAATVAAIAVRGWPLRVLAGLLAVASFLVGYLGV
FT                   SLWVLPDVTVRGAVLAHVSLLSLVGSQRHHLGAGAAVAASGCTLIAAVLLMRSASVIGS
FT                   ARQGTSKYVVPAQRRSIARRDGAATAISQMSERMIWDALDEDRDPTDRLREPDTEGRWW
FT                   TACRRSLPFMNVVEIGGCTGSVAGRWVTSGKGNDTHVSGNCA"
FT   CDS             38154..38972
FT                   /transl_table=11
FT                   /gene="trpC"
FT                   /product="putative indole-3-glycerol phosphate synthase"
FT                   /note="MLCB1610.33, trpC, probable indole-3-glycerol
FT                   phosphate synthase, len: 272 aa; similar to many e.g.
FT                   TR:O68814 (EMBL:AF054585), trpC, Streptomyces coelicolor
FT                   indoleglycerol phosphate synthase (269 aa), fasta scores;
FT                   opt: 1064 z-score: 1185.8 E(): 0, 64.1% identity in 262 aa
FT                   overlap. Equivalent to TR:O06129 (EMBL:Z95554) Rv1611
FT                   (MTCY01B2.03), hisC, probable indole-3-glycerol phosphate
FT                   synthase (272 aa) (90.8% identity in 272 aa overlap).
FT                   Contains Pfam match to entry PF00218 IGPS,
FT                   Indole-3-glycerol phosphate synthases, score 339.30,
FT                   E-value 4.3e-98 and PS00614 Indole-3-glycerol phosphate
FT                   synthase signature"
FT                   /db_xref="GOA:Q9X7C7"
FT                   /db_xref="InterPro:IPR001468"
FT                   /db_xref="InterPro:IPR011060"
FT                   /db_xref="InterPro:IPR013785"
FT                   /db_xref="InterPro:IPR013798"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9X7C7"
FT                   /protein_id="CAB43179.1"
FT                   /translation="MCPATVLDSILKGVRADVAAREACISLSEIKAAAAAAPAPLDAMA
FT                   ALREPGIGVIAEVKRASPSVGSLATIADPAKLAQAYEDGGARIISVLTEERRFNGSLDD
FT                   LDAVRAAVSVPVLRKDFVVQPYQIHEARAHGADMLLLIVAALDQSALMSMLDRTESLGM
FT                   IALVEVRTEQEADRALKAGAKVIGVNARDLMTLEVDRDCFSRIAPGLPSNVIRIAESGV
FT                   RGPADLLAYAGAGADAVLVGEGLVKSGDPRAAVADLVTAGTHPSCPKPAR"
FT   misc_feature    38169..38927
FT                   /note="Pfam match to entry PF00218 IGPS, Indole-3-glycerol
FT                   phosphate synthases, score 339.30, E-value 4.3e-98"
FT   misc_feature    38313..38357
FT                   /note="PS00614 Indole-3-glycerol phosphate synthase
FT                   signature"
FT   CDS             39036..>40055
FT                   /transl_table=11
FT                   /gene="trpB"
FT                   /product="putative tryptophan synthase beta chain"
FT                   /note="MLCB1610.34, trpB, probable tryptophan synthase beta
FT                   chain, partial CDS, len: >340 aa; similar to many e.g.
FT                   TRPB_PSESY (EMBL:M95710), trpB, Pseudomonas syringae
FT                   tryptophan synthase beta chain (408 aa), fasta scores; opt:
FT                   1387 z-score: 1546.4 E(): 0, 61.6% identity in 320 aa
FT                   overlap. Weak similarity to ilvA, M.leprae putative
FT                   threonine dehydratase biosynthetic in cosmid L458 (427 aa)
FT                   (26.8% identity in 246 aa overlap). Equivalent to
FT                   TRPB_MYCTU (EMBL:Z95554) Rv1612 (MTCY01B2.04), trpB,
FT                   M.tuberculosis probable tryptophan synthase beta chain (410
FT                   aa) (88.5% identity in 331 aa overlap). Contains Pfam match
FT                   to entry PF00247 trp_syntB, Tryptophan synthases, beta
FT                   chain, score 623.90, E-value 8.8e-184"
FT                   /db_xref="GOA:Q9X7C8"
FT                   /db_xref="HSSP:2WSY"
FT                   /db_xref="InterPro:IPR001926"
FT                   /db_xref="InterPro:IPR006653"
FT                   /db_xref="InterPro:IPR006654"
FT                   /db_xref="UniProtKB/TrEMBL:Q9X7C8"
FT                   /protein_id="CAB43180.1"
FT                   /translation="MMPNLSCFSVAISEPTCHDPDSGGHFGGPDGYGGRYVPEALMAVI
FT                   EEVTAAYEKERVNQDFLDLLDKLQANYAGRPSPLYEATRLSEYAGSVRIFLKREDLNHT
FT                   GSHKINNVLGQTLLAQRMGKTRVIAETGAGQHGVATATACALFGLDCVIYMGALDTARQ
FT                   ALNVARMRLLGAEVVSVETGSRTLKDAINDAFRDWVTNADNTYYCFGTASGPHPFPTMV
FT                   RDLQRVIGLETRRQIQYQAGRLPDAVTACIGGGSNAIGIFHAFLDDPGVRLVGFEAAGD
FT                   GVETGRHAATLTGGLPGAFQGTFSYLLQDEDGQTIESHSIAAGLDYPGVGPEHAWLRET
FT                   "
FT   misc_feature    39111..40055
FT                   /note="Pfam match to entry PF00247 trp_syntB, Tryptophan
FT                   synthases, beta chain, score 623.90, E-value 8.8e-184"
FT   misc_feature    39336..39365
FT                   /note="PS00168 Tryptophan synthase beta chain
FT                   pyridoxal-phosphate attachment site"
XX
SQ   Sequence 40055 BP; 7643 A; 10900 C; 12538 G; 8974 T; 0 other;
     taggttagtc accaggacgg tcgcacaccc gctagcgctt cgtgtttcgg atctcatcac        60
     ttcaattcat cgggacaagg tgcgcaggcg ctgcttctta tagtaacttt actgtagtcc       120
     ttttgcacca tatacgtgtt gtttttcgca tttatacgcg tttcagccga tgttttccgg       180
     ttataatggc tgcaagctgt tcgtgtgagt cagactgatg catcgtgcac tattgcagag       240
     cttccctata gatccgtcac ggacttggtc gtgctggatt ttccgcgacc tgaggtcgcg       300
     ttgatcactc ttaatcggcc cggccggatg aattccatgg ctctcgacct tatgaagtcg       360
     ctcaagcagg ttctcaaaag gattacctat gaccactcgg tgagggtggt cgtgcttact       420
     ggcgcgggtc gaggattctg ctctggtgca gatcaaaagt tcacggcacc tgtgccacag       480
     gtcgaggggt tgacacagcc ggttcgcgcg ttgcgtgcca tggagcttct tgaagaagtc       540
     atcctggctc tgcggcgatt gcaccaacct gtgatcgccg cgatcaatgg tccggccatc       600
     ggcggtgggt tgtgtctggc gttggccgca gacgtccggg tggcttcaac tagggcctac       660
     tttcgagctg ctggcatcaa caacggcctt agtgccagtg agctggggct gagctatttg       720
     ctgcctcggg ccgtcggatc gtcgcgggct tttgaaatca tgcttagcgg tcgcgatgtc       780
     ggggcggagg aagccgagca gatcgggctg gtgtcatatc gggtgtcgga cgatcggctt       840
     ctggacactt gctactccat cgccgcgcgg atggcgacgt tctcgcggtc tggaactgag       900
     ttgacgaagc gggctttgtg gggtggcttg gatgccgcca gtctggataa gcacatgcaa       960
     agcgagagcc tggcacagct ctttatagca ttgcatacca gcaattttga agaagcggct      1020
     gccccatgca ccgagaagcg gccaacggtg ttagtcgatg ccagaggctg cgccacaagc      1080
     ccggggtaag gctttcgaat aaccatactt cacgctgatt gtttgctaac aaagattttt      1140
     tgctaacaag catggtcatt ccatgtcttc caagtcacgc gatgaatttt gttggtctcg      1200
     acataataga agcgaactag ttgtcgcagc tgttggttgg cgaccacgtt gtgacattga      1260
     tcggcagccg acagaagtaa aaggctgctg atgtcacaga tagtcgctca agcggtggat      1320
     gacttaatgg tcagaggcac tctggcatgt cgatctaggg ccgaccaccg actcgcaatt      1380
     ggaacagagg gagcacttga caccttggat cttagaggga acacattgag catgcgatga      1440
     cgatacctag gatatcatag gccagtgtcg acctcactgt tacctgtggt gatggtggac      1500
     cgagccgttc gaacctgacc agactgatcc gcgacccctg cgtgcggctc cgggatcggt      1560
     gcgggagggc ggcatatgaa ctcgtcatcg ggcagcgcgt atatcttctt gcagtagtta      1620
     agtatttatg acgtaccacg accgggactg agcggtttgt cgtggattgg ctcaacgcac      1680
     acggctggcg gggtgagggt accgtgctcc cgggatgaga tgttctgact cggccgctgg      1740
     attggtggcg tgccgatagc caactaagga tgagctcgct gacttcgtaa tcgctgaacg      1800
     tcggtagagg tgcgcgcgga cgacttttgc aatgtggccg tgaccaagca ccttctggta      1860
     ggcagggtta cttagcccgc attgttcaat ctatgtagtg agcagcgggc ccaggaagtg      1920
     agcgctgtgt tcgcacgttt aggagaaaac ggtgttcgaa tgtaacggca atccatgaag      1980
     gagtgcggcc ggtcgaacag gacgagagcg ataccttcat gatgctcttc gcttgtgtgt      2040
     cgcggatgct atggcagcgg cattagactt gcagcgagcc cagctggacc gatccggttc      2100
     cgtattgggg tgcccatccc ggggaggtgc ggttgcgtga tgagggtagt tacgttggtt      2160
     cgatcgtcaa ccggactgcg cgcctacgca acttggcgca tggaggccag accttgctgt      2220
     taagtgctac cgagaacttg atggttgctc ggctctttag cggggcctgg ttgacccatc      2280
     tggggtattc accggatgcg tggtctgccc cggcctgagc gggtggtgca gctgtgtaat      2340
     gctgatttgc aggttgagtt cccgccgttg cgggccactt atgacctgct aaactacgtt      2400
     ttcctctgca gctaaacgat tttcgtcggg cactgcggac aacccaacga ggttcagcag      2460
     ttgatggccg gtaatcgggc tgttcaccct gaccggtgtt ggtggtgtga gcaagacacg      2520
     gctatcgttg caggtagcgg tccaatttgt cggtgaggtc ggcggtgggt ggtacgtgga      2580
     tttggtattg atcagcgatt cggatctggc gccggccaag gttgcctgtg agctgggtct      2640
     gcgcgaccaa cctgaccgct ccaccgtgga cgtaggctat atttcttcgc cgggcgtcag      2700
     gcgctactag tgggctggat aactatgaac atctggtggg aggctaccgc ggcgttggtg      2760
     tcgcgtctga cggacgcagg tccacgcgtg agggggtgac tgactatggc gcgaacgttt      2820
     tcgtctcgcg ggcgaggtga gcgggtggat gccggcactg tctgtaaacg attaggtcat      2880
     cgaattgttt tttgtgaccg ggcgcagacg gatttccggc ttatcgatgt ccactcgacg      2940
     gctgcaatgg cgatccgtgg ccgcctgtat ggtgtttgtc acgggcgatt gagctagtgt      3000
     cggtctgtgt tcgggcgctt tcaccggctg agatggttga taatttagat gactgtttcc      3060
     acttgctgac ctggcaacgt ctgcatggtg gtggcccgtc aacacactct gtgggcttcg      3120
     gcggactggt cccatgcgct gttgatcagg ccggaaccgg ttgtgttccg ctgggtggca      3180
     gtgtgtggac tgtttcttcc tgggtgctgc ggaagccgtc gcggttgtgg tgaggtggag      3240
     ccctgtcggg tacacgcgat gcgctcacgc tgcttgttga ccagtcgttg gtggtcaccg      3300
     acgatagtgg ctggtcgagc attatatcgg ctgtcgaaga cggtgtgtca gtacgtgctg      3360
     gataaacttg acgagtctgg ggagatttgc acgctgtgcg gctgcggcgc agcgactact      3420
     acgtgagact ggccgctctg cttaacgtta atgcctcgga tgacactgac tatggcaagt      3480
     gccttaaaca ggcagaggtc gaaataaaca agtcgaaata aacaacttgc gcgccgcctt      3540
     tatgtggacc ttgcgagagc tccgacactg gactcgcgct ggtgctgtcg tcttcactgc      3600
     agcccacatg gctggcatgt ggccacatta cgttaagggc aagcctggct tgatgctgtt      3660
     ctctgccggc aaggatggcg gttaacttat gctaccgctt gtgatgtgcg cgagagtgct      3720
     tgccgagaag gttgtgctcg acatcttttg tcgatgttac cgcgggcata gagtatacca      3780
     accggccttg cagattgctt gtgaagttga cgaccgcgct ttgctgtctt cgggtactca      3840
     cggaatgcgg gttaagcgct gtgatggccg gttacgacgt tgctgaccgt tatgttaccg      3900
     aggtacttgg tcttgcccga gcctcaaacg atctggtgga ggttgagctc agatcccccg      3960
     cgttagcgac acggctgatg tgtatcgccg gtattgacga tgtaggctgc aataagtgca      4020
     gcattcgtac gactgagtac gacaagtctt cgccgccgca gcatcccgac cggctcgatc      4080
     gttcctgctc aatgctgtca ttacccggat cagtggcagg taagggattg cacatatgtg      4140
     tggacgctgc aaagggtgtg ccataccacg ttttgtgtgg acgtgttctc ccaccccatc      4200
     ctgggctggc aggtgatgac cagcaagacc gcgggtgctt ggtcacatcg gtgcttgagc      4260
     agacgctact caccccctat tgcacatagc tccattctct ataaccgatt tgattcatca      4320
     cttggacgcg ggatcaatat acgtcgatcg cgttcaccga ggcgctgcac gaggccggta      4380
     tcgccggatc catcagcagt gtcggcgacg ctttggacaa cgtgtttatg gattctgcta      4440
     tagggacggt acaaatctga attaatcgac caacatgcga ccttcaccgg tcgcgtcgaa      4500
     ctggaacggg aaaccacctc ctgggtgcac tagtacaaca cgatcgagac tccacctcat      4560
     caatatcaat ccaataccga ccaccagtcg catacgaaca gcgctaccag cagacagcca      4620
     ccccgaccga agtgacatga accaataatc tccgacagct ccagaacagt tcaagatgcg      4680
     tccaagattg ccttctcggt ttttggagtt aagccgatct cggtagtggc agccatcgct      4740
     aatcccttgc tgccgtggtt ggccgttact ggccatattg tgcaccacgg cggtgacaac      4800
     ttgtcagctg ttctttccca gcatgttggc gcgtcaagct cggacaagtc gtcagcgacc      4860
     ttgcctccgc agtctgtacc ggcttgacct gtggcaacaa caatgcacag agaatgcatg      4920
     tgggcatctg ctgatcgcaa aacatccttt gaactgcgac ggaacaagtg tgcttggcag      4980
     gatttgaccc tgtagtcgtc ggatgaggag ggtagcaccc tgcatcatga aacagaaggc      5040
     ttgttgctat ataagtgcta cacgctggac tactttggac aggaaaatct gctctgacca      5100
     gtgcccccgg cacgactcga acgtgcgacc tagggattag aaggcccttg ctctatccac      5160
     ctgagctacg gaggcagtgc gcagttcagt ttatccaatg gggatagtca atcggtttcg      5220
     cctgcttaac gcacacgaat tattttcctc gccgcggtta ccataatggg gtgtgcttcg      5280
     atacacgtac aacaactggt ggtagtcatc gataaccgca gtaatgcgtg tgcctgatgt      5340
     ctaaattgtg gtggcgttgt attgggcgta gcgattagcg tgacttcgga atgctcgtag      5400
     gctgcagccc gggtgctgtg acacgaggct cagctggcgg ccgatgtctg atatcctgtc      5460
     acttgttgct ttgcccttag cccgtcgcgg tgagctggct ggccatgagt ttcccgcagt      5520
     gcgtgaccgg gcctgcgctg tcatggctat acgtctgttg cgacgacgtg gcctttgcag      5580
     tgcgcgggtc acacatttaa ctgctgtact cgaggagtcc ttcgggatgg gcttccgcat      5640
     atctggtgtg tagcccgatt agcgattcgt cggcgtgtcc gcgctgcgac ggctgttgca      5700
     gatggctggg ccgcactgct ggtaagcggc acacagctcc cgacctcttc tgcggttcgg      5760
     accggtagcg aaaatctgtt cgacatttgg tggcgtcatg tctcccccgc aagtgggaag      5820
     gactcccaga cgtctgtctg gctgtcagga gccggtgttg gtacaggttc ttcgtgtggc      5880
     attggtctat caacatcaaa cgggggcaag cctattcatt gccgaacagg gggatggaac      5940
     ttagcgggat ttgtgtcttg gtcaacggac agcaagggtt cgcgctttgt ttggccggct      6000
     cacatcttcg attgaaaagg ttatcgcgtg ggtacgcgat aacattgctg actgccgccg      6060
     ccgactccga gcggttcaac gcggctgcgc cgatctcgta cacgcaaagc gaggcttcgt      6120
     cgtgtttgat gctgcacggt gaggaaaaaa atcctttggc tccgagcgcg aaggctcacg      6180
     ccttctgcgc ggcgctgctc acggttggag tccgcacggg cccttgtcgg catccccgac      6240
     gagactgaac cgggtggccg caattctgac gaatccgcca acccggctgc ccattatgga      6300
     gtttcgtatg ccagcattcc aacatccacg caatgagcga cctgtatctg cagttggctt      6360
     tcaccagcgg gtgaacccga ctagcgttgg gtaggcaatt aggtgggggg actgagtgca      6420
     ggaggcgtgg ttggcggtga caacgtactg gtgggcgtgg ggtcgttctg atggctgact      6480
     ccgtcgaggg cattggaccg ttcgatgaac tgggagcgct cgactatctg ctgcatcggg      6540
     gtgaggcaaa tccacggacc cgcgccggga tcatggcggt ggaacttttg gataccacac      6600
     cggactggaa tcggttccgg agccggatcg aggatgtctc ccaacgagtg ttgcggctgc      6660
     ggcagaaggt tgtagtaccg acattgccta ccgccgcacc acgctgggtg gtggatcccg      6720
     acttcgaact ggactttcat gtgcgtcggg tgcgcgtgcc tgatccgggt acgttgcgcg      6780
     aggtattcga tcttgccgag gtgattcagc agtcaccgat ggacgtgtcg cggccactgt      6840
     ggacggctac cttggtcgag ggcctggccg ctggcagggc tgcgatgcta ttgcagatta      6900
     gccatgcgat caccgacggt gtcggcagtg tcgagatgtt cgccgaaatg tatgacttag      6960
     aaagagatcc gccgtctaga cctcggtccc cgcaacctat tccgcaggat ctgtcgcgca      7020
     atgacttaat gctacaaggc atcaaccact tgccggtcgc cctggcaggc ggtgtcgtgg      7080
     gtgggctatc tggggtggcg tcggtggctg ggcgggcaat tctacgacct gcttcgactg      7140
     tatcaggagt cgtcggttac gtccggtcgg ggatcagagt gttgagccag gctgccgaac      7200
     cgtccccgtt gctgcgccag cgcagcctgg ctactcgcac cgaggcaata gagatccagc      7260
     tctccgacct gcacaaggct gcgaaagcgg gcgatgggtc gattaatgac gcttatctgg      7320
     ccggcttggt tggtgccttg cgccgctacc atgaggcact tggtgtgtcg atcagcacgc      7380
     tgccgatggc ggtgccggta aacgtacgga ctgaagccga tgtggttgga agcaaccgat      7440
     ttgttggcgt taccttggcg gcaccgctcg gtaccaatga tccggcagcc cggatgcaga      7500
     agattcgttc gcagatgacg cagcggcgcg acgagcctgc aatgaacatc attggttccc      7560
     tcgcaccgtt gatgacagta ctaccggcat cggtgctaga ttttatcgtt gactctgtgg      7620
     ccagctccga tgtgaatgcc agtaatattc cggcctaccc cggggacacc tacttcgcgg      7680
     gtgcaaaaat cttgcggcag tatggtatag gacctcgccc tggtgtagcg atgatggcgg      7740
     tgctgatgtc tcgaggcggg ttctgcacgg tcactgtgcg atacgatcgg gcttcggtga      7800
     aaagcgaggc attgtttgcc cggtgcctgc tggagggttt cgacgaggtc ctggcgttgg      7860
     ccggtgaccc gacaccgcac gctgtgccgg catcgtttgc tgcgcggtcc tccggttcgc      7920
     cggcgggatg gttgtcgagc tcatgagtgc cttcgacgag aacgggacgc ccaagtcagg      7980
     gccggaagtg cgcttgcctg gttcggtcgc cgagatcctt gccagtcctg ctggtcccca      8040
     ggtaggggcg tttttcgact tggatgggac gctggttgcc ggctttaccg cagtcatcct      8100
     tacgcacgag cgtctgcgac gctgtgatat gggtgtggga gaattgcttt caatgattca      8160
     ggctggtctg aaccatacac tcgggcgcat cgagttcgag gacctcatcg gtaaggtttc      8220
     ctcggcgttg cgcggacggt tattgactga cttggaggag atcggcgagc ggctgtttgc      8280
     ccagcgcatc gagtctcgga tctaccctga aatgcgtaag ctggtgcagg ctcacgtggc      8340
     tcgcggtcac accgtggtgc tcagctcgtc ggccctgacg attcaagtgg ggccggtcgc      8400
     acgttttctg ggtattgccc acatgctcac caacaagttt gagatcaacg aagacggaat      8460
     gctcactggt ggcgtggtta aaccgatctt gtggggtccg ggcaaagccg ctgcggtaca      8520
     acgatttgct gctgagcatg acatcgacct caaggacagc tacttctacg ctgatggtga      8580
     cgaggacgtc gcgctgatgt atctagttgg caatccacgg ccgactaacc ctgaaggaaa      8640
     aatggctgcc gtcgccaaac gccgtggctg gccgatcttg aaattcaaca gccgtggcgg      8700
     cgttggtatc ggagcgcagt tgcgaacact ggtcggcctg agttcagtgc taccactcgt      8760
     ggccggcgcg gtgggaatcg gtgtgttgac tggtagccgt cggcggggtg ccaacttctt      8820
     cacctctgtc tttcccccgt tggtgctcgc ggccgcgggg gtgcatctca atgtgatcgg      8880
     gaacgagaac ctgaccatgc agcgtcccgc cgtgttcatc tataaccatc gcaatcagat      8940
     tgacccgctc atcgccggtg cgctagtgcg tggcaattgg attgctgtgg ccaagaaaga      9000
     attggcaaag aatccgatca tccgcgcgct cggcaaatta accgacggtg tgtttattga      9060
     ccgggacgat ccggtcggcg cggtagagac gttgcacgcg gtcgaagacc gggccaggaa      9120
     gggactctcg attctgatgg ccccagaagg tacccgatta gataccacag aagtcgggcc      9180
     gttcaagaag gggcctttcc gcatcgcgat ggcggtcggg atcccgattg ttccgattgt      9240
     gttccgcaat gcggagatcg tcgccgctcg aaactccaat acgattaatc cgggcatggt      9300
     cgatgtcgcg gttttgccgg cgattccggt tgatgactgg actcttgatg cgttatcgga      9360
     gcgcattgcc gaggtgcgtc agctgtatct ggacattctg gctgactggc cggtcgacga      9420
     gctgcccaag gctgctctgt acacccgggc aacaaaggcg acggtgaaaa aggcggagcc      9480
     aaaaaagggc ggtcgcactt cggccgccaa gaccaccgct aagaagacag cgaccaaggc      9540
     cacggcagct aagactgcta acacagctgt taccaaaagc atgttgacgc cccccacgag      9600
     gatatcggac aaagagaacc cgaaagtacc taaggtcgtt tcccaagacc tcgtcgtgaa      9660
     acagcccaga gagcggtcgt gactgaaccg gatgtagaaa tcagctcagt ccttaccggt      9720
     gaagacacgc tggtgctagc gtctatggac actccggcgg aaattgagct ggtcatggat      9780
     tggctatgcc agcagcgtaa ccgcaacccg gacatcaagt tcgacgtatt gaagcttcct      9840
     tcgcgcaact tagcgcccgc ggcgctgaca gcacttgttg aacagctcga atccgacgaa      9900
     gaccggtcgg tcgtgccggt gcgtgttttc tggatgccgc ctgcggagcg cagtaagttg      9960
     gccaagctgg ctggattgtt gcccggccgg gatccttacc accccaaccg gcgccagcag     10020
     cgccacatct taaaaaccga cgcccggcgt gccctggtga ttgctggcga ctctgctaaa     10080
     gtgtccgagc tccgccaata ctggcgcgat accaccgttg gagaaaacga gtgcgatttc     10140
     gctcagttcg ttactcgccg cgccatcttg gcgatggaac gtgccgagtc tcgaatcctc     10200
     ggaccacagt acaagtctcc gcggctggtg aagccagaaa tcttggcgtc aacgcggttt     10260
     cgtgctggac tggaaaagat ctcgggcgca accgtggaag aagctgggaa gatgcttgac     10320
     gaactcgcca ccgggtggag cagggcgtcg gttgacctcg tttccgtgct cggcaggatg     10380
     ctcagccgcg gcttcgaacc tgagatcgac tacgacgagt atcaagtcgc ggcgatgcgc     10440
     gcggcgttgg aagctcatcc agcggtgctg ctgttctcgc accggtccta cattgacggt     10500
     gcggtggtgc cggtggcgat gcaggagaat cggctaccac cggtgcatgt gttcgccggc     10560
     atcaacctgt cgttcgggtt aatggggcca ctgttgcgcc gctccggcgt cattttcatc     10620
     cgccgtaaca tcggcgacaa tccgctctac aagtatgtct tgcgcgaata cgtcggctac     10680
     atcgtggaga agcgtttcaa cctgagctgg tccattgagg gcactcgttc gcgtactggc     10740
     aagatgctgc cacccaagct cggtctgctc acctacgtgg ccgatgcgta cctggacggc     10800
     cggagtgaag acatcctgtt gcagccggtg tcgatcagtt tcgatcagtt gcacgaaacc     10860
     gccgagtacg ctgcctatgc tcgtggcggc gaaaagacgc ccgaaggtgt cgcttggctg     10920
     tatagtttta tcaaggcgca aggtgaacgt aactacggta agatctacgt ccgtttcccg     10980
     gaagcggtct cgatgcggca gtatctcggt gcgcctcacg gtgcattggt tcaagatcaa     11040
     gacgctaaac ggcttgcgct ccaaaagatg tcgttcgaag ttgcatggcg gattctgtgt     11100
     gcgacgccag tgacggcgac agcgttggtt tccgcgctgc tattgaccac tcgtggagtg     11160
     gccttgacgc ttgatcaact gcatcacacg ttgcaagaat cactggatta cctggaacgc     11220
     aagcaaactc ctgtgtcgaa gagtgcgttg cggctgcgtt cgcgtgaagg cgtgcgtgct     11280
     gcggtcgacg cattgtccag cgggcacccg atcactcggg ttgacagcgg tcgggaaccg     11340
     gtgtggtata ttacccccgg taatgaacat gctgcggcat tctaccggaa ctcggtgata     11400
     cacgccttcc tggagacctc gatagtcgaa ctcgcgttgg cgcatgccag gcatgtcgaa     11460
     ggcgaccgta tgaaggtttt ctgggcgcag gcgatgaggc tgcgtgatct cttgaagttc     11520
     gatttttatt tcgcggattc ggctgctttt cgtgccaata tcgccgaaga gatagcgtgg     11580
     caccagaatt gggaggatcg tgtttccggt gatggtgatg atatcgacgc gatgctgctt     11640
     actaagcgac cgttgatctc agatgcgatg ttgcgggtat tttttgaagc gtacgatatt     11700
     gtcgctgatg tgttgcgcga tgctccggcg gatgttggcc aaaaggaact gactgaattg     11760
     gcacttggtg tcggacgcca gtacgtggca cagggtcggg tccgtagcgg tgaatcggtg     11820
     tctacgctac tattcgccac cgcttaccag gttgttgtcg atcagaatct gatagcgcca     11880
     gctccggatc tcgctgaacg tcggatggtt ttccggcggg agttgcggga tattcggcga     11940
     gatttcgact acgtcgaaca aatcgcgcgc agccggttca tcgtccgtga gttcaaatcg     12000
     cgttagtgac gcttgtaccg ttcagtggcc aagctatcgg tccataccca gggtagctga     12060
     cggttgctcg ctttgctggc aagctcgggt gtgtccttaa ctgactggtg tcgcagcgtt     12120
     aataaggtcg tacggcgact catcgccaac cgtcgggatc gctcgtcact cgcggactga     12180
     tccccagtca cggttctttg cacggggtca gcgggggtgg gctcgtgttg ttcctaattt     12240
     gtcggtagtt ccgcgctgaa tggtcattgg aagctgaatg gtcattggaa aggggccaac     12300
     acgtcacgac tcaaagacga caacagagca agcaacgaag ggaatgcgtc agtgttcgga     12360
     acgccgttaa ctgtagtcgg ttacatcgtc aacgacctgc aacgctgcaa ggtgggcgat     12420
     caggaggtgg tcaagtgtcg ggtggtcagc agttctcgtc gtcgcatcgg tgatcgtggt     12480
     gggtcacgtc tataccagcg agtacgggac cggcacggct tatcgttcgt cgttggagat     12540
     gcgggcaact cctccggtcg gcccggattt gtcgcgggtg attgtacgca tcgagaagcc     12600
     tggataaaaa ccggtccgga cgtggattcg attcccacag ccgccgatgc tgttacccga     12660
     gctcctgacg ggaatccagc ctccaggtcc ggctggtcgc cgctactggg ctcgacgcat     12720
     tacccatgtc accgatgttg gtttaggagt cgaattgcgg ctgtggcgcg atgcctagga     12780
     tgggcgctag taactttgca gactagaaag gcaatacggg ggcatggctg agttcatcta     12840
     cacgatgaaa aaggttcgta aggcgcacgg cgacaaggta atccttgacg acgttacgct     12900
     gaacttcttc cctggagcca agatcggtgt cgtcggcccg aatggtgccg gtaagtcgag     12960
     cgtcttgcgg atcatggccg gtatggacaa gccgaacaac ggtgacgcgt tcctggctac     13020
     cggcgctagt gtcggcattc tgcaacagga accgccgttg aacgaggaga agacggttcg     13080
     tggcaatgtg gaagagggct tgggcgacat caagatcaag cttgaccgct tcagtgaggt     13140
     cgccgaattg atggccaccg actcctccag cgagttaatg gaagagatgg gccggctgca     13200
     agaggaactg gaccacgccg acgcgtggga ccttgactcg caactggagc aggccatgga     13260
     cgtgctgcgc tgcccaccgc ccgacgaacc ggtgacaaac ctgtccggtg gtgagcggcg     13320
     tcgtgtggcg ctgtgcaagc tattgctgtc caaacctgac ttgttgctgc tcgacgagcc     13380
     gaccaaccat ctagacgccg aaagcgtgca gtggctcgaa cagcatcttg ctggttacgc     13440
     tggtgcggtt ttggcggtca cccacgaccg ctacttcctg gacaacgttg ccgaatggat     13500
     cctggaactg gaccgtggcc gcgcatatcc ctatgaaggt aactactcta cttaccttga     13560
     gaaaaaggct gagcgactga ctacacaggg tcgcaaggac gccaagctac agaagcgatt     13620
     gactgacgag ctggcctggg tgaggtccgg agctaaagca cgtcaagcca aaagcaaggc     13680
     gcgtttgcag cgctacgagg aaatggctgc cgaagctgaa aagaatcgta aattcgattt     13740
     cgaagaaatt caaatcccag tcggaccacg cctgggcaac atggtggtcg aggtcgaaca     13800
     cctcgacaaa ggttacggcg gacgcaccct aatcaaggac ttatccttca cgttgccccg     13860
     taacggcatt gtcggcatca tcggccccaa cggggttggc aagaccacac tttttaaaac     13920
     catcgtgggg ctcgagcagc cggacggcgg tgctgtcaag attggcgaaa ccgtcaagct     13980
     gagctacgtg gatcagaccc gcgcgggtat cgatccgaag aagatcgtgt gggaagtggt     14040
     ctcagatggg ttggaccaca ttcaggtcgg ccaaaccgag gtctcgtcgc gggcctatgt     14100
     ttcggcgttc ggattcaagg gtcccgatca acagaagctg gccggggtgc tctccggtgg     14160
     tgagcgaaac cggctgaacc ttgcgctgac gcttaagcag ggcggcaatc tcatcctgct     14220
     tgatgagccg accaacgacc tcgatgtcga gaccttgggt tcgctggaga acgcgttggt     14280
     gaatttcccc ggttgtgcgg tggtgatttc gcacgatcgc tggttcctcg atcgcacgtg     14340
     cacgcacatc ttggcatggg aaggcgacga cgaggggaag tggttctggt tcgagggaaa     14400
     cttcggtgcc tatgaggaaa acaaggtgga acggctcggt gctgaagctg cgcgtccgca     14460
     cagggtgact caccgtaagt tgacgcgcga ctaatgtcag cttcactagc gttggtatcc     14520
     cgggggttca gagtgggttg ggagcaaacg gcatgaccat cgatcccgga gctacgcatg     14580
     ttgccgagct gtgtaccaca ttcacgcagg gtgcggacgt acccgactgg atctcgaagg     14640
     cgtacatcga cagctatcgc ggttcgcacg gcgacgtacg cgaagccccc gaaaccagtc     14700
     gagtcaatcc taacgccttg gtgacgccgg ccatgctcag cgcacactat cgtctaggtc     14760
     agtgccgacc gaacggtagg aactgtgtcc gcgtctatcc ggcagatgac cccgcagggt     14820
     tcgggcccgc gctgcagatc gtcaccgacc atggcggcat ggtgatggat tctattaccg     14880
     tgcttctgca ccggctcggg gttacataca cggccatgat gaccccagtg ttcatggtgc     14940
     ttcgcagccc gacgggggag ttgctcggcg tcgaacctcg agcttctagt acgtcgcact     15000
     ccatcgaagg gacttgggtc ggtgaggtct ggatatatat ccagctcttg cctgctgtcg     15060
     atagtaaatc cctagccgag gttgaacagc tgctgccccg gaccctggtt gacgttcagc     15120
     gggttgccgc tgacgcagca gcattgaacg ccaccttgag cggtctggcc gcagacgtca     15180
     aaacgaacaa agaaggccac ttttcggctt ctgaccgcga tgatgtcgcg gcgttgttgc     15240
     actggctggg taacgggaat tttctgcttt tgggttacca gcgctgccgg gtgcattatg     15300
     gtctggtctc ctgcgacagg tcaaccggtc tcggcgtgct gcgtgcacgc acaggctccc     15360
     ggccccggct gaccgacgat aacgaattgc ttgtcctagc gcaagctgct gtcggcaact     15420
     atctgcgcta cggggcgtac ccgtatgcca tcgcggtccg tgaatacgac gatggtgggg     15480
     acggaggtat tattgaacac cgcttcgtcg ggcttttcac ggttgctgct atgaacgcag     15540
     atgtactgga gattccgtcg atctcacacc gggttcgtgc ggcgcttgcg atggccaaca     15600
     gcgaccccat ttatccgggc cagctattac ttgacgtcat ccagaccgtc ccgcgttcgg     15660
     agcttttcac gctgagcgct gaacggcttt tcacgatggc caaagaggtg gtagatctag     15720
     gatctgggcg gcgtgcgttg ttgttcctgc gcgctgatcg gctgcagtac ttcgtctcct     15780
     gcttggttta tgtacctcgc gatcgctaca ccacaggtgt gcggttgcaa atcgaggata     15840
     tcctcgttcg cgagttcggt ggtacgcaag tggaatttac tgcacgagtt agtgaatcac     15900
     cttgggcgtt aatgcatttt atggtccgtc tgtccgaagg tgctgcgacg ggctcagtcg     15960
     acgtctcgga aggcaatcgg atccggatcc aggcgatgtt gagtgaagct gcacggacat     16020
     ggtctgaccg gctaatcgca gccgcggctt cattctctga gggttccgtc tcgtatgctg     16080
     aagccgagca ttacgcggcg accttctccg aaacctacaa acaagctgtc actccggccg     16140
     atgcgattga ccacatagcc atcatcaaag agttggccga tgattcagtt aagttggtgt     16200
     tcttcgagcg gaaagcggat gggttcgccc agctgacttg gttcttgggt gggcgcagtg     16260
     cctccctgag ccagctgctc cccatgctgc aaagcatggg cgttgttgtg cttgaggagc     16320
     gaccgtttac cgtcgcccga accgatggct tgccggtatg gatctatcag ttcaagatct     16380
     cgccgcatcc gacaattccg ctggcgtcga cagcgaatga gcgggagctt accgcaaaac     16440
     gattctccga tgcagttacc gccatctggc agggccgcgt cgagatcgac cggttcaacg     16500
     agttagtgat gcgtgcaagg ctgacctggc agcaggtcgt gctactgcgt gcttacgcga     16560
     agtacttacg gcaggcgggt ttcaactata gtcagtccta catcgaatcg gtgctcaatg     16620
     agcacccttc caccgctcga tcactggtcg cattgttcga agcgctattc gatcccagcc     16680
     cgttgagttc gtcaacgaac tgtgatgcgc aagcggccgc tgcagctgtc gctgcggaca     16740
     tcgatgcctt ggtgagtttg gacaccgacc gcattcttcg tgcctttgca tctctggttc     16800
     aggccaccct aagaaccaat tactttgtga cacaaaaatt ttctgctcgc agcaaaggtg     16860
     tacttgtcct caaactcgat gcacagctga tcaacgagtt gccgctgccg cggcccaagt     16920
     tcgagatctt cgtgtattcg cctcgtgtcg agggcgtgca cctgaggttc ggtgcggtgg     16980
     cgcgtggtgg gctgcgttgg tccgaccgac tggatgattt ccgcacagaa attctgggtc     17040
     tggttaaggc acaggcggtc aagaacgccg ttatcgtgcc ggtcggcgcc aagggcgggt     17100
     tcgtgctcaa gcggccgccc ctgcccaccg gcgacgccgc tgccgaccgc gacgctatga     17160
     gagccgaagg tatcgcctgc taccagctgt ttatttctgg tctgctcgac attactgaca     17220
     acgtcgacca cgcgaccgga aaagttaacg cgccgcccca ggtggtgcga cgtgacagcg     17280
     acgacgccta cttggtggtg gccgcggaca aaggcactgc cacgttttcc gacatcgcca     17340
     acgatgtcgc caagtcctac ggattctggc tgggcgacgc cttcgcctcc ggtggatcgg     17400
     tgggctatga ccacaaggcc atgggcatta ctgccaaagg cgcgtgggaa gccgtcaaac     17460
     gacacttccg ggagatgggg gtggacactc aaaatgagga cttcaccgtt gtgggtatcg     17520
     gcgatatgag cggcgacgtg ttcggtaacg gcatgctgct tagcaagcac atcaggctga     17580
     tcgctgcttt cgatcatcgg catgttttcc ttgacccgga ccctgatgcc gcggtttcct     17640
     gggctgaacg gcagcggatg tttgatctgc cgcgatccag ctgggacgac tacaacaagt     17700
     cgttgatcag cgagggtggc ggtgtgtata gtcgcgagca gaaagccatc ccgaccagtc     17760
     cgcaggtccg cactgccctg ggcatcgatg gcgaagtaac cgaaatggca ccgcccaact     17820
     tgatccgggc aattctgcaa gcaccggtgg atttgctgtt taacggcggc attggcacct     17880
     acatcaaggc tgaaactgag tccgttgccg acgtcggcga tcgcgcgaac gatccggtgc     17940
     gagtcaacgc aaatcaggta cgcgccaagg ttatcggcga aggtggaaac ctcggggtta     18000
     ccgcgttggg ccgcgtcgag ttcgatctgt ccggtggacg gatcaatacc gacgcgatgg     18060
     acaactctgc cggcgtggac tgttccgacc atgaggttaa catcaagatc ctgatcgact     18120
     cactggtcac cgctggcaag gtgaaagtcg aggagcgtaa acacctgttg gagtcgatga     18180
     ccgacgaagt cgcacgtttg gtacttaccg acaacgagga ccagaatgac ttgatcggca     18240
     ccagtcgcgc caacgcagcc aacatgctct cagtgcacgc gatgcagatc aaatacctgg     18300
     tggacgaacg cggagtcaac cgcgaattgg aagcgctgcc ttcagagaag gagattcaaa     18360
     ggcggtccga ggccggcatc ggccttacct cgcccgagct ctcgacgctg atggcacacg     18420
     tcaagctggc acttaaagag cagatgttgg ccaccgaact gccggaccag gatgtcttcg     18480
     tgtcgaggtt gccacgatac ttccccaagc cgctgcgcga gcgtttcacc ccggaaatcc     18540
     gctcgcacca gctgcgccga gaaatcgtca ccaccatgct gatcaacgat ctggtagaca     18600
     ccgccggcat cagctacgct ttccggatcg ccgaggatat cggggtcggg ccgattgatg     18660
     cgatacgtac ctatgtcgcc accgacgcca tctttggtgt gggtgatgta ttgcggcgta     18720
     ttcgtgcggc aaatttgtcg gtcgtgctgt cggatcgaat gacgttggat acccgtcggc     18780
     tgatcgaccg cgccggacga tggctactta actatcgtcc gcaaccgtta gccgtcggcg     18840
     ccgagatcaa ccgatttgcc gcgaaggtta aggcgctaac gccgcggatg tcagagtggt     18900
     tgcgcggtga cgaccaggcc atcgtcgaac agcaagccac agaattcgtg tcgcagggtg     18960
     ctcccgaaga cttggcctac cgggttgcgg ttgggctata tcgctacagt ctgcttgaca     19020
     tcattgacat cgccgacatc accgagctcg acccggccga ggtcgcagac acttatttct     19080
     ccctgatgga ccggctgggc accgatggat tgctcacggc ggtgtccaaa ctaccccaaa     19140
     atgaccgctg gcattctttg gcgcgtttgg cgattcgtga cgacatctat gcttccctgc     19200
     gatcgttgtg tttcgatgtg cttgccgttg gagagcctga tgaaagcggt gaagagaaga     19260
     tcgccgagtg ggagcacatc agcgcttccc gggtggaaag ggcacgttta atgcttgccg     19320
     agatccacgc tagcggtgag aaggatctcg cgacgttgtc ggttgcggca cgtcagatcc     19380
     gccgcatgac ccgcaccagt ggacgtgggt cgtcggggtg agtgcggggt tcgttgtggc     19440
     cgtgctggtc cggtgatcgg acatcgacat gtatcattac atcaactacg cgactatggt     19500
     cacgattatc gaagaggcgc gtgtttcttt tcgcacaagc gttttggcgt cgacatcact     19560
     ttaactggtt cgctgatgtc cgggtcacta tacacaagcg tcagctgcga ctggaccgat     19620
     tcaccattac aggtgacgat atgagtgtag cggttgcggg cgatggactt caccactcgg     19680
     ttacgaggtg cgttcggtca atgctgatcc cgagtccaaa cctgttgtta ttgttcgagt     19740
     cgcagctggc tgcgtattca catcgacgag cagcggttgg tgtgactctt gccgcatcgt     19800
     cgggagtatt gggtatctgc aacagtggca acgataactg agtggggact gtggttgggc     19860
     ccagacatta gggccgccca tcgagtggat ctggccacgt tcgtcgatcg tgtggtgcgg     19920
     ctcgaccagg ctgcggttat caaacttcgg gtacactgtg ccggattgcg gacggtatgg     19980
     atggcaaccg gtttcgactt gctagccgcg cctagtggta gtttgccaat gtgcagacca     20040
     gcgatttgtt tttcggcgca aacgcatgag tgtgcagcct gtccgcgatg gatgcctcgt     20100
     gctatattag tctggttttc tcgatggact cagcctggcg aggtgtgttg tctccagaat     20160
     ctggcttcgc caacatccat gacgtaccgg ccctggtgat gctggatttg gcgtagtgtg     20220
     gcgcgcggct cgctaaggag cacagcagtt accgtggttc gccggttttc ctgcttggtc     20280
     aggggaggtc attcaggtca gttctagaga cgtcagtgtg ggggccatct atgcgctgtg     20340
     tgttcgcttt gatcgctcgc aatggttttt ctgccgcagt caccggatgc ggtcgacatc     20400
     gtggccatcc ggctataggc actgcgttag tatgctgaaa ctaaccaggc tgcggtatct     20460
     ggcagtaatt ggtcacccat cagtggtgtg ttggcaagta tgcattctcc tgctggcagc     20520
     gccagcgtac ggtcgctggc gttaaatgtg tatactagtc tgccgccgca ccggccggaa     20580
     tatccgcgaa tcgcgtagcg cggtcagccg ctctatcttg ctaccatcaa attcggctca     20640
     ctccttgtgt gatttcaacg tttggctaaa gaacgacaac gttgagttgg cgtccggtgc     20700
     gctatttttc taaggtcagc gtcacccaag ccgtaggcat tggcagccag gtgtcaaaca     20760
     gacggggata aaccgatatg cggagcaaca ccttaccagg gaattggaac ccaataccgg     20820
     gcgcggccaa ggtcggtatg tcctgagcgt tcccaggtcg ggtcctgcaa tgccgcgtca     20880
     tgcaggtcca cgtcgggcaa ctccagtttc tgatcattat agatgaacac cgcgctcggt     20940
     aggacaagca tcaccatcgc catgacccgc gctcggagca gtccgttatt gccgccccca     21000
     tagcgggtga cgtcccgacc tacgtcatga tggccagtgt ccaggttggg atcgcgtttt     21060
     cgatctacgt ggccgtacgt gagttctgca ccgcgtcgtg aatggcggca gcgtcgaagt     21120
     tgatgatctg cgccaggagg aaattgaagc tgaggtgtag ttcgtcagac cgcaggtaat     21180
     tcggtccacc gggcattgtc ggtgacttaa tacttcgccg atgatcgccg cgccaggata     21240
     gtcgtggacg accgtgccaa tatcgcggtt aatcgcttgc acacttgggt ggttgaagcg     21300
     ccggttgtcg tcggtatggt gcaacacccc tggtctcgtt tacctgtgag ttcggcaggc     21360
     cgggcggctt ggccatcccg tgcggccaca tcgatgtgga agccatccac gccgtgtccc     21420
     agccagaagc gcagcgtttt ctcgaagtcg tcgaagacat ctcaggtggt cccagttcaa     21480
     atcgggctgc tcggtgtcga agaggtgcag ataccacctg gccgggggtg ctgtccggtt     21540
     cgtccacccg tttccaggcc ggcctgccga acaacgggtc gatgtcacga ggatcggcga     21600
     cgtcgtaact atggtcggct attggtgaga cagtgacgga gtgggtccaa atcgctttga     21660
     taccgagcaa ttccaggtgc ctctgccggg ttgtgaaacc gtccaggctg cccaccccgt     21720
     agccgttgct gtcagcgaac gactgcggat agaactggta gaacaccgcg gtcgaccacc     21780
     acggctgggt ctcgtggccc gacatcaaga cgcggagttc acgagggagt gtgcggccat     21840
     ctccaaatag tcgagcaact cccgtcgatg ctcgttgtcg agggtatgcg agtcgatgga     21900
     ggcgacggcg gtgtgcatgc accgcagcca agcgtcgcgc tcgatggcag tgatccggaa     21960
     aggagcgtga cgcatacgca accgcggatg gccgcgttga ctggagtatg ttcgcggacc     22020
     accccagtat tgctcgagga acatgcgcaa tcgttcttcg gcgccggcca ggtcgtctgc     22080
     gggatacaac tcacgcagta tctcatcttc aggaacctgc gcataaaagc gtgacacaat     22140
     cgctttgaag gtctcggccc cgccgatagc gtcgtagaaa gattgctgca cctgatccat     22200
     ctcttccatt gtggtgtatc gcctgggtga taaccggagg tgggcttaag ccaataggtg     22260
     gttcctggtg ttcaccagat cgacatggca tacactcagc gattggtgtg cggtcagcgg     22320
     cgaatcatgg tggactgttt cgcggaggac ttagatttaa tggtgcagcg caagaatcgc     22380
     cgcagccatc gcagttccgg ggccgcggca aacctgatca gggctgctaa cccctcatcc     22440
     ctgcacaatg ttgatatcca cccgtccacc tgctatgaga gcggatcaat ttggaatcgc     22500
     cggcgggtgt tgctgctgaa ttccacctac gagccgctca ccgcattgcc gacgcggcgg     22560
     gcgatcatta tggtgatctg cggcaaggcc gacgtcgtgc acgtggatcc ggccgggccg     22620
     gtcgtccatt cggcaaccag gtcgatcacg gtgccgtcgg tgatccaact gcggtcctat     22680
     gttcgggttc cgtatcgagc ccgtgtcccg atgacccgcg ctgcgctaat gcatcgggac     22740
     cgcttttgct gtgcctattg cggagccaag gccgataccg tcgatcacgt ggtgccacgt     22800
     agccgcggtg gtgaccactc ctgggagaac tgtgttgcgt gttgttcgac atgtaaccac     22860
     cgcaagggtg acaagctgct taccgaactt ggctgggtac tgcgccggac gccggtgctg     22920
     ccgacaggtc aacactggcg gctattgtcg acggtcaaag aactggatcc ggcgtgggct     22980
     cgatatcttg gtgggggcgc cgcttaagct ggatttcccg gcatccgcag tggtccgtct     23040
     tagctctagg tggcgcgcgt gcgaccttaa acgtgttgag tttacactca ccgtgctggc     23100
     caaatctgct tccgaagggg ccagcggctt ccgcacgtaa gtaagtaccg ctcgggctca     23160
     cattgccgcc gggctgggtc taaaatttgc tttcacatgc gccggtcgcg ctagtggtgg     23220
     gataggtttg cttcgtgagc atgctgcaaa tccatctttg tttcgttggt atcccattgt     23280
     tgctggtggt catgctggca gtgcccatcc tgtcccgtaa agggccctat cccgcgacct     23340
     atagattggg cgagaggtgg acgcatccgc ccatcctatg ggctgccatc gacgaagttg     23400
     tcagtgacgg gcacgggggc tatggggcat cagagttcac agtcggaggt ggcgccagtg     23460
     gtacgtggtg atgttgtaac tgagctaccc aaggggtggg tgatcaccac cagcggacgg     23520
     gtctccgggg ttaccgagcc cggggaccgg tcagtgcact acccatttcc aatcaaagac     23580
     ctcgtcgccc tggacgacgc gctaacatac agctctcggg catctcacgc ccggttcgct     23640
     gtttacctcg gtgacctggg caatgataca gcagcactag ctcgcgaaat cctagcccag     23700
     gtgccgacgc cggacgacgc ggtgctagtc gccgtttcac cgaaccagtg cgccatcgag     23760
     gtggtttacg gctcgcaggt ccggggtcgt ggcgccgagt cggcagcacc actgggtgtt     23820
     gccgcggctt cttcggcctt cgagcaaggg aacctgatcg atgggctgat cagcgcggtc     23880
     cgcgtgctca gtgcggggat ttcccggtct tagggtggtg gtgttgtggt gggctggtgg     23940
     ggtgtgggtg gttgatctgc gctagaaggt tgccgtatgt gccgccgacg gccggatcat     24000
     cgatgcactg ttcactaaca cgatactgct gcacccggcg gcatgcctgc ttgctggctg     24060
     agggccggtc aacaagccgc cgacaccgat accagcggca gaaatggtgc aagggataac     24120
     atcaggtgcg aatagttata ccgtgcacgg ggacgtgcct gttcaggtgc ggccacattg     24180
     accacgccga cacctcaagg ccatgacatg cacactaaaa cgcccctacc ccgcggcgct     24240
     aacaactatc ctcacactca cgcctgcatc gatatcgcct tcagcacagc ccaggtgccc     24300
     agcccctggc atcatcaaca tgtagaccaa gcagcatcca ccaccgacat gcttacgtgc     24360
     gccgcgctaa tcgtgtccac tgcggcaaag cacacgaagc ctcatcgaaa gcaggcagtc     24420
     agccacccac caacaaaaac cccgcaacac agcaaaacac gtcaacagga ttccgggctg     24480
     ccccccacga ctaccgaaca tatcggccag tcataaaacg caaaaacagc catttcaccc     24540
     accacctatg aaaccaaccg aacaactcac cgccacagac acaagacagc agagcccggg     24600
     aaccgcacga tttttgacgc cactcaggcc aagttggcaa ccacgtggta gggccttttt     24660
     agaccaggct gcgaaacctg ccgcaaatgt tgccgcggct tgacgggttc atgcggtttg     24720
     atgccgttga tggctggcct cgattgccgg cctggattac ggcggcttta tgtcggctct     24780
     ggtgctcgtg ccgtacagcc gacggcgatg ctgaattccg atgtcagaac acctttactt     24840
     tatctagctg ccattgcttt gctgtcgcac agcatgctcg acggttggcg agctgtgggg     24900
     ccactcgcag cggagttggc cagcaccgaa cgggtcgatc gctgtgtcct ggcggttact     24960
     cggctgttga tcgggttgtt atgaggcagc gtttggggtg cacgtcgact atgtcgacac     25020
     cgaccagcag tggctagcag ccgccgagaa attcggagtt atgtcacacg atcgggtaaa     25080
     gttgggcaag tcttgggatc ggcaccccgc taccttgtat attgccgcag atttcgcgac     25140
     gttacgcgcg acgtggcgga cggcatctgt actgatagcg ggatctacta ccaacccgtg     25200
     ttagatgatg ccgttattct tgatgtacac tcgtgggtac ggtttgttcc tgggcacgtc     25260
     gatgcgcgca ccgtcattcc gcgcattttg gacatgcttg ccgaggtatg agctttgttg     25320
     ccggccgttg agcacgtggt tgcataggaa gatgccccat gagcttggct aacgatggac     25380
     gagcaagact gtcttcgcca gggggtaggc ttgcaccggc tcaggcaatg tcgccgacga     25440
     cgaccccggt ccggcacagc tgccgcgcac aatccatggc aactgctgaa tgtctgggcc     25500
     gtcggagctg gcggacataa tctgggtgtt ggtgtgatat gcatgttgcg gcgggtgaag     25560
     tgcacgttgg cttcgttggc ggctagaacg tttttgatcc agtcagtttt tcccgtgacc     25620
     cagcgtgatt gccaacacat tgacttcgca gtaggcggtc acgatgctct ggtatggcat     25680
     gccggatttg cgaccatggt gagtgatggt tctcggcagg taaggtgcga ttggtgttga     25740
     gcgccgggtt catggatttg atctgaaggt gctcgagcca gggcgggaag atcatcggga     25800
     cgcctgggga ttatttgggg tgatccttgg ctgacatggc cgttccttga gcggccatgt     25860
     cagccaagtg gagacaaacg cggtgcaggc ttgttagtgg cactgtaccg gccggctatc     25920
     gactcgagct gctctgggac aatggtcgtt gtgagcatca cgccaccatt cctgctagcc     25980
     tgcattgatc tgcggggagc tgagttgacg gctacccagt tgcgggccac gctgccgcgc     26040
     ggtggcgccg acgtggaggc tgtgctgcct acggtgtggc cgatcgtgca ggccgtcgcc     26100
     gagtgcgggg ccgatgcggc tttggaattc ggggcgttat tcgacggcgt gcggcccccg     26160
     actgtccggg tgcccgacgc cgcgctggac gccgcgctgg caggattgga ccccgatgtc     26220
     cgcgacgcgt tgcaggtgat gatcgagcgg acccgcgttg tgcatgccga ccagcgccgc     26280
     accgacgtca ctaccgcgct cggcccgggt gcgacggtca ccgaacggtg ggttcccgtt     26340
     gagcgggtgg gcctttatgt acccggcggc aatgcggtgt acccgtccag cgtggtgatg     26400
     aacgtggtgc cggcacagac cgcgggtgtc gattccctgg tggtggccag ccctccgcag     26460
     ttcacttccg gtggccggtt tcacggactg ccgcatccga cgattctggc cgcggcacgg     26520
     ctactgggag ttgacgaagt ctgggccgtc ggcggggcac aggcagtggc gttgcttgct     26580
     tacggaggca ccgatagtga cgattgcgag ctggcaccgg tcgacatgat caccgggcca     26640
     ggcaatatct acgtcactgc cgccaagcgg ttatgtcgct ctcgggtggg tatcgacggc     26700
     gaggccggtc cgaccgagat tgccatcctc gccgaccata ccgcagatcc agcacatgtg     26760
     gctgccgaca tgatcagcca ggccgaacat gacgaaatgg ctgccagtgt attggtgacc     26820
     ccaagtacgg atctggctga tgccaccgac gccgaattgg ctgcccagct gcggaccacg     26880
     gtgcatcgga aacgggtggt ggcagcacta ggaggacgcc agtcagctat cattttggtc     26940
     gacgatctgg aggctggtgt caaagttgtc aatctctacg cagctgaaca tctggagatc     27000
     cagactgctg aagcttcacg agtcgcgagc agaatacgtt gtgcaggagc tattttcgtt     27060
     ggcccgtggg caccggtcag cctcggtgac tactgcgctg ggtccaacca cgtattgccg     27120
     accgcgggct ttgcccggca ttccggcggc ctgtcagtgc agaccttcct gcgtggcatc     27180
     catgttgtca actacaccaa gacggctctc aaagacatat ccggacacgt tatcacgttg     27240
     gccaaagccg aagacctgcc ggcgcatggg gaggcggtac ggcggagatt cgcgcgatga     27300
     atgtacctga gcccacactc gacgatctgc cgttgcgcga taatcttcgc ggcaaatcac     27360
     cttatggtgc aatgcaattg ctggttccag tgctactgaa caccaacgag aacccgcacc     27420
     cacccaccaa ggcgttggtc gacgacgtgg tgcggtcagt gcaaaaggtg gcagtcgact     27480
     tgcatcgcta ccctgatcgc gatgcggtgg cactgcgtca ggacttggcc tcctacctca     27540
     ccgcgcagac cggtatccgg cttggtgtcg aaaacatttg ggctgccaat ggctccaacg     27600
     agattctgca gcagcttctg caggcgttcg gtggtccggg gcgcagtgcg atcggctttg     27660
     tcccgtccta ttcgatgcac cctataatcg ctgacggcac tcacaccgag tggctagaga     27720
     ccgttcgtgc tgacgatttc agcctcgatg tcgaggccgc tgtcaccgcc gtcgctgacc     27780
     gcaagcccga tgtagtgttt atagccagcc ccaataaccc ttctgggcag agtatttcgt     27840
     tggctgattt acggaggcta ctcgacgtgg tgccggggat cttgattgtc gacgaggctt     27900
     atggcgaatt ttcctcgcgt cccagcgcag tggcgctggt aggtgagtat ccgaccaaaa     27960
     tcgttgttac ccgcaccacg agtaaggcat ttgccttcgc cggcggcagg cttggatatt     28020
     tgatcgcgac gcctgcgctg gtggaagcga tgttgctggt gcggttgccg tatcacctat     28080
     catcggtcac tcaggcggcg gcccgagcgg cactccggca cgctgatgat accctaggta     28140
     gtgtcgcggc gttgatcgcc gaacgagaaa gggtgacaaa atctttggtc cacatgggat     28200
     ttcgtgtaat ccctagcgac gcaaacttcg tgctattcgg acacttctcc gacgcggctg     28260
     gcgcttggca gcattatttg gacaccggtg tgctgatccg cgatgtcggt attcccggtt     28320
     acctgcgcgc taccacgggt ttggccgaag agaacgacgc gttccttaaa gccagctccg     28380
     agatagctgc caccgaattg gccccagcca ccacactagg agcctcgtga ccaacaccga     28440
     agtagggaaa acaacgcgtc gtgcgcgtat tgaacgccgc accagcgaat ctgatatcgt     28500
     cgttgaactc gacctggacg gcaccgggca ggttcacatc gataccggtg tctcgttcta     28560
     tgaccatatg ctgactgcgc tgggcagcca cgctagtttt gatctgaccg tgtgtaccaa     28620
     aggcgacgtg gagatcgaag cccatcacac catcgaggac accgcgattg cgctgggtca     28680
     ggcgtttggg caggcgctgg gaaacaagaa gggcatccgc aggttcggag acgccttcat     28740
     cccgatggac gagaccctgg tacacgcggt agtcgatgta tcaggtcgtc cttattgtgt     28800
     gcacaccggg gaacctgatc acctgcagca caacattatt tccggaagtt cggtgcccta     28860
     ctccaccgtc atcaatcggc acgtgttcga gtcgctagcg gccaatgccc gtatcgcgtt     28920
     gcatgtgcgc gtcttgtacg ggcgcgatcc gcatcacatc actgaagcgc aatacaaagc     28980
     cgtcgcgcgc gcgttaagtg aggcggtcaa attcgaccct cggttttcgg gcgtgccgtc     29040
     caccaaaggt gtcctgtgag aagtaaatcg gttgtagttc tggattacgg ttcgggtaat     29100
     ttgtggtcgg tgcagcgcgc gctgcagcgg gttggtgccg cggtcgaagt tacggctgac     29160
     tcagctgcag gcgccgccgc tgacgggttg ctggtgcccg gtgtcggcgc gttcgaggcg     29220
     tgcatggcag gtctgcggaa aatcgcgggt gaacggacca tcgccgagcg gatcgtcgcg     29280
     ggacggccgg tattaggggt ctgtgtcggc atgcagattt tgttcgcccg cggtgtcgaa     29340
     ttcggtgtgg aaacaacagg ctgtaggcag tggccgggcg ttgtgacccg gctcgacgcc     29400
     ccggtggttc cccatatggg ctggaacgta gtcgattccg cttctggaag tgcgttgttc     29460
     aagggcctag atgctggagt gcggttctat ttcgtgcatt cttatgccgc gcagcgttgg     29520
     gaaggctcat ccaaggcgct gctgacatgg gcaactcacc aagtaccgtt tttggccgcg     29580
     gtggaagaag gaccactggt tgccacccag ttccatccgg agaagagtgg ggacgctgga     29640
     gcaaccttgc tgagcaattg gcttggggaa ctttaaagga tgctgttgat gtcgctgata     29700
     cttttaccgg ctgtcgacgt ggtcgagggg cgcgccgtgc gcctggttca aggaaaggcc     29760
     ggtagcgaaa acgattacgg ttcagcattg gacgccgcgt tgtgctggca acgcgacggc     29820
     gccgactgga ttcacctagt ggatctggac gcagcgtttg gtcgcggttc caaccgcgaa     29880
     ctgctttctg agatggtggg caagcttgat gtgcaggtcg agctatccgg aggtattcgt     29940
     gacgacgact ctttgaatgc tgcgctagct accggctgtg cacgggtcaa cctaggtacg     30000
     gccgcatgtg agaacccgca ctggtgcgca caggtgattg ccgagcacgg tgacaagatc     30060
     gccgtcggtt tggacgtcca gatcgtcgac ggccagcatc ggctgcgtgg acgcggctgg     30120
     gaaaccgacg gaggagacct gtgggatgtg ttagaaaatc ttgatagaca aggatgttca     30180
     cggttcattg ttaccgatgt cactaaggac ggtacgttgg acggtcccaa ccttgacctg     30240
     ctggcaagtg tctccgatcg caccaacgtc ccggtgatcg cgtccggtgg cgtttccagt     30300
     ctggatgact tgcgtgctat cgctaaattc accgagcgcg ggatcgaggg tgctattgtg     30360
     ggcaaagccc tctatgccga acggttcacc ttgccgcagg cactggccgt ggtgcggatg     30420
     tagccccaca atggatctgt acgcgttgat ggccgaggag tcggcaatcc ttgatgctgc     30480
     gagttcgttc ctggctggcc atcgcaccga tccggcggcc ggcaagaagt gcgacgattt     30540
     tgctaccgaa ggtcaacatt gccatcgaac ggcggggcgt cgctgtgtcg atagaaatca     30600
     cggagatcgg cgtgcacggc gaggagttag agggtccgga cctggaccca ccttgagtgt     30660
     gggtattgga cccgatcgac ggcacatgca actatgcggc gggatcgccg atgaccgtta     30720
     tgttgttagc tctgctgcat tacgacgact cagtggcggg cttgacctgg atgccgttca     30780
     tcgtcgggtg ctacacggcc gtcgcgggtg ggtcgatgat gaaaaatggt gtttcacaac     30840
     catcgctggc tacccgagag tgtagcaagg tgctgttcgg tgtcgttacg ttcaatacgg     30900
     gctcacgggg ctgggtcccc ggacgctgtc ggttggctgt ggtggaaaat ctcagaaggt     30960
     cctgtttgcg gttgcacacg tatggcttca ccggtactga tcttgcctat gttgccaacg     31020
     gaattcttgg tgaggcagta agtgtcggtt gctacgtctg gtatcacgct gccggcgtcg     31080
     tgttggttca ggctgttggt ggtattgtca ccgatctggc tggtgaattg tggacaacca     31140
     cggcgtcgtg tgtggtaaat gcggcacccg gcgtgcatag ccagatcttc gagatccttt     31200
     gcggcatgag cgaaccggag gattaccaag atgtactcgg gtaacggtct tgcggtacga     31260
     gtgattcctt gcctggacgt ctattgtggg cgtgtggtta aaggggtcaa tttcaagaac     31320
     ctccgggatg cgggtgatct cgtggagttg gctgccgcct acgatgctga aggcgccgac     31380
     gagctggctt ttcttgacgt gactgcgtcg tcgtcgggca gggccaccat gctggaggtg     31440
     gtgcgctgca ccgccgagca ggtgttcatc ccgttgatgg tcggcggtgg ggtgcgtacg     31500
     gtcgctgacg tcgacgtgtt gctgcgtgcg ggagctgaca aggttgcggt caacaccgca     31560
     gcaattgctc gtcccgagct attggccgac atggcagggc aattcggttc tcaatgcatc     31620
     gtgttatctg tcgacgcgcg cacggtaccc accggatcag cgcgaacacc gtccgggtgg     31680
     gaggccacta ctcatggcgg ttaccgcggc accggtattg acgctgttga gtgggcggcc     31740
     cgtggggccg accttggggt gggagaaatc ctgctgaact cgatggacgc cgacggcact     31800
     aaagccgggt tcgacttggc gatgttgcgg gctgttcgtg ccgcggtcac ggtgccggtc     31860
     atagctagcg gcggcgccgg ggcaatcgaa cacttcgtgc cagcggttac tgccggtgct     31920
     gatgcggttt tggcggccag tgtctttcat ttccgagagc taacgatcgg gcaggtgaaa     31980
     gacgctatgg ccgcagcagg gatcgcggtg cgatgacact cgacccagac attgctgtac     32040
     gcctcaagcg caacgccgaa ggtttattca ccgccgtcgt acaggagcgc tccagcggag     32100
     acgtgctaat ggtcgcttgg atggacgacc aagcactggc tcgcaccctg gaaacccgtg     32160
     aggcgaatta ttattcgcga tcccgggccg agcagtggat caagggatcg acctccggca     32220
     acacgcagca tgttcactcg gtgcgcctgg attgcgacgg cgacaccgtg ctgttgacgg     32280
     tcgaccaggt cggtggagcc tgccataccg gcgctcacag ttgctttgat tccgcaatgt     32340
     tattagcccc tcaggactag tcggcgcctc acccatgcct ctagcaaggc gcggtatctc     32400
     aatccgttgt ccgcggctat cgccgcgtgg gtccgttcgt gtcggaaaaa cggctggctg     32460
     tattctatag cgtcgaagaa tcttgaaatt caatactttt ccaagcgtta cagccgaggg     32520
     tactcggaag gagaaggatg attcgtctgt ggcgatagcc agaatggagt tggtgtctgg     32580
     taaagccgtt ggcttgtgtg gtgtcctagg tccacttgac tggcttgaag ccgccaaaat     32640
     ttttggcata ctctctccat aagcgggtgc cgcgtccgtt gtagagtttg gttattgctc     32700
     tgtctacggt caggtgtctg accggattga gtacatctga ttgatcggtg acttggctgg     32760
     ttaacttaat cgccggcgtt gattgaggag tgttgcagtg tcgctgttcg gcggagtcac     32820
     cgcagccgga gggttgccga cggcctgttg atgggcgcgg gcgccgctca tccggttacc     32880
     tacgaccgca gtcaccgcaa tttacgtggc agtggccacc accgcaatca gtatcgtcgg     32940
     ttccagccgg ctagcgcgca tttttcgtcg gctaccgcag taggcttcgc gatgaccgat     33000
     gtcggttcga cgatgatttt cggtggcgtc cacctcgccg gccggggtgg ttagcggttg     33060
     gccggccagt gcggacgtga cgtccgcgca cgtcggataa catgtgtcgg gttttgggcc     33120
     agcactttag gtaatgccca ctttggtagt gacggcgttc aagttatgct cattcgggtc     33180
     gccgctcact gggtagcgga ggtggtgcca acaggtgctg gctcataatc atcgccgggt     33240
     tggagtgctg gaacgtggcc gactcggtca gtaggttaga tgccgtgcaa ccgcgtgtat     33300
     gttcagttta gttcttgtgc cgagattttc actctaaaat agttgctcgg acgcgcagta     33360
     ggcgcgatgc tgcccactaa aaacatgttt gtcgcggtta ggccgctaat cttgaccagc     33420
     tcgtggacga taccgaaatc ggccgatcag ggtacctgcc gtagccgact agcatcgccc     33480
     agcgcgatca tagttggccg gcgtgacgtt acaatgaaga aatccgcgtg catgagtgta     33540
     atcaagcgct tcaccaacga ctgaaattac ttcgactacg tttgctcgca gcattccgga     33600
     tgagtattgg ctacgtaaca attttgcggc atcggttcct tccacacaag tccattgaga     33660
     tccacagttg ttcctggtgc ttgccgtgct cgtggatccc aacgatgtgc tcatcgtaca     33720
     cgctcgcggt catgtcggct tcctggttga agcgttgatt aaattagagg ttcctgttca     33780
     ggtcgctggg caagattttg acggcatcgt ggcgtgttag cctcgaatcc tgcgcctaga     33840
     tacaccagac acatatttca ccggcccggg tgcgaaccgc ggtcttgcga agctctatcc     33900
     ggtgaacacc tgtcctctgt ctgttgggcc gtcgctttgc ggcatgtgaa catgctattg     33960
     agggctatgg gcggcactcc agggcaatcg cgggagagcg gcaggtaatc gcggtagagc     34020
     ggcgctggtc atagtggttc cgtgttcgcc ccgctgggga gccggcagat gactttggtg     34080
     agaatggcgg gcgatgttga aacggatagc ctggaccgca ctggtacctc tgtttgctct     34140
     ggccgtgctg gcattgacct ggggaagaga gatcggtcca gtcgtgaccg cgttacaggc     34200
     ggcgttgctg accggcgctg tgctggccgc ggtccatcat gcggaggttg tcgcgcaccg     34260
     ggttggtgag ccgtttgggt cgctggtgct cgctgcagcg gtgacggtca tcgaggtggc     34320
     tctcatcgtc acgctcatgg cctctggtga gaacgaatca tggaccctgg ccagggatac     34380
     cgcgttcgcc gcactaatga tcaccactaa cgggattgcg ggtttttcgc tgctactggg     34440
     ttcccgacgc tacggtgtga cgttgtttaa cgcccacggc agcggggccg cgctggccac     34500
     gctcaccacg ttggcgacgt taagcctggt gctgcccact ttcactacta gtcaccgtgg     34560
     caatgagttc tcacctggcc agctggcctt cgctgccgtc gcttcgctag gactttatct     34620
     gctttttgta ttcacacaaa ctatccggca tcgcgacttt tttttgcctg tcgcacagaa     34680
     gggccaaaaa ggcctgttcg aagaggatga aagccatgcc gatccgccca gcgcacggtc     34740
     tgcgctgatc agcctcgcat tgctgcttgt tgcgctgatc gcggtggtgg gtctggccga     34800
     actgcagtcg tcggcaatcg agcacttggt gacagcagtt ggcttccccc agccattcgt     34860
     tggtgtagtg atcgccacgc tggtgctgtt accagaaaca cttgcggcgg tgcgtgcggc     34920
     gcgtcggggt cgcatccaga ccagcctgaa cctggcctac ggctcggcta tggccagcat     34980
     tgggctgact attccggcca tcgcgctggc ttccatctgg cttaccggac cgttgatact     35040
     cggcctgggc gcgacccagt tggtgttgtt ggctctgacc gtcgtaatca gcgtgctgac     35100
     cgtggtgcct gggcgggcta ctcgcctcca gggtgaggtg catctagtat tgcttgccgc     35160
     tttcgtgttt ttggcgatca tcccgtaaaa aggcagttac tatcgacgag tttgaataat     35220
     taatcggtcg atctggcccg cagcgtctct agcgccttgt cggcatgggt gtccatgctt     35280
     aattcgctag agatcacatt gagcaacttt cggtcggtgt cgacgacaaa ggttatgcgt     35340
     ttaaccggaa ttagcttgtc aagcagaccg cgcttaacgc cgaatattgg gtggccaccc     35400
     tgctgttggt gtccgacaac aggggggaaa gaattctgga tgttggcgga cttgacttgc     35460
     ttttgaaaca atatcggtgc ttatgtcgac cctgccagcc ccgaccgcga cgaattcttt     35520
     cgccaagtcg tggcaatagc acgtttcctt ggtgtagcca ggtgtcacgc cgccggataa     35580
     aagaacggaa ctacgggccc gccggcaagc agcgcactga gtttatgcag tgttccggtt     35640
     tgatcgggaa gtgcgaagtc agtcgcggtg tcatcggtct tcatggctgc cacactcaat     35700
     cgcccgcgct tacgtctgca catcggcacg ccatgtagaa cggtagttta ggggcgttcg     35760
     gacgtgataa gcccatctgg aaggatgaat cggtgcacgc ccacctcgcc gccaccacgt     35820
     cacgtgagga ctttcgccag ctagctgtcg atcatcgggt ggttccagtg acccgcaaag     35880
     tcttggccga tagcgaaacg ccgctatcgg cataccggaa gctcgccgcc aatcgtccga     35940
     gcaccttcct gctagagtcg gccgagaacg gcagatcctg gtcgcagtgg tcgttcattg     36000
     gcgtgggtgc cccatcggct ttgacaattc gtgacggcga agcggtgtgg ctgggcactg     36060
     taccgcagga cgcgcccact ggcggtgacc ccctgcatgt cctgcaggcc acgctcgagc     36120
     tacttgccac cgcggcaatg cccgggttgc caccgctatc gagtggtatg gtgggcttct     36180
     tcgcctacga catggtgcgg cgcctggagc gcctgcctga actggcctta aatgatttgc     36240
     agttgcccga tatgctgctg ctgttggcca ccgatgtagc ggccgttgac caccacgagg     36300
     gcacgatcac cctgattgcc aacgctgtga actggaacgg taccgatgag cgggtagatc     36360
     aggcctacga cgatgcgatc gcgcgcttgg acgtaatgac cgcagcactg ggccagccgc     36420
     tgccatcgac cattgccacg ttcagccggc ccgaccctcg gcgccgagcg caatgcacca     36480
     tcgaggaata cggtgcgatc gtcgaccacc tcgtcgacca gatcgcggcc ggtgaggcct     36540
     ttcaagtggt gccctcgcag cgcttcgagg tggataccga tgtcgatccg atcgatgtct     36600
     atcgcatgct gcgggtcacc aaccctagcc cttacatgta tctgctgcat gtgcctaata     36660
     gcgatggagc aactggcttt tcgatcgttg gatccagccc ggaggcgttg gtgaccgtca     36720
     aggacggtcg ggtgacgacg catccgatcg ctggaactcg ctggcgaggc cagaccgaag     36780
     aagaggatca gctactggaa aaagagttgc tcgccgacga gaaggaacga gcagagcact     36840
     taatgcttgt cgatctcggt cgcaacgacc ttggtcgggt ctgtacgcca ggcaccgtac     36900
     gtgtcgagga ttacagccac gttgaacgtt acagtcacgt aatgcacatg gtgtctacgg     36960
     tgaccgggct gctcggtgaa ggccgcaccg ccctggacgc ggtgaccgcc tgcttccctg     37020
     ccggcacgct gtcgggtgcc ccgaaggtgc ggtccatgga gctaatcgag gaagtggaga     37080
     agacgcgccg cggcctttat ggtggcgtgg tcggttacct cgacttcgct ggtaacgctg     37140
     acttcgcgat agcaatccgt acagcgctga tgcgtgacgg catcgcgtat gtccaagccg     37200
     gcggtggggt agtggccgac tccaacgggc cctacgaata catcgaggcg agtaacaagg     37260
     ctcgagcggt gttgaacgcg atcgccgccg ccgagacgtt gacctctttg gacttcggtg     37320
     ttgccctcgc ccctggccgc gtggcggcca ggggcgaggc gggcaatcaa gggaggctgt     37380
     gatggctcct gatatcaaga gcgcccgggc cggccggctg acgattcaaa tagcgcagtt     37440
     gttgctggtg gttgctgctg gcgcattgtg gatggcggcc cggctgccgt gggtcgttat     37500
     tcggtcgttc gacgggttgg gtcctccaaa ggaggtggct ctttccggag cgtcatggtc     37560
     ggcggtgctg ctgccgttag cgctgctcat gctggccgcg actgtcgcag cgatcgcagt     37620
     gcgcggttgg ccgttgcggg tgttggcagg gctgctggca gtggctagct tcctggtcgg     37680
     ctatttgggt gtcagcctgt gggtgctgcc ggatgtaacg gtgcgcggag ctgtcctggc     37740
     gcacgtctcg ttgttgtcgc tggtaggtag ccaacggcac catctgggcg caggggctgc     37800
     ggtagcagct tctgggtgca ccctcattgc tgctgtttta ttgatgcggt cggcttccgt     37860
     catcggatcc gcccgccagg gcacatcgaa atacgtggtt ccagcgcaac gtcgttcgat     37920
     cgcgcggcgc gacggtgccg cgacggcgat ctcgcagatg tcagagcgga tgatctggga     37980
     tgcgcttgac gaagaccgag acccgacgga ccgactccgc gagccggaca ccgaggggcg     38040
     gtggtggacc gcgtgtcgac ggtcgctacc cttcatgaac gtcgtcgaaa tcggcgggtg     38100
     taccggttcg gtagccggtc ggtgggtgac aagcgggaaa ggaaacgaca cgcatgtgtc     38160
     cggcaactgt gcttgactcc attctaaagg gagtccgggc cgacgttgcc gcacgcgaag     38220
     cctgcatcag cctgtccgaa atcaaagccg ccgccgcggc cgcaccagca ccgctggacg     38280
     ctatggccgc tttacgtgag cccggtattg gtgttatcgc cgaggtcaag cgcgccagtc     38340
     catcggtggg ctcattggcc accatcgctg atccggcaaa gctggcccag gcctacgagg     38400
     acggcggtgc tcggatcatc agcgttttga ccgaggaacg tcgctttaac ggttcactcg     38460
     acgatctcga tgcagtgcgc gccgcggttt cagtaccggt gttgcgcaaa gactttgtgg     38520
     tgcagccata tcagatccat gaggcgcgcg cgcacggtgc cgatatgttg ttgttgatcg     38580
     ttgctgcgtt ggaccaatcg gcgttgatgt ctatgctgga ccgcaccgaa tcacttggta     38640
     tgattgctct cgttgaagtc cgcacggaac aggaagcgga ccgggcccta aaggcagggg     38700
     ccaaggtgat aggtgtaaac gctcgcgatt taatgacgtt ggaagtggac cgggattgct     38760
     tctcgcgaat tgctcctggg ttgccgagta acgtgatcag gatcgccgaa tccggtgtac     38820
     gtggccccgc ggatctgttg gcttacgctg gtgcgggagc ggacgccgtg ttggtcggcg     38880
     aaggcctggt gaaaagtggt gacccgcggg cagcggttgc cgatctggtc accgcgggta     38940
     cccacccgtc atgcccaaaa ccggctcgct agtcattcga caagatgcag tggtgaaaca     39000
     ttgaacgcta tcagccgact gcccgttgag ttccggtgat gccgaatcta tcctgtttta     39060
     gcgtggccat ctccgaaccc acttgccacg accccgattc gggtgggcat tttggtggcc     39120
     ccgatggtta tggtggtcgc tacgtccccg aagcgctgat ggcggtgatc gaggaggtca     39180
     ccgccgctta cgagaaggaa cgcgttaatc aggacttcct ggacctgctg gacaagttgc     39240
     aagcaaacta cgctggccgg ccgtcgccgc tatatgaagc tactcggctt agcgaatacg     39300
     ctggctcggt gcgcatcttt ctcaagcggg aagacttgaa ccacactggt tctcacaaga     39360
     tcaacaatgt ccttggtcag acattgctgg cccagcgaat gggcaagacg cgagtgattg     39420
     ctgagacggg tgccggtcag catggagtgg ccacagcgac ggcgtgtgcg ctgttcggcc     39480
     tggattgtgt gatctacatg ggtgcccttg acactgctag gcaggcactc aacgtggcgc     39540
     ggatgcggtt gctgggtgcc gaggtcgtat cggtcgagac tggctcgcga acgctcaagg     39600
     acgccatcaa cgacgcgttc cgggattggg ttactaatgc agataacact tactactgct     39660
     tcgggactgc ttcgggaccg catccttttc cgactatggt gcgtgactta cagcgcgtca     39720
     tcggtctcga gacacgtcgg caaatccagt atcaggcggg caggctaccg gacgcggtta     39780
     cggcgtgcat cggcggaggg tccaacgcta tcgggatctt ccacgcgttc ctcgatgatc     39840
     cgggtgtacg gctggtcggg ttcgaggcag ctggcgatgg cgtcgagact ggcaggcatg     39900
     cagcgacatt gaccggcggc ttgcccggag cgttccaggg aacgttctcc tacttgctgc     39960
     aggacgagga cggtcagacc atcgagtccc attcaatcgc agcaggtctg gactatccgg     40020
     gggttggccc cgaacacgcg tggctcaggg aaacc                                40055
//


  
spacer
spacer