![]() |
EBI DbfetchID BX538351; SV 1; linear; genomic DNA; STD; INV; 307050 BP. XX AC BX538351; XX DT 30-JUN-2003 (Rel. 76, Created) DT 17-APR-2005 (Rel. 83, Last updated, Version 2) XX DE Cryptosporidium parvum chromosome 6, complete sequence; segment 2/4 XX KW . XX OS Cryptosporidium parvum OC Eukaryota; Alveolata; Apicomplexa; Coccidia; Eucoccidiorida; Eimeriorina; OC Cryptosporidiidae; Cryptosporidium. XX RN [1] RP 1-307050 RA Dear P.H.; RT ; RL Submitted (27-MAY-2003) to the EMBL/GenBank/DDBJ databases. RL MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, UNITED RL KINGDOM. XX RN [2] RX PUBMED; 12869580. RA Bankier A.T., Spriggs H.F., Fartmann B., Konfortov B.A., Madera M., RA Vogel C., Teichmann S.A., Ivens A., Dear P.H.; RT "Integrated mapping, chromosomal sequencing and sequence analysis of RT Cryptosporidium parvum"; RL Genome Res. 13(8):1787-1799(2003). XX DR EMBL-CON; BX526834. DR RFAM; RF00005; tRNA. XX FH Key Location/Qualifiers FH FT source 1..307050 FT /organism="Cryptosporidium parvum" FT /chromosome="6" FT /isolate="Iowa" FT /serotype="Type 2" FT /mol_type="genomic DNA" FT /db_xref="taxon:5807" FT repeat_region 178..185 FT /note="(ca)4" FT CDS join(327..409,456..516,561..703,1461..1899) FT /locus_tag="1MB.173" FT /product="SRP19-domain protein" FT /note="1MB.173, predicted protein, len = 242 aa, unknown; FT predicted pI = 9.4688; contains Pfam match to entry PF03256 FT APC10, Anaphase-promoting complex, subunit 10 (APC10); FT contains Pfam match to entry PF01922 SRP19, SRP19 protein; FT contains no predicted TM helices; some similarity to FT SR19_ORYSA, signal recognition particle 19 Kd protein (136 FT aa, Oryza sativa, EMBL: U19030, AAB65810); Fasta scores: FT E():2.3e-05, 36.364% identity (38.095% ungapped) in 110 aa FT overlap, (aa 77-184 of 1MB.173, aa 6-112 of SR19_ORYSA)" FT /db_xref="GOA:Q7YYK3" FT /db_xref="HSSP:1JHJ" FT /db_xref="InterPro:IPR002778" FT /db_xref="InterPro:IPR004939" FT /db_xref="InterPro:IPR008979" FT /db_xref="UniProtKB/TrEMBL:Q7YYK3" FT /protein_id="CAD98478.1" FT /translation="MQLTEPDGWVRIPLSPREIADNFFKDAMPIQIKTMCDSQNYISAF FT CIQIAILANHQTGRDTHVRYVKCVYLNYLICLVRQIRVWGPREVNDNVVGNNNTKSSGR FT LSSLIHCVEDPTIAEIAEVCIQLGIPCKVESKRYSKDCRTLGRVRFQLFDESGRAFNDR FT ILTKKILLNQIGIMIPKLKNRQNTTSSVIDHANSRNNKANIDDIHNENNCKIKETDSIT FT YSGVNIHSSNAKSNSKKKK" FT misc_feature 459..527 FT /note="Pfam match to entry PF03256 APC10, FT Anaphase-promoting complex, subunit 10 (APC10), score 26.7, FT E-value 2.9e-08" FT misc_feature 657..872 FT /note="Pfam match to entry PF01922 SRP19, SRP19 protein, FT score 25.3, E-value 1.4e-06" FT repeat_region 836..843 FT /note="(a)8" FT repeat_region 1671..1678 FT /note="(a)8" FT repeat_region 1884..1892 FT /note="(a)9" FT repeat_region 1977..1984 FT /note="(a)8" FT CDS complement(2017..2871) FT /locus_tag="1MB.176" FT /product="conserved hypothetical protein" FT /note="1MB.176, predicted protein, len = 285 aa, conserved FT hypothetical protein; predicted pI = 4.5843; contains no FT predicted TM helices; reasonable similarity to Q94AZ3, (326 FT aa, ??, EMBL: Q94AZ3, ); Fasta scores: E():3.8e-10, 30.208% FT identity (32.768% ungapped) in 192 aa overlap, (aa 35-218 FT of 1MB.176, aa 61-245 of Q94AZ3)" FT /db_xref="UniProtKB/TrEMBL:Q7YYK2" FT /protein_id="CAD98479.1" FT /translation="MSSKKHIRVDDESNQLNFNNESIDGEIFLSATESPSYGKKAKRTD FT SDSSDNESVIDGEFEFNDPNENDYHSIKNILLMSQYSRIKGIQFHEFVDLICNQGNIGT FT TVSISDSIIAFSTILNFRQYKETLNKIVDYLSGVISKSRNKEFENLFNSVIYEKNVGLM FT VNERFPNTPLEIVPALCNCLKDDIQWTIDNLYTDIPNKEREYYKWDYIILLTTRYISPD FT SNTIIYQKYEEEKLVINSLHTIIWQGDKKQLCGTGKKDETDLMNQQFLVSILSYDQFQK FT NYN" FT CDS 3168..3476 FT /locus_tag="1MB.177" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.177, predicted protein, len = 103 aa, unknown; FT predicted pI = 10.9276; contains no predicted TM helices; FT some similarity to O96146, hypothetical 61.3 Kd protein FT (508 aa, Plasmodium falciparum, EMBL: AE001381, AAC71834); FT Fasta scores: E():5.8, 29.412% identity (29.762% ungapped) FT in 85 aa overlap, (aa 4-87 of 1MB.177, aa 380-464 of FT O96146)" FT /db_xref="UniProtKB/TrEMBL:Q7YYK1" FT /protein_id="CAD98480.1" FT /translation="MDSKGKKNLRSSNGDRKKNAVARATGEMNKIFETILNKKATLKAN FT SRAQKEKNKLAKRTKKDSVIRDNTNKSLDINYGYIKPIRYLNDGLPVYRLEDINLGK" FT repeat_region 3182..3190 FT /note="(a)9" FT CDS join(3516..3575,4042..6090) FT /locus_tag="1MB.178" FT /product="kinesin heavy chain, possible" FT /note="1MB.178, predicted protein, len = 703 aa, possibly FT kinesin heavy chain; predicted pI = 9.4396; contains Pfam FT match to entry PF00225 kinesin, Kinesin motor domain; FT contains no predicted TM helices; reasonable similarity to FT Q93XF7, kinesin heavy chain (1079 aa, Zea mays, EMBL: FT AF272755, AAK91818); Fasta scores: E():9.8e-17, 23.511% FT identity (26.370% ungapped) in 655 aa overlap, (aa 35-671 FT of 1MB.178, aa 36-637 of Q93XF7)" FT /db_xref="GOA:Q7YYK0" FT /db_xref="HSSP:1I6I" FT /db_xref="InterPro:IPR001752" FT /db_xref="UniProtKB/TrEMBL:Q7YYK0" FT /protein_id="CAD98481.1" FT /translation="MEEARLTVHLIATAVSNSLRGEINIGVNSKRASKVMDAVGKISNS FT NKIHLNRENVRVIVRVRPIQDCNESSSSCVSIINEQQTQAKQIVLEDPRHRGLPKKYEF FT DEIYGTESTTEDIYSKEIKDYVNPLLSECSCINIFAFGSSGTGKTFTMHGDFNNEIGIV FT GLTIKQLIEINDKQAEPGAFSFSFFEVYCEQIQDLLTGAERLEHLDKPLKSTKSNISIR FT TDICGRIRIVGANSSEFKTWDEFNSAYSSALKKRASGKTAVNSNSSRSHACIQINYIPP FT ASNITQFEAETRNEDTIDNRRKRGSISFHVKPKVNLSYPRTIVNLIDLSGFENNKITNN FT TGKRMAESTFINSSLLSLSKVINALKKNAGTQSTQSCIPYRESKLTRLLQEYLGGGADP FT PYSPYCLRCIMVCTISPSVTFFQQTYATLNTPSYGNNSIMRKYIGIASATTNIDLKNSL FT EVNRATLKSQKINSSKSLNKIVTAGKNSNSNTIHLAAKATSEKKLYDRADELFSRKNSY FT KNIESRVAQMIKGKVPLNVGEPKKSRDINYGEIPKGSVPGSSSQKNIVKQPYSIGSTKI FT KCIEAKGPEERLISDLTKCDKACCKSIRCENVMETVSEALGRDDCSADSNTPEHREGPQ FT SNQNNVKDLQITDTSENKSICANLDVSRDEKVNVELNKSNSRSSAKDKGSLGMIIRRPV FT TRSQTKKD" FT repeat_region 3763..3771 FT /note="(c)9" FT misc_feature 3813..4832 FT /note="Pfam match to entry PF00225 kinesin, Kinesin motor FT domain, score 176.4, E-value 3e-50" FT misc_feature 4794..4801 FT /note="tgcatgca" FT CDS complement(6243..6779) FT /locus_tag="1MB.180" FT /product="hypothetical predicted transmembrane protein, FT unknown function" FT /note="1MB.180, predicted protein, len = 179 aa, unknown; FT predicted pI = 5.9468; contains a predicted TM helix FT region; signal peptide predicted; some similarity to FT AAM05012, predicted protein (180 aa, Methanosarcina FT acetivorans, EMBL: AAM05012, AE010831); Fasta scores: FT E():0.00015, 30.075% identity (32.520% ungapped) in 133 aa FT overlap, (aa 24-152 of 1MB.180, aa 38-164 of AAM05012)" FT /db_xref="InterPro:IPR018990" FT /db_xref="UniProtKB/TrEMBL:Q7YYJ9" FT /protein_id="CAD98482.1" FT /translation="MNKTIFRLLFFFAIYIMIGISNASDMTSSGSLKASNNLEKVKLVN FT LDLCNSKEAIINVQDISSSDSIIYFITVKPGTEITVNIKGNPTTGYSQQMIIKPNDSIV FT KVIDAEPSYVPDPHPEGMVGYGGKYTFRFSAVGSGSTVSTIEYARYFERPPKCIFKTEI FT QFKVIDLPCEEIIKE" FT misc_feature complement(6708..6764) FT /note="1 probable transmembrane helix predicted for 1MB.180 FT by TMHMM2.0 at aa 5-24" FT misc_feature complement(6711..6779) FT /note="Signal peptide predicted for 1MB.180 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.919, signal anchor FT probability 0.076) with cleavage site probability 0.885 FT between residues 23 and 24" FT repeat_region 6745..6752 FT /note="(a)8" FT CDS complement(6839..9616) FT /locus_tag="1MB.181" FT /product="putative kinesin heavy chain, possible" FT /note="1MB.181, predicted protein, len = 926 aa, possibly FT putative kinesin heavy chain; predicted pI = 5.6003; FT contains Pfam match to entry PF00225 kinesin, Kinesin motor FT domain; contains no predicted TM helices; reasonable FT similarity to Q9ZUS4, putative kinesin heavy chain (1022 FT aa, Arabidopsis thaliana, EMBL: AC005896, AAC98061); Fasta FT scores: E():1.2e-21, 27.304% identity (30.940% ungapped) in FT 868 aa overlap, (aa 68-892 of 1MB.181, aa 69-877 of FT Q9ZUS4)" FT /db_xref="GOA:Q7YYJ8" FT /db_xref="HSSP:1N6M" FT /db_xref="InterPro:IPR001752" FT /db_xref="UniProtKB/TrEMBL:Q7YYJ8" FT /protein_id="CAD98483.1" FT /translation="MLGNSVMKEDSLLCKIDNTKIDSYRSSLQQSKKELVSNNHFSSFI FT NDETNKNVSSHSKSAFSVAIRFRPAVCSEINEIENNSIWELSHKSIYDVSKKTNYYFDH FT VFDDKSTNYLIYDKLIKDAVKSCLSGINVTIFAYGQTSSGKTHTMFGDNKGSYDGIIPL FT SINEIFNPEYTNCNQSIGTNSKITVSYLEVYNEKLFDLLAPQSNLNNSSNDFNSKQIKV FT IDGVDGTVDFINLTSKTVSSPEDIHGIIKTGMKSRRVAETAMNERSSRSHAILRIKIES FT HINSNDVCVGILNFVDLAGSESIKRTQLEGDRRKEGMSINRSLLALSQVISQLSESTIV FT EPHANICPNQTNFADGSFHSKNKYINYRDSKITRILSDSLGGNSRTIIICNCSPDRVNY FT YETLSTIDFARRAKKIQNKVRVNILKSKEEKYEIAELKEKIVNLNRKVEDIDLLKQEIE FT LLKSQKAELKSELNRNKVKSSDFEWNDSLSVINKEQFSHLLAEYAEIIDLKDQNIKNLN FT EEINTMQAKVRNTEKIINDNYEMKVDLDQKNDKIIKQESEIEHLSKLLSESNEKLNIFQ FT TQLDNKDLVIESLICKNEKTSLYCVELKKKLEASLLDIYSFISWYIYDYNNEITQKTEY FT NKSDLNENLDNFFSNLKSYLNFISIYNQISKQGSINYTLELLNFKNEILQLESEIKHYE FT EEKNKIDSETIICQNQAIQNLNSIILNMFELLCNLITERNDCYQINNSLQNLSISDSYE FT FMRKIKDSEKLIELLNLELNDKKDLEVKSKILQIEIENINNENKILNDSLEFQKKENNE FT LRLKLNQLKHELNFKFISSSNNLNDQEIFNNSDFITNKDEILKSNQPNDAKQSIKTENK FT ECKEKKLQRIYDLKENRSKENFIKLESNISKNKEYSEESTFNFKENHITECSTQ" FT repeat_region 7193..7200 FT /note="(t)8" FT repeat_region 7800..7807 FT /note="(t)8" FT misc_feature complement(8369..9325) FT /note="Pfam match to entry PF00225 kinesin, Kinesin motor FT domain, score 411.8, E-value 4.2e-121" FT repeat_region 8374..8381 FT /note="(t)8" FT repeat_region 9325..9332 FT /note="(t)8" FT repeat_region 9620..9627 FT /note="(ta)4" FT CDS join(10386..11652,12289..12362,12409..12480) FT /locus_tag="1MB.183" FT /product="conserved hypothetical protein" FT /note="1MB.183, predicted protein, len = 471 aa, possibly FT hypothetical 96.8 Kd protein; predicted pI = 6.0957; FT contains no predicted TM helices; reasonable similarity to FT Q95K11, hypothetical 96.8 Kd protein (836 aa, Macaca FT fascicularis, EMBL: AB070016, BAB62961); Fasta scores: FT E():0.005, 24.377% identity (29.333% ungapped) in 361 aa FT overlap, (aa 129-450 of 1MB.183, aa 394-732 of Q95K11)" FT /db_xref="UniProtKB/TrEMBL:Q7YYJ7" FT /protein_id="CAD98484.1" FT /translation="MFLEFIDINNSILQWLKEYLNNFSQTSICSTNTIVTRLLSIKEET FT NHSYGDDDMNNDSEKVISSYKSKQMEEKAEFEIHCDLLNENEMSELYIDSFESLPWKGQ FT TMFSNFRELQISLQSILERVILHSDEGRNNLFLKRDEIIIRKISHMITELEDYLYNQFD FT ILNKERNKSEKMEIENKNLKKQLKTAIEESEEYKFEIQNSINQKNIYEKELEHLISQKN FT RLSNDLISFKEENKSLERKYEGLLYNYNELKEKHDILCEENQQLINEINKYKEENKIES FT IERKKNENNSYLELDQNIIMSVGHRANDLIYHSMEIKNVDCENNIKENSNYTKKGNLYK FT YPAIIKTIPDFLESSIQTPISCRSYREKVGYFSEKTTDKELRIVNVVSPVAENKFENFL FT IQKRKKDTHKQKKNLSKTKNEMGFFSSLFFESDLLQYLFELSIICLQNCKFYKLLKSHQ FT KYFHYHKFGQAH" FT repeat_region 11008..11015 FT /note="(at)4" FT repeat_region 11191..11202 FT /note="(aaat)3" FT repeat_region 11237..11245 FT /note="(a)9" FT repeat_region 11614..11623 FT /note="(a)10" FT CDS complement(join(11684..13440,13528..14521)) FT /locus_tag="1MB.184" FT /product="hypothetical predicted ATP cone domain protein, FT unknown function" FT /note="1MB.184, predicted protein, len = 917 aa, unknown; FT predicted pI = 9.1257; contains Pfam match to entry PF03477 FT ATP-cone, ATP cone domain; contains no predicted TM FT helices; some similarity to Q9PQJ8, hypothetical protein FT uu293 (1447 aa, Ureaplasma parvum, EMBL: AE002126, FT AAF30702); Fasta scores: E():0.0033, 22.158% identity FT (25.768% ungapped) in 871 aa overlap, (aa 113-915 of FT 1MB.184, aa 17-833 of Q9PQJ8)" FT /db_xref="UniProtKB/TrEMBL:Q7YYJ6" FT /protein_id="CAD98485.1" FT /translation="MQDKPFDISVIEYRKENIKFSNSKKTDKSNIFTPFGAVRPFTYCK FT NDFNLSFNKSWNLFFSKFLNGFRLEMSDGMVKILESIDSLVDHDVNYELTVIGALSGSN FT NCDHLITFNVLEELLNSSGQPYTIFLDHEKIGIEGLNVSKLLETIWNSIKDKFLDLLIE FT ENNFIDSQKGYDYENDSLAGKSTYLNKLKENDEKLADRVKRNKRKCNIEHSGIKETHFD FT LNKKIVFSDSSKVESKNKWLPIQTIPFNFDSSKFSLSNGNSNQKYINIIKGLKFIGSIF FT KMRKKNNLNIKPIIILIPSSECLAPGQLGQLLEIISQIKETYKITFTIILDSKMAFFQS FT IINPLLHLDDFTSRIFLKGNFVKFKELIPGNNLELNDPSNAITDSESELFSFPLLSSAC FT IRHIKDNFFQYDYSITSSLKTIFIIFQLHFTDNHFDFLFQRKEEISIQDLSKQIIDQLK FT LKYQINEESFLSGIKLREAAIVEISLLKLNLKSIALGLCIINTILSDMLNIFDINERYN FT LIIEWLEVLERNRRKELTKKIGNLTREIKGINNVALDKNIVFTKINNITEIFLTRNKSF FT YEIKNQAFIELPWNIKTFECNISKLFIHSFDESQLSEHISDNVFRIFIYSNIKTFSNQM FT DYFLEEILTPNFIPKIINELDQTCDNENIFDDFSAIYKIYTLNNGNKINLFKLFSTFCK FT QIMDSSNKYCNKSDSKNKLEKTNKVRLEINCLPDQYKDLFSRFIRVMNTLQFIGLLYLP FT QKNTKFEQDVFQINFDLNDSEEKHQSSKKYLDKYFRFTLGNLYAHKLFWGNNIPISCMI FT TNDKVNIDDTKTYLSNDEKFKPPNLSDNINKINSKTKYNRKLISPDANDKSVNKNIKKN FT RLEVLKLRAAESNQRKIIEFKGIKIQRAINSANRIKNSRNSINEK" FT repeat_region 12618..12629 FT /note="(aatg)3" FT misc_feature 13237..13244 FT /note="tgcatgca" FT repeat_region 14446..14453 FT /note="(t)8" FT repeat_region 15187..15194 FT /note="(a)8" FT CDS 15508..17919 FT /locus_tag="1MB.186" FT /product="ribonucleoside-diphosphate reductase large chain" FT /note="1MB.186, predicted protein, len = 804 aa, FT ribonucleoside-diphosphate reductase large chain; predicted FT pI = 7.1072; contains Pfam match to entry PF00317 FT ribonuc_red_lg, Ribonucleotide reductase, all-alpha domain; FT contains Pfam match to entry PF02867 ribonuc_red_lgC, FT Ribonucleotide reductase, barrel domain; contains no FT predicted TM helices; high similarity to RIR1_CRYPV, FT ribonucleoside-diphosphate reductase large chain (EC FT 1.17.4.1) (803 aa, Cryptosporidium parvum, EMBL: AF043243, FT AAC12280); Fasta scores: E():0, 100.000% identity (100.000% FT ungapped) in 803 aa overlap, (aa 1-803 of 1MB.186, aa 1-803 FT of RIR1_CRYPV)" FT /db_xref="GOA:Q7JQE4" FT /db_xref="InterPro:IPR000788" FT /db_xref="InterPro:IPR005144" FT /db_xref="InterPro:IPR008926" FT /db_xref="InterPro:IPR013346" FT /db_xref="InterPro:IPR013509" FT /db_xref="UniProtKB/TrEMBL:Q7JQE4" FT /protein_id="CAD98486.1" FT /translation="MYVVNRKGEEEPVSFDQILSRITKLSYGLHPLVDPARVTQAVING FT LYSGIKTSELDELASQTCAYMAATHNDFSKLAARISTSNLHKNTSSDIGDVASQLYNFK FT DNQGCPAPLISKPVYDFIMENRERINSKIDFSKDFEYDYFAFKTLERSYLLKIDNKVVE FT RPQHLLMRVSCGIHCGDIEAALETYELLSQKYFTHATPTLFNSGTPRPQMSSCFLLRIP FT EDSINGIFDTLTKCANISKTAGGLGVAVSNIRGTGSYIRGTNGRSNGLIPMLRVYNDTA FT RYIDQGGGKRKGAIAIYLEPWHVDVVEFIEIRKNHGKEEMRCRDLFPALWVPDLFMERV FT EKDQDWTLMCPDECRGLQDVWGDDFKKLYEEYEKQGRGRKTMKAQKLWFLILQAQIETG FT TPFICYKDAANSKSNQKNLGTIVSSNLCTEIIEYTSTDEVAVCNLASIGLPKFVDKNNK FT TFDFDKLKEVTKVITRNLNKLIDVGYYSLKECKKSNLRHRPLGIGIQGLADCFMMLRMP FT YESEGAKKLNKQIFEVIYYAALDASCELAEKYGPYETYSGSPASKGILQFDMWGVTPDS FT GLCDWDLLKDRISKHGIRNSLLISPMPTASTSQILGNNESFEPFTSNIYHRRVLSGEFF FT VVNPHLLNDLLELGLWDDRLKQNIIANNGSIQNILTIPEDIRELYKTVWEIKQKTVIDM FT AADRGPYVCQSQSLNIHMENANFAKLSSMHFYGWKKGLKTGIYYLRTQSATRPIQFTVD FT QQLLKSETKEKDSLETNKRQALEPEAQKLIACPLRPTNMKDDEECMMCSG" FT misc_feature 15508..15777 FT /note="Pfam match to entry PF03477 ATP-cone, ATP cone FT domain, score 80.3, E-value 2.6e-21" FT misc_feature 15922..16140 FT /note="Pfam match to entry PF00317 ribonuc_red_lg, FT Ribonucleotide reductase, all-alpha domain, score 127.1, FT E-value 2.2e-35" FT misc_feature 16144..17736 FT /note="Pfam match to entry PF02867 ribonuc_red_lgC, FT Ribonucleotide reductase, barrel domain, score 1046.2, FT E-value 0" FT CDS 18083..20512 FT /locus_tag="1MB.187" FT /product="glycosyl transferase, possible" FT /note="1MB.187, predicted protein, len = 810 aa, possibly FT hypothetical protein kiaa1918; predicted pI = 8.9651; FT contains Pfam match to entry PF00535 Glycos_transf_2, FT Glycosyl transferase; contains 3 predicted TM helix FT regions; signal anchor predicted; reasonable similarity to FT Q96PX0, hypothetical protein kiaa1918 (516 aa, Homo FT sapiens, EMBL: AB067505, BAB67811); Fasta scores: FT E():5.6e-22, 33.257% identity (38.220% ungapped) in 439 aa FT overlap, (aa 269-673 of 1MB.187, aa 72-487 of Q96PX0)" FT /db_xref="GOA:Q7YYJ5" FT /db_xref="InterPro:IPR000772" FT /db_xref="InterPro:IPR001173" FT /db_xref="UniProtKB/TrEMBL:Q7YYJ5" FT /protein_id="CAD98487.1" FT /translation="MKSRIIPSKIIKLGKNRVFCRTFLITILIFLIFFFKRRLVIDLLL FT IVELILIDILFSKFLKKKKAIYSSISSTNLNLEKRNSIELKKLSTDSCNNSLKQQKTIK FT SLSDDRKSRKIGRKKNKGYGWTLTIIASLILIFTSFYESFIANIIYKEYSNEKEITSLI FT DQYSEERKYYSNVFKYNPPINLPTLEFSNWLLTNWVFRTISNLPILFSDYSNIKKKGIE FT YIETEFDTFINLKKGSISNITNNELENSKFDGQETNIDFVSDSSEFNENDRISIVIPAH FT NEDEFISKTIIFTIESTPTELLREIIIVDDFSEKPVFEILEEELPENYKKYVKIIRLKK FT CEGLIRSKIIGADAALGPNIFFLDGHCKPKKGWSEALVKSIRENYKRVVCPIVQSISNI FT DWSDIGTAGAKMMIEWNFAFHWYDDGLPEIPIASGGILMITKRWWEESGKYDPGMLYWG FT GENIEQSFRVWLCGGEIHVVRNSLVGHIFERNNSNRRNQDFQYKKMLIDNMNSNHQRTA FT FVWLSEQFYETYFKNYHVLGYLPISYTKGLSERLSLKHILKCKPFEWYIGKFRPAFERQ FT GELYYNFHHIQHVKSKLCLSIANKQNDRIGVGKAEIEIPMTVVPNDVSRYSIKTTTDYD FT ILALKTCNYLDESQKWSFILGNRMLYNFKSKKCLDKASSVNLFKKMKTKDFIYSPNNSS FT ESNTELELPLLYECDWNLVMRARNYNQFWAWKDIGDKSGKIVNWSGDEHRSNVTGGAEE FT FIVPINKGVDSESYCLYSNTALGSYEETKMFYSNCKTENNSEEISFKKIWRQNLFV" FT misc_feature 18083..18196 FT /note="Signal anchor predicted for 1MB.187 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.003, signal anchor FT probability 0.996) with cleavage site probability 0.001 FT between residues 38 and 39" FT misc_feature join(18119..18187,18197..18250,18449..18517) FT /note="3 probable transmembrane helices predicted for FT 1MB.187 by TMHMM2.0 at aa 13-35, 39-56 and 123-145" FT repeat_region 18266..18273 FT /note="(a)8" FT repeat_region 18433..18441 FT /note="(a)9" FT repeat_region 18731..18739 FT /note="(a)9" FT misc_feature 18905..19426 FT /note="Pfam match to entry PF00535 Glycos_transf_2, FT Glycosyl transferase, score 70.6, E-value 2.2e-18" FT repeat_region 19163..19170 FT /note="(t)8" FT repeat_region 20861..20870 FT /note="(t)10" FT CDS complement(join(21245..21651,21685..25516,25568..26112, FT 26260..27925)) FT /locus_tag="1MB.188" FT /product="conserved hypothetical protein" FT /note="1MB.188, predicted protein, len = 2150 aa, possibly FT hypothetical 135.8 Kd protein; predicted pI = 6.5792; FT contains no predicted TM helices; reasonable similarity to FT O96192, hypothetical 135.8 Kd protein (1121 aa, Plasmodium FT falciparum, EMBL: AE001398, AAC71888); Fasta scores: FT E():0.0008, 23.213% identity (27.622% ungapped) in 1021 aa FT overlap, (aa 102-1049 of 1MB.188, aa 27-957 of O96192)" FT /db_xref="UniProtKB/TrEMBL:Q7YYJ4" FT /protein_id="CAD98488.1" FT /translation="MNSSLEEENVLFKLLCKVLFYQGDFLECSIVLERILGDYSEEEIF FT NKKIIVNCEYLFDFFSSWNLFNIRNSISANKNNDHIFGDLMLKELKLQKLCSMILDEDN FT FELSDHSNYTETIKEIKYSLIKIYRELKVKGTSPKLALQEKYNKIFENILLRIECCKKY FT EKKCLEIERYITNTNFEENYIKSIGNISKSIQDDSNNIFSYFKQIISLFGINNLYISFE FT PFQLLEDYINDLSAKLNLSNYKNEFKVNSDRRTRSKNVNNDLPNVNLNKEVIERDLLLE FT LDNPLKKIIIDISSNTVTFPKYNDKQYKIWIQTIPYYCYNLILSVLKKEEMKCDNIFSL FT NLNNNFPSIEQFDEMIVHETNFITEQYSAMLETKYVSFQDIKQLFDNLRHKFLMNFDEQ FT HIFEVDNRFSFLSGNLLTSVDEFLLNSNNLLNKIQKFIFSTKNEDIIITSETISSLLFS FT ITEKLFKYTMGIITNIVDFTKFHRYNDKPTIRFQCSNHKSRIFDKNYLCSKLTSKLYYL FT LFQVNKQIQKEILLLFYSIMKTLIIFIFVTNNIISNTKMIEIPIVKIYLEDELVFFLKE FT DLMKISGYAGNINLILKLFSDLDQNNQGGIISTGFALKLLSDNSIRTIKEKFSNWISIY FT NSKCSHEFPTYDIIANYSVFLDNNYLEYMQMKKINCNLRNEIIDHKFLQHTNSIILDIN FT QWIKKFERFDLSENFCEFLISLLKNLVLIINKDIQYLYFNNIIIYELNLINISKLSKKP FT LHKLTMETILINISCIHKVFMNNLSANKLNILVMKNLINLLIALFDFFKYSSTIEEISS FT NSIINLIRYLTYLEFSFEFVKDENKEQKETKVYTKQSEEDTFKRHILNNYCILFPIIEE FT QTFDFEGIYDINEVMINEQFDKFFNSTLSTKVLLLTITIKYGLFSNLSVIENWFKEIYT FT FVQVYKTNSHLKNWNLLYHNCRKVVKNYSALFCNLNMKKLYEKIELEKYCLFPEQETEY FT LELLGIQFQLLYGLPVCPLPSMPFNKMAIYNQSQIYDYLFDLKILVSNETNFKNLFINW FT IINLIKNNEFRSSGILEYISPYSPLIILSCNIFQTLINCTSLTERQYSDSKVKFYTSII FT GLVERLFFNLNNLKEADAGNIWNIISSPLIYPLFMETLSEIMNSEKFYFDTTGKTLLQD FT DFSSILYNSQVSDQILNYYDNIMVLSPSFSNFEMDEFEVEKHGFEDLLFDEISIYIKEN FT NLVIASLEPLLYSKYYYTILLDRFMKSNLDIKYLSNEFLDETLVQIRSKVENLISFPNG FT DNIFLKINWKNNIQKFIPFRFIKDCFISCIPVSYMDRSSSLWSPGRFGVTGNNNIQLSL FT STLFKYLKLLNMNLFLMDNYDLNIVTCCNNVLKQIICELDDYDTFQRLERNFPVFIKRM FT VAQLIYLLQCEIKFFLKPERNIQIPYSQIVNNILSASFDQLIIRLINIKSVYGKTNAKY FT LLIYLNKVKVLSKHNFNNEIFPRSFEKDLLINLIQVISKLRKTLFTEFIPKISNEEKKW FT YCIYTNSFSQIQNCVWLPSLLESKARFKLAKIMISSFIYSPVDNHNHVEAAITELKNSY FT YLSIESLIICYLILSGNTDEIDTFFKNYEKHSGIEITKFFNSRLITIYILFFNIYLNKF FT PEKNPLYNKIGFMKSQINDILKNINISYSNLSTNLAGILFYFYKSFINLQSIKQFIVLF FT ITNKSFPINDNEIAEFKDLIKINSHSINNYSITEMINELIFDELNSSQTTLKKNKLTQS FT IFTPIINQSFSYFSMINYILLGENSTNLLLYYQKFLLLNKKVMNFWGLETNKLINDHYN FT GLYLRRNRKYLCYKYKFMLVPSMLILVIKNEITFDMNNILYNCINSETEIFACFNKDLT FT LDDRYSSPIFFPIYNSLLDTKCDNFANLSDISYYYLVNLNQILDHLIDLLINEFKSIRQ FT LPAVEKFEYLINDQIKNNNKKGINTNLIDIGIINWNDISESGHFNISLNIILKILNNIL FT DIFNFVDEIGLPVDGKLFSSLFVSSEHKKSQLKSEFAQCIPLSVIKQLERQLGSFLQTN FT IPFSVVHKIGEIKVSLEFNFINIFNKCFTFSLDKKVIYSRNYFLEVVKLLIQFLGISSN FT GVNFVNLYSENSEVIEGFLFEEIMPEVKKRKRYDTSLFD" FT repeat_region 22476..22484 FT /note="(t)9" FT repeat_region 26058..26065 FT /note="(a)8" FT repeat_region 27269..27276 FT /note="(ta)4" FT CDS 28564..29184 FT /locus_tag="1MB.191" FT /product="hypothetical predicted transmembrane protein, FT unknown function" FT /note="1MB.191, predicted protein, len = 2150 aa, unknown; FT predicted pI = 10.1201; contains a predicted TM helix FT region; signal peptide predicted;" FT /db_xref="UniProtKB/TrEMBL:Q7YYJ3" FT /protein_id="CAD98489.1" FT /translation="MKHTKYLLLLLTPTFLVYCSNITPKLENDTFIDALDKDLSENEKE FT KKMTVCFELTQKEFIEKRDVYQKIAASIKDKESITSNDALRALFHQNLITCYFNSNIKD FT INSVINRNIPKEGIAKMFTVSENTPLRFSSNQLKILERVISMSTKNTGKQISGGIAKQL FT SGFYGHLYFIFALLLIGVSFYFALDRLNKSIKGAKGKITKKNK" FT misc_feature 28564..28620 FT /note="Signal peptide predicted for 1MB.191 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.974, signal anchor FT probability 0.016) with cleavage site probability 0.714 FT between residues 19 and 20" FT repeat_region 28697..28705 FT /note="(a)9" FT misc_feature 29056..29124 FT /note="1 probable transmembrane helix predicted for 1MB.191 FT by TMHMM2.0 at aa 165-187" FT CDS complement(join(29218..32242,32461..32995,33329..34425, FT 34546..35031,35080..35583,35626..37391)) FT /locus_tag="1MB.193" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.193, predicted protein, len = 2471 aa, unknown; FT predicted pI = 6.4699; contains no predicted TM helices; FT some similarity to Q25802, rpod protein (960 aa, Plasmodium FT falciparum, EMBL: X95275, CAA64574); Fasta scores: FT E():0.0017, 20.543% identity (24.312% ungapped) in 1032 aa FT overlap, (aa 488-1462 of 1MB.193, aa 14-942 of Q25802)" FT /db_xref="UniProtKB/TrEMBL:Q7YYJ2" FT /protein_id="CAD98490.1" FT /translation="MLLNVEQIIGYFAFSNNFYAELTNERNNCTDTYSDFDKFFGNELK FT LFDDLHIASVINNDTICVYNVSTIIFSRDRAQDKCRAPNIKSLFNINLPFNPNCNCSLF FT FDRCTSILTISVCSNEGQLYIIEISVFGECEYGFNILTTRSFKTELSNLKLLTQATKSL FT FLACNELGVFGIISNSTILKDEFLPLFIKKNIFTESLEISAFQIVKTTVTNKNTENIYL FT ITFSEKNKELCLILIRFNPAENNVCSNIIDRRTDIEIGINIDTHPVISLLEGTDHFVLL FT INSTVYLYKILNFCDYNSENINLQLSSYLNLDLQQSGDKKKLLYPQVYCNKSEIYIARA FT AFPGNIMNDYDLCLPICEIHKICLNDHTSNNFPSNANLGNSTLVWSQGLQNSFKNPLDI FT SDLKCKLISNIEKLSNFEEIINFLLCRVIGECKSNTVVAMAFNRYLRFLESIISKIVCE FT KKQCELLNILYSTSKFEDLVNLMKNVYIYLDGKALINKEFETRIKCKYDIEKIIFIITP FT LHLLINYFKNIGNYCIRIFSDKDKTFLLSPISVSVISSLSRQELDKPSSISDNCNSYLY FT ISLEIIRNYLKLKLDKHLFDLFDFYISDGINNKIIVCALIMELIEVLSKKLTFMNKLED FT TINHYTSLNEIFSPASNYINNFNLIYYDILTQNFTTDELKSLIQNIIQFFVNLSPTSND FT EVLNFVGKSIKADHNGDVKCLVLNSVDKETNKYFKASNLEIMKVKNTILILNIYSIITR FT NIFNCGVENILQFFSEDYSFLNLGKYMVTGKLNIITHNCGRINEHHFVPSVDQVYRVYL FT YYNSIYKFFSSDVFIDSGANDLSLYIFSNKIFWELFLNDFCSKKVSQECNFQLVDLVYD FT YLEQQIINGCFEHNYTRYISYIIRLLPYKSHTSDSSLSILKQLISDIKLFSKPRIKIYY FT NSIYFNLLEISINLICACFSSCSPNQKILINEIHKKYLIQYIYLSGNNIEDHSIVKLLN FT KSLSCFNSKHEKMLFLMVLWIKHYLSNKVENKSELKFSTISASVFATELEEFYSLLIQI FT FWKKTKFLNNTNSEAFLRFYSKGAFIVRSIYKFLMNEWKRGERYQEIVTLAYLNALDKY FT CDLCSYNNCTNALFPESWIEFILMNNSRYEEFDSFKIINEALENLSGIIQYFSFEKEVI FT NADSEPLNIIISSLNTCITYSSATLINQFNDRQYSDKPICIPNLFFGIELPKIIYQNNY FT SQGIHHKVTFLPSIEYISYLKWYFVGINKLLTSDLTLAETYQHFINFCKNIERYKNYLE FT LEYKNDFLTQTSGQKKKKLSYSSPDPKITVFEEILLQNAKLRNIQILKGSDNLAFAINI FT NYWSSIISRINHSPYFLQGVHIALLANERIYGAMNSDNPSDSAYTSILPIQLKKMYKDP FT EYLMKIIRSCSTSIGQHIHIFINVIENLLSFSFIDEALRFITAIISDKQIELPIYFVAK FT VRDNLILLPRVIFIFVIQCLSLIIVANSLPWIPDYEFSNPITVSERIHQNEIQNLVKRK FT LMNKYEIKFNEEHEKLRKLINDTKEKVYINQVNQMYDLLADQIRNIGFSCPTTSGIKNQ FT YCNEKRLKGFENMKERSYCENAFDPSPIAYSYNSNEYSYSDSGDIILNYENEDLVLEDS FT NCVSVIVDGDVLICGDLIVANGQLNTLTSFDALEAQNLSVGKTQPIHSKRRCIIRKGLT FT VRGSMILPFSNLIVIGKLIVGKNVYTADLQVDLSSNLFIGVLYLFSDVYQNNSGSDSWN FT PFSITRVKLFHASNSCLNTFDNNIVQNQFQDSSKSNESETKEQNLGSLWSGRKIYVGNS FT GKLWVRGDISAGEVVISDAGLVFGTCGNLKTNTTHGLGIHIRDSGSLHMMNSSIFTSRL FT SLIRSSVVIVKKGELKINSILLLHSGSKIRIGGNVMLHIASIRDSSTFYADSLKILNQS FT IENIEYYHNSYNISSIFGISSTQYTEINDKMAGFFQVVSASAATIYGEVFVQGSIEIND FT GSEFLSMGKVSIFENAVISDGSEILILGHSNDSNSNNNSNNNCERYQHCILGSILVSNS FT SNMFIANGSLKVNEHVDLIEGSFLLGESMEVGKSFISTRKSSFFINKGNLIINSVNPSF FT EVKSNDYKSFFISNGIFIINGKLIVKNGNVVLIMKSYLEVEQIYINQGSLGMSTKSLTV FT VNGEKGNEFTDLFLFRNKIPINLLINGNLTIDTFSEMLVFEGSISIESVILTSGSIINI FT KRENNAIQQYQVNINKVVVLQGNSLMNVSERYLLEVNSGIMVDEFSILNCDKTIIKGDV FT FTDDGGVFIAKSAVISYPTTLVSQDSSKIYIENLQVNYLISRKNTTNTSSEKSNIPLFI FT SENGGSIKIDKCEYICDESFCIKSLFGIRTTESMITIENSFPWFIPVKTSVQFFDLNST FT RSKELYQVIDPIALIARKNPEGNKQIEGKSKITFSKGEQVRRGRTWEIDNSKINSANES FT QPIQWQ" FT repeat_region 29678..29685 FT /note="(at)4" FT repeat_region 32268..32276 FT /note="(t)9" FT repeat_region 32657..32664 FT /note="(t)8" FT repeat_region 32944..32954 FT /note="(t)11" FT repeat_region 33028..33035 FT /note="(a)8" FT repeat_region 33925..33936 FT /note="(catt)3" FT repeat_region 34263..34272 FT /note="(at)5" FT repeat_region 34489..34496 FT /note="(t)8" FT repeat_region 36426..36434 FT /note="(t)9" FT repeat_region 36811..36818 FT /note="(t)8" FT repeat_region 37459..37466 FT /note="(a)8" FT CDS 37666..38712 FT /locus_tag="1MB.194" FT /product="conserved hypothetical protein" FT /note="1MB.194, predicted protein, len = 349 aa, possibly FT hypothetical 44.4 Kd protein c27d7.08c in chromosome I; FT predicted pI = 9.1345; contains no predicted TM helices; FT reasonable similarity to O42662, hypothetical 44.4 Kd FT protein c27d7.08c in chromosome I (385 aa, FT Schizosaccharomyces pombe, EMBL: AL009227, CAA15827); Fasta FT scores: E():1.6e-14, 29.329% identity (35.169% ungapped) in FT 283 aa overlap, (aa 49-327 of 1MB.194, aa 15-254 of FT O42662)" FT /db_xref="GOA:Q7YYJ1" FT /db_xref="InterPro:IPR010286" FT /db_xref="InterPro:IPR016909" FT /db_xref="UniProtKB/TrEMBL:Q7YYJ1" FT /protein_id="CAD98491.1" FT /translation="MGQANNKNQQKKATQLHHRNVHANDDFLFLSERYPELKKCIKIIN FT NKVRINYNPAALHCISKVLLHYRYNINWDIPDKFLIPTIPSRANYVHFISDLLTPEHFY FT NTEKVNDEGLRNDLKTKTCIEGGTDVCFSELIPRGKQVLGFDIGIGANCIFSLLCNKIY FT SWNMIGSDISIESLSVSDTIIKKNNLCGCIKLLHQEKPEYILFGILDKTEIEDLKFSFT FT ICNPPYYDSVEDSEINMHPARFRSCQNYEIITHGGESQFILKLYFESKNFSKRVIWYTS FT QVSKLKNLKFLKSVLKKEIINNELKSLRYTTLKQGKHDKWVIAWSFFEKEERTSILKFL FT RNNKNMSS" FT repeat_region 37694..37701 FT /note="(a)8" FT repeat_region 38218..38225 FT /note="(a)8" FT repeat_region 38271..38278 FT /note="(at)4" FT misc_feature 38759..38766 FT /note="tgcatgca" FT CDS join(38854..39121,39243..39316,39361..39438,39489..40328, FT 40373..40417) FT /locus_tag="1MB.195" FT /product="BT1 family protein" FT /note="1MB.195, predicted protein, len = 434 aa, probably FT biopterin transporter; predicted pI = 7.6845; contains two FT Pfam matches to entry PF03092 BT1, BT1 family; contains 9 FT predicted TM helix regions; good similarity to Q55721, FT integral membrane protein (494 aa, Synechocystis sp, EMBL: FT D64002, BAA10362); Fasta scores: E():7.8e-37, 34.63% FT identity (36.13% ungapped) in 410 aa overlap, (aa 29-421 of FT 1MB.195, aa 27-436 of Q55721)" FT /db_xref="InterPro:IPR004324" FT /db_xref="InterPro:IPR016196" FT /db_xref="UniProtKB/TrEMBL:Q7YYJ0" FT /protein_id="CAD98492.1" FT /translation="MDGTLHQNEIISLLDEKRTISSTIKFSSLLGSITITKLVIFLVGF FT SDGLTHLATLAIYYLLKDDLRLSPPEVSVIYAIPAIPWFLKPLFGMRRKPYLIFFSILQ FT VIGFLLLATNADTVFKAAVCLLLISLSAAFCSSIAEALVVETSGINGGAETVSDYFGSK FT ALGALATAYFSGSLLDTYSKQGIFLTTSIFPLFVFIACLIMDDKKQTEDLTAKNQLFSL FT KEFLKKPIIWGPAIYIFTYTAGPDYDDAMFFYFTNRLGFSPTFMGSLRLTYGIAGIIGI FT VLYRIILKKTPFREILLWTTLFSIPIYILPLALVTGLNLNMGISNRMFALSGGFLIEAI FT AEIQLLPLLVMTAKFCPKGLEGSVYAVMMSIRSLGIGVSKVISAGLAYSLGITAFNFSN FT LGLLIWISSAFLLLPLFFLNLVVNEEEIQSTENQV" FT misc_feature join(38965..39033,39263..39331,39527..39586,39647..39706, FT 39764..39823,39860..39928,39956..40024,40091..40159, FT 40172..40240) FT /note="9 probable transmembrane helices predicted for FT 1MB.195 by TMHMM2.0 at aa 7-29, 95-114, 135-154, 174-193, FT 206-228, 238-260, 283-305 and 310-332" FT misc_feature 38995..39123 FT /note="Pfam match to entry PF03092 BT1, BT1 family, score FT 25.6, E-value 8.6e-08" FT misc_feature 39248..40276 FT /note="Pfam match to entry PF03092 BT1, BT1 family, score FT 90.6, E-value 2.1e-24" FT repeat_region 39774..39781 FT /note="(at)4" FT CDS complement(40525..42693) FT /locus_tag="1MB.197" FT /product="oligosaccharyl transferase stt3 protein, FT probable" FT /note="1MB.197, predicted protein, len = 723 aa, probably FT stt3 protein; predicted pI = 9.3317; contains Pfam match to FT entry PF02516 STT3, Oligosaccharyl transferase STT3 FT subunit; contains 12 predicted TM helix regions; signal FT anchor predicted; good similarity to O97353, stt3 protein FT (723 aa, Toxoplasma gondii, EMBL: AJ132382, CAB38944); FT Fasta scores: E():2.1e-123, 51.671% identity (55.373% FT ungapped) in 718 aa overlap, (aa 24-722 of 1MB.197, aa FT 29-717 of O97353)" FT /db_xref="GOA:Q7YYI9" FT /db_xref="InterPro:IPR003674" FT /db_xref="UniProtKB/TrEMBL:Q7YYI9" FT /protein_id="CAD98493.1" FT /translation="MKDISYKELENEYKKKSIFESAGRISPVLLFLSVVLIMGLCIFVR FT LFAVVRYEAIIHEFDPHFNYRTSKFLSKHGFYAFWNWFDSRSWYPLGRIIGQTLFPGLM FT LTAALMRYIAHQLGLLVSILHICVFTGPIISSLTALASYLLTFQITKRNETGIMAALFT FT GISPTYLSRSVAGSYDNEAVAIFALVFSFALYLRAVNDGRILSSFFAALAYNYMTMSWG FT GYVFVINTIAIYTLALVLLNRFTYKHFVVYIVFYVIGTILNLNIPFVNVGAVKSSEHLS FT SHCMAFLVVCVIALNSSKYILSTSSSRILLKAIWISCFLVSSSAFLVLTLSGKTRWAAR FT SMTLLDPTYASKHVPIIASVSEHQATTWSQYIFDLHITAIFFPLGIFVCAYSIYISQKK FT SPKNSPFQTSHSTCIPDTAIFLIVYGVLAVYFSSVMIRLMLVFGPAACCLSAVGFSFIL FT SSLIGRRQTKKTTYGDGSQGNFGILRVFFVLLMFLLCVSYVVHSVWSSAVAYSHPSVIT FT SNRLRDGTRLIQDDFREAYYWLRKNTPYNARIMSWWDYGYQCTELGDRTVIVDNNTWNN FT THIATVGLALASPEDEAYKIIEKLDVDYVFVVFGGFAKYSSDDINKFLWMVRIASGIYP FT HIQQNDYLSDSGHYTVGKDASNAMLNSLIYKLSYYRFSDLQKDGFDLVRNYHIGKTQFS FT LTHFEEAYTTDNWVIRIYKVKKQSNRSI" FT misc_feature complement(40663..42612) FT /note="Pfam match to entry PF02516 STT3, Oligosaccharyl FT transferase STT3 subunit, score 390.9, E-value 8e-115" FT misc_feature complement(join(41182..41247,41308..41373,41386..41442, FT 41506..41562,41701..41766,41788..41838,41884..41949, FT 41971..42036,42097..42147,42259..42324,42355..42420, FT 42553..42618)) FT /note="12 probable transmembrane helices predicted for FT 1MB.197 by TMHMM2.0 at aa 25-47, 91-113, 123-145, 182-199, FT 219-241, 248-270, 285-302, 309-331, 377-396, 417-436, FT 440-462 and 482-504" FT repeat_region 41233..41240 FT /note="(a)8" FT repeat_region 41286..41293 FT /note="(t)8" FT repeat_region 41497..41504 FT /note="(t)8" FT misc_feature complement(42571..42693) FT /note="Signal anchor predicted for 1MB.197 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.000, signal anchor FT probability 0.993) with cleavage site probability 0.000 FT between residues 41 and 42" FT repeat_region 42646..42654 FT /note="(t)9" FT repeat_region 42858..42869 FT /note="(tatt)3" FT CDS 42870..43724 FT /locus_tag="1MB.199" FT /product="gal83 protein, possible" FT /note="1MB.199, predicted protein, len = 285 aa, possibly FT gal83 protein; predicted pI = 6.3101; contains no predicted FT TM helices; reasonable similarity to Q9ST66, gal83 protein FT (289 aa, Solanum tuberosum, EMBL: AJ012215, CAB52141); FT Fasta scores: E():2.9e-09, 28.230% identity (31.892% FT ungapped) in 209 aa overlap, (aa 17-213 of 1MB.199, aa FT 85-281 of Q9ST66)" FT /db_xref="UniProtKB/TrEMBL:Q7YYI8" FT /protein_id="CAD98494.1" FT /translation="MGIVSSSSNLSSIDKPNEQNNFHCTNGHMNGHNLENIQCVIRWSF FT GGDEVFVTGSFNFWRKQDEYKLFKSGHDHLIAIELTRNIHFFKFIVDGEWRYSPEYPIE FT SDSEGYINNCIDLTKYKAPYYSTPCDKSRYGVQEFHQELPTEFPVDAPALPILLGKSRC FT PLETANGIHIPFHCISNHIYYDSLVQEIFGTHIVTFCVTKRWFKEKYMQIDHCMQKFTT FT ILYVSFRLIDEFYPILICKKNDYNLNCFSKSVSDSENNNPHYDAYFSDRLLTTAEMFAT FT IFR" FT CDS join(44069..45046,45083..47266,47456..52063) FT /locus_tag="1MB.201" FT /product="putative transcription regulatory protein, FT possible" FT /note="1MB.201, predicted protein, len = 2590 aa, possibly FT putative transcription regulatory protein; predicted pI = FT 6.2366; contains no predicted TM helices; reasonable FT similarity to Q94LQ9, putative transcription regulatory FT protein (2363 aa, Oryza sativa, EMBL: AC069300, AAK55455); FT Fasta scores: E():8.1e-42, 26.836% identity (31.993% FT ungapped) in 2165 aa overlap, (aa 530-2578 of 1MB.201, aa FT 419-2350 of Q94LQ9)" FT /db_xref="InterPro:IPR007196" FT /db_xref="UniProtKB/TrEMBL:Q7YYI7" FT /protein_id="CAD98495.1" FT /translation="MVEKNLPCGEDIPKRQLFEIIKEIGPSSTSNQENFDEILLGWIND FT SSNFKRDIATSFIYMLNDTSKYNKDPKYKEKGSDDFSEVLRARWNELSNNKNDQNLLVQ FT SSREDIRLGWNSSVFVEGVNKIMKIQESKNKQFNTNWNDIIDEIIDKSDLLDLTDNDGS FT FELFCSLLGESYRRDENKVYNPKCNELYYKLDPFLKEWKNPDIQALFLLKICKTSYEIN FT NRSYNLHQKDYSNDILEDTLIENAVNFDSVYTNMSNYSIILDASSRIHNSIVSMFESDE FT SILELIKDGLLGTPCDDIFKSKLPKIGIFHTKGSCGLLICHDLLRRLLELLQHLIFYEI FT PRYSPIGLLSSLGAIYDDFPQAVSVSSDLANKISESFDILVPNNKVTKVSLIKMAIYSA FT QEFILFPISKIVATSHEKEYDIGPRFGVIMQIIFASSEKHRNDNSQSQFMRDIISQIAF FT CYRISSVFLISKLNSLSRSYLWEFLDSEWKTLPQFYKMNVSDCLSQILSRDKLDLLKDS FT EYSHESIDYYHWWFLCIDIVFSGISISQNSELGLSSSISDEISSRAVNVLFTLFKEELE FT GKNIETPLFIIAIMDYIRARLVYMTYASAELQTILVNRRTSLPSFAVDKAASFKVFAAR FT VVRRNLSLGSSQLYQPKECWISSLETPSFKFSCWRISSSSILAIQEAILKFGVSIKGYE FT STPISELHDALLEFEKIKDTNNLQDKNNSLGSIIKDVSSEIMLNSSVNEKDEKTLNNIS FT QTLFTNNDSEHKNSTKINGKSELIEAENEGNGFEINHVNDFLTKCYSGEINTSELTVEL FT KKMHSLSNHPGKNVKIFNTFLQTLFDECRSYPKYPNQELKITAEILGILVKEDLLISFG FT NALVFVLRCIIEALRKGHWTKMFCFGVFAMEMFIDRFISFPQFLSAIINMSQHLKHAIE FT PYVTYCESCIAILPENLKNKLYIERSVMESLNLDIPPKPESLISKVHPEMINFEFRQKE FT TSESPCKPNMSIDKLGSFITLNLSERKVLPTGITIDQLQGFGLGSLEKLMNDPEILNAL FT VTPSENLDKLMPKNSDLLLEEKTESDIPLTHKGNEESKINIIEITTLASYDCIKALLRY FT ASILNEVSSFLNVLRHLGYWLGQITIGINRPIIHKYLNPRQLLIDSYSRGCIASVLPFI FT CKILENVKGGYYYPPNPWTNNILYALAEIHSLANNSNSHMFEVELLFKQLELNLDDYVG FT KSNYLGLSSHTDYTEHKALGEKQRGHNIYPKTQTEHNSHITLGSSFERPNITNNVINSS FT LNQSAGLYQLSANIGDAQLASTFMPPNHSSQMMHQQTPQQIPSSDIQFWANKVLISPSI FT VLFQIQPSLRPLVPLALDRSIREILQVVIPRSVRIAAITTKEIIGKEFAFEADENIYKR FT AAHLMVAALSGSMAIAACREPLRVAFTAQLRQVLHPTPSRDGEDHVLIEQVVQVICSDN FT IDLGCQIIEQAVVEKAIEELDEVISPGIIARRKSRETGHQFVDTDFYGGPNTQNSATFW FT SSLPENLKYRHNSMRHLQLYKDFLQFTLMRNLERRDSVTQYELQNSLQSNQITSLYQHG FT SNDQFNSQTQQWNNSNAIQFSHQAESIQNFNTVRSDNTSSQMPSSNHSQMNTSPSTIVQ FT PPEPVRVPLVFELAYLPLMMRVDECLGQIKDVIREIALYPPIFSKQLIPPVSNNLSEGM FT SVNQNIYSKPLGSNIFTYTPKSTAHPVLSVLSSLQSDHILFYLCRVLYSIGKSASQRED FT VLIGISQKLFKTLFDAGAAFQQSTTGILPSSRCIASSLGFDAALLHIEVFLALCNQISY FT YSSKFWLKLRKEAIGWFIYTIEDPKYSVDIVIGALRYDLISSDELDVSLSNILETAIST FT LNDSNPAIGGNSRCLRIVEFIYKLFFRSIEDWHYPITKKLPSATKNLNRLSNNSVAFQN FT SNFSAIPIVLPGLYYKPYSYTTNLGELKNKVESILLELESNKSIKFHEMWIGDQCPEIL FT NFYSVIQCNLDTILNPRYIALPTPIKPPPDISKGINTIFDEWILLLRITIFNGVGGSER FT NNPYRNLFLQRLSRQGLLRMDDTTEKLFTACIERAIYLSLNHNSSDSDALNNSISENAN FT DSRNNMDPFPIDSLVRLITTMARYVDPQQMAAVVITHKFLSILTRVIHKDAESHGFNQR FT PYYRIFYSLLQEYESIGFNTEMIHFTCILSVVHHLQYLNPNRVPGFAYSWIQIISSNRF FT FPYLLRHVKGWQPYQALLLQIFIFISPFLRSVQLSSNIKTIYGALLRILLVLLHDFPEF FT LCDYSCSFCDVLPVNCIQIRNLILSAFPRNMKLPDPFLPTLKIGNLPEMKLIPRMIANY FT GAYILYKDLKVNIDKFWITRDASILPLITETIKMPRDEALKCGTKYSFPIITGLLLYIG FT IYLPNGNESNSSIDGSHNGIFNIFNSDPSTNSIESASKLDQTPNIKSDQLETFEDPSLS FT IILFLCKDLDMEGRFVLISAMTNFLGYPNSYTYYFSSLILWLFSKSNDSIVQEQITRIL FT LERLIVHRPHPWGLLITFIELIKNPKYAFWSCSFVHLAPEVEKLFQSVAQTCLGQAPNK FT TNLVNHT" FT repeat_region 45057..45064 FT /note="(at)4" FT repeat_region 46540..46547 FT /note="(a)8" FT CDS complement(52083..55373) FT /locus_tag="1MB.203" FT /product="adapter-related protein complex 2 alpha 1 FT subunit, possible" FT /note="1MB.203, predicted protein, len = 1097 aa, possibly FT adapter-related protein complex 2 alpha 1 subunit; FT predicted pI = 7.5425; contains Pfam match to entry PF01602 FT Adaptin_N, Adaptin N terminal region; contains no predicted FT TM helices; contains predicted helix-turn-helix motif; FT reasonable similarity to A2A1_HUMAN, adapter-related FT protein complex 2 alpha 1 subunit (977 aa, Homo sapiens, FT EMBL: AC006942, AAD15564); Fasta scores: E():2.4e-06, FT 30.435% identity (32.984% ungapped) in 207 aa overlap, (aa FT 5-205 of 1MB.203, aa 27-223 of A2A1_HUMAN)" FT /db_xref="GOA:Q7YYI6" FT /db_xref="InterPro:IPR002553" FT /db_xref="InterPro:IPR011989" FT /db_xref="InterPro:IPR016024" FT /db_xref="UniProtKB/TrEMBL:Q7YYI6" FT /protein_id="CAD98496.1" FT /translation="MELVEYEISNLKSILEKTDDSSKKGPQLSLKDKEKIIWRLAYISV FT MGYEIDFGWLEILELVSSNIFEFKQCGYIAASLIYRGNLELLRLLINTIKNDLNNCFEV FT LVDKEKGSDKSRNFRQFSVGIGKNHSGKNIDIRKQKIKNCLLALNFIGNSPTLDFADNL FT FIDIKKLAEIPVEKQLNDTNIIRSKAICCLTKLFQCCPDRLRANEWGERLLSYFQFERN FT ADCLISQCVFIKNSLKYYLNTEYLEPTENAYKNLNEQETDHSVLRDEREIINSNLNIKL FT VKIWEFIVPNLIFTLSRIRLYKEIKNWRFHKIPMFWLQVKLIEILELFPEVINDYFVNY FT KMNKIMEDVFFNAIHAIKSVNECNLSFSSTFDEIEFLCIIGIAIEMTKYFNKICNQELS FT ANIGHLVGKFIEGLLYADNKDYISISISLIFESKTNKAIQELVKKNLVTLLRFTCSFDE FT DTTLNLLNIFSYICNKNNWVFITKEILNGVLYKYILNPYKLNIEEIHNIQKGSSDCFFP FT EKTKTLLEEIILSVCYTIRRFSNKNEVSYKTINVLFQTLEKSFCNPNPGFEITENTLFQ FT ITDILFDSKLKIECTDLMEKVNESLGNDISTQKYVAIKSYKLLNKLKNRRDNFNPRSGI FT RWLCFILGEYGKLISSKVSIIKQVEILLIIHDILTIDSEDDHIPLLKSTILLSFTKLYC FT NSDHETQNRIYEILKIGVGNRGNCIGSPEFLNAIVANTQYSVTPINGNITPVNIPSENN FT SALRYRDQLLDGVFNHKKNSLLIPNQSNKYGDDGYCSRNTWLSLCLSNKGSLYNFSLLS FT IGFSNGSFKYSSGESTIIVKLGKNEKNRIFNIERISTFCEEKKLSVKSSNNSFPNIFDS FT TSQNLHVEVFSETNKGFDNEGKFEQRINLICNGPYLNPPMISLLIRAFSNTKDEVSEIN FT EQMEEINTDSNYSKLICINFRLPVILTNFMAPTKKMGKKEFGNFWEKLTQSSVKGILGI FT SSLEIPIFLQLLNFCVYSVMDSIDDPSVSTPMIYGGTSTLYLSNRKRIPCMFKIIPCNE FT HSESFREDKIHDNGKNEEPVEIRVRSSSLIVAKILKQIISTYILSNSN" FT repeat_region 52100..52107 FT /note="(at)4" FT repeat_region 53059..53066 FT /note="(t)8" FT misc_feature complement(53175..55367) FT /note="Pfam match to entry PF01602 Adaptin_N, Adaptin N FT terminal region, score -81.3, E-value 9.7e-05" FT repeat_region 54040..54048 FT /note="(t)9" FT repeat_region 54094..54111 FT /note="(gaaata)3" FT repeat_region 55404..55415 FT /note="(ttat)3" FT repeat_region 55473..55487 FT /note="(t)15" FT repeat_region 55966..55973 FT /note="(ta)4" FT CDS complement(join(55978..56888,56937..60744)) FT /locus_tag="1MB.205" FT /product="exonuclease ii, possible" FT /note="1MB.205, predicted protein, len = 1573 aa, possibly FT exonuclease ii; predicted pI = 7.9289; contains Pfam match FT to entry PF03159 DUF251, Putative 5'-3' exonuclease domain FT contains no predicted TM helices; reasonable similarity to FT EXO2_SCHPO, exonuclease ii (1328 aa, Schizosaccharomyces FT pombe, EMBL: Z98849, CAB11514); Fasta scores: E():3.2e-58, FT 32.101% identity (42.047% ungapped) in 947 aa overlap, (aa FT 1-873 of 1MB.205, aa 1-797 of EXO2_SCHPO)" FT /db_xref="GOA:Q7YYI5" FT /db_xref="InterPro:IPR004859" FT /db_xref="UniProtKB/TrEMBL:Q7YYI5" FT /protein_id="CAD98497.1" FT /translation="MGISRFYRWISERYPQINEEISDGIIPPFDNLYLDVNGIAHNSVN FT SSEMRTPSNGINLSEKGSPEIWAAIFRYINKLVYIAKPRKLLYIAVDGVAPRAKMNQQR FT SRRFRSARDSEFLKKMEKNNSIDTTQNNVFDSNCITPGTTFMHELRRQLEFFIYHQIHT FT DPLWKHLEVVLSGADVPGEGEHKIMDFIRCIKSQRDYNNNTTHCLYGLDADLIMLSLAS FT HEPHFSLLREEIKFSSSRNYESRTVCTKERFQFLHISILRDYIINDLNPRNLSVEELNL FT CFGYKFGLKNDSQDQISIFDNVLIDGERLIDDFIILGFIVGNDFLPHIPFHTVDQGLAR FT IISSYRKYLAHFFLMKLSSSPWILEDCGRINYVNLLRFLIWHIISEKEEAEKRISNPDY FT WKKVIPDQNKISSDSIGVPRFMKSDSIEQDKFRAKWQETRPENFEVMRWRYYFVKMAIN FT IDKFFPENSNVQNQANIKSNTNSNRITTNSIEDIVFCYLEGLQWVSYYYFRGVPSWRWY FT YPYRYAPFACDIAIILSCWLQNKSISTLSRDDIVKYGSSKLMFVHTMYSEEIELNGLAF FT RFIKGNPLQPFEQLMGVLPSNSKDLLPKPFRKLFTNPNSPLLSFYPANFEVDMEGVKVP FT WGGVTLIPFIDEFLLLDSITYVLSKCGFYDSDFRNFEIKIKTNFEYELKNRYKFYLSTS FT ELENLIDANTNTHSSIINSSEFLDTEDKARNQEGRPFIFYLDNYAFPTPKIESTVKLYL FT PSITNSKVVSKSFYHPAFPDGVSNFPNYLLDNTILPFDWFPNLCRIPYRIYYSSGISVF FT GTESSNESIFSSISSSHISKEFLNNLFLPSINTRNPGIRILVDYPRMKVAELVSIKTLR FT YILRPKLLTKPIIEPNNNPKEFIRLLFNENSLLKKRGIVLSPSNSMNIDEELLSISKLV FT NCKEVLDLKDFDAELKDISDPVNCLVEARVAESSYITESGKVEFNFSKISKHYLLPLIL FT LYSDLKESEAERFKQISKINLGEKLLCINKQNPNYGYIGTIVDEAQRLVLFRIPKNKNK FT VEQLQKRIFEYALEDLSSIQWFSIEDVAKIIYKPLLNCLNTKCGLDATKSLEYMRFSTL FT FLKKLIIGPIQVKCNDSTLQEISMNLFKFENKLKNKLYLPLYSKFVDENSFITSSNTNV FT ELGTPIVSNEAIKSILEYLDLFPCFVLAIFKSLVLLDQGELDEHIINSEGKNICSNTFN FT IDNNNEKVENVFNLKKIHVKDIFSDLIDSKDQDFKFNVMIKIIKATMLPSTISRLERLI FT LNSQIETNDKYIDLSKLCRNEHFYQSENQFNSGINKLISRFGYPLGTRVVYINTNDGLL FT PFGTVGTVVSVCCGVKVNKEYYPSTIIDILLDETQINANNLNGMCSKLRGISVKSSECI FT PLLPCIQNLKDENHLEIISKQILNFDLQFNNKAEKSKKKCDNYSYYWLKNQNSSLKQMG FT KKEISITDKQNSSEVNNNNNNNCNDSDLRCRKTKNKTEIRHINSNEHPLAVELKSLLSI FT NINKKKNGLDYLNSRNIENKIKKNDEISNVPECEKKIEDSMEFKLKQILKIT" FT repeat_region 56028..56036 FT /note="(t)9" FT repeat_region 56248..56268 FT /note="(att)7" FT repeat_region 56304..56315 FT /note="(tttc)3" FT repeat_region 56374..56382 FT /note="(t)9" FT repeat_region 59684..59692 FT /note="(a)9" FT misc_feature complement(60043..60744) FT /note="Pfam match to entry PF03159 DUF251, Putative 5'-3' FT exonuclease domain, score 416.2, E-value 2e-122" FT repeat_region 60822..60829 FT /note="(t)8" FT tRNA 61057..61130 FT /note="tRNA Leu anticodon GAG, Cove score 52.07" FT CDS complement(61269..66140) FT /locus_tag="1MB.208" FT /product="oocyst EB module wall protein" FT /note="1MB.208, predicted protein, len = 1624 aa, oocyst FT wall protein; predicted pI = 6.5447; contains six Pfam FT matches to entry PF01683 EB, EB module; contains no FT predicted TM helices; signal peptide predicted; high FT similarity to Q9GQD0, oocyst wall protein (328 aa, FT Cryptosporidium parvum, EMBL: AF266273, AAG39054); Fasta FT scores: E():1.5e-119, 100.000% identity (100.000% ungapped) FT in 328 aa overlap, (aa 29-356 of 1MB.208, aa 1-328 of FT Q9GQD0)" FT /db_xref="HSSP:1EMN" FT /db_xref="InterPro:IPR006150" FT /db_xref="InterPro:IPR009030" FT /db_xref="UniProtKB/TrEMBL:Q7YYI4" FT /protein_id="CAD98498.1" FT /translation="MKRILLLSFIIGAFGAPQVKPNIPGVASPLPHTKGQVPTYVETPL FT ESCPPGYLMENGVCVQRIQVPPMPFCQEPAIYHEGHCLIVTAPLKQCPPGYEISGKQCT FT ATKTASQQPSCPPGTTLHGTECISKHMIDTVCPPGFVDNGRDCVAFTMPEKSCPPGFVF FT SGKQCVQSDTAPPNPECPPGTILENGTCKLIQQIDTVCPSGFVEEGNRCVQYLPANKIC FT PPGFNLSGQQCMAPESAELESTCPPNSIFENGKCKVIKNIDMVCPPGYTDSGDDCVLYV FT APAKECPPNFILQGLQCIQTSSAPTQPVCPPGTVLQDNACISVQAIDAICPPEFLDNGK FT DCVKYSPVTKECPPGFTLSGNRCVQMVNAPMEFECPPGTILKDDQCQSIERVDTICPPG FT FVDNGEDCVQFSAPEKICPQGFSLSGKQCVKTESAPRLTECPPGTTLENNSCISYELED FT AICPPGYLDNGSDCVQFSQPEKECPTGFVLIGKQCTQTTQAPPQPECPPGTNLVNGQCQ FT KVERINMVCPTGFIDNGTNCASFSAPNRECPPGYTLSGSQCEQIKEAPPVSECPPGYKL FT QGNQCTALKMIDAICPDGFLPNGDDCIQFSPASTVCPTGFTLQNQQCVQTTTSPKTPEC FT PPGSALDGDSCTRLVPGALQYVCPVGTREGDVCVERSISSPVLECPPGYSLETGKQCVR FT RSQYDCSVTTYVTECKTPDVKALRRLAAAKETSTVYETSEIQNPGHHHGHSHGHSHSQV FT IPIQTQNIHTQHHKEAPRPICEDVPKITPKTCTKADSVPAVPICENNAELVGKECVLTN FT YYPLEAICQDGTRSKECAKFVKTPPTLKCPPGSVDVGSQCQVNKYSPYDLACPAGYALV FT GDKCATTREKVCPNESCQRVVTAPVSLTCPPGYHQIDEVMNISAHPHHRHLAGVQSTSQ FT KGYSHGHKYTPVISQPPQPVPVVAPIQQMKCIHADHAPYNLICPVGSRLVADKCVTYSD FT KICPNGNCERIYNEPAELVCPPGFSSSKPIQPISHSHINHPNVSVPVQPQTINQPQVIQ FT QRQVNYQPQVIHQTQEILTTYPTPVYQTGTIYQGHHHHHHHHHRNLASPECIKTISVPY FT ILKCESPFILDGDKCIEKTEKICLQGDCRKQVVVPPTLSCPQGYRNANGIQTAISSKHT FT AGTHHYSTPSAECVRTIFEEYSLVCSSGFVLLGDRCALFTNKICPNGNCERLISKPANM FT VCPPGFTRPQSNHHSDHAGHGHGHGNQLLQECTKQIYTPYDLSCPDNYSIIGDKCAIHT FT VKVCPDGNCEQLIRSPPTMECPPGYYRPQAGVAIRSHGHKASGASQCMRNVYEPYALQC FT PDGFRLLGDMCQQSTAKVCPNNNCERISYIPPILSCPQNFERSGQRCIANEYADYELAC FT PPGLIVISDKCAKYADKVCPNGDCERIRTFPPELVCPPGYTMEAGVAQGTRRSLGTASN FT HPHHSSGHHHALGHHHHHHAVTQEVSIVRTTVCSREVFAPYSLSCTADSQLLGDRCARF FT APKVCPSGGCEKIESTPVVSSCPSGYVTDQDGTCYIVEYAPFSLTCNDPYTLFDNKECV FT LVTKPDSRCPDNTEKTSTGCVKKVITTPIVSYETTCIGPTCNAA" FT repeat_region 61701..61718 FT /note="(atg)6" FT repeat_region 62391..62408 FT /note="(tccatg)3" FT repeat_region 62869..62892 FT /note="(tga)8" FT repeat_region 63137..63144 FT /note="(ta)4" FT misc_feature complement(64272..64502) FT /note="Pfam match to entry PF01683 EB, EB module, score FT 6.2, E-value 0.012" FT misc_feature complement(64530..64697) FT /note="Pfam match to entry PF01683 EB, EB module, score FT 8.2, E-value 0.0082" FT misc_feature complement(65052..65282) FT /note="Pfam match to entry PF01683 EB, EB module, score FT 6.5, E-value 0.012" FT misc_feature complement(65310..65477) FT /note="Pfam match to entry PF01683 EB, EB module, score FT 3.9, E-value 0.02" FT misc_feature complement(65505..65735) FT /note="Pfam match to entry PF01683 EB, EB module, score FT 3.5, E-value 0.022" FT misc_feature complement(65763..65930) FT /note="Pfam match to entry PF01683 EB, EB module, score FT 0.6, E-value 0.04" FT repeat_region 66042..66049 FT /note="(tg)4" FT misc_feature complement(66096..66140) FT /note="Signal peptide predicted for 1MB.208 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.997, signal anchor FT probability 0.000) with cleavage site probability 0.680 FT between residues 15 and 16" FT CDS join(66506..67137,67315..67745,67791..69421,69521..71036, FT 71071..71099) FT /locus_tag="1MB.209" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.209, predicted protein, len = 1413 aa, unknown; FT predicted pI = 8.8044; contains Pfam match to entry PF00560 FT LRR, Leucine Rich Repeat; contains no predicted TM helices; FT some similarity to O51588, hypothetical protein bb0643 (279 FT aa, Borrelia burgdorferi, EMBL: AE001166, AAC67000); Fasta FT scores: E():0.46, 22.008% identity (24.783% ungapped) in FT 259 aa overlap, (aa 734-980 of 1MB.209, aa 16-257 of FT O51588)" FT /db_xref="UniProtKB/TrEMBL:Q7YYI3" FT /protein_id="CAD98499.1" FT /translation="MNTDILESRNNRRPRYVSISLNNDELTHFCPFLSDNDEKVLFTLT FT CNYLSLGQFELARSSILQLSSLNFRKVSELLYSIIYYGPPPDWQLSVTISTSAHFILAC FT IREYESFFRNNTKIENYIIKRTEFDLLIGQMIADISDVKISFEIIKKLRNFYSVFLING FT LDVKVPELKILPKIVGLPSYLSRLPSFSTNLNKLSENLLIDLANNQGNLNWIGLKNCII FT DLIFFKENCGIISLNSLGDMISKNISIIKLEDIFENHDLELTNFTYEKDVSSKNNFTKN FT KKPEDVISIVFMIFCIINLFCKYKLNKEIFLKFLADQIEILKSCEDICNTCIFDSRIIS FT NVKSLFSENLLKQLIKFFENDTYSDHIGTLVNSNSDQNLLFKIVLEFDNILFNINFGIS FT NSLGIFSLPRYIENKNKVTLNPILTYDFSINCKFDEKPFFWCEYLRYLNLSRNKFEELP FT IIFITNLFDKIYKTKRNINSFESANKIILCFPHLRAICVHLGMKNDPIFNWELLKNIWL FT PFRLISKNEESLVESKSLELDLDEISRRHAISMFISEELNSNFCSEFRYSSNKNIRKIF FT EEITLKKSIINYYFDNFNSSKKYTTINWVSLEAFISNIISLPLTIGSDISLLTKERDCI FT KSYLIQKEVFETIDGKGSSSFGLNLQNTSIEIGNIQTEYFHNELMYCFFCRIFIYLKKI FT KINNSIIHTKNKKLFNITQLLYYLFIWSDRLSRNFQIEMLPVIEKTFEKTILMIELILF FT SFPFNIPNLPLPKNLEIFSWWYRFILSTPSRVFNTGKTIIRSSKSSEYFQNLFKVRWVI FT RGIYENNPWYSNLFSFESFCVLNYYNNSTRNNYSEILLFPSILILKLLSIQNYEKSIEL FT ILSSKFHVNITLIVLDGFIFHSLFEEEISLFSAMEVFSIYIQTKFIDNENEIGWIFYAF FT PKLYQAFVLIDLYICFGNKNSSNLFEDIAKARDMIEQYTSNLNIKYKSVVQTTLDRLFI FT FYQSSSTISLTRYIMEIDTLPNQSSQIKTHLSRIHDRKLITTTLTQSLNYLSGFSLIKP FT QDESLININQILEQSLFSNFNNSSYLFSVIHYIKSIISELFLNLDKHKAFPILALTPNE FT IISYSFFERKLKGGSKRLSRIMNADLISVIIKALNCFHCSTSMFRNSEISSHSDMEFLI FT KQLDPFKGTFAMLSHNILQSKDVQYSYFIWKFAVKNRINFSLEKKQIILISLEYRIWKY FT DIHKQLESKIFNANIDKLSGFFTSKSLEYYNYLLKWLSIIITFFSLNCNDLHLIMNKFD FT KTIGLINYLFPIYSISRTNTVDKILFQLSREMLRNQNIINLHLNNFKRIIKLNNPYYLY FT RLITRAQGISDPYFILSIIEVTSKSIKNSYSGIKNSKSNNFKSKFIEENIYVILNVQRL FT FSIYGMSGSF" FT misc_feature 66811..66818 FT /note="tgcatgca" FT misc_feature 67832..67903 FT /note="Pfam match to entry PF00560 LRR, Leucine Rich FT Repeat, score 16.1, E-value 0.055" FT repeat_region 68038..68045 FT /note="(t)8" FT repeat_region 68773..68780 FT /note="(t)8" FT repeat_region 68789..68800 FT /note="(ttta)3" FT repeat_region 69553..69560 FT /note="(ta)4" FT repeat_region 70452..70459 FT /note="(a)8" FT CDS join(71135..72812,73242..74488) FT /locus_tag="1MB.210" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.210, predicted protein, len = 975 aa, unknown; FT predicted pI = 9.8666; contains no predicted TM helices; FT some similarity to Q9EMG8, amv238 (448 aa, Amsacta moorei FT entomopoxvirus, EMBL: AF250284, AAG02944); Fasta scores: FT E():0.17, 22.362% identity (25.000% ungapped) in 398 aa FT overlap, (aa 1-378 of 1MB.210, aa 66-441 of Q9EMG8)" FT /db_xref="UniProtKB/TrEMBL:Q7YYI2" FT /protein_id="CAD98500.1" FT /translation="MLKYSIEMKMFSIAKIISQQLLFKVIKHCGDVQYTEKELKHFFSG FT DNFFEEIGKDNNFNNFSLVSIVNYIITNIRDEQIFQKYNFIKLSKAVVSKWIGSFESII FT GFLNSNIGIDEKYIILDAYINSQNSSLDANRINILKTILTSIKLVKDLRMSNIQASNLF FT SPYLIIRMAIMSRNSRLIEILKRDIDIFLKSGDVLLNLIEESFGIIDDKQILKEKIKKS FT YIFGQADINSCIKLFPMIENNIKLCLNIEDIPSYNNDIYFSIKLISIIPKKKTMSDLLF FT SISNKISNQLYRIFRNSISDNSLIRTRFECSNSQLWVAEEGSLKIIPNSLHPFSKFSKA FT KRNSYYIKNRCLNKYKSGLNINLEKSYNNRNEINIENISKSRILFRNLFVLLTWASDYI FT GKSEFSKSIKSLTHLFEIWQRNPALSFSLKDFVYISDYVILVLVFSDQIKILKDLSENN FT ILETNMKPLDKNHEPKNLIICNIVANSITISRMNQYYNGKSDHSNPTTLNASKSRNEMI FT MDIFCNKWKINRGKVTNIEILKGNTVRIIENLLLLKPDILNLMNNKKFFQLDFINHLLI FT QSTPNHYFLINTFVKLNLWWELMVYILRWSFELGPSKSRSLLIDKRREIRANSEKTFEE FT IFFDLVIKKAHSKGQLNSLINAMNEAVPLGGPRAAIVVKKCKNAIQKYLECSGALEMLY FT RSFLSLYISIEIPHALMGAIAIHLSLLDHLNIDSRTGYLESALYHFKNANLVLKKKVKS FT GVIKKNKHASPTNCIDCERTAEVPFLSQNLNIGKLNDVLIPFIADIPIYKINISPWGGL FT SLPIIQRIIRLVELQNSIIKLLIKENSSMSILSPNYKDRRNVIVVLFLIREYPLAFKTS FT NMLEIPLMEILVHATKELIVSYPNSNSLYCFLDSMKLWLSEYDSDALISNAINMWISEK FT KINLNNIPELEKNSIMELVNRLSNPLSRSEAFKMINPLERVKLS" FT repeat_region 71257..71264 FT /note="(t)8" FT repeat_region 71910..71917 FT /note="(at)4" FT repeat_region 71948..71957 FT /note="(a)10" FT repeat_region 73109..73116 FT /note="(t)8" FT repeat_region 73139..73146 FT /note="(a)8" FT repeat_region 73233..73240 FT /note="(a)8" FT repeat_region 73805..73813 FT /note="(a)9" FT repeat_region 74349..74357 FT /note="(a)9" FT CDS complement(74502..75146) FT /locus_tag="1MB.211" FT /product="hypothetical predicted transmembrane protein, FT unknown function" FT /note="1MB.211, predicted protein, len = 215 aa, unknown; FT predicted pI = 9.5342; contains a predicted TM helix FT region; some similarity to Q9PQE5, hypothetical protein FT uu346 (364 aa, Ureaplasma parvum, EMBL: AE002131, FT AAF30755); Fasta scores: E():1.5, 30.058% identity (33.766% FT ungapped) in 173 aa overlap, (aa 29-187 of 1MB.211, aa FT 7-174 of Q9PQE5)" FT /db_xref="UniProtKB/TrEMBL:Q7YYI1" FT /protein_id="CAD98501.1" FT /translation="MSTLFLPETPKEKERDDLKLINTFSKGTNEKNLSLFEETDLLIKI FT QNSKSENYEEHLDYLGSDDRLLVNDCLNRTPVKKRYNKEYSFTQSNKLLTTPKKYFGED FT ILNTVPKNMVMFTNYVDKSLSNKRCNTKLNNTWQKSKFRRSLIRMKSACNKQCKFCNPE FT FSCTNISIKSVKKNLKEWFIYLFNALYDFIIYQLPLALIFYSFAIYLLSCL" FT misc_feature complement(74514..74570) FT /note="1 probable transmembrane helix predicted for 1MB.211 FT by TMHMM2.0 at aa 192-211" FT repeat_region 74617..74625 FT /note="(t)9" FT CDS complement(join(75433..75700,75745..76508)) FT /locus_tag="1MB.212" FT /product="conserved hypothetical protein" FT /note="1MB.212, predicted protein, len = 344 aa, possibly FT hypothetical protein pa5071; predicted pI = 8.4502; FT contains no predicted TM helices; reasonable similarity to FT Q9HUB2, hypothetical protein pa5071 (235 aa, Pseudomonas FT aeruginosa, EMBL: AE004920, AAG08456); Fasta scores: FT E():1.5e-12, 34.225% identity (36.364% ungapped) in 187 aa FT overlap, (aa 154-335 of 1MB.212, aa 54-234 of Q9HUB2)" FT /db_xref="GOA:Q7YYI0" FT /db_xref="InterPro:IPR006700" FT /db_xref="UniProtKB/TrEMBL:Q7YYI0" FT /protein_id="CAD98502.1" FT /translation="MIFQYETQSFVSLVLLCDRIFLMHHARRSSLNADHTAIMVIFPSI FT LMNLIIIDKKDINEDLTVELSSRQSNHCISVLEINIGSKVNVGVKNSGKGTATVTKIMR FT KETNKDGITKSESIINLSSIEEIGNKVNNRSRRSISEINNLNNFQYAVTIKLDTEIHKE FT EYKHDYPLIDLLIALPRPKVFEKVLQNAVTIGVGRIIFVCTDKSEKSYLNSSKLKKESI FT DEIVQLGLEQASKTLCPDVYVYASWTFFLQHLRNYFFCKESMIGIVADVLGNSKITEIG FT LQSHTGPIILAIGPEGGWTKEELTDLITMGFKVVNMGDRILKVETALISLYSKVSFGRT FT QRN" FT repeat_region 75687..75694 FT /note="(a)8" FT CDS join(76526..76596,76641..76744,76892..77175,77216..77396, FT 77439..77603,77984..78312,78361..78652,78698..80544, FT 80594..80836) FT /locus_tag="1MB.214" FT /product="conserved hypothetical multi-pass transmembrane FT protein, possible ion transporter" FT /note="1MB.214, predicted protein, len = 1172 aa, possibly FT hypothetical protein all4633; predicted pI = 8.8337; FT contains 4 predicted TM helix regions; signal anchor FT predicted; reasonable similarity to Q8YND3, hypothetical FT protein all4633 (263 aa, Anabaena sp, EMBL: AP003597, FT BAB76332); Fasta scores: E():6.1e-05, 35.644% identity FT (37.500% ungapped) in 101 aa overlap, (aa 246-341 of FT 1MB.214, aa 114-214 of Q8YND3)" FT /db_xref="GOA:Q7YYH9" FT /db_xref="InterPro:IPR013099" FT /db_xref="UniProtKB/TrEMBL:Q7YYH9" FT /protein_id="CAD98503.1" FT /translation="MPLYFQDDLFSITKDQKKYINIKSLIRTTFTFVYSIFIFTTAVLV FT LNKYYSFPGIVAIENSINHLQEFFKKKSADRKFIMFFRKLLMVFRYLKNSNLKKNKNEL FT EECIEIDCKSGSSESFNETGLYDYAFDHSIISKAASDDWVAAIPRIHKQVMSNEFYSNK FT LFTLKGVELIHWGIHTGYPSFSQLLKNTSEILSSFIEMCFRQTQTPAYQTGHIVKSMLW FT SGVYIWATTKMYRKNNNDPWNFKRIPEIYYFFESTFLWMLSYDYVIGIFAGAITIILTH FT AGAIFTIESPSEKEYNSLFDYLYYSVVTIGTVGYGDFSPRKREGRLATIILITLTLVLL FT PHEFQRLKEALNTPPDSIGSFIRKNDTYLCIIGPIPPKLLFITKSLSLQKQRRFKSIVL FT VTPIQILEYQNIVKISQQRGYIRLSIKQGFLGSAINNLVQYSSIVLIYGSEKPILHDVI FT CNEGTQKSDFDALITVMCLTNLLGLKDKFFLIFYSSQVASLSKITGVLGSLSLENLRIK FT LLSKCISNCPGFLPIMLHLIIPKNENMLKKLQEPPKNTFFTSNIDRSKITPYEWKCRWR FT GLHFKVYTLRFPNSFFGYPIQIFTKYIYKHLGIFLIGVYCPDTNRTYLNPHQYIIGDNS FT NIYYEGSNYLGIVLSSSISLVKKAELLESPKNYTEEFIKKVGRRSCSPSNVFLNRSKKF FT KSKEYIENKDQLNIYDCTIISEIHKHCDSNDTDKNIIKNLININETGKNIQITQHNREF FT TNLINYSSSYDDQCLFDQVKNMVLPNHFEETASNMTNINSKFDICKINSNTVSFSQSNL FT NFLPETNTGIPVVSNYLEAISKVFTNREYPIVLIIGWCEQIDMLVKLTFSLKPSNFIVL FT CEKIIDSSFINEEFFGRIAQISGVGTSEHDLKQAGVLVASRIIIFDSTGNPSNIYKEEL FT INVRYGKHAIGTWIVVCYLFSKYKKKDNCLVGSQKMPPLIIDIKETDIGLLLFQYSDDW FT PTRIDGSTYSIPYKNELDFFYSRQFLSGQFFVDNIIDSLIPFLVPILDNNPINKSFINQ FT IIYGNPTSKPFGNVRYYNEKNCSLQMEEIPFVFVNSTFYNLFSQLLKHGRLAIGIYRPI FT RVDDHQFNKKQDIKTQKFSPSLGDLESVINENSVKECFNCETHLIISCPPPKFKINLGD FT CVYFL" FT misc_feature join(76526..76596,76641..76710) FT /note="Signal anchor predicted for 1MB.214 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.000, signal anchor FT probability 0.758) with cleavage site probability 0.000 FT between residues 47 and 48" FT repeat_region 76572..76579 FT /note="(a)8" FT misc_feature join(76595..76663,77321..77389,77417..77485,77504..77557) FT /note="4 probable transmembrane helices predicted for FT 1MB.214 by TMHMM2.0 at aa 24-46, 266-288, 298-320 and FT 327-344" FT repeat_region 76617..76625 FT /note="(t)9" FT repeat_region 76807..76816 FT /note="(t)10" FT repeat_region 76819..76826 FT /note="(ta)4" FT repeat_region 76924..76931 FT /note="(a)8" FT repeat_region 77008..77015 FT /note="(a)8" FT repeat_region 80042..80049 FT /note="(at)4" FT repeat_region 80131..80139 FT /note="(a)9" FT repeat_region 80292..80299 FT /note="(t)8" FT repeat_region 80887..80897 FT /note="(t)11" FT CDS join(80971..81727,81775..81823,81869..82103) FT /locus_tag="1MB.220" FT /product="hypothetical predicted transmembrane protein, FT unknown function" FT /note="1MB.220, predicted protein, len = 347 aa, unknown; FT predicted pI = 6.7714; contains a predicted TM helix FT region; signal peptide predicted; some similarity to FT O96188, predicted multiple transmembrane domain protein FT (269 aa, Plasmodium falciparum, EMBL: AE001397, AAC71884); FT Fasta scores: E():0.022, 25.833% identity (26.957% FT ungapped) in 120 aa overlap, (aa 119-234 of 1MB.220, aa FT 2-120 of O96188)" FT /db_xref="UniProtKB/TrEMBL:Q7YYH8" FT /protein_id="CAD98504.1" FT /translation="MWGFIEQLRTKAGGFIDLSSGIICCAAFGSLLLIFRKYIPHPSEF FT FPFGWFFLKFGIHKHDNFNLLLDILEGVDIPETGKYSIRIKCGRETDESSTTICLEDVN FT KFTNNLTKSFKCCWKEKRIILVRQRENYLFIELISHGTFTSSVIGECRISIMDIIDAKF FT PKKVTYNIQKDRKVVAKLLLSFYRISANIIVEDTNPVLFQAMINLQTDADISGNKSIYA FT DFDKMSEKEQLIFFSKALQGNLYYLENGDKDKLRMFYFRAVEVTINRWEWCYWATEGDY FT LKGSSKLGGYPFLAMSVVIPDKTDRDQIYIKYHDLYGIHDLFFKAIEIDRDVWSEAIYE FT FIERVS" FT misc_feature 80971..81078 FT /note="Signal anchor predicted for 1MB.220 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.011, signal anchor FT probability 0.904) with cleavage site probability 0.004 FT between residues 36 and 37" FT misc_feature 81007..81075 FT /note="1 probable transmembrane helix predicted for 1MB.220 FT by TMHMM2.0 at aa 13-35" FT repeat_region 81121..81128 FT /note="(t)8" FT repeat_region 81364..81375 FT /note="(tatt)3" FT repeat_region 81761..81771 FT /note="(t)11" FT repeat_region 81834..81841 FT /note="(a)8" FT CDS 82336..82491 FT /locus_tag="1MB.221" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.221, predicted protein, len = 52 aa, unknown; FT predicted pI = 10.7141; contains no predicted TM helices;" FT /db_xref="UniProtKB/TrEMBL:Q7YYH7" FT /protein_id="CAD98505.1" FT /translation="MIESKVKNSRTVKIHTNGGSEKHSKNINSPNSIKRMKMEVNPSRY FT ENEPLI" FT repeat_region 82915..82924 FT /note="(t)10" FT CDS complement(83071..83874) FT /locus_tag="1MB.222" FT /product="U1 snrnp, possible" FT /note="1MB.222, predicted protein, len = 268 aa, possibly FT u1snrnp; predicted pI = 9.7386; contains Pfam match to FT entry PF00076 rrm, RNA recognition motif. (a.k.a. RRM, RBD, FT or RNP domain); contains no predicted TM helices; FT reasonable similarity to AAM13340, u1snrnp-specific protein FT (250 aa, Arabidopsis thaliana, EMBL: AAM13340, Z49991); FT Fasta scores: E():4.4e-06, 35.955% identity (35.955% FT ungapped) in 89 aa overlap, (aa 20-108 of 1MB.222, aa FT 12-100 of AAM13340)" FT /db_xref="GOA:Q7YSX7" FT /db_xref="HSSP:1CX0" FT /db_xref="InterPro:IPR000504" FT /db_xref="InterPro:IPR012677" FT /db_xref="UniProtKB/TrEMBL:Q7YSX7" FT /protein_id="CAD98506.1" FT /translation="MNNSLMDRKNGIKVLKNSLSDDDENKTIYINNLNDSISIPKQKLS FT LEEIFAKYGKIESIKMFNSYFRKGQAWITFSNAVSAVNAVNNEIGTQIFGKHVNVSFAA FT KESERRNIQVRNSSSNIRMVPKSINARIELYKQYLAQWLKNAENNGLFQTLDDPEKNQI FT KLVNNQVLYDNNYVNLMRSNKNKKLNVHPFINNELTQNSLGRPEMQIDGTTSFLHRLPN FT NASPLEVSNTVFVQMEMGICREEELYTLFSHINGFKELRFIPVSF" FT misc_feature complement(83575..83793) FT /note="Pfam match to entry PF00076 rrm, RNA recognition FT motif. (a.k.a. RRM, RBD, or RNP domain), score 35.1, FT E-value 1.1e-07" FT repeat_region 83804..83815 FT /note="(tca)4" FT CDS 83952..84608 FT /locus_tag="1MB.223" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.223, predicted protein, len = 219 aa, unknown; FT predicted pI = 4.8806; contains no predicted TM helices; FT some similarity to Q9ZB56, nrpg (249 aa, Proteus mirabilis, FT EMBL: U46488, AAD10395); Fasta scores: E():4.8, 27.607% FT identity (29.605% ungapped) in 163 aa overlap, (aa 1-155 of FT 1MB.223, aa 35-194 of Q9ZB56)" FT /db_xref="UniProtKB/TrEMBL:Q7YYH6" FT /protein_id="CAD98507.1" FT /translation="MNYDEIRSLILDTAEEYKHSLGSDYRLVLKEINQLGDISVSDSFQ FT KNKVEFSFSLKKGKEKYKISIYVNRKLLESNESLIGNKHLKKSVEISALAFWEFSDKKK FT ELFEYFTELFEEDGISPFKNSESHDIFYSNCEGLRDELIPSCDIISFLDIFLLSLFRSK FT IPQLIRHLGWDESIPLRNSLIEHIASDECFKDLDESSTDSESEESTVKKRVNRRY" FT repeat_region 84255..84262 FT /note="(a)8" FT CDS complement(84704..85222) FT /locus_tag="1MB.224" FT /product="ribosomal protein L11, probable" FT /note="1MB.224, predicted protein, len = 173 aa, probably FT 60S ribosomal protein L11; predicted pI = 10.4676; contains FT Pfam match to entry PF00673 Ribosomal_L5_C, ribosomal L5P FT family C-terminus; contains Pfam match to entry PF00281 FT Ribosomal_L5, Ribosomal protein L5; contains no predicted FT TM helices; good similarity to RL11_MEDSA, 60S ribosomal FT protein l11 (181 aa, Medicago sativa, EMBL: X78284, FT CAA55090); Fasta scores: E():5.8e-50, 74.850% identity FT (74.850% ungapped) in 167 aa overlap, (aa 6-172 of 1MB.224, FT aa 9-175 of RL11_MEDSA)" FT /db_xref="GOA:Q7YYH5" FT /db_xref="HSSP:1IQ4" FT /db_xref="InterPro:IPR002132" FT /db_xref="UniProtKB/TrEMBL:Q7YYH5" FT /protein_id="CAD98508.1" FT /translation="MTAEVNPMKNIKIEKLVINISVGQSGDRLTRAAKVLEQLTDQKPV FT FGQARFTIRSFSIRRAEKISCYVTVRGDKAEEILEKGLKVKEYELRKRNFSATGNFGFG FT IDEHIDLGIKYDPSTGIYGMDFFVQLTRPGNRVSLRRKCRSKVGKHGRVTKDEAMQWFQ FT SKYDGIILN" FT misc_feature complement(84737..85036) FT /note="Pfam match to entry PF00673 Ribosomal_L5_C, FT ribosomal L5P family C-terminus, score 190.5, E-value FT 1.7e-54" FT misc_feature complement(85046..85207) FT /note="Pfam match to entry PF00281 Ribosomal_L5, Ribosomal FT protein L5, score 80.9, E-value 1.7e-21" FT repeat_region 85436..85444 FT /note="(a)9" FT repeat_region 85446..85460 FT /note="(atata)3" FT CDS join(85502..85512,85708..88207) FT /locus_tag="1MB.225" FT /product="putative translation initiation factor if-2, FT 73082-68138, probable" FT /note="1MB.225, predicted protein, len = 837 aa, probably FT translation initiation factor if-2; predicted pI = 7.0300; FT contains Pfam match to entry PF00009 GTP_EFTU,Elongation FT factor Tu GTP binding domain; contains two Pfam matches to FT entry PF03144 GTP_EFTU_D2, Elongation factor Tu domain 2; FT contains no predicted TM helices; good similarity to FT Q9SRD2, putative translation initiation factor if-2, FT 73082-68138 (1280 aa, Arabidopsis thaliana, EMBL: AC010718, FT AAF04442); Fasta scores: E():2.7e-96, 45.494% identity FT (51.897% ungapped) in 932 aa overlap, (aa 6-835 of 1MB.225, FT aa 361-1279 of Q9SRD2)" FT /db_xref="GOA:Q7YYH4" FT /db_xref="HSSP:1G7S" FT /db_xref="InterPro:IPR000795" FT /db_xref="InterPro:IPR004161" FT /db_xref="InterPro:IPR005225" FT /db_xref="InterPro:IPR009000" FT /db_xref="InterPro:IPR015760" FT /db_xref="UniProtKB/TrEMBL:Q7YYH4" FT /protein_id="CAD98509.1" FT /translation="MLTRLKKQKKKQAKSHALATEDVGSEAITKPKPISAAAKAAAERL FT RQIQENEQELKRREEEEKKKEEERRRLEEEEALRIQEEKLQKQKQRKERREQLKAEGKL FT LSAKEKAVKQKREQFVEYLKQQGVVSTSEGSLANSSFSSGLATRKKKNNKSNLKEDILS FT TDIIENSERNQDMETKTVIQEFVLDSWEKAVDYEAGSKSPNVSTKNIRDLVPPKKVDGI FT ADTNCIEHIAEESCEDLGFRSPVCCILGHVDTGKTKLLDKMRKTNVQDNEAGGITQQIG FT ATYFPPEMLSEQVKKVEADFELQIPGLLFIDTPGHESFNNLRSRGSSLCDIAVLVVDIM FT HGLEPQTRESIGLLRSRKCPFIIALNKIDRLYGWIEQNWSSSRSTLSIQNESTRDEFDT FT RLNRVLLELSEEGLNCDIYWKNDDFRGNVSIVPTSAVTGEGVPDLIYLIAQLTQNYMGL FT HCLQLNTRELSCTILEVKAIDGLGVTIDVILVSGILREGDTIIVCGLSAPIVTTIRALL FT TPQPMHEMRVKGEYIHHRFIKASMGVKICANGLDDAVAGTQLLVQSKNSTPEEIESLKE FT EVMKDMGDIFSSVDRTGNGVYVMASTLGSLEALLVFLKSSNIPVVALNIGTVHKSDVRR FT ASIMHERGFPEMAVILAFDIKVDAEAEVEAKKLNVRIMKANIIYHLCDMFTKYYSDVQE FT EKKKEKSQKVVFPCILKIIPQYIFNARDPIICGVYVEEGILKPGTPLCIPEKDNLMIGR FT VTSVEFNKKPVNEGKKGQEVAVKIQPFASDTNITYGRHFDHNDKLVSRITRDSIDILKQ FT HFRDDLSKDDWKLVIQLKKTFGIP" FT repeat_region 85721..85729 FT /note="(a)9" FT repeat_region 85913..85924 FT /note="(gaa)4" FT misc_feature 86222..86869 FT /note="Pfam match to entry PF00009 GTP_EFTU, Elongation FT factor Tu GTP binding domain, score 114.5, E-value 1.3e-31" FT misc_feature 86909..87181 FT /note="Pfam match to entry PF03144 GTP_EFTU_D2, Elongation FT factor Tu domain 2, score 29.3, E-value 5.6e-06" FT repeat_region 86943..86950 FT /note="(at)4" FT misc_feature 87644..87835 FT /note="Pfam match to entry PF03144 GTP_EFTU_D2, Elongation FT factor Tu domain 2, score 12.1, E-value 0.0045" FT CDS 88443..89669 FT /locus_tag="1MB.226" FT /product="GTPase activator protein, possible" FT /note="1MB.226, predicted protein, len = 409 aa, possibly FT putative plant adhesion molecule; predicted pI = 7.9357; FT contains Pfam match to entry PF00566 TBC, TBC domain; FT contains no predicted TM helices; reasonable similarity to FT Q9M894, putative plant adhesion molecule (304 aa, FT Arabidopsis thaliana, EMBL: AC021640, AAF32453); Fasta FT scores: E():3.7e-17, 30.081% identity (31.356% ungapped) in FT 246 aa overlap, (aa 122-367 of 1MB.226, aa 69-304 of FT Q9M894)" FT /db_xref="GOA:Q7YYH3" FT /db_xref="InterPro:IPR000195" FT /db_xref="UniProtKB/TrEMBL:Q7YYH3" FT /protein_id="CAD98510.1" FT /translation="MDVDLPIFEFTLKNLEKKELNNLADNKIGVTENNPDECDISVRLE FT NICWKSQCYVDILPLDFCDVNHSLLNETNSRKTVGMGWELQRKLTKLGYNNFNKYLISN FT YINLFPIGISKSKAKLCNWANYKDFTQKKLLKLCINGIPSDIRGEVWCYLLGSDRMLRN FT NSNVYFNELNGSIDKNIENQIILDLHRTFPNSKYYSNSSNFNKVGTLSRVLYAFASYDK FT AIGYCQSMNFIAAILLINMKEEAAFWSLVQLVSSNRNKEFMVCSWGDLETYYGERMDGV FT IRDIAILETLCRQFIPKVSQKLENIGVNFQWFALEWFLCFFVTSLPLKSIMEILDLIFC FT FGSDVLFNISIALLDINKKKLLSSVNMEECMEILKNITRNITDSTKIIRKAMKYNISSN FT HIRKLREEN" FT misc_feature 88854..89534 FT /note="Pfam match to entry PF00566 TBC, TBC domain, score FT 110.5, E-value 2.1e-30" FT repeat_region 89514..89522 FT /note="(a)9" FT repeat_region 89678..89689 FT /note="(aatt)3" FT CDS complement(89684..91549) FT /locus_tag="1MB.228" FT /product="F-box domain protein" FT /note="1MB.228, predicted protein, len = 622 aa, possibly FT hypothetical protein; predicted pI = 9.2549; contains two FT Pfam matches to entry PF00646 F-box, F-box domain; contains FT Pfam match to entry PF00560 LRR, Leucine Rich Repeat; FT contains no predicted TM helices; reasonable similarity to FT Q8WUG0, hypothetical protein (448 aa, Homo sapiens, EMBL: FT BC020572, AAH20572); Fasta scores: E():7e-07, 22.616% FT identity (26.266% ungapped) in 367 aa overlap, (aa 275-605 FT of 1MB.228, aa 93-444 of Q8WUG0)" FT /db_xref="UniProtKB/TrEMBL:Q7YYH2" FT /protein_id="CAD98511.1" FT /translation="MLSESEWLFFASNVEVHVSGKRIMQNLGANGTGIYYKLFADDIKV FT CIHGFNREFNYSFKKDILSIFWKFYLDGKFTMVSKKYIYFFFSNATPNSIFCFIEKVSG FT IAPQFLSPRFSSRNDIIYKSSLTFKNKFPDNNDKMLITKKQKITKFEQMNHQKMKNCAS FT ESTFEGERQNTFCKTNSGNYPIISTNCISSLPEDLLFYVLSMLVTPVVYEENEKSYINY FT NSFYKDDLIKSTNLAFVNSRLLKFFQGNVWEILINHRLRKKSNNIINFKNLGSLIISSK FT NIKDSDIKEIASEKYLPASINKLTINGNNNITDSALSLFLLRLRNLEFLEIIDCNSITG FT SSSFRVIGMKSSIKFLKIGSRIKQNFSINDNSLKDLFNSNYQKKLIKTDVSVCTKPSIK FT HLELQNCVGITKIPKTVKEYCYNIEYLDLRGCKNILNIELEHFFGYSKNLKVLVLSNTN FT ISDQLLDIIFENCPGIEILDVSYCANISEKIFDKIPSKLKMISGLKLSYCLNFKANALI FT QILSNCKFLEIIDLAGCCNLECSLINYRNMTFTSNLRKMGVFRLSFDPRRLNEWILNNM FT EVEPNNKKINIEICYDVELPISEFTYKFEKIRDKYLERNKPIWVN" FT misc_feature complement(89891..90034) FT /note="Pfam match to entry PF00646 F-box, F-box domain, FT score 8.1, E-value 0.57" FT misc_feature complement(90047..90190) FT /note="Pfam match to entry PF00646 F-box, F-box domain, FT score 5.5, E-value 1.2" FT misc_feature complement(90134..90208) FT /note="Pfam match to entry PF00560 LRR, Leucine Rich FT Repeat, score 13.2, E-value 0.41" FT repeat_region 90398..90405 FT /note="(t)8" FT repeat_region 91290..91298 FT /note="(a)9" FT CDS complement(91784..92647) FT /locus_tag="1MB.230" FT /product="hypothetical predicted multi-pass transmembrane FT protein, unknown function" FT /note="1MB.230, predicted protein, len = 288 aa, unknown; FT predicted pI = 7.7798; contains 6 predicted TM helix FT regions; signal anchor predicted; some similarity to FT Q9PPX4, hypothetical ferrichrome ABC transporter (351 aa, FT Ureaplasma parvum, EMBL: AE002150, AAF30928); Fasta scores: FT E():0.64, 23.134% identity (26.050% ungapped) in 268 aa FT overlap, (aa 28-277 of 1MB.230, aa 85-340 of Q9PPX4)" FT /db_xref="UniProtKB/TrEMBL:Q7YYH1" FT /protein_id="CAD98512.1" FT /translation="MYFTYVVRPGDAPEGRGPSQPLFFENFLVNNLKLGFGSMLLGFIG FT LTICSIFSGFGLHITKFLSPFNEATNSTGTALFFMIFMIFGVICLNMSTILMDDDGSIS FT QTRGFRSGCKAFGQGSLVILLGLILIVVTLYSNVGYYEGNALSGLDLNHVIKCMYNAGL FT AITSVGLTILGFAAFLVEVYSSDGTREILGFASILLCKVSGIFMFLTIIFPDCKTIGSL FT ATLVTIIALTHVTLWAGIFESIALKSRIKMTQSAVRNEYYKSRNALAYFGPPVMAEGNY FT TQQPTM" FT misc_feature complement(join(91925..91990,92006..92071,92108..92173, FT 92234..92299,92360..92425,92471..92536)) FT /note="6 probable transmembrane helices predicted for FT 1MB.230 by TMHMM2.0 at aa 37-59, 74-96, 116-138, 158-180, FT 192-214 and 219-241" FT misc_feature complement(92480..92647) FT /note="Signal anchor predicted for 1MB.230 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.000, signal anchor FT probability 0.699) with cleavage site probability 0.000 FT between residues 56 and 57" FT repeat_region 92886..92898 FT /note="(t)13" FT repeat_region 93558..93572 FT /note="(t)15" FT variation 93572 FT /note="(t)15 in clone 45 shown vs (t)14 in clone 24 not FT shown" FT CDS 93641..94672 FT /locus_tag="1MB.232" FT /product="hypothetical predicted multi-pass transmembrane FT protein, unknown function" FT /note="1MB.232, predicted protein, len = 344 aa, unknown; FT predicted pI = 4.0632; contains 6 predicted TM helix FT regions; some similarity to Q91TK4, t100 (351 aa, Tupaia FT herpesvirus, EMBL: AF281817, AAK57143); Fasta scores: FT E():2.1, 25.322% identity (28.502% ungapped) in 233 aa FT overlap, (aa 71-291 of 1MB.232, aa 82-300 of Q91TK4)" FT /db_xref="UniProtKB/TrEMBL:Q7YYH0" FT /protein_id="CAD98513.1" FT /translation="MYSQPYSGEEPMLYQSRGPNKSSTNQFNMSATNGMQVINDGIMDD FT EALNERGEEITPLISNFSCITLRTGTLLQCASLLVLIILYNVFGNKGLFTFDLYGNGGT FT LAEDLSYYAGVVSMLCVYFIGVLFLAGFQEFIADNSKAPLGFRAGSRLLNTANIIQIIV FT VALRVTQFSFTYSYFNQKWYGKFSQTKGDWCLYNFGIVCEAVSLIMYGISFFYVESYAD FT VGVGEQYAYWMLTLFTLAGISELMMLFTGFGSFFILFFAAALIVTCTWAFQFEPLLESN FT SPLLFSRDINADVLPKETPLDDMSIEKNQTVNNYQPNFTYQLYAGASGVSSVPNPYGVE FT MQQ" FT misc_feature join(93848..93907,93968..94036,94094..94162,94220..94288, FT 94331..94399,94403..94462) FT /note="6 probable transmembrane helices predicted for FT 1MB.232 by TMHMM2.0 at aa 70-89, 110-132, 152-174, 194-216, FT 231-253 and 255-274" FT CDS 95011..96504 FT /locus_tag="1MB.233" FT /product="diphthamide synthesis protein, possible" FT /note="1MB.233, predicted protein, len = 498 aa, possibly FT diphthamide synthesis protein; predicted pI = 5.6382; FT contains Pfam match to entry PF01866 Diphthamide_syn, FT Putative diphthamide synthesis protein; contains no FT predicted TM helices; reasonable similarity to Q9BTW7, FT diptheria toxin resistance protein required for FT diphthamidebiosynthesis (363 aa, Homo sapiens, EMBL: FT BC003099, AAH03099); Fasta scores: E():7.6e-20, 38.438% FT identity (43.243% ungapped) in 333 aa overlap, (aa 76-408 FT of 1MB.233, aa 1-296 of Q9BTW7)" FT /db_xref="InterPro:IPR002728" FT /db_xref="InterPro:IPR016435" FT /db_xref="UniProtKB/TrEMBL:Q7YYG9" FT /protein_id="CAD98514.1" FT /translation="MDSTIANPNISLSNDINYKRFDPGYSDIENIELLKEAIRVGLPSH FT YDFEIIKTIKKINEIKSISDERENFTVYLQMPEGLLVFSQSISLILQHFARTSVVILGD FT ITYGGCCIEDQLMSILESSHSKSSNQSLLVHYAHSCLIPFEELAMSKDINIANILYIFV FT EINLLSDHFIQTIKHNFKKEDKIALLSTIQYHSTIVGSQKCLNDYFLNPVKIPVCDPLA FT WGETLGCTSAIIDGDVEKCIFLSDGRFHLESAMIQNPSKVFYLYEPFSKKITRETYSHQ FT LLHEIRKSSIVNSFKIVQAGGSNLINDSVVIGIFFSTLGRQGSFAIVERLEKLINKYNT FT NTNNKTKIVACTIFASDLSAECINDNILEGVDYSIQLACPRLSTDWGAYYKKPILNSYE FT AFVLLGNLSNNNFQFFCDMENNLDLSNSEFSYLQTYPMNYWASNGNIWCNYYSDESRNG FT SFGASKDQIDEIKYNIRQNKLKNLTKRKINLQYERNELA" FT misc_feature 95236..96267 FT /note="Pfam match to entry PF01866 Diphthamide_syn, FT Putative diphthamide synthesis protein, score 188.0, FT E-value 9.7e-54" FT repeat_region 95820..95827 FT /note="(a)8" FT CDS join(96794..98987,99105..101224) FT /locus_tag="1MB.234" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.234, predicted protein, len = 1438 aa, unknown; FT predicted pI = 7.2160; contains no predicted TM helices;" FT /db_xref="GOA:Q7YYG8" FT /db_xref="InterPro:IPR001841" FT /db_xref="InterPro:IPR001965" FT /db_xref="InterPro:IPR011011" FT /db_xref="InterPro:IPR019786" FT /db_xref="InterPro:IPR019787" FT /db_xref="UniProtKB/TrEMBL:Q7YYG8" FT /protein_id="CAD98515.1" FT /translation="MEEVNINQVLLERGISYTCTQILDAHAKKLKNEINTRCREFQSVD FT ELINFSNSDQNRNDSNGLLSNFDWRSSYLPHIYSATLGLKNGFIDPKLQGELYQSILKC FT INQGFVSGNIGIVRISDHSHPVRLATPIDKTCYSLVWRKKDLSETHDIPIVLGEYTGDV FT KYVGDEEISDESYSFQLTFKSSAFKFTGAVVSGSHQQFGDFSANNTDKLSDQPTYFPDE FT FEYTEANDNSYSPNKSSFFSKPFSQTRNELLSTDGNIKKIGARLLLPNENEYILSADKT FT CNEMAFLNHYECVFNNNFNFRINVQWHSVYVDGWPHIILTSIPGVGIKLGDELAADFGE FT NWFSRIRRISENNIKNELILFRLRFNSGNNQGSVNFSDLDENLSLKQTDSKDSLSNCGI FT SGLINFYEICEVCQNSITSQARKSYYKPKNEDSVLFLKSKFDYLLCNGCNRAFHWYCVN FT RPEFRDLKTSSKNWFCFQCLSLTLRILPSLIYRIPNLTPGLINSIQNLYNSTDDLIPGV FT IDAIGNVVLRSTDEVNTEDPKSIVFRPPEYFDYNLRHFNEYGGLEGLTKLKPCYLCYSS FT STISNQMIGIATICRIHRLHIAPDYDEFFNSVQYTYNSDNNQLVSNSSLHLSLVADDYG FT KRKYTASTQSSTEDIIRPHIKKLSGSRYIIERSRRVICSLRQSLYQSEKHRVELLYKLE FT KLRKIQDEYFRNISEGNGMIPVLCIKLGSTILNKEFSANDIENHRINIDDLLITDNHVK FT STKKTSENLIPYRNSKKHTIFGISEEFYKSENFEFNIIKMKNEKLLTETYKNFNIDSIY FT DKVKLIHKFIIQECKMNQDNEDAILIRETPKYEYIDQPEWNIICGSLNIESNLIKYFVG FT VNISRISHKTTTFNITSKICIGNVFLIKTIPITIYNEFSINNDELAELNEKSHSEILNT FT IRFSAIEILAWTQSRLSEIFIKETSLNNNDGIALDKVDGCENTNIEQNYKRDLGESLNS FT DFNSALVKENWLQKANLIAQALQPYPTGIKWINETHSWKVVYSNEFGQKISKFFDTIPF FT DENATVESLYLKACKFVSVSQTNNKVIKMLSTSRKLNKITNEISSPLDKFIFNEWGNTI FT EDFLARYNLNNPDRAVYSINDFNNEINSLRPFPDNLDWCDSTFSFVVVLPSNEKRSFQL FT DKFEFNIINCYSAAYSLAIGNVNIDKEFYNRKIRKYNSRVKNNTNPTQLKKQIYKKHLI FT NNRHESPSEEKKENFSLNKSINTKQKNSNETGKFTSLPSCQVYSYQDKETTDTNTSSEE FT SLLGNICESKIEPKELINEDLKGNSFNKLPSSDEDKLSIRAKSNKRKTIVRALPAPSDP FT LSRARIIEYSKQADELRPFPHGITWCYRSARFKVRYRRSSDGCWTATSFTPTKFNSVKE FT AFENAVDKLAAHINDYKNSNEIPIIPKKRHLEIEP" FT repeat_region 97612..97619 FT /note="(at)4" FT repeat_region 101268..101277 FT /note="(t)10" FT CDS complement(101309..102505) FT /locus_tag="1MB.236" FT /product="myb proto-oncogene protein, possible" FT /note="1MB.236, predicted protein, len = 399 aa, possibly FT myb proto-oncogene protein; predicted pI = 6.0801; contains FT two Pfam matches to entry PF00249 myb_DNA-binding, Myb-like FT DNA-binding domain; contains no predicted TM helices; FT reasonable similarity to MYB_BOVIN, myb proto-oncogene FT protein (640 aa, Bos taurus, EMBL: D26147, BAA05135); Fasta FT scores: E():1.3e-15, 25.428% identity (28.650% ungapped) in FT 409 aa overlap, (aa 5-395 of 1MB.236, aa 94-474 of FT MYB_BOVIN)" FT /db_xref="GOA:Q7YYG7" FT /db_xref="HSSP:1GVD" FT /db_xref="InterPro:IPR001005" FT /db_xref="InterPro:IPR009057" FT /db_xref="InterPro:IPR012287" FT /db_xref="InterPro:IPR014778" FT /db_xref="InterPro:IPR015495" FT /db_xref="InterPro:IPR017930" FT /db_xref="UniProtKB/TrEMBL:Q7YYG7" FT /protein_id="CAD98516.1" FT /translation="MPQEPWNAEEDELLYQLVQRLGPSSWSETARLLNSTLNVNRLAKQ FT CRERWISHVDPRIRRGDWSLSEDRFILNQQNLWGNRWADIARHLPGRTTHAVKNRYHQL FT QRLLNQGKTKLETILNAPLIHVVPHFLQQTNFFKDPPEKESEVGNQIRVKKAIIKKRRE FT SISNYRCMNEKESIDQFKKNKPIYSENEVSFIQSELEVDKHYSKSNRKPVENYSRFEGF FT EQRQENVIKNDPDLYKQRIQTIESTCEVVPSNSEIEFSPRNFRRACLEFQQSPPGVLLA FT SDVGIYPVTPTFTENCNNYKTNNAIEPSTCETNQDNEYISKSEFLTPSSELSECLPSKC FT IQPICIFNEEPSIYNEMDYRDPNWYVWNENSNVWDNNPIHRLDTGSIADPFLHSFPEI" FT repeat_region 101958..101965 FT /note="(t)8" FT misc_feature complement(102191..102328) FT /note="Pfam match to entry PF00249 myb_DNA-binding, FT Myb-like DNA-binding domain, score 56.4, E-value 3.9e-14" FT misc_feature complement(102344..102499) FT /note="Pfam match to entry PF00249 myb_DNA-binding, FT Myb-like DNA-binding domain, score 47.2, E-value 2.3e-11" FT repeat_region 102540..102547 FT /note="(at)4" FT repeat_region 102822..102835 FT /note="(t)14" FT CDS join(102947..103317,103351..104276,104340..105259) FT /locus_tag="1MB.237" FT /product="DEAD/DEAH box helicase" FT /note="1MB.237, predicted protein, len = 739 aa, probably FT DEAD/DEAH box helicase; predicted pI = 9.2535; contains FT Pfam match to entry PF00270 DEAD, DEAD/DEAH box helicase; FT contains Pfam match to entry PF00271 helicase_C, Helicase FT conserved C-terminal domain; contains no predicted TM FT helices; good similarity to Q9D2M2, 4632415a01rik protein FT (568 aa, Mus musculus, EMBL: AK019495, BAB31760); Fasta FT scores: E():2.9e-35, 41.152% identity (47.619% ungapped) in FT 486 aa overlap, (aa 60-513 of 1MB.237, aa 61-512 of FT Q9D2M2)" FT /db_xref="GOA:Q7YYG6" FT /db_xref="HSSP:1HV8" FT /db_xref="InterPro:IPR001650" FT /db_xref="InterPro:IPR011545" FT /db_xref="InterPro:IPR014001" FT /db_xref="InterPro:IPR014014" FT /db_xref="InterPro:IPR014021" FT /db_xref="UniProtKB/TrEMBL:Q7YYG6" FT /protein_id="CAD98517.1" FT /translation="MSEVNKKKNKTNTRYNDEQEIKNLKNRIVIELPCRGKSWNNPNLL FT NHGVINNKNAELPVKRIKIEDIMSPDLFSDLPISRRTLEGLRAEGYYQMTLIQRDTLPH FT SLQGRDIIGQARTGSGKTLADNYCSIDGLLSLILTPTRELASQVFDVIKEIGKFHSTLS FT AGCIVGGKDIKSESSRINMLNILVATPGRLIQHMDESPLWDANNLKILVIDEVDRMLDM FT GFLNDIKIILDGIPSSSSGRQTMLFSATVYSSELSIKKIENLFRPNQLESFSLDNIGAL FT PKNLQQLYIKVAIHEKIDTLFNFLRTHSNKKIIVFVSCCKQVRFLSTVFTKLKIGCKVL FT ELYGKQSLQKRLEVVHNFYTHESLVTSNEKLKLKNIGRNSKSSYDGAVLFCTDIASRGL FT DFPKIDWVIQLDIPENADTYVHRIGRTARYISKGINTIKKVTPNEYEMRYTIHSSLQSI FT CASDQNIKEMAERAFSAYIKSLFILTPNDKREELKKLDFSAFALSLGLAIPPKIKINNT FT ESEKRLISKHSSKLQKFKEKIRQKKLSKNLDDNEDINSNRLLQKDDSEPIDILSDELIL FT FSNNESAINQEKLSAIPLNKKVATDKLRFRSDYSGKIRGHGNFDLDKKHIFFSDDEGPG FT TINEDNKCENLDIDCQRKYIEQVKNRLKCQTKNDKERDRERVHEMHVKKRRISRQFRSE FT KSNNANDIELIESREEEESRICDEDNLHNLTTAALEKLGIQISN" FT misc_feature 103196..103810 FT /note="Pfam match to entry PF00270 DEAD, DEAD/DEAH box FT helicase, score 176.7, E-value 2.5e-50" FT misc_feature 103952..104236 FT /note="Pfam match to entry PF00271 helicase_C, Helicase FT conserved C-terminal domain, score 82.3, E-value 6.5e-22" FT CDS 105336..107411 FT /locus_tag="1MB.240" FT /product="hypothetical predicted multi-pass transmembrane FT protein, unknown function" FT /note="1MB.240, predicted protein, len = 692 aa, unknown; FT predicted pI = 9.4213; contains 9 predicted TM helix FT regions; some similarity to Q55044, form i operon orf FT protein genes, insertion sequence is630protein (426 aa, FT Shigella sonnei, EMBL: U34305, AAA84871); Fasta scores: FT E():0.36, 20.482% identity (22.368% ungapped) in 332 aa FT overlap, (aa 361-675 of 1MB.240, aa 84-404 of Q55044)" FT /db_xref="UniProtKB/TrEMBL:Q7YYG5" FT /protein_id="CAD98518.1" FT /translation="MQYGFGNNLKLYSGIDSIFNLRKVNKTRHIVIHAFVRETYEIFLF FT LSAISLLYFILSGYDGSNTMVIANALRLIFTIGCYLVSHRMREGDLSLLLLWLVLLMNI FT CIIPIFSIDIFYFRMLDILSYFFLFWFTGFHPIFIFINAIIGLGSLSWANFRMLSAIDI FT NLYERNVLQALSFTLLFTRLSVGLIVLNNIYTNCSIIRDITVNKIKSLYDIKVIIENES FT ELIKLSKSLGISLKQLSKQSEDLDNSNNTELLRVDSDNYSSSMSLINPSSIHSANISRA FT LSFADNSNESSSIGSSTFLNTPIQSIDDFESVPHSSTLQGLYNRLLIKKKNDVFERFSP FT SMTFIDKYRQRMKKLFRSLMYKCESFILTIRFAEFPDSTLPISTKAVSQVFLDQEIEGM FT YTQWLQFYSCNTIEHSLHYILLSCIIVPLMSSAEWLLLFSNKSKQLLCIYSPYVCLIQT FT SNKRKIIFFIFREGIQVIVSMILIGMIYILCKVNKIIRKREKSKSDQEHLLKGGSIKER FT IINAFSRKIVLYPKFSESIHSCIQWLSIIIGSWTIFCNIIDCILMGNISLRFLTLSSSL FT LIAGTYLNLRVSSTIIVYLIWGLFSFIFTVVISGSIEKYIPCIISISIPAASIIFLQTI FT PLDRVRRILFCRYVLPYILYIQHTAFVLKDCGEDIKKKLSNKYVSSSMGIRTVYSK" FT misc_feature join(105432..105500,105528..105581,105615..105683, FT 105711..105779,106581..106649,106731..106799, FT 106968..107036,107094..107159,107178..107246) FT /note="9 probable transmembrane helices predicted for FT 1MB.240 by TMHMM2.0 at aa 33-55, 65-82, 94-116, 126-148, FT 416-438, 466-488, 545-567, 587-608 and 615-637" FT repeat_region 105710..105718 FT /note="(t)9" FT repeat_region 106320..106330 FT /note="(a)11" FT repeat_region 106788..106795 FT /note="(at)4" FT CDS 107520..108809 FT /locus_tag="1MB.241" FT /product="hypothetical predicted multi-pass transmembrane FT protein, unknown function" FT /note="1MB.241, predicted protein, len = 430 aa, unknown; FT predicted pI = 7.2487; contains 3 predicted TM helix FT regions; some similarity to Q9NAT3, merozoite surface FT protein 1 (360 aa, Plasmodium falciparum, EMBL: AF286876, FT AAF87595); Fasta scores: E():4.1, 27.586% identity (30.573% FT ungapped) in 174 aa overlap, (aa 121-280 of 1MB.241, aa FT 17-187 of Q9NAT3)" FT /db_xref="UniProtKB/TrEMBL:Q7YYG4" FT /protein_id="CAD98519.1" FT /translation="MKKINFFQYYIGLFVFFIALFSELDAYSGSFTKLEDFGSKGYRVT FT TETFNVTANAFYTNVEKEINECVDWYSVIFTSRSFGQLIIPSLPNGLFIFFSKKRAIKC FT NKDEKIKEYFLSIPNINLGFDISVINNFTKLYSKEEITYELNKVNKQFYLDILSKSNPF FT TPNHPEFFSLEGEELTYIHFPHLEVESKILKKLLQGMFELDLINSQTLSLLRFSNKNHS FT NKWTTKITSFTINKNNTGKNCESGNLEVVRKLSGSGFHWNLETELINFENTPILLDSFN FT REIFVDIDEISQRNEKKVDGILVPLHPVHIERPSEESDSSMLLSIGKSSDIPIHARYQS FT ACNGCNFKTVKIGFPSIVIYEKESVDKHCLIPKLRFEIATNQNVKTGSSQEIVINIPVG FT NSKDINYVFIATIFTIIITTVTTGISIKRY" FT misc_feature join(107538..107606,107739..107807,108729..108797) FT /note="3 probable transmembrane helices predicted for FT 1MB.241 by TMHMM2.0 at aa 7-29, 74-96 and 404-426" FT CDS complement(108839..110368) FT /locus_tag="1MB.242" FT /product="endonuclease/exonuclease/phosphatase, possible" FT /note="1MB.242, predicted protein, len = 510 aa, unknown; FT predicted pI = 5.3401; contains Pfam match to entry PF03372 FT Exo_endo_phos, Endonuclease/Exonuclease/phosphatase family; FT contains no predicted TM helices; some similarity to FT AAL87740, (648 aa,, EMBL:); Fasta scores: E():0.023, FT 24.540% identity (27.211% ungapped) in 163 aa overlap, (aa FT 53-210 of 1MB.242, aa 217-368 of AAL87740)" FT /db_xref="GOA:Q7YYG3" FT /db_xref="InterPro:IPR005135" FT /db_xref="UniProtKB/TrEMBL:Q7YYG3" FT /protein_id="CAD98520.1" FT /translation="MWADEDELPDRISFLTFNAGLLEYRICGVKLYQNPPYTSHRLLQI FT PSALRGINADIIALQEVFDEKHSDYIIESLQPVYPYFARETKQSQNQKSMRWQPISVIH FT NQLALHNGLLVLSKYPILNARFTCFSDVTLIEEWFVSKGMLEVTIQLPGMDKSPLTLFN FT IHMASGAVNPESETIETLRNKEIEQLLGACDHAIRRGEVPVIIGDLNAAPNCCGSNYNY FT FIDRGWRDCFECFKHENRYSTSASSNISSQINLKSRQNDFDYKGKQTYIDMDDLYDDLS FT CYSKENFAHREEQIEEEEHATPSSNDSFSTRIEIDTTPMINMSIGSSSINKGNDDVAVV FT NKLVSSFAVAKAAVEVAKRGFSSCSPESTSIEVVSTTKSSNCSNNINEFQENYPIKQYF FT PGNKDYCSSKLFSSDFTWDPNNPLNVIGPHSNCHGQRCDHIFLPPTHLSKKLAQYEPYK FT SSVILREPRVLVDGWCFGCVGSTVLVTLSDHYGVYVELKRKLDNNYRNSQY" FT misc_feature complement(108881..110335) FT /note="Pfam match to entry PF03372 Exo_endo_phos, FT Endonuclease/Exonuclease/phosphatase family, score 61.2, FT E-value 1.5e-15" FT repeat_region 110542..110550 FT /note="(a)9" FT CDS join(111121..111510,111553..112606,112796..115687, FT 115874..116832) FT /locus_tag="1MB.243" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.243, predicted protein, len = 1765 aa, unknown; FT predicted pI = 8.4116; contains no predicted TM helices; FT some similarity to Q9H8D0, hypothetical protein flj13745 FT (1424 aa, Homo sapiens, EMBL: AK023807, BAB14687); Fasta FT scores: E():0.24, 21.315% identity (27.006% ungapped) in FT 821 aa overlap, (aa 675-1484 of 1MB.243, aa 489-1147 of FT Q9H8D0)" FT /db_xref="UniProtKB/TrEMBL:Q7YYG2" FT /protein_id="CAD98521.1" FT /translation="MIKPISWYETYFSKFGRCNMFINQNGELYFVNDRKNQRVMTEKIK FT IVVSLIPLKEGVIIVGYDRIETKLMINMAILGHPLEHPTKIVLKSSRNLFDKSNESKFN FT KVVIWGSVDFPLIMVYDQIEKNHTLYSIWEDSSKYSKCAETIFISRTIFDNNFMISYHI FT PSASELKLVIFSESEFNDVCYGKKSIIEDKELTLQTIHNVKDVVPITFGHQQEKHELFG FT IHYLFGFNFVKSNQLFKQEKSQELPVFSLILIDTGMLLLYSGSELIINVNITSEQIGDF FT ELLGISSGISNNFDALVKLNGNHSNKDHLLGIRCSINLLPEDFLVQIITKLLIRISNKE FT FSVSLIREILLNRSNKHWDILRKILLGETKDINTEQELKFSNMSSNGSIESPMSNGVYK FT KIKLFSDKDVCSNNKSKTWQILSKFWASYLEKNKKSPFDRIEFSNLISKGIIPHEFLEG FT KKYEYTNYSNSGYQNLEFKASPQYDSNFSDWSFPPVLLDKLYFFTRFSKNNKKDIYFKV FT IEKHFPLQCFTIKLYEQLLNRSKNGEIIDLISKVGITNTILPILSPIISFPIEKFLNSY FT SNNVPNLSFDDNKYFLLGRLDMLSYAKNINSSYSSVNLKIASGSDNIAASVLPHGRNTV FT DDQKSMLENLQAYYKNKVDPFRITSAEKRTIIEIWKNDLTEYSDIGSFELSYKWCFQVF FT SYDNRYCDALELLSLKNPPQILLQRSSGAYPIDEESWDEFQRQRVVDAVQLVYTSLLGR FT AACTFGGSSFDISINKINRPVFVNKIYEVASNSLISVDIERFKEVCFDISVWNEFYMGV FT TQSLNYCYPYSKLPQDGFNKNKKLWIIEQMDTFSSQEDIPFISGFLYGISLSGFFKADK FT SDSITTFPLILEPKEIYKILENDGQSIKTCSMLLATAISALKSQNTTLLRLYLMHIPSI FT LPSIYTKSLQISSISQYSAITSIGLLFSQSRSHQVIEILFSEFLRTVSDTDDQASINPN FT IYSISVAVSLGMVLQPNEESINTSFNTMIEDDITNALLSCISGNKLPKFLSSLASGPNL FT EHYTEFSGLERFKYQNENMDVGNNSKSNSSLSSGTNKFSNSSKNYCSKIVDSTLLAIPA FT ALSLSIMHIRSKNKSISSSISIPFSRPEELVNYRPEVLIFMSLAKVVIEWEESSIPNKD FT FICSQIPSYLWYLPSDKIFPFPITSNKCSEIEPQVNSKLMHCISMGTLDWIHCIQARIA FT ILSGIIWGLGIVFAGKRNIELKYTTTTILEYLDRIPLIQMPLSIASTIRDKSICSTHIT FT IDRWSKELCIRVCLTSASLCFSGSGDKQILMQIKYFRAQLLESAQLLWTSSTAISPFSI FT FSIPPIEHVHSQLMAYNNALGFLFLSAGHFSFSNKDKLGSTFLILATHPIYPKDSSDIS FT TPGIIFQPLRYLFISAMNYGRRVVIPKLVTCSPEPLNDSCDFSDKDTNCYQRLIKGELW FT VQTKNRNNGYCWNEANHIINQVKNYQFHNFQIYNFYDNQLNNKQMLYKYYGPNNKILSA FT NSNVFSRCDEISDLYKNFSHTLVLEENKSFYRKMNNKEMHNKKAGCAPWRGAILNNYID FT QILSTWNKINQDGNGCEILLSPEIIKILRITLNIDNDKKLQTILLNIHRRLQAQMSVLI FT RCVRYYYGAHNGTRIPSCDEKQSLRLFLSLNGMPLASHFNFIMKKKIESNTKIDTLLRK FT DYKKYEEIIAPILIMELPTLSSLGLQILKVFIYERYFKAQSASSQSIEAQQQIKLMLTR FT IFSSKNYF" FT repeat_region 116581..116590 FT /note="(a)10" FT repeat_region 116845..116874 FT /note="(at)15" FT CDS 117044..118120 FT /locus_tag="1MB.246" FT /product="hypothetical predicted transmembrane protein, FT unknown function" FT /note="1MB.246, predicted protein, len = 359 aa, unknown; FT predicted pI = 4.5012; contains a predicted TM helix FT region; signal peptide predicted; some similarity to FT AAL93030, (1289 aa,, EMBL:); Fasta scores: E():0.068, FT 22.656% identity (25.778% ungapped) in 256 aa overlap, (aa FT 10-246 of 1MB.246, aa 252-495 of AAL93030)" FT /db_xref="UniProtKB/TrEMBL:Q86PQ4" FT /protein_id="CAD98522.1" FT /translation="MNVLLLLLYLSLIYCSYCSIQRAGVKQINTLTPQSIERKEGTIID FT QNSRNFCQYTDFRWSSCICDRNIMIGVRSLIKEQTENCSPEVTIIELPCDKSSCNQKLL FT TTSDGVACPSLANIDGVCDHSKNFLTDVMAKSFEMCRSFCSVMVNCTHFIMDTKNSRCK FT LYSGNKICGKEAPGITTGLAGFDTNPCSECTVGIWGGLSECKKPVDFDPNLLGCGEAVR FT RRISNGDVDADCPYRTETWTCSLPGQTCDMRTEISDIYLKSKPENDDKSESYKLLEISL FT IFAGIVTGILIFVPTSLIFPKIGQFFYGKRLYSLLSGNELDSSRTPVNETINNPDDYEE FT WEEDMNVEYETEFTIDNR" FT misc_feature 117044..117097 FT /note="Signal peptide predicted for 1MB.246 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.999, signal anchor FT probability 0.000) with cleavage site probability 0.403 FT between residues 18 and 19" FT misc_feature 117878..117946 FT /note="1 probable transmembrane helix predicted for 1MB.246 FT by TMHMM2.0 at aa 279-301" FT CDS complement(join(118220..119965,119999..123184)) FT /locus_tag="1MB.247" FT /product="hypothetical predicted WD40-repeat protein, FT unknown function" FT /note="1MB.247, predicted protein, len = 1644 aa, unknown; FT predicted pI = 6.8646; contains three Pfam matches to entry FT PF00400 WD40, WD domain, G-beta repeat; contains no FT predicted TM helices; some similarity to AAM14817, FT hypothetical 60.5 kda protein (fragment)(544 aa, FT Arabidopsis thaliana, EMBL: AC002337; AAM14817); Fasta FT scores: E():0.0024, 33.010% identity (34.000% ungapped) in FT 103 aa overlap, (aa 175-276 of 1MB.247, aa 234-334 of FT AAM14817)" FT /db_xref="GOA:Q7YYG1" FT /db_xref="InterPro:IPR001487" FT /db_xref="InterPro:IPR001680" FT /db_xref="InterPro:IPR001841" FT /db_xref="InterPro:IPR001965" FT /db_xref="InterPro:IPR011046" FT /db_xref="InterPro:IPR015943" FT /db_xref="InterPro:IPR017986" FT /db_xref="InterPro:IPR019775" FT /db_xref="InterPro:IPR019781" FT /db_xref="InterPro:IPR019782" FT /db_xref="InterPro:IPR019786" FT /db_xref="UniProtKB/TrEMBL:Q7YYG1" FT /protein_id="CAD98523.1" FT /translation="MEPYYICYILKEYGFDDVSNRVFEKFKSISKIPTKINYDGTETQI FT SYEELTNGLQDNIFGFIKLLSSFIELCKFNKNNIVGTENLIFNQDSFLNTIQYLKNEVI FT GHSTFIENIKDICSRSTYLNLRSLKHGVSDKLMLSNMKYSHLLTNFCGRFILSNFIYAH FT QHIEDSEFGVGGEPASVYVIKRDKTGSVLISGGDDGMIRLWNTFTGSLLASVRKHTGDI FT IDLDVNSCNALLASCCGKGQLLLTLVYGNDWVPMAIIENTDRLTFVRFAYSKHSIKLNN FT DDIFQESEMLISAGDNAIITVYKVSDLFIHGSLYHLALMKRREFPEVLLNSFQSIFKRY FT GSLSKSPGSFEREQSLFYKAPPVYKIDIFPLSPKAFDICLNPIANHGSSSDENLKNYVF FT FSVGASLQSLSSTEEIQINISGKNISNEVISIQKKNDMQGYSLVFMIPLEHSHSKSGIN FT LINMPQKHNEHPDVSFANNSHDLVTASDDGTIIMWIFSNLSIYNQTNLYTYISDRINSD FT SYSKKKNNSIKKATEKSPTESLSDDDSDYVGSGSMDENSLLRRNPVRTSRFVVKDSSPN FT KENDKSIGTDSSSNFVQFIDSIQWSCDDSIILIAQSVTSKGMLRRTRNSTLVSCIECCI FT SYFSRNYGERFLDVVLPGVCSRIACLIPHPVSPGIVLSLTYNGVIFVISNSKLNQLNDF FT KFKNNSNILFEHKNTKNPYLNGVWIKNGLSFVVSQKYGSFEIYDICNKGDLNQQVLNQT FT YRYSFPEQFFLNDFSEILRDDTFGWIDQNLRRPIYTIPRSVIVDRNKRMYPEDIQPPIP FT MFSSLGNKTSKNNTRNELLNSRIPFYPQLDKTLSTYKILNLNSEIYLEKAKKRSIYRKK FT MLEIDKNPLNGSPTVSNNSNAFQSRNTISNINEELTWVSSQTNNDETSSVEAQSYSSDF FT SNDEDFDINNSSTTNTNRRLRMISRRSPSISDTGSSSSSDDSSLGENDYDSDLTSEDSL FT YNEIRGSKSITRSRRKIKNRLINVQDSDSEVVPEIEFRYYAFSNLDTIWMPDSEEQEVN FT NLLVCKLCKKSSTTIEFNSHNIRPILHGVNGSIDLGPLIGPFDYFYSTHRPERNAKFIQ FT ENDNFGEGSFYLHSRCLITIPFLQWELINDQVYTNLFDIMRRVYNPSQGTPPLPKDGLS FT STFFDSKIVKFPKLIGTCSYCHDYGASILCQGNRCNRQFHYHCTALAYHSFPESVNSSF FT ISPRESNMYWCDIMQFYLFFCPKCIYLKQSNVPYCPKRENMINNTGHNHCNRSWLLADE FT VSAGYVPQVNDYLYFFPNAYICSGLDDIFFKNMLEIATENKRLPRRSNRKFEFIKCKLI FT NISYAFPSDTEHSIKAILTFLTELTSGKHVYWQIRCAPNDGPDYLVLEDEVEKGIHNLE FT HRLRVGQENHIFIDNQWHEILIRNIKQNFIWESIEVTWKQEESNNNSLMVSPWEIYETV FT PDKKSLPNNLEASNDLIKIFCWMTTQNGQNNPLSVVEFFKYPIPFFSKKNISKNGFSNQ FT EWVMHYWKEIPLPFSFTSIINRIRNNYYRRLESLVFDIFLVRSNCEHFNINNNNLVQGI FT RSIEYELLRLIFSKRLPRYIIQTNTNILLQEIGLEDRIKVIDSSNIDSDEEIINRIGKR FT TRRRL" FT repeat_region 118430..118441 FT /note="(att)4" FT repeat_region 118620..118628 FT /note="(t)9" FT misc_feature complement(121703..121816) FT /note="Pfam match to entry PF00400 WD40, WD domain, G-beta FT repeat, score 10.8, E-value 0.45" FT repeat_region 121881..121890 FT /note="(t)10" FT misc_feature complement(122444..122560) FT /note="Pfam match to entry PF00400 WD40, WD domain, G-beta FT repeat, score 3.8, E-value 3.8" FT misc_feature complement(122570..122686) FT /note="Pfam match to entry PF00400 WD40, WD domain, G-beta FT repeat, score 20.7, E-value 0.0022" FT repeat_region 123474..123482 FT /note="(a)9" FT CDS complement(123707..124615) FT /locus_tag="1MB.250" FT /product="hypothetical predicted transmembrane protein, FT unknown function" FT /note="1MB.250, predicted protein, len = 1644 aa, unknown; FT predicted pI = 6.4854; contains a predicted TM helix FT region; signal peptide predicted;" FT /db_xref="UniProtKB/TrEMBL:Q7YYG0" FT /protein_id="CAD98524.1" FT /translation="MSGARSSIFIACILYFISLVSANGVCILKKGASCLNASYSFIEGG FT MTPGDAIRVSVVLLPSASLELTNSSFQNIATLRFGVDSCSIESGIASLRAYSLYKNGQY FT NTGSTISFDLVALEKSLAIFIAGSKVLEIPLLVAAPFMLTSDSNGSAPTLQFLEYSTTE FT TGFCSIASSGSCNNSSLVGMSNGVTKNLTIRAELPSSIPDDYFSFYISTYSNIYKLPVL FT EVAFTKQEWLATSSGSFVGRGKIPAGINLGSSINFVLVPSSGGSVELKVNEKSLGVVKV FT DASSIKFIYQSVSQGTMNVSY" FT misc_feature complement(124532..124594) FT /note="1 probable transmembrane helix predicted for 1MB.250 FT by TMHMM2.0 at aa 7-28" FT misc_feature complement(124550..124615) FT /note="Signal peptide predicted for 1MB.250 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.973, signal anchor FT probability 0.026) with cleavage site probability 0.862 FT between residues 22 and 23" FT CDS join(125044..125538,125833..126843) FT /locus_tag="1MB.252" FT /product="hypothetical predicted multi-pass transmembrane FT protein, unknown function" FT /note="1MB.252, predicted protein, len = 502 aa, unknown; FT predicted pI = 8.0827; contains 8 predicted TM helix FT regions; signal anchor predicted; some similarity to FT O77343, hypothetical 67.8 Kd protein (579 aa, Plasmodium FT falciparum, EMBL: AL008970, CAA15598); Fasta scores: FT E():0.034, 22.772% identity (25.000% ungapped) in 202 aa FT overlap, (aa 302-499 of 1MB.252, aa 379-566 of O77343)" FT /db_xref="InterPro:IPR016196" FT /db_xref="UniProtKB/TrEMBL:Q7YYF9" FT /protein_id="CAD98525.1" FT /translation="MEDKPKCNRWFLLLLYFLISFSVCGLVGSFTAILPLLRKACMFSN FT LCRCDPSTKNYRGDDCLNPELKDLILLEPLLSHECDPIGCNLQNVLHVDAWKFGLSAPL FT IASPIAGTIADIIGPRALGTIGTIFVCCGLLIWLLLTNIFEVLWIAKYLLSLSWLFLGI FT GRKLSKIKQFITCQEEMANGNQVRKDFGELEKSTTEEDILITCPKTVQLRARSEVTNVK FT IKIKCCSSEGVECESSAMCNLKASSAIMGGYSVVCSLAKCRYMLKIDKIRSFSTPSNIF FT NVFDLNAKNSLNTIIDRPIFEQLFSYEYFTFMALFIYNFWRCSILVSKSEYIVTSVLYI FT NKSVGIEQEMLQIMNIYNIIMSLGSIFSVIWGIIATKYGVNFMIGLTAILGCLIHVLLI FT FIDSLPFWCIYLYFCAFSALRSFIFGSLNCFIGDTFGFSNFARLAGIQAFTCFIFFQIM FT NFLTSQYFEVISWAKVNQYLLIPNIFLLSVPFILNLLKAKRNN" FT misc_feature 125044..125127 FT /note="Signal anchor predicted for 1MB.252 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.083, signal anchor FT probability 0.917) with cleavage site probability 0.022 FT between residues 28 and 29" FT misc_feature join(125077..125145,125404..125463,125482..125535, FT 126100..126168,126187..126246,126259..126327, FT 126361..126429,126472..126528) FT /note="8 probable transmembrane helices predicted for FT 1MB.252 by TMHMM2.0 at aa 12-34, 121-140, 147-164, 353-375, FT 382-401, 406-428, 440-462 and 477-495" FT repeat_region 125710..125721 FT /note="(acaa)3" FT repeat_region 125734..125745 FT /note="(ttat)3" FT repeat_region 126285..126292 FT /note="(ta)4" FT repeat_region 126695..126702 FT /note="(t)8" FT repeat_region 126893..126901 FT /note="(a)9" FT CDS 126912..127775 FT /locus_tag="1MB.254" FT /product="mitochondrial carrier protein, possible" FT /note="1MB.254, predicted protein, len = 288 aa, possibly FT CG4743 protein; predicted pI = 10.1482; contains two Pfam FT matches to entry PF00153 mito_carr, Mitochondrial carrier FT protein; contains 3 predicted TM helix regions; reasonable FT similarity to Q9VBN7, CG4743 protein (297 aa, Drosophila FT melanogaster, EMBL: AE003753, AAF56493); Fasta scores: FT E():1.4e-17, 31.295% identity (35.223% ungapped) in 278 aa FT overlap, (aa 13-275 of 1MB.254, aa 29-290 of Q9VBN7)" FT /db_xref="GOA:Q7YYF8" FT /db_xref="InterPro:IPR001993" FT /db_xref="InterPro:IPR018108" FT /db_xref="UniProtKB/TrEMBL:Q7YYF8" FT /protein_id="CAD98526.1" FT /translation="MENGKKSFQIVVNSLISGGIAGLFVETILYPVDAIKTKMQYRSLC FT KNSVLFFSTRNYYRHIYSGFKYSAFGSFISSSIFFGTFHFLSNYSPQVKYKSLKTMQIS FT LISELLSSLFRAPFEVIKQNIQVRNKSFLGHFYTNYSLFRHCLNFNKITASLIRDIPFS FT IIQFSLWEKLNRTGDGVLGEKCNFKDLLSGISGGISGAIAALITSPADNIRTYIFTKAK FT TRSSNVSILSAIKTIYGLNGIKSFYIGSFLRALWLTIGGIIYFGCYQLCNSALEKMYHS FT FETLPQ" FT misc_feature 126939..127199 FT /note="Pfam match to entry PF00153 mito_carr, Mitochondrial FT carrier protein, score 24.2, E-value 2.3e-05" FT misc_feature join(126939..127007,127101..127169,127647..127715) FT /note="3 probable transmembrane helices predicted for FT 1MB.254 by TMHMM2.0 at aa 10-32, 64-86 and 246-268" FT misc_feature 127473..127754 FT /note="Pfam match to entry PF00153 mito_carr, Mitochondrial FT carrier protein, score 43.4, E-value 3.3e-10" FT CDS complement(127777..128766) FT /locus_tag="1MB.255" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.255, predicted protein, len = 330 aa, unknown; FT predicted pI = 9.4173; contains no predicted TM helices; FT some similarity to YIA6_YEAST, putative mitochondrial FT carrier yil006w (373 aa, Saccharomyces cerevisiae, EMBL: FT Z38113, CAA86245); Fasta scores: E():0.29, 24.000% identity FT (28.272% ungapped) in 225 aa overlap, (aa 24-228 of FT 1MB.255, aa 151-361 of YIA6_YEAST)" FT /db_xref="GOA:Q7YYF7" FT /db_xref="InterPro:IPR001993" FT /db_xref="InterPro:IPR018108" FT /db_xref="UniProtKB/TrEMBL:Q7YYF7" FT /protein_id="CAD98527.1" FT /translation="MNFPKIKLNEQWQLLRLYGVLNSPQLVLIYPVYTIQLKSMCLPYF FT ESYLENSNSIESAHSTTSSSRISRGIIYNLSGIKLIFDSLRCIYNEEGYWGLYRGFIPM FT TLHSYFTRILHDQLLKVYKRHSKVHDSFIKPVYVKSSAKYISEIITYPLLVISTQQAIF FT DVKRYNLGLQDDKDKIEESVIGIYSLIQLSVNSEEGITSLWKGVTAHIIFRITEDLIKN FT SLYSYLSINYQRAKDTEDKLLATKTKPHNKKHLKHGVSNFTTSLLSPLALISTVKRCQS FT SDHIGLCRGDVGIKEILGNVNWGIYFGQMAVNSIFIAIYLFDRENSLF" FT repeat_region 128767..128774 FT /note="(ta)4" FT repeat_region 128954..128961 FT /note="(ta)4" FT repeat_region 129115..129122 FT /note="(a)8" FT CDS join(129126..129377,129504..131234) FT /locus_tag="1MB.256" FT /product="Sec1-family protein, possible" FT /note="1MB.256, predicted protein, len = 661 aa, possibly FT car protein; predicted pI = 5.0416; contains Pfam match to FT entry PF00995 Sec1, Sec1 family; contains no predicted TM FT helices; reasonable similarity to Q9Y1I2, car protein (617 FT aa, Drosophila melanogaster, EMBL: AY069498, AAL39643); FT Fasta scores: E():1.6e-14, 26.364% identity (30.366% FT ungapped) in 440 aa overlap, (aa 239-644 of 1MB.256, aa FT 198-613 of Q9Y1I2)" FT /db_xref="GOA:Q7YYF6" FT /db_xref="InterPro:IPR001619" FT /db_xref="UniProtKB/TrEMBL:Q7YYF6" FT /protein_id="CAD98528.1" FT /translation="MDLIRLNCRKTLLSFLELVSEKDNENRLAPQIFIETSISSFLSLI FT TDIVDYNNFGFKRIQPLIISEDVEERLESIICDFEFDSNQKIDVCFYIGFVPFYSKVIL FT RLLHQQLASLKVTPECYSVNLFWAPIDETSLSMEFRNIFFDFHVYKEESSLQLVAYSLY FT WLLKFTNSTNVPIYSIGSAGVSVLEHLIRCFKENNSLSNILKPFDWNTLFETNDDRNYK FT YPEFKRSDKDLLTNTLEFTQYYLQNSNIKNFPEDDLTNEFPMGFDQVIIIDRRCDLVTP FT FSTPFSYHALLDFLFGVQKTYVDIPTKKVASDVYEESPHWKLPLFGDPLFAILKDLKLK FT DVGIYLHQKANELQSLYQEKEKLKDISAIGDFIRKLKGKQREQGTLAKHVNIATYLNEY FT FTKDCQTLRRLELEDSIMSDSHQSVTGVVKELSTKFSEAISFRGAEDSPLEDLLDQEDI FT QIEEIYRLLCLSCIIENGFKNKKVYEQIKKHILSVFGFEELYRMNILERVGLFKFDPNK FT KSYWQLIKRLLNLFVDESESENDISCVYSGYAPISTRLIEILCREMNNSKSGGYDSLKE FT ALNYVWGPSVELTPSMQSMIKHNSCLVVFVGGVTLGEIATLRKLQEIINKEIIVATTEV FT INHKSFFESCKKPESMSHLRNENSK" FT misc_feature 129219..131072 FT /note="Pfam match to entry PF00995 Sec1, Sec1 family, score FT -87.2, E-value 5e-08" FT repeat_region 129485..129493 FT /note="(a)9" FT CDS join(131286..131334,131387..132684) FT /locus_tag="1MB.258" FT /product="mRNA capping enzyme alpha subunit, possible" FT /note="1MB.258, predicted protein, len = 449 aa, possibly FT mRNA capping enzyme alpha subunit; predicted pI = 7.7862; FT contains Pfam match to entry PF01331 mRNA_cap_enzyme, mRNA FT capping enzyme, catalytic domain; contains Pfam match to FT entry PF03919 mRNA_cap_C, mRNA capping enzyme, C-terminal FT domain; contains no predicted TM helices; reasonable FT similarity to MCE1_CANAL, mRNA capping enzyme alpha subunit FT (EC 2.7.7.50) (449 aa, Candida albicans, EMBL: D83180, FT BAA11833); Fasta scores: E():7.1e-13, 29.017% identity FT (32.880% ungapped) in 417 aa overlap, (aa 9-421 of 1MB.258, FT aa 12-383 of MCE1_CANAL)" FT /db_xref="GOA:Q7YSX6" FT /db_xref="HSSP:1CKM" FT /db_xref="InterPro:IPR001339" FT /db_xref="InterPro:IPR013846" FT /db_xref="InterPro:IPR016027" FT /db_xref="UniProtKB/TrEMBL:Q7YSX6" FT /protein_id="CAD98529.1" FT /translation="MDIALNIEIPGVVLNDDSKRNEILKKVRSFCGWRQDTFPGSQPVS FT LNRQKLESCIGKNLYVACEKTDGIRLLLYAASRRVFLIDRNQKINMVKMTLPSSFWDTV FT YEVKSSNKNIENLETQKIFSGRNSELLNLDPTRDEHAQYFQQNTLLDGELVKDTIEVDG FT QKRYILRYLIYDCICIERDDTVKSLPLLERLKLAYLKVVIPKCKYDQNRSTISIDPTPF FT ELYLKDFFEVDEVPAILNFSRRLPHPSDGIIFTPVHLPYVPGTCPQLLKWKPPHLNTAD FT FAAIFYAESESYDSRVFLELLVGIRGVRASVNCFCVPKGSVYNQLVDQFKLYRTSGQIL FT ECYYDENAIYSKPTKSEDGNILWNKPFTTVQGGWIVERIRSDKNSPNDINTVNRVFESI FT RDGINSEVLINTIKLYQKSGKKSVVEYCNVPEFVTNCRKGQIEDKRSEI" FT misc_feature 131409..132104 FT /note="Pfam match to entry PF01331 mRNA_cap_enzyme, mRNA FT capping enzyme, catalytic domain, score 113.6, E-value FT 2.5e-31" FT misc_feature 132111..132515 FT /note="Pfam match to entry PF03919 mRNA_cap_C, mRNA capping FT enzyme, C-terminal domain, score 1.8, E-value 5.5e-05" FT CDS complement(join(132711..134159,134207..135265)) FT /locus_tag="1MB.260" FT /product="DNA topoisomerase III beta-1, probable" FT /note="1MB.260, predicted protein, len = 836 aa, probably FT DNA topoisomerase iii beta-1; predicted pI = 9.0072; FT contains Pfam match to entry PF01131 Topoisom_bac, DNA FT topoisomerase; contains Pfam match to entry PF01751 Toprim, FT Toprim domain; contains no predicted TM helices; good FT similarity to TP3B_HUMAN, DNA topoisomerase iii beta-1 (EC FT 5.99.1.2) (862 aa, Homo sapiens, EMBL: BC002432, AAH02432); FT Fasta scores: E():5.2e-144, 44.628% identity (46.154% FT ungapped) in 847 aa overlap, (aa 6-834 of 1MB.260, aa 4-840 FT of TP3B_HUMAN)" FT /db_xref="GOA:Q7YYF5" FT /db_xref="HSSP:1CY0" FT /db_xref="InterPro:IPR000380" FT /db_xref="InterPro:IPR003601" FT /db_xref="InterPro:IPR003602" FT /db_xref="InterPro:IPR006154" FT /db_xref="InterPro:IPR006171" FT /db_xref="InterPro:IPR013497" FT /db_xref="InterPro:IPR013824" FT /db_xref="InterPro:IPR013826" FT /db_xref="UniProtKB/TrEMBL:Q7YYF5" FT /protein_id="CAD98530.1" FT /translation="MSPIKVLMVAEKPSISDTISRILSNGKLDTRRGKTPVHEFNGTFM FT GMNVQFRVTSVAGHVFEIDFPQSYSNWEKTDPVSLFDAPIIKNESSSKMVSHLEKESKG FT CSYLVLWLDCDREGENICFEVINVVSNNMSKGRDQRDWIFRAKFSSISPGDIFFAMKNL FT TFPNKNESDAVDVRQELDLKIGVAFSRLQTKYLKSKFGDFNKSSIISYGPCQTPTLFFT FT VQRRDSINSFIPEKYYTISATLSKDSQDINLIWSRSRVFELQVANCFLQLIQNKKPLSA FT RVVDITSKNSRRIRPLPLNTVSMLKLSSTILGIGPFQTLNIAEKLYLSGFTTYPRTETS FT RYPKNFDIKSTIAMFKNNSVWGSYSSDLLQKGFNLPRKDGLDLGDHPPITPVRSATQND FT LDGDSWRLYDLITRHFFATISNDIKMINRTIKFDLNGEIFIISGNQVIDHGFSVIQGRA FT ACHESNIIPDFTLNEVVPIKNIEILSKETKPPPILSESDLLSLMEKHGIGTDASMPTHI FT QKIQEREYVKLASGRRLEPTKLGIALVHGIMNIDHELVLPKIRSEMEKYVDLIAKGKYN FT HKAVLKHSLSVFKLKFLYFAENISLFESLFRLGFTNITASCSRISRCGQCKRYLTYISN FT ILPQRLYCSFCEIYLDIPQRGTIKIYKELKCPIDDFELLLFIDQKGKKSIFCPRCYNDP FT PFLDAKENMLCKFCPHQTCNFSLKSTFFMACPSQDCRDGIITMDSSSTSDWRFDCSRCS FT ISFSIKKNICERLSLGERCEKCGSRKLKICKEKTSQEHFVGCPNCDEEILSLIEVISLP FT KLAYRSDVAKNSFRGGGKRGKRH" FT repeat_region 132937..132945 FT /note="(t)9" FT misc_feature complement(133556..134779) FT /note="Pfam match to entry PF01131 Topoisom_bac, DNA FT topoisomerase, score 202.7, E-value 3.6e-58" FT repeat_region 134603..134610 FT /note="(a)8" FT repeat_region 134792..134799 FT /note="(a)8" FT misc_feature complement(134819..135253) FT /note="Pfam match to entry PF01751 Toprim, Toprim domain, FT score 69.7, E-value 4e-18" FT CDS 135861..136199 FT /locus_tag="1MB.261" FT /product="divalent cation tolerance protein, probable" FT /note="1MB.261, predicted protein, len = 113 aa, probably FT divalent cation tolerance protein; predicted pI = 4.6751; FT contains Pfam match to entry PF03091 CutA1, CutA1 divalent FT ion tolerance protein; contains no predicted TM helices; FT good similarity to O27553, divalent cation tolerance FT protein (105 aa, Methanobacterium thermoautotrophicum, FT EMBL: AE000911, AAB85984); Fasta scores: E():2.8e-14, FT 43.000% identity (43.000% ungapped) in 100 aa overlap, (aa FT 12-111 of 1MB.261, aa 4-103 of O27553)" FT /db_xref="GOA:Q7YYF4" FT /db_xref="InterPro:IPR004323" FT /db_xref="InterPro:IPR011322" FT /db_xref="UniProtKB/TrEMBL:Q7YYF4" FT /protein_id="CAD98531.1" FT /translation="MTETKIESNIILIYISAPNQDEATSIAKTLVDEELCACVSIIPSV FT RSIYKFKGQVHDENEVMLLVKTTSQLFTTLKEKVTEIHSYELPEIIATKVVYGNENYIN FT WVNQTVRS" FT misc_feature 135888..136193 FT /note="Pfam match to entry PF03091 CutA1, CutA1 divalent FT ion tolerance protein, score 132.0, E-value 7.2e-37" FT CDS complement(136228..137037) FT /locus_tag="1MB.262" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.262, predicted protein, len = 270 aa, unknown; FT predicted pI = 7.6244; contains no predicted TM helices; FT some similarity to NAOX_ENTFA, NADH oxidase (EC 1.6.99.3) FT (446 aa, Enterococcus faecalis, EMBL: X68847, CAA48728); FT Fasta scores: E():2.4, 28.750% identity (31.507% ungapped) FT in 80 aa overlap, (aa 106-178 of 1MB.262, aa 351-430 of FT NAOX_ENTFA)" FT /db_xref="UniProtKB/TrEMBL:Q7YYF3" FT /protein_id="CAD98532.1" FT /translation="MKSSIDKNINTNVNIRSLKAKVLECKRKLLKSEFWGTIVEHLETE FT IFENWIKGRKCSSSRCILRGRCICYNNVFFVIGLGSLESTINSNLSSFFQLAAVIALNE FT RINLEGIYFCDPEFTQADRELIFELFMEFKTHIEVFTSHDLSIPLNTVNHQLKNKLYGR FT DSQINILFFMPHCDRCVFGMIINYFNSGEGKTLYSDSARMIVWGNNLETFQIDSNENFN FT KRCEYCKVLSTSLSKMNFSTINFPVNDYKTIFCSSFSDLSIYHLPVS" FT repeat_region 137304..137314 FT /note="(t)11" FT CDS complement(137721..138281) FT /locus_tag="1MB.263" FT /product="conserved NAC domain protein" FT /note="1MB.263, predicted protein, len = 187 aa, probably FT predicted pI = 9.8212; contains Pfam match to entry PF01849 FT NAC, NAC domain; contains no predicted TM helices; good FT similarity to AAH22371, (158 aa, Homo sapiens, EMBL: FT AAH2237, AK027750); Fasta scores: E():6.5e-13, 40.580% FT identity (42.424% ungapped) in 138 aa overlap, (aa 52-183 FT of 1MB.263, aa 14-151 of AAH22371)" FT /db_xref="InterPro:IPR002715" FT /db_xref="UniProtKB/TrEMBL:Q7YYF2" FT /protein_id="CAD98533.1" FT /translation="MEVVARKKFWRSLEFIRTDKYLYKMKPDIKVDDTIEQARQKLRDR FT FGVGTTQVGGKGTARRKKRAQKPTGVDVKKLQAVTSRFRCQTFPAIGEVTMMKKDGTCL FT HFSNPKLQASVATNTYILTGNPQEKLIKDLPQQINPMDLSAFLNDPKFQKLLEESQANK FT LKMASGEDDDIPDLVENFEDVEE" FT misc_feature complement(137892..138071) FT /note="Pfam match to entry PF01849 NAC, NAC domain, score FT 42.0, E-value 8.5e-10" FT CDS complement(138491..139282) FT /locus_tag="1MB.266" FT /product="hypothetical predicted multi-pass transmembrane FT protein, unknown function" FT /note="1MB.266, predicted protein, len = 264 aa, unknown; FT predicted pI = 9.9965; contains 6 predicted TM helix FT regions; some similarity to Y243_MYCPN, hypothetical FT protein mg243 homolog (224 aa, Mycoplasma pneumoniae, EMBL: FT AE000049, AAB96145); Fasta scores: E():0.58, 24.352% FT identity (29.193% ungapped) in 193 aa overlap, (aa 63-248 FT of 1MB.266, aa 28-195 of Y243_MYCPN)" FT /db_xref="UniProtKB/TrEMBL:Q7YYF1" FT /protein_id="CAD98534.1" FT /translation="MEEKKTVKNKVRIVEGEKKVTSNAKLRLVYDALSVTYFSVLWMMM FT FLDCGPNYFSFFYYLTNWGCTSTLIFYFIATLVDYERLVKVNVSTRLIHSCTFVRELSI FT SLQSVIVPFFWIIVYPKEKWRSITWEVQMHGMGLIFICIDYLIRTSNFSSLSSKNLFMV FT VLSYLTLNYFVVNKLETMIYPGITYNTVESWIVVTLAIALVLIAHQLASIITICLVVKN FT KIWRKYEKDDFVSKSIKIVQKAQKLRHKDGVRKRLGSIFVG" FT misc_feature complement(join(138626..138691,138737..138802, FT 138842..138892,138923..138988,139049..139114, FT 139145..139198)) FT /note="6 probable transmembrane helices predicted for FT 1MB.266 by TMHMM2.0 at aa 28-46, 56-78, 98-120, 130-147, FT 160-182 and 197-219" FT repeat_region 139111..139118 FT /note="(a)8" FT repeat_region 139290..139297 FT /note="(t)8" FT CDS complement(139663..142368) FT /locus_tag="1MB.267" FT /product="glycogen phosphorylase 1, probable" FT /note="1MB.267, predicted protein, len = 902 aa, probably FT glycogen phosphorylase 1; predicted pI = 6.6500; contains FT Pfam match to entry PF00343 phosphorylase, Carbohydrate FT phosphorylase; contains no predicted TM helices; good FT similarity to PHS1_DICDI, glycogen phosphorylase 1 (EC FT 2.4.1.1) (853 aa, Dictyostelium discoideum, EMBL: X62142, FT CAA44069); Fasta scores: E():5.5e-174, 53.563% identity FT (54.733% ungapped) in 842 aa overlap, (aa 42-882 of FT 1MB.267, aa 20-844 of PHS1_DICDI)" FT /db_xref="GOA:Q7YYF0" FT /db_xref="HSSP:1BX3" FT /db_xref="InterPro:IPR000811" FT /db_xref="InterPro:IPR011833" FT /db_xref="UniProtKB/TrEMBL:Q7YYF0" FT /protein_id="CAD98535.1" FT /translation="MGDSVFTRDNYNFEMRRKASFSKLTGAVPRGMTGMYLDDFDPTAD FT KRREKLWYLMESYLPTDIESIQRSIVNHVEYTLARTRFNFDDNAAYRATAYSIRDRLIE FT NLNDTNEYFNERDCKRCYYLSLEFLLGRAMQNALVNLDIEENYRKSLFDLGYNLEALYD FT NEHDAALGNGGLGRLAACFLDSLATKNYAGWGYGIRYTYGIFEQKIVQGRQFEHPDYWL FT VQSNPWEIERQDVTYGVRFYGHVREFEEHGKKKFRWVDGEVIQAVAYDNPIPGFDTYNC FT INLRLWKATPSREFDFNAFNEGKYVDAVCARQRAEYITSVLYPNDNTEQGKELRLKQQY FT FFVCATIQDILRRFKKSGKVDWSELPKKVSCQLNDTHPTIAVAEMMRILIDVEELDWDF FT AWNITSECFNYTNHTVLPEALEKWSSSLFSKLLPRHLMIINEINYRFLNDVRAVLGDGP FT WISKMSIYEEGWDKKIRMANLAVIGCRKVNGVAVIHSEIVKKDLFSDFVEYYRRKGIND FT KFINVTNGVTPRRWVNCANPKLSHLISNWLGSDSWLTNFDMIRSLQNNIDDLSLQKEWA FT EVKLSNKERLAKWVEINTGYKVSTSMLFDIQVKRIHEYKRQLLNLFYIIHRYLTLKHIS FT PEERKKFVPRCCFFGGKAAPGYATAKTAIKMMNNLSVIINNDPDTKDYLMCVFLPNYNV FT SNAQIIIPASDISQHISTAGTEASGTSNMKFVMNGGLIIGTLDGANVEIREECGNETMF FT IFGALEQEVEHIRNRAREGNYPIDQRLHDVFNFIRTGGIMLGDGKAQGEFCEIVNKICS FT NGEGQIGDFYLVCHDFPLYCDAQMRVDQAYRDQTTWVKTCIKAASSMGKFSTDRTIEEY FT ATAIWELEQCERPAPEACKKLSGYSPNKSK" FT misc_feature complement(139729..141921) FT /note="Pfam match to entry PF00343 phosphorylase, FT Carbohydrate phosphorylase, score 1357.6, E-value 0" FT repeat_region 140446..140457 FT /note="(tttc)3" FT repeat_region 142607..142615 FT /note="(a)9" FT repeat_region 142763..142772 FT /note="(t)10" FT CDS 142791..143270 FT /locus_tag="1MB.268" FT /product="ribosomal protein L21, probable" FT /note="1MB.268, predicted protein, len = 160 aa, probably FT 60S ribosomal protein l21; predicted pI = 11.0834; contains FT Pfam match to entry PF01157 Ribosomal_L21e, Ribosomal FT protein L21e; contains no predicted TM helices; good FT similarity to RL21_CYAPA, 60S ribosomal protein l21 (161 FT aa, Cyanophora paradoxa, EMBL: AF092950, AAC64142); Fasta FT scores: E():1.1e-30, 55.634% identity (55.634% ungapped) in FT 142 aa overlap, (aa 1-142 of 1MB.268, aa 1-142 of FT RL21_CYAPA)" FT /db_xref="GOA:Q7YYE9" FT /db_xref="InterPro:IPR001147" FT /db_xref="InterPro:IPR008991" FT /db_xref="InterPro:IPR018259" FT /db_xref="UniProtKB/TrEMBL:Q7YYE9" FT /protein_id="CAD98536.1" FT /translation="MPHSFGKRARTRSKFSKGFRQKGVPMLSRYLKPIKVGDYVDIVVD FT SSIHKGMPYHFYHGRTGVVYNVAPRALGVIVNKVVGNRQIAKRINVRIEHVRLSRCNED FT FLKRVKANDAARHEAHVAGLPSPVTKRVPQLPREGGFVDCSNMEVLTPHITVSIC" FT misc_feature 142794..143090 FT /note="Pfam match to entry PF01157 Ribosomal_L21e, FT Ribosomal protein L21e, score 162.2, E-value 5.9e-46" FT repeat_region 143306..143314 FT /note="(t)9" FT CDS complement(143470..145437) FT /locus_tag="1MB.269" FT /product="hypothetical predicted multi-pass transmembrane FT protein, unknown function" FT /note="1MB.269, predicted protein, len = 656 aa, unknown; FT predicted pI = 7.9847; contains 2 predicted TM helix FT regions; signal anchor predicted; some similarity to FT O13704, putative protein disulfide isomerase c13f5.05 FT precursor (EC 5.3.4.1) (363 aa, Schizosaccharomyces pombe, FT EMBL: Z99091, CAB11768); Fasta scores: E():0.057, 24.845% FT identity (27.586% ungapped) in 161 aa overlap, (aa 86-239 FT of 1MB.269, aa 49-200 of O13704)" FT /db_xref="GOA:Q7YYE8" FT /db_xref="InterPro:IPR012335" FT /db_xref="InterPro:IPR012336" FT /db_xref="InterPro:IPR017905" FT /db_xref="UniProtKB/TrEMBL:Q7YYE8" FT /protein_id="CAD98537.1" FT /translation="MNRFSWKYKRENHLIDKSNGILSRMLWSRYIFCCLLFFIWYICTW FT FVGAHLLRIETKEGIYTKSAFIKELSSVGELRDIVLNDLHGPKVIIFYSSFCAYCHMAS FT NPLKKVAESLTPTGVKFYAFECGKGYSECSIWGIDGLPNLRLIGPEDKTINLESLINYT FT EFSDINCTRKSEKHIVFPEKHIPKLMEVPYLKHLKSKSISMPVINEETFLMCSIIRAFD FT LSNVFKPLNNSVLSTSAISQTKTGISNHFGRWSEESMQISPSHAIVDAITTKFYILHNW FT VFFGNNVVNTSQFLEKRRLNALYRFVETSWVLIPSKRTRAKLEEILVFLKNYMDNRDNS FT IYSKLSLESWQSFIKTVVVEGISTTQNGSDPTFYICKKSLFCGIWLLFHSWSISLLKGV FT QQQGKGCPLYNGPSLTPGQVVNRIAETVKYFMVCQSCKEHFETMINNNTCDRTSYIPPM FT NNNKFPVLLYEAEGLVFWLFRVHNLVTLRVATESSYEHLKQKRSSSISYVGTGVSFPPI FT GSCFDCYRPNQTPAEVTNQMLSSINDLTDDDYDKDIFEQGPVVAFLEAYYWKEGWILPK FT TTLDLQKSELASAYSFPNSNQIDPFDSLNSFKRSAEIQVGSVFDIFPVLYPILTFFIFS FT LTVFYLVETGPFLQEQKIAI" FT misc_feature complement(join(143509..143574,145294..145359)) FT /note="2 probable transmembrane helices predicted for FT 1MB.269 by TMHMM2.0 at aa 26-48 and 621-643" FT repeat_region 144953..144964 FT /note="(taat)3" FT misc_feature complement(145291..145437) FT /note="Signal anchor predicted for 1MB.269 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.001, signal anchor FT probability 0.999) with cleavage site probability 0.000 FT between residues 49 and 50" FT CDS 145663..146649 FT /locus_tag="1MB.270" FT /product="f24b9.20, possible" FT /note="1MB.270, predicted protein, len = 329 aa, possibly FT f24b9.20; predicted pI = 4.2920; contains no predicted TM FT helices; reasonable similarity to Q9LQP6, f24b9.20 (595 aa, FT Arabidopsis thaliana, EMBL: AC007583, AAF75084); Fasta FT scores: E():1.3e-12, 32.906% identity (36.321% ungapped) in FT 234 aa overlap, (aa 5-230 of 1MB.270, aa 373-592 of FT Q9LQP6)" FT /db_xref="GOA:Q7YYE7" FT /db_xref="InterPro:IPR007282" FT /db_xref="UniProtKB/TrEMBL:Q7YYE7" FT /protein_id="CAD98538.1" FT /translation="MDAKNQVQKSSSSSCDQEDTSPQNNTINSQNQCDQQTSTQNNVDK FT DLSLEDILYETNQQAISYNRTNYGLLGILNVIRMTDSDLNILALGTDLTTLGLNLNSSE FT CLYLNFDSPWSSSKPAQPESETNEIIQAFANTPNNVSQIIGLKSTYVQKFALETLFYIF FT YNMPQDLLQGFAAVELCNRGWLYYPDSLQWYSKVQNEEKQTAEWQVFDTDKWCKVPISD FT PPSSNLLSIDEIRPSVEEGVRIHSKWIQEQNQIYQIVQQQMQQQSNSRNKGSEPQINSN FT MNNISDFSSAYNQRSIQQTFSQAGNNQTINSNNYSNFRSSNRHNSSF" FT repeat_region 145692..145703 FT /note="(ttc)4" FT variation 145712..145765 FT /note="Sau3AI restriction fragment sequence absent in clone FT 5" FT repeat_region 146672..146679 FT /note="(ta)4" FT repeat_region 146817..146825 FT /note="(t)9" FT CDS join(146927..147277,147314..149320) FT /locus_tag="1MB.271" FT /product="e3 ubiquitin-protein ligase, probable" FT /note="1MB.271, predicted protein, len = 786 aa, probably FT e3 ubiquitin-protein ligase; predicted pI = 4.6641; FT contains Pfam match to entry PF00632 HECT, HECT-domain FT (ubiquitin-transferase); contains no predicted TM helices; FT good similarity to Q9XZN5, e3 ubiquitin-protein ligase (876 FT aa, Mya arenaria, EMBL: AF154109, AAD34642); Fasta scores: FT E():7.8e-62, 43.115% identity (45.368% ungapped) in 443 aa FT overlap, (aa 350-784 of 1MB.271, aa 447-875 of Q9XZN5)" FT /db_xref="GOA:Q7YYE6" FT /db_xref="HSSP:1C4Z" FT /db_xref="InterPro:IPR000569" FT /db_xref="UniProtKB/TrEMBL:Q7YYE6" FT /protein_id="CAD98539.1" FT /translation="MELCDDSSVLFKILFERCFQEYLLLCCDPTESAANAINLANELAR FT KCSTKDEKQLLIFDLEGLYMMIEKYNLCDRILGSYKRLINIKNCINNLKEINYSLEQCK FT QVLAKHNMSVIDILFLIEEGDDFEIKIDIAHLKMFTDWIELMSREYFELTILKNGINNE FT PISNIISLAQSNVIEMIRNNFNVTSEVTQHFDEKVGNRIIKPFQIRFILVLLQGNILEH FT GFGGFSALRSTLNLISLFHNNIQEVLKNWFIKLPLEYLEQAVSTLQQMISVRFYELYDD FT HNYENLDIIRFPFRNIASSSIKSLKPAFLNDLRISSNLLRILFDANKERGIITKNFDFS FT NNNRLDISLFTNEAINSAKALLQYELRQWFLSNPPSNDTEFGLLKNAYLLEPSIKAQAL FT QQDSLMQQRLELQSSISQALGSVESLFNPTQALIQPFLVLKVHRDSIVNDSMEQLVIQS FT NLKKQLKVSFVGEEGVDEGGVQKEFFQLLVQEIFNIDFGMFIYYEDTRLFWFNMASLES FT NGEFELIGIVLALAIYNGIILDVHFPLAVYKKLLGYKVDIGDLYEIQPEVANSLLSLLS FT IKSDSEMEQLCLTFSATINNFGVMAEVPIAPGEFDPSEPVTICNVHRYVELYLDWFLNK FT SIESQFRAFYNGFQSVCGGRTLELFSPEELVLVICGSSDFNIDSLIEASQYQDGYTKDS FT TTVVMFWEIVKKLDLKLQKKLLFFVTGSDRVPMKGLGELGFVIGRHGPDSDLLPTAHTC FT FNFLLIPDYQNKEKLERLLLIALEHSKGFGLK" FT misc_feature 148379..149281 FT /note="Pfam match to entry PF00632 HECT, HECT-domain FT (ubiquitin-transferase), score 234.3, E-value 1.1e-67" FT repeat_region 149097..149104 FT /note="(a)8" FT tRNA 149564..149646 FT /note="tRNA Ser anticodon TGA, Cove score 63.92" FT repeat_region 149713..149720 FT /note="(a)8" FT repeat_region 149798..149806 FT /note="(t)9" FT CDS join(149859..150392,150474..150789,150829..150973, FT 151016..152941,153035..154313) FT /locus_tag="1MB.273" FT /product="rhoptry protein, possible" FT /note="1MB.273, predicted protein, len = 1400 aa, possibly FT rhoptry protein; predicted pI = 6.2684; contains no FT predicted TM helices; signal peptide predicted; reasonable FT similarity to Q26223, rhoptry protein (2269 aa, Plasmodium FT yoelii, EMBL: L27838, AAA21304); Fasta scores: E():0.0023, FT 21.248% identity (25.398% ungapped) in 1426 aa overlap, (aa FT 128-1396 of 1MB.273, aa 82-1431 of Q26223)" FT /db_xref="UniProtKB/TrEMBL:Q7YYE5" FT /protein_id="CAD98540.1" FT /translation="MPERFKGIKINTIWAIFRVQFLVLLLFLNNLACENEILDESERNA FT ALINSSSHTNPDSIQNEVANPMNEITLLSSSGEISSFNAFELNVQDQCNQLWTYRNKLV FT HEVSNIVKLHKQFVIFLNKIDPKKNTVHLEEIFKKSLDLSLASNRNEEINKKFKEYLGL FT AVDYRFISSDKDESTYISNFDICVKKKKKVIKSGSDSLNALINLSKLLNISENALYDEN FT IGYLEQISSITNNLDSNLSFIKTLKKLQEVCDKGLAQDDQSNKKSKIKWLKKNSNKINK FT TMIHNLLNCINHFQTSNSINDNTLAMVNTIGNQIVNHSINRRNVLSVISSRLFSLWLFV FT QKRLTTIQIAIKVMKESKELTNRMNSVFSMDEAIDLLTKCSETLRKTASDWNVFFSDKN FT TLIIKKLHKETLEQLNHYKFEGDRNTLAMYMEWIWLNNHIQSVIEQGYDAFEGIIKGTL FT KSPLTNCAKNLSSLYKNLKSEFSAKENFSEKTIKKINKEYRSVLNDVSGHLLKQIDSLF FT SLKHSFLWPHNNATLISAISMYNRTSEFVSSINLPQEQNSFEKPNYSYELLKQQQIITN FT GFSTLYENIKTRFNWDLSLFQTEFGYVKNINLGLDTEETINFSKLKVYQDSSEQFGEWF FT ESTLEFLNLKENFSKAKNILEQAIQFRLQFIQQWSDIIAKIINKVGNPTALSISKISEE FT AYTYYLQYQVSKSKEFSNYFESDLSILKYVDVVDNSLSDLNRESFVSKFETTNAPNLDG FT SYHGDDEFPSVSSIKSYLTDLKNRVDHYCSYEQKRNKNDEDYLSIVRQICDKFSIIFST FT INQNLKGVDSKLQRVYSDLESQKEDISKLDTIIQDCISLDNKVGRVKCWIPNQSIELPL FT RRALQIRKVMNNWLRETIKTKVLVFSNFIEDCLKLNFKGSLIFQNNESNKVEQTQIERI FT NSFQLLLKLINDLMPNIVEELNTLETIYLKYYQNKINEITSASKFPKLLLSKDEVHNIE FT SSTFAFDSPVNNNATYAADLTDYIKNNQLSSNNHTILLGKYLKALSTLKHRIEDIPQIN FT DNNSSLLNENLKSEIEFEKYRNIEEKSLLILGESDKYLKLANIEHFRFYTIRNITGNIT FT DVISKLSEEIKLKEEKLEKREEEIANTNCYRRWKWKKRVKRRKMKKIKDEKIDKMIKIN FT KSNANLNNLKLKKKEANGKLKLFLTKTINIENFDTSCNPVEDLESISYDDKSDLESFQF FT DDLYKFDERPGRYNPKNQEEQEIEYDDDTGGTTEYGKKPNSLISKISTAVDMAVTINNS FT IRQYKEIKNELEQSGIEFGQENSGVNLMAGFDVLNALTGGPTEDLSAMLGSINEQDQVF FT NEADILGELQTGEMPNGVEKEVNSRPFEGLDISYDDFKRKVEFELKNSDLNFDFDD" FT misc_feature 149859..149957 FT /note="Signal peptide predicted for 1MB.273 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.926, signal anchor FT probability 0.003) with cleavage site probability 0.538 FT between residues 33 and 34" FT repeat_region 150237..150244 FT /note="(a)8" FT repeat_region 150501..150515 FT /note="(a)15" FT repeat_region 150755..150763 FT /note="(a)9" FT repeat_region 151198..151206 FT /note="(t)9" FT repeat_region 154323..154330 FT /note="(t)8" FT repeat_region 154350..154366 FT /note="(a)17" FT repeat_region 154408..154419 FT /note="(agtc)3" FT repeat_region 154718..154726 FT /note="(t)9" FT repeat_region 155032..155046 FT /note="(a)15" FT CDS 155183..156163 FT /locus_tag="1MB.276" FT /product="farnesyltransferase, possible" FT /note="1MB.276, predicted protein, len = 327 aa, possibly FT similar to farnesyltransferase, caax box, alpha; predicted FT pI = 5.3063; contains five Pfam matches to entry PF01239 FT PPTA, Protein prenyltransferase alpha subunit repeat; FT contains no predicted TM helices; reasonable similarity to FT Q921F7, similar to farnesyltransferase, caax box, alpha FT (377 aa, Mus musculus, EMBL: BC012711, AAH12711); Fasta FT scores: E():1.9e-20, 27.742% identity (30.605% ungapped) in FT 310 aa overlap, (aa 22-326 of 1MB.276, aa 80-365 of FT Q921F7)" FT /db_xref="GOA:Q7YYE4" FT /db_xref="InterPro:IPR002088" FT /db_xref="InterPro:IPR008940" FT /db_xref="UniProtKB/TrEMBL:Q7YYE4" FT /protein_id="CAD98541.1" FT /translation="MQELDSLENTNIETIEYSVDFQNEGVCKFLFKPDHYALFSKLKSL FT LDNECFDLENLDISTQVIDLNPQHYTAWYFRRKIIRENYVEHENKTEFLREELRFVRGI FT CERAPKCYQSWWHMRVIRELLGFDIEELNFISKQLEFDAKNMYVWNHRTWFIRKYNSVE FT NDLLISELDFISKLISEDCRNNSAWCYRHFIFTNLKKMNALKESDLLEEVDYIVNWLMF FT APHNDSIWNYIISFFSKIMVNGNVNKETLIKNLSLENAPKSFIDAIDEIYTNHYDSCHQ FT VVYIKACMEYEKGDTDFALKAFKLLQSVDPIRKFYWKWRADNLKT" FT misc_feature 155342..155434 FT /note="Pfam match to entry PF01239 PPTA, Protein FT prenyltransferase alpha subunit repeat, score 21.6, E-value FT 0.0012" FT misc_feature 155468..155560 FT /note="Pfam match to entry PF01239 PPTA, Protein FT prenyltransferase alpha subunit repeat, score 11.2, E-value FT 0.044" FT misc_feature 155570..155662 FT /note="Pfam match to entry PF01239 PPTA, Protein FT prenyltransferase alpha subunit repeat, score 31.4, E-value FT 1.4e-06" FT misc_feature 155687..155779 FT /note="Pfam match to entry PF01239 PPTA, Protein FT prenyltransferase alpha subunit repeat, score 33.5, E-value FT 3.2e-07" FT repeat_region 155776..155783 FT /note="(a)8" FT misc_feature 155813..155905 FT /note="Pfam match to entry PF01239 PPTA, Protein FT prenyltransferase alpha subunit repeat, score 6.5, E-value FT 0.18" FT CDS complement(156510..156977) FT /locus_tag="1MB.277" FT /product="hypothetical predicted multi-pass transmembrane FT protein, unknown function" FT /note="1MB.277, predicted protein, len = 156 aa, unknown; FT predicted pI = 9.8784; contains 3 predicted TM helix FT regions; some similarity to YM02_PARTE, hypothetical 23.3 FT Kd protein (196 aa, Paramecium tetraurelia, EMBL: X15917, FT CAA34037); Fasta scores: E():0.27, 28.462% identity FT (32.174% ungapped) in 130 aa overlap, (aa 39-155 of FT 1MB.277, aa 37-164 of YM02_PARTE)" FT /db_xref="UniProtKB/TrEMBL:Q7YYE3" FT /protein_id="CAD98542.1" FT /translation="MDMYRYPGGLTSTDFHNFEINGSSRDQGFSFWASNLTCEERAITI FT VKRLYYSEFSKYLYLGIFLLNILVLFSGLFRAQSGSRFSIFLETIITLTLTFEVIIKLF FT LMKKRFFNKVNNVFDFVVALTCLSLLFLNGDIRRLFATKKIADLRNNQIVS" FT repeat_region 156545..156552 FT /note="(t)8" FT misc_feature complement(join(156576..156626,156663..156728, FT 156741..156806)) FT /note="3 probable transmembrane helices predicted for FT 1MB.277 by TMHMM2.0 at aa 57-79, 83-105 and 117-134" FT repeat_region 157009..157017 FT /note="(a)9" FT CDS 157178..158035 FT /locus_tag="1MB.278" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.278, predicted protein, len = 286 aa, unknown; FT predicted pI = 10.1026; contains no predicted TM helices; FT some similarity to O96171, phosphatase (2010 aa, Plasmodium FT falciparum, EMBL: AE001391, AAC71865); Fasta scores: FT E():0.04, 20.714% identity (22.925% ungapped) in 280 aa FT overlap, (aa 18-284 of 1MB.278, aa 177-442 of O96171)" FT /db_xref="InterPro:IPR007175" FT /db_xref="UniProtKB/TrEMBL:Q7YYE2" FT /protein_id="CAD98543.1" FT /translation="MQQIQSKLSIISKSINSKSLNSFEFNNNYNSQNPKFQIVEEQKSG FT NNSELEFLIDASKFYSLICPSLSSRLICKVANLTSEISQTKSYSDYFSGKICPKCFSVY FT LIGINCKLEVKPIKGIIRKRLKKKLLKLSKISGQNFQNYSFKNILISCKLCKFSFKKPY FT WEKKIKNTNRHEPLNDAQNNKNKVEDILNYSNFILSNADTSIQDQCRQNSNKSGNSQNG FT NPNTNKNHRFFNKTQKAGSMLQEKKNSYVDITNLTELNKDIKSEQSSQSTQGNSFYDIL FT SMLE" FT repeat_region 157911..157920 FT /note="(a)10" FT CDS complement(159451..160578) FT /locus_tag="1MB.280" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.280, predicted protein, len = 376 aa, unknown; FT predicted pI = 8.2587; contains Pfam match to entry PF03194 FT DUF259, Protein of unknown function, DUF259; contains no FT predicted TM helices; reasonable similarity to Q921Z3, FT unknown (491 aa, Mus musculus, EMBL: BC009092, AAH09092); FT Fasta scores: E():7e-22, 28.493% identity (31.231% FT ungapped) in 365 aa overlap, (aa 8-351 of 1MB.280, aa 8-361 FT of Q921Z3)" FT /db_xref="InterPro:IPR004882" FT /db_xref="UniProtKB/TrEMBL:Q7YYE1" FT /protein_id="CAD98544.1" FT /translation="MDEIRRQIEDLMGGIEAPIEKKNPHDNDVCKFYLCGLCPHELFEN FT TKLYMGPCKNIHSEVLREKYLSERESKGNTRIKYETDSLRVFQGMVDDCNKKIERNRVR FT AELSGSSKLEDENIRALDLEIKEIMKEIDDLGANGDIDGSLKRMEDLTRLNQQKMKISA FT TKEDVENGMYRQKLHPCEICAAFLSETDNDQRLNDHFNGKIHVGYLKIRKQAKDLKEWI FT KAHSPNRDSNYRHENIAHKGDFRYKSHRQSYSGEGRAYYYSNNYRDRRNREHMDYRDNY FT GNHSSRRENNQTRYEELGNNYKSTVQSDYNSFGPRNKHINTRSNPYSTVELNRISSEKH FT KKHIKEDTKIYPPSDGEISPESDSSRSLSRSISPY" FT misc_feature complement(159844..160578) FT /note="Pfam match to entry PF03194 DUF259, Protein of FT unknown function, DUF259, score 180.0, E-value 2.5e-51" FT repeat_region 160652..160667 FT /note="(attt)4" FT repeat_region 160719..160727 FT /note="(a)9" FT repeat_region 160866..160880 FT /note="(a)15" FT CDS 161107..164511 FT /locus_tag="1MB.281" FT /product="spm1 protein, possible" FT /note="1MB.281, predicted protein, len = 1135 aa, possibly FT spm1 protein; predicted pI = 5.3995; contains no predicted FT TM helices; reasonable similarity to O43949, spm1 protein FT (771 aa, Theileria annulata, EMBL: Y15794, CAA75786); Fasta FT scores: E():8.9e-07, 22.899% identity (26.333% ungapped) in FT 690 aa overlap, (aa 483-1107 of 1MB.281, aa 9-673 of FT O43949)" FT /db_xref="UniProtKB/TrEMBL:Q7YYE0" FT /protein_id="CAD98545.1" FT /translation="MRNINFSSSSSENLTRSSVLRRQLHDQSHFLPSNSLVNSNNTVQD FT YRFLPKPTFRIEDNNLYSNLNQSNYNSNWKQTSLGLLKSPINGLSRSSQDLRNSLIEKT FT TEKSSKYVKTPYPHVYKPDSYLKNEISIPSHGSHLPLLPPSRFPFQSEARRTINRLSDS FT TTKYNLGSNDYDLQRDTQTCRKRNLQQVYKQKNENEHEEEEFHDAISDDEYISNESMNN FT PNKRPNIPPQKKTYNQTPSTAAQTGLLGTQNNTPNKPTNTYGLPLSMIDDDYKPDPTPA FT YNFSDEKLRDVQKQANPLGLTQYSHPYNFPKASFLPPNTEPSSNIIHSSVQRIRTPLRY FT RSSLLGALSNRTNSSCRILSAAEMKKVFCKSKKPIGSLKDFVHTIELRNETLNEPKTNS FT ELPKVPEVHSEIEHAIEDKSSKNDKDSSPEKDSKNNKEDIISSKSECLNQSTLEVSNIL FT SKIADKSISPEKSDGDKELVEKVNNVDNLQVSSPNIFVNDSKLPVLSEQDNKSNITTID FT IEATVEKRDDIKETKEVVSEDFSLKEKDSEVTEAKKDEPVPWWLANVDKPNLVEVDNEG FT VFMPDEDDEKKDNEEKPTISAAGLFSLPSTEGSSDKNIATSLFSFGNSSTNACVSTGTS FT LFGSSLFQINKTSENSAAPEEPKPEEKALPLLNSQSSTFSFGQGIDNSASSNISLFGIT FT SKTVEEKTTIGEIEKPISETVVDTEKETTDSSLKPQLSIFGRPSPVDEEKKQAENSEKK FT VGIFSTDNPGSLFSSTNNISGLFSQQSSSMNQLFGSSSASSSTQNAFSTKDIFSLSSSG FT VSTKINDTEKREVSAPIFGSSDMESKNPASDVSLSASLGLGNLNSSSNMSFTGGSTASQ FT TTLQTPGLFDAKLINFTGSTNSSTAAASSSLFGGSNPSNNTLGLGNSLFSTPTNNNSMA FT SGLFPNPNTSTSNNIPSTTAVNSIFGAQQVPVSDVNPTQNTLNFDSNIAGSSNSVPGSN FT NINIFSQTPSIFGSSNSSNPFVFGSNNKPNDQTVTQTPGQNLEQSSGSNIFGSPQGIFN FT STNKAETSIFGGMPSNPVATQFDPKPQVSGQPPINTSIFGSNPTNLLGASPLVGNNSNP FT NPFVGHSSNPAGGTSHRRRIARAKRSH" FT repeat_region 161801..161809 FT /note="(a)9" FT CDS complement(164674..165771) FT /locus_tag="1MB.286" FT /product="putative phosphatidylinositol-4-phosphate FT 5-kinase, 11335-7537, possible" FT /note="1MB.286, predicted protein, len = 366 aa, possibly FT putative phosphatidylinositol-4-phosphate 5-kinase, FT 11335-7537; predicted pI = 5.7779; contains fourteen Pfam FT matches to entry PF02493 MORN, MORN repeat; contains no FT predicted TM helices; reasonable similarity to Q9C962, FT putative phosphatidylinositol-4-phosphate 5-kinase, FT 11335-7537 (769 aa, Arabidopsis thaliana, EMBL: AC018908, FT AAG51639); Fasta scores: E():6.3e-25, 37.624% identity FT (38.776% ungapped) in 202 aa overlap, (aa 117-318 of FT 1MB.286, aa 14-209 of Q9C962)" FT /db_xref="GOA:Q7YYD9" FT /db_xref="InterPro:IPR003409" FT /db_xref="UniProtKB/TrEMBL:Q7YYD9" FT /protein_id="CAD98546.1" FT /translation="METSSHSYSGDIKGGLFHGRGVLIYSKNEKYEGDFVMGKREGFGK FT FTYADGASYEGEWVDDKIHGQGKASFSSGNTYEGQWENGKINGYGKLTFSNGDVYEGEW FT VDGKMHGRGVYKYVDGDIYSGEWRDDKRHGKGTVTYVSSTGDQIIEKYEGDWVNGKMHG FT HGKYVYVDSAVYEGDWFEGSMHGKGTYIFPCGNVYEGEWVNDVKEGYGVLTYQNGEKYE FT GYWKDGKVNGKGTLTYSRGDKYVGDWLDAKKHGEGELFYSNNDRFKGNWVADKACGFGV FT YTYANGNRYEGYWENDRRHGKGIFYCAEDNNVYEGEWANGRKDGKGILRFAMGHSIQGV FT WKDGVLSQFHSLQFPPESQWSNPNF" FT misc_feature complement(164764..164832) FT /note="Pfam match to entry PF02493 MORN, MORN repeat, score FT 33.8, E-value 2.6e-07" FT misc_feature complement(164836..164904) FT /note="Pfam match to entry PF02493 MORN, MORN repeat, score FT 26.8, E-value 3.3e-05" FT misc_feature complement(164905..164973) FT /note="Pfam match to entry PF02493 MORN, MORN repeat, score FT 33.0, E-value 4.5e-07" FT misc_feature complement(164974..165042) FT /note="Pfam match to entry PF02493 MORN, MORN repeat, score FT 24.4, E-value 0.00017" FT misc_feature complement(165043..165111) FT /note="Pfam match to entry PF02493 MORN, MORN repeat, score FT 30.9, E-value 1.9e-06" FT misc_feature complement(165112..165180) FT /note="Pfam match to entry PF02493 MORN, MORN repeat, score FT 36.4, E-value 4.4e-08" FT misc_feature complement(165181..165249) FT /note="Pfam match to entry PF02493 MORN, MORN repeat, score FT 31.2, E-value 1.6e-06" FT misc_feature complement(165250..165318) FT /note="Pfam match to entry PF02493 MORN, MORN repeat, score FT 40.9, E-value 1.9e-09" FT misc_feature complement(165337..165405) FT /note="Pfam match to entry PF02493 MORN, MORN repeat, score FT 29.6, E-value 4.7e-06" FT misc_feature complement(165406..165474) FT /note="Pfam match to entry PF02493 MORN, MORN repeat, score FT 44.3, E-value 1.8e-10" FT misc_feature complement(165475..165543) FT /note="Pfam match to entry PF02493 MORN, MORN repeat, score FT 41.8, E-value 1e-09" FT misc_feature complement(165544..165612) FT /note="Pfam match to entry PF02493 MORN, MORN repeat, score FT 35.4, E-value 8.6e-08" FT misc_feature complement(165613..165681) FT /note="Pfam match to entry PF02493 MORN, MORN repeat, score FT 36.3, E-value 4.6e-08" FT misc_feature complement(165682..165750) FT /note="Pfam match to entry PF02493 MORN, MORN repeat, score FT 21.1, E-value 0.0018" FT repeat_region 165831..165842 FT /note="(aata)3" FT repeat_region 165970..165977 FT /note="(t)8" FT misc_feature 166034..166041 FT /note="tgcatgca" FT repeat_region 166242..166250 FT /note="(t)9" FT CDS 166554..169337 FT /locus_tag="1MB.288" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.288, predicted protein, len = 928 aa, unknown; FT predicted pI = 5.5167; contains no predicted TM helices;" FT /db_xref="HSSP:1FO8" FT /db_xref="UniProtKB/TrEMBL:Q7YYD8" FT /protein_id="CAD98547.1" FT /translation="MYSQIPISPSLRSKYNSGNQGVQRKINADDMNTAVSSYSNNMFAT FT NLDGYDVFDNVNKDSELNMPKIQKSSFPHGSYRPSSSNISINNGINIDSNTNNRPNIGR FT FSVNPGNLNVNGTTNQIQTSSAPMSTPLPAIPQPPVPFQNNSSISGGIQYQSPKNTIPT FT QPPNNLQPVNQISKPFLQNTNTNPNFQNMPIPYPPKAPLSDCNNSDSESSFSKKMQDNK FT ERLEQLNISRTMLASMGSPSSDSRHSYSQDKSQEADRTQNPFQSGKNSISHFGIQEPNS FT NKNTSRSNLRLPPKVPGISSNNAFKFNISQPAQNNQINLEMPEVSMKSVDSNAERNIVG FT LKDTVRPPTAPTSFPSNKLNLDNNSNLANNLVKPPFKSHNDLSASQDSFFSSDTQDTSK FT ISQIPRQTPSVVRPPTAPNSISKSSSLPKVDLLINLMDKTVENKRREFEKLNKLKSALS FT ELGDFNKKLLEENTRLRSGNNSTNLQEVSLLSRDLQNESTEFTNSYQTPLSDAVSPFIP FT SVHDPETQINSLKTIINKKNQRISELEKKLSNRESNNLSATEFDVCHSLLQKIKTRDED FT LSSTLEEIQETYINSITDQVNSCLSILTDKSYGKLKTIANDVLNQIEFNYKALSVTLEH FT LFEQQSSTLENIKHEGRVDTQLDSSNLQSKLSSREFNAHEVNLNNIQFDTVVSNGSKSF FT DNTFVQENAQDTFKFKKSQGTGDYSNNGDEFLVAQTKKMSLNGNVELDNEVNVPQCIIN FT SSIEVNHDESAMQNQHQELNLYQNSTFNHETQDHRNDYVQNQAFNTMNVAQGAFVNDHA FT NNHIYELHKDEDHSSSTQRVEQENSSHPAFGFEQEKHYNSHFGQYEGNNFSPEKENAQQ FT GADYTNSVPPPVFAAYYEGGDNENNNLFTMDTMVNADGLYFDNHSHYHVPTNYSNI" FT repeat_region 168156..168163 FT /note="(a)8" FT repeat_region 168743..168750 FT /note="(a)8" FT repeat_region 168997..169004 FT /note="(at)4" FT CDS complement(join(169445..170142,170176..171094)) FT /locus_tag="1MB.291" FT /product="histone acetyltransferase, possible" FT /note="1MB.291, predicted protein, len = 539 aa, possibly FT histone acetyltransferase; predicted pI = 6.9424; contains FT Pfam match to entry PF01853 MOZ_SAS, MOZ/SAS family; FT contains Pfam match to entry PF00096 zf-C2H2, Zinc finger, FT C2H2 type; contains no predicted TM helices; reasonable FT similarity to Q8WYB4, histone acetyltransferase Myst1 (430 FT aa, Homo sapiens, EMBL: Q8WYB4, AF217501); Fasta scores: FT E():3.2e-31, 37.685% identity (42.857% ungapped) in 406 aa FT overlap, (aa 133-523 of 1MB.291, aa 48-419 of Q8WYB4)" FT /db_xref="GOA:Q7YYD7" FT /db_xref="HSSP:1FY7" FT /db_xref="InterPro:IPR000953" FT /db_xref="InterPro:IPR002717" FT /db_xref="InterPro:IPR011991" FT /db_xref="InterPro:IPR016181" FT /db_xref="UniProtKB/TrEMBL:Q7YYD7" FT /protein_id="CAD98548.1" FT /translation="MALRSRSSNIQKQQANNTPSNTSNNNEAVSSELISSENQRKKTNA FT RIYSSIEEHSDKTLDECKEDEAVPTIEKGPTADFPSVNQWVLALFEEEGKQMLARVVGW FT KLYGTQVTHLSHLVNKKTGGNKQNSLIKVSSSSNESQIKEDYEFYVHFRGLNRRLDRWV FT KGKDIKLSFDVEELNDPVLIERFQKQGIKFISSLVVSNSANKSGNKSKKRNVGVLDISD FT GEDPDEHEGMDHSAILDHEETTRLRTIGRVRIGKFILDTWYFSPLPDEYQNVDTLHFCE FT YCLDFFCFEDELIRHLSRCQLRHPPVFEIDGALTRGYAENLCYLAKLFLDHKTLQYDVE FT PFLFYIVTEVDEEGCHIVGYFSKEKVSLLHYNLACILTLPCYQRKGYGKLLVDLSYKLS FT LKEGKWGHPERPLSDLGRAIYNNWWAHRISEYLLEYFKQNKICERGGSKQPLQVSNYSK FT FIDNVVRSTGIRREDVIRILEENGIMRNIKDQHYIFCNQEFLKGIVKRSGRPGITLIDK FT NFNWVPFSRAPPSEVESLPQE" FT misc_feature complement(169607..170191) FT /note="Pfam match to entry PF01853 MOZ_SAS, MOZ/SAS family FT score 271.3, E-value 8.4e-79" FT misc_feature complement(170195..170263) FT /note="Pfam match to entry PF00096 zf-C2H2, Zinc finger, FT C2H2 type, score 10.7, E-value 0.47" FT repeat_region 171115..171123 FT /note="(a)9" FT CDS 172295..174373 FT /locus_tag="1MB.294" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.294, predicted protein, len = 693 aa, unknown; FT predicted pI = 4.7495; contains no predicted TM helices; FT some similarity to NASP_RABIT, nuclear autoantigenic sperm FT protein (680 aa, Oryctolagus cuniculus, EMBL: M37893, FT AAA31423); Fasta scores: E():0.018, 23.827% identity FT (26.295% ungapped) in 277 aa overlap, (aa 7-268 of 1MB.294, FT aa 261-526 of NASP_RABIT)" FT /db_xref="UniProtKB/TrEMBL:Q7YYD6" FT /protein_id="CAD98549.1" FT /translation="MKDNIMSENDSGHISTENLLNNNNQEKDLSIDDNNVANTPSDDFG FT HKEEVTQDSSPVSLNSSPMQEHKSEVEDIKGEIAQTEKFDESVSTETNDAKQENLDEVK FT EDNTTEGNPMQDETNDSNIELQCANNSLAKGESAPSNSEEHSGSINMPELSLPYKLWID FT AKSRDWILEWQTKNGRWSVRKFSCKRWGKGKAYSHAMNFLASLTSCGIIKDSNYNQQLG FT DSNEYSAESVNHVNRSVEDELVLQFLSQRDAVANAEKQKNGFGSNSGLNTLAAIAAAAA FT AAAIASTPTKSQAPTKDHASSQSTATPQPQVRGAVRKSGVPGVYWSQKPQGWRVVYYTG FT KDREFEYFKVPANASEEIISEILEVAKRFRSQVTAEGRHLPNGAVGSSSKRARMAERKA FT AAAAAAAAAAAASAAPDAPRHLLQSRTNHFPEEGLTDYLRNPSDITSKSLLETDPLSQF FT SSKFSPANIAGLQNPTGAMSPGPYEWLYNPLLMNLYGNYSAASQAAAMQQWAMFQNSFS FT QNGGMLPSAIPGQVDTSTPQANPMLNSFLSNPFMNPFGLQVPPILGQNANLIAQMPQVS FT TMNPASQMNMNMFGRYYGMTLPQGQTNHSPAVSPIGQNANPASSAPSQPDSAMWFNQLN FT SAQQLMPNHDQLVQQPATSDSFNTSRADEDTSLISETTSKQTENLLEVSSSSNKTSS" FT repeat_region 173123..173140 FT /note="(gcagct)3" FT repeat_region 173510..173524 FT /note="(gct)5" FT repeat_region 174462..174469 FT /note="(t)8" FT repeat_region 174512..174519 FT /note="(a)8" FT CDS complement(174820..175494) FT /locus_tag="1MB.296" FT /product="excision repair cross-complementing rodent repair FT deficiency,complementation group 1, possible" FT /note="1MB.296, predicted protein, len = 225 aa, possibly FT excision repair cross-complementing rodent repair FT deficiency,complementation group 1; predicted pI = 6.5159; FT contains Pfam match to entry PF03834 Rad10, DNA repair FT protein rad10; contains no predicted TM helices; reasonable FT similarity to AAH08930, excision repair cross-complementing FT rodent repair deficiency,complementation group 1 (297 aa, FT Homo sapiens, EMBL: BC008930, AAH08930); Fasta scores: FT E():7.9e-28, 39.796% identity (40.415% ungapped) in 196 aa FT overlap, (aa 26-221 of 1MB.296, aa 101-293 of AAH08930)" FT /db_xref="GOA:Q7YYD5" FT /db_xref="InterPro:IPR000445" FT /db_xref="InterPro:IPR003583" FT /db_xref="InterPro:IPR004579" FT /db_xref="InterPro:IPR010994" FT /db_xref="UniProtKB/TrEMBL:Q7YYD5" FT /protein_id="CAD98550.1" FT /translation="MSSQEEDIVKKESQPKFFDDKAGEMIIASTRQRGNPILAHVCNVP FT YDFQNIVPDFLVGKYDAVVFISIKYHKLHNQYLRKRIESLQKNYKVRILLCLVDIPPSG FT AIDAAILEVTDICFDLNMTLFLAWSPKEAGHILETLKSHENSSSEIIRGGLSLDLFSRI FT RDALSSLPRINKTDSENLLKHFGSISKVVNASEEELSKIQGIGPIKAKVISEIFSTEFS FT DS" FT misc_feature complement(175222..175425) FT /note="Pfam match to entry PF03834 Rad10, DNA repair FT protein rad10, score 113.1, E-value 3.4e-31" FT repeat_region 175586..175593 FT /note="(at)4" FT repeat_region 175784..175797 FT /note="(a)14" FT repeat_region 176218..176225 FT /note="(ta)4" FT CDS 176253..177005 FT /locus_tag="1MB.297" FT /product="conserverd hypothetical MSP-domain transmembrane FT protein" FT /note="1MB.297, predicted protein, len = 251 aa, possibly FT hypothetical 26.9 Kd protein; predicted pI = 6.1357; FT contains Pfam match to entry PF00635 MSP_domain, MSP (Major FT sperm protein) domain; contains a predicted TM helix FT region; reasonable similarity to O44782, hypothetical 26.9 FT Kd protein (245 aa, Caenorhabditis elegans, EMBL: AF039720, FT AAB96705); Fasta scores: E():5.2e-13, 29.675% identity FT (32.018% ungapped) in 246 aa overlap, (aa 4-244 of 1MB.297, FT aa 3-235 of O44782)" FT /db_xref="GOA:Q7YYD4" FT /db_xref="InterPro:IPR000535" FT /db_xref="InterPro:IPR008962" FT /db_xref="InterPro:IPR016763" FT /db_xref="UniProtKB/TrEMBL:Q7YYD4" FT /protein_id="CAD98551.1" FT /translation="MSMEGAKLVRVHPEKALEFPLVLYSSVTTPLILENITSSTVAFKI FT KTTAPRGYLVRPSSGLIQAGQSKEIQVILQPLQSVEQASPSHRFLIQTTACDSSVEQLT FT KDFWQDLSKEQLFEHRLSVIFKQENMGVNEPLSSSAVGTSSNASQGVPTSLISSAAGSS FT SSNTAQAGSMDSEFKNKYDELVQYCLALEKQSNELKEEVVSLREKLDKSESKLKSSNQG FT NIAQGFEFWHIIAMIIVAIVALKLINYF" FT misc_feature 176274..176630 FT /note="Pfam match to entry PF00635 MSP_domain, MSP (Major FT sperm protein) domain, score 80.5, E-value 2.2e-21" FT misc_feature 176940..176999 FT /note="1 probable transmembrane helix predicted for 1MB.297 FT by TMHMM2.0 at aa 230-249" FT repeat_region 177140..177147 FT /note="(t)8" FT CDS 177416..182245 FT /locus_tag="1MB.298" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.298, predicted protein, len = 1610 aa, unknown; FT predicted pI = 6.7195; contains no predicted TM helices; FT some similarity to Q8YRF6, hypothetical protein alr3492 FT (414 aa, Anabaena sp, EMBL: AP003593, BAB75191); Fasta FT scores: E():0.68, 25.442% identity (28.346% ungapped) in FT 283 aa overlap, (aa 817-1088 of 1MB.298, aa 145-409 of FT Q8YRF6)" FT /db_xref="UniProtKB/TrEMBL:Q7YYD3" FT /protein_id="CAD98552.1" FT /translation="MKKKDEKEFVPSCGHCQNYFLYKRIRSIIGKGEKINLTNLFEFNS FT SSSQPNESNHNIGGRLVYTKNGNFLFLQENIMNKSSNPRFVLIPCEISGKFRSNIMLKC FT CETEFSFVSCWIIIRNWAIVKMKTGEPSAPLRSILLNRHLNPSKFQDESTTFKVLRKRL FT LCFSASTAFWIPYTFSNEEHIVNGLKGIQITGIITHISTVVMHKGNNNMSSTSMIVNDE FT KASMISGSETSDSEEIQLNSQINKFGIDEILRPSENSVCELSFSIVIKSISNNQEFVIF FT FPGVNLSINRFVLEWRNIYRFENVIAGSVSIHNFGESSREKKCLIANKCTSIVAEPVTE FT DISRTISETPKLIQIEKVLGFGVYEIKTDKIPIKLFLQNSNIHETSLGCSISKGSILWV FT RNFRVITKEFGDRSLPIGISIEPSSDWGLERHSKFALEKVTALKKTIWENLPLHIQTIG FT HEYLEFNDFNSEIHNNPEIVFHHPWDIRNLCYLHYSLYVDLFNLVRTSADKSQVLPILL FT VQIISNQKSGSPISVSKDFEIKYPDGTTTPQNSCSLICSNCLHFQNNITHPSQTVELKS FT KDSQQCPISCTDTLFGLWKIASIEEWVDLIKSLEVSQKLSLGNSFSSNITLYIEYLYYI FT RGDVRKLPIYQKNQKNDKTSDVSNNVNSLRCSFIAEESFPVYIGSFGHFLAEVIMEKAF FT ESNAWISNVYFIKTPSLYDSKNNYYPLFIDCNSEVNFLFNSNCSKYSFEKKFVFIKKAM FT LRLYNQRISFFVKISDIVCINFENCFLKPPISQGILSGNQIRGILLFICDKEIQSSVEG FT CKQYLHCLPFFKTYSKLSKLGIQFDKNEIIESQIEKFQVPPNWYSFTSVSRFVLILDRP FT ILQNSEVLLSEIQEKEFFEAINTFDIEKIEYLIIKELVFPSQFSSDAMNFDKTKSYIEI FT KLLGNNNESILWPECLKSPFSYKFETNIQANQGSYTVFDIKHWTEYLFEWQVLSRRSSI FT IYRSTKVARNEIINFFTSFYQQISLEYDNDSEIQISLESVKILDIIHFVGLNPNNKYGR FT WSIIRVTTDLSTKFKPGTTNLQSIYLIADPPWNSGWIDIWFSEEDALTHDIKLLLPGET FT ASFTNIRVEKLVNSFPPLSPVSDIMISVLDESFEFPGQDKSIYFNMYDQSQTKLEKTTF FT SDFNQAQTERKKTGNTLSIFNNGAIIFRTSIQNAMIKRDKAQIYSSSQNFNNPGKDLFF FT SYLYFSPTSISDVVSYYHFDPGYQEYIQKVHWKTCDPWIDKFLILNGPKKIEKEPNDNK FT KDESNKLVNSFSQSKNLNNSHQIFNIHKIPNPYILAPDGAPITRNFLSLMKTNHTCFIA FT FNRAYCIDTEISTDISQINFNPSFSPILNFDEFKSIDPDLVFSLKSSILHIQKISLSWF FT CLNCLQTIVGDSVCLCGVDIGKSSLWKSLAIYLIGALEIETTDSMPKILPFTMSNWNVI FT KLLAYVDKIGDQEIESKILYNRIIELADIIWNHMEGLNNSLGIQTIYSFGKAPTNCEFL FT SSMEDNSEECKPVGIIGQVQLFDRDKALNIPTDPISNLEMEVCLRCVQSSNKNTLSYIY FT LHVIRWKVRNTKLELEEKLKNINFQFDLEI" FT repeat_region 177419..177427 FT /note="(a)9" FT repeat_region 179442..179449 FT /note="(ta)4" FT repeat_region 179643..179650 FT /note="(a)8" FT CDS complement(join(182242..183595,183806..183951)) FT /locus_tag="1MB.299" FT /product="hypothetical predicted multi-pass transmembrane FT protein, unknown function" FT /note="1MB.299, predicted protein, len = 500 aa, unknown; FT predicted pI = 8.9920; contains 8 predicted TM helix FT regions; some similarity to Q9HIJ4, hypothetical membrane FT protein (362 aa, Thermoplasma acidophilum, EMBL: AL445067, FT CAC12466); Fasta scores: E():0.075, 24.710% identity FT (28.700% ungapped) in 259 aa overlap, (aa 240-481 of FT 1MB.299, aa 42-281 of Q9HIJ4)" FT /db_xref="UniProtKB/TrEMBL:Q7YYD2" FT /protein_id="CAD98553.1" FT /translation="MIREETTSSDFTSDYSEESQNVTISKSRPISHSFKFTLFMCLLQG FT FSESSYITLILCRYLLSFSYGALEPSVQVISVNYKLNSPETAKLFGAVSCYKSFGSFLS FT IAVSTMVYLKGNDEVVTTYSRIIWILTGGVSTLISLVLVISIYKDFKSSNNNSALEMVL FT QDNYYYKKLESSISNNSDESKDKIKIIIIYFVIFLVGLFTNIVSEIYSFYQITSSFSFE FT RYEDLGFNGDRSDAYLLSLNLTYYTLNCVVFLLGSAIGSLIFAILYKNLFGLTKKFKER FT FEIDSITPIGRMSSISILCLSSISVVIFFMLNLILIHFIFEIPNISKFSIIDIVANSFN FT LYWIIPHMVAVFFMGSFLTTLMEVIPRFQLFNLNKSSKSVISYGLYLMITGIFSDPSLY FT KYIRLFVPEKNYTISIAGNSSFNIIPSIFHIPIKKLFPIQLINKLTKMGFPNNYIYYQE FT LQMSSSIYSILFYALISFIILILLLIKLVLCKRSSKRMYLN" FT misc_feature complement(join(182491..182556,182656..182721, FT 182857..182922,182992..183057,183151..183216, FT 183319..183384,183511..183573,183619..183684)) FT /note="8 probable transmembrane helices predicted for FT 1MB.299 by TMHMM2.0 at aa 89-111, 126-147, 189-211, FT 245-267, 298-320, 343-365, 410-432 and 465-487" FT repeat_region 184086..184097 FT /note="(aat)4" FT CDS complement(184433..185254) FT /locus_tag="1MB.301" FT /product="heat shock protein DNAJ homologue pfj4, probable" FT /note="1MB.301, predicted protein, len = 274 aa, probably FT heat shock protein DNAJ homologue pfj4; predicted pI = FT 8.8142; contains Pfam match to entry PF00226 DnaJ, DnaJ FT domain; contains no predicted TM helices; good similarity FT to Q9GUX2, heat shock protein DNAJ homologue pfj4 (244 aa, FT Plasmodium falciparum, EMBL: AB050739, BAB17689); Fasta FT scores: E():2.2e-28, 46.121% identity (48.198% ungapped) in FT 232 aa overlap, (aa 1-226 of 1MB.301, aa 5-232 of Q9GUX2)" FT /db_xref="GOA:Q7YYD1" FT /db_xref="HSSP:1HDJ" FT /db_xref="InterPro:IPR001623" FT /db_xref="InterPro:IPR015609" FT /db_xref="InterPro:IPR018253" FT /db_xref="UniProtKB/TrEMBL:Q7YYD1" FT /protein_id="CAD98554.1" FT /translation="MDYYEILEVKRDASTSEIKKSYRKLALKWHPDKNPDNREEAEEMF FT KKIAEAYEVLSDPEKRNRYDTYGADGVSADFSSDFHGFDRHFSMGHASRIFEEFFGTNN FT IFDIFSSFGEFPGFNEPSRSSRGFSRSRGSRLSPFDDLHSQIFSNFGLSGSGFGNMQSF FT SSSSFSSGMGFQGGVSKSVSTSTSVINGRVITRTKTTERLADGTVRETVQEIEEDGRGN FT RVIRNSDNSSGSSRNRMLRQGSSQEFNDGFSLSTGNLHRTRSHRQSRTQNR" FT misc_feature complement(185051..185251) FT /note="Pfam match to entry PF00226 DnaJ, DnaJ domain, score FT 152.4, E-value 5e-43" FT CDS 185419..188721 FT /locus_tag="1MB.303" FT /product="similar to helicase-like protein nhl, possible" FT /note="1MB.303, predicted protein, len = 1101 aa, possibly FT similar to helicase-like protein nhl; predicted pI = FT 8.4601; contains no predicted TM helices; reasonable FT similarity to Q9BW37, similar to helicase-like protein nhl FT (401 aa, Homo sapiens, EMBL: BC000673, AAH00673); Fasta FT scores: E():1.1e-28, 36.919% identity (41.234% ungapped) in FT 344 aa overlap, (aa 9-330 of 1MB.303, aa 5-334 of Q9BW37)" FT /db_xref="GOA:Q7YYD0" FT /db_xref="InterPro:IPR006554" FT /db_xref="InterPro:IPR006555" FT /db_xref="InterPro:IPR010614" FT /db_xref="InterPro:IPR013020" FT /db_xref="InterPro:IPR014013" FT /db_xref="UniProtKB/TrEMBL:Q7YYD0" FT /protein_id="CAD98555.1" FT /translation="MNPIKNEYLIEGYSVPFPYDAYKCQINYMQKILYSLKYKKHALLE FT SPTGTGKTLCLLASTLAFQKHFLISHGSLKKPIENTSSAGVIKSEGGSIELMGVPRTVL FT ENNKETKMDMLIPRIIYSSRTHSQLSQVMRELKSSGISDGFTIELFDTDAENKAKIEKS FT TSKKRPVIKGGKKLFKATILGSRDQLCVHPKISKFRGNALIKNCRKITKEGKCKYHNNL FT KQANTSGVAADIQDIEDLKNIASSSDSGYFCPFYATREIESVCNVVLLPYNYLLDSITR FT QNLKIDLNNTVLILDEAHNVESVSEEAYSFDLRDIDLALSQKAIQNILEATKLGLLQEK FT ESDEQDSEVDISFDIEVAVALATGIHLLSRNLKEIPCPVPTNNNGSKFKSFPKMGEVQG FT TTYPGSHIYSLFASSGFGIDNFQAIDECLTNMINFGQNLVGPGGNVSSQLDVNSIQINA FT RIGALERFQRCLRLTFNETVMKNPQWFKLYIHYEPDSYKEINGFDENGGQNSFVDPETT FT DQGLSLYLSFWCFSAAAALSSLVSAGVRSMIITSGTLSPLDTLAQQFSSSNVTFDVFLE FT NDHVIDSESQLWAATLERGNSTNNTHLIGSYEARNNPSYFSSLGSVVFDCVKRIPDGIL FT LFFGSYSLMDQAVKHWTDQGLIERIKAFKSVFIEPRNSFELGSVLDSYMDCIKKGADSS FT SQNDGYFKDKKAKSGLSDFVLKSKRISSSGSLLIAVCRGKVSEGINFSDNACRGVIIAG FT LPFPSIADARVCLKKQYMDESKMDGRQWYNQQAIRAVNQAIGRVVRHRNDYGAIILADK FT RFNQPNIYTRLSKWIRTNTKHLPQLDSRQLDNISDFFEKKLSITVNGTNCQSSEGNNNK FT EIPTKTLPSNITGNKTTLYGRNPSVVSFPNLSNEINTLLKRVENKNTTKEESKGEENNT FT NNKIIIPKPFPGLISKVSVPWKRVKTSSPCSESRNSTPWTFSSNKNESSIHENQKRTNF FT NIIPLEKMNYQEILNNSKNILKEDEFNRLKPYIQNLSQVNSNSLRSIANILFPSYIMDE FT KELTERKSLALELLKLLSNKYREEFRTMIEKMTSNIDMMKDKALVDALESEL" FT repeat_region 187076..187083 FT /note="(ct)4" FT CDS complement(188746..190218) FT /locus_tag="1MB.307" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.307, predicted protein, len = 491 aa, unknown; FT predicted pI = 5.8396; contains no predicted TM helices;" FT /db_xref="UniProtKB/TrEMBL:Q7YYC9" FT /protein_id="CAD98556.1" FT /translation="MLLASENSFLEKYNNCPDLQLEFNGGKDLVIEEKTGKETEIIIPN FT NKEFEESTNTVINQAECEIPNCKGENIVNCHHSFGDGIDKSGLKELNPYYVLQGDSFSS FT SKICNIGEIIGSESLSGVVSDSNIHEELFSIGCQESDDCRELKVFTNEIGGLKSDHSQH FT KALSNFHFHHNFSMFGEVIQDNGQRIVHSQCYQHQIRSSGLDKFTAVDRNGKRSILNEV FT ETQAQLMPSIPGVYFDRRQIGYRVRYHNSYVGWVALSRHSSIKDAYEYAKQLWLKARNK FT SKLNKCQTDQIEAGNRGMGARKRARFTVYDDNQELNQTILCKSGPSSEDNIYFDSSHIE FT NNQIFQTKCYNSDYSSGNDGYGVHSIIENGYTFSENAMEKQVLEKESSNELSKMSQYSA FT LETMRKLYYATQEYMEKWPSEDGHYTIHWSNTSDLSSYLYGEENEQNHKKDLNNKKSKN FT QRKIYINSEKQNINFDNLNLNKKQLESFNYIW" FT repeat_region 190490..190498 FT /note="(t)9" FT repeat_region 190536..190544 FT /note="(a)9" FT CDS complement(join(190601..192155,192669..193198)) FT /locus_tag="1MB.308" FT /product="hypothetical predicted AT hook motif protein, FT unknown function" FT /note="1MB.308, predicted protein, len = 695 aa, unknown; FT predicted pI = 4.4794; contains Pfam match to entry PF02178 FT AT_hook, AT hook motif; contains no predicted TM helices; FT some similarity to Q9T069, hypothetical 59.1 Kd protein FT (532 aa, Arabidopsis thaliana, EMBL: AL161592, CAB80447); FT Fasta scores: E():1.3, 26.857% identity (27.811% ungapped) FT in 175 aa overlap, (aa 234-405 of 1MB.308, aa 175-346 of FT Q9T069)" FT /db_xref="UniProtKB/TrEMBL:Q7YYC8" FT /protein_id="CAD98557.1" FT /translation="MIPNSIYNKVKEGRNLICTEGNSEEEKVISENIPCLINNSIGMLG FT WSENIIHDDDLSHSNDRNGIFSYNFYNNNADDMFQSEEKIGCQEMQGQIISSKFLLDSE FT NKANERNQEISNIQTQCCSNFPSEKINLNRQISNYSSSTSSVSIYNVTDTVFKEMRSEG FT TSEPKDISRYMDNELLKVWCMEMCPEEYLDAGYPLGSLLWVSTLDIESHIKKWRYRGKK FT SNGQFEYITPKIHGFEERALKKNPVPEVSLLTNNTERNMNEENLQINSNQVLEDLNLQK FT RDFIEDDGCIDNSGINNVDCVRSSSSSECNDSAAEMQALNQGSIFCKSTKKSRGRGRPK FT SKPKQIFFEEEEHEFPRKRMFLSEEGENNSSIITNEKFSERSQDESIEELELELGMEFE FT QASSQGYSSNFIWDLVRSIPIESLNRLYVSWLDGNIPGVTKLIVKSSVVQEYLNIQKNV FT ENSRLEESSDRLYNCKETNSDILNKKQTSEVINSKRTILYHKALYLAIQQHLLMYSSEL FT QALLKALEGNNTISERDTDHLDPFHKNLDVVLNSISPLQHLSMSFHCNSPLRIVEEGQI FT DDEEHISREAGIRSCSEEKLIHGCDSHNDSNISRDGLVDDNGESLIESQNLTETPRLGN FT NMHDSSSVFSIITSAQYQDFVELPDIASSFSDQLLSDHTELSKHCANMGEPIDLFFAIS FT " FT repeat_region 191630..191641 FT /note="(ctc)4" FT repeat_region 191952..191960 FT /note="(t)9" FT misc_feature complement(192164..192202) FT /note="Pfam match to entry PF02178 AT_hook, AT hook motif FT score 8.8, E-value 0.4" FT repeat_region 193177..193184 FT /note="(ta)4" FT repeat_region 193447..193456 FT /note="(at)5" FT repeat_region 194444..194453 FT /note="(a)10" FT CDS complement(195009..195986) FT /locus_tag="1MB.312" FT /product="peptidyl-prolyl isomerase/macrophage infectivity FT potentiator, possible" FT /note="1MB.312, predicted protein, len = 326 aa, possibly FT macrophage infectivity potentiator; predicted pI = 4.8284; FT contains Pfam match to entry PF00254 FKBP, FKBP-type FT peptidyl-prolyl cis-trans isomerase; contains Pfam match to FT entry PF01346 FKBP_N, Domain amino terminal to FKBP-type FT peptidyl-prolyl isomerase; contains no predicted TM FT helices; signal peptide predicted; reasonable similarity to FT O32828, macrophage infectivity potentiator (234 aa, FT Legionella oakridgensis, EMBL: U92214, AAC45700); Fasta FT scores: E():3e-18, 39.773% identity (41.667% ungapped) in FT 176 aa overlap, (aa 135-310 of 1MB.312, aa 66-233 of FT O32828)" FT /db_xref="GOA:Q7YYC7" FT /db_xref="HSSP:1FD9" FT /db_xref="InterPro:IPR000774" FT /db_xref="InterPro:IPR001179" FT /db_xref="UniProtKB/TrEMBL:Q7YYC7" FT /protein_id="CAD98558.1" FT /translation="MKLFLKVILFLSFVSTISLCSRLEWTGDEFEVKRLTDFIPGTVPQ FT VQRTLMCESCQLITFALKEYISKKLSIYKSSQPPKGFTDIVVSNFLESHVACSNHIWQP FT LADNSLEFTIEDFISACRSNLSTWEPELELISTSKLSFDQEISRICLGTKSCKDKSELW FT TEQEYPENRESKKSLLKKRSDEFIAKNKDRKGVITTSSGLQYIVVKEGTGQEKPKPEDE FT VEVFYRGKTLGGIEFDSSYNRGEEPSKLQISQLIPAWIEALTMMTEGQEVILFAPYDLA FT YGEQGAGELIGPNEVIIFKLKLGKIIKNENKKPDNEISSASDEL" FT misc_feature complement(195075..195356) FT /note="Pfam match to entry PF00254 FKBP, FKBP-type FT peptidyl-prolyl cis-trans isomerase, score 91.0, E-value FT 1.5e-24" FT misc_feature complement(195366..195449) FT /note="Pfam match to entry PF01346 FKBP_N, Domain amino FT terminal to FKBP-type peptidyl-prolyl isomerase, score FT 33.0, E-value 4.6e-08" FT repeat_region 195875..195886 FT /note="(agtc)3" FT misc_feature complement(195921..195986) FT /note="Signal peptide predicted for 1MB.312 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.983, signal anchor FT probability 0.015) with cleavage site probability 0.580 FT between residues 22 and 23" FT repeat_region 195986..195994 FT /note="(t)9" FT repeat_region 196051..196058 FT /note="(c)8" FT repeat_region 196184..196193 FT /note="(ta)5" FT repeat_region 196195..196204 FT /note="(at)5" FT repeat_region 196251..196260 FT /note="(t)10" FT CDS join(196727..197491,197534..199300) FT /locus_tag="1MB.313" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.313, predicted protein, len = 844 aa, unknown; FT predicted pI = 6.9369; contains no predicted TM helices; FT some similarity to Q95JQ2, hypothetical 36.7 Kd protein FT (308 aa, Macaca fascicularis, EMBL: AB070128, BAB63073); FT Fasta scores: E():3.8, 20.526% identity (21.547% ungapped) FT in 190 aa overlap, (aa 12-195 of 1MB.313, aa 35-221 of FT Q95JQ2)" FT /db_xref="UniProtKB/TrEMBL:Q7YYC6" FT /protein_id="CAD98559.1" FT /translation="MKTASASSSGRRVEYEAKEVNASGNSKKGEQVGYNNGIQETKFSN FT KKSSNNLPNRKLKLDHCVLNGPENHSSIEICKDIHESNSNCSINSESAPKSNPKRVFEM FT IDRTLNTLLEKIVPSAKKKKEKMIILQCVELLVRETFGDSAKLFLTGSAAAEVDSEMSD FT VDLVVFTPLDSRLALTKIASQFKSIKKKHKEDCLRCRKHSGHQKFKADTDLSSECIEGH FT LCEMEVNVIATAKVPVMMIKSRYSKFKTECDVSCVLYSYPELQPVLRLLKYWLHIRRLP FT VAKDGGLPCIVWLLLAIVHCSVNGNKSSRSQREHLESICIPAKHILLSLQVHSSGTSFQ FT STEFGSPLNSFQNGSISTCEYHDCDSRTFDTELGNPSSIPNFKMDILTHNRQIERNRYL FT SHSNSNNINSKVPDTIDFGETGTTFMALVSFFTSLWNRSSLTCSVSVINKNVQPKDVKT FT VSQIIFNNGGIWDEILTLDDPSASLVRQLELLKCELDDFGFKNNLLAQIFDPEDPNKLE FT KEHFYFESKEAEKRHELEISLASILPPSDCLMVSNLAARITCGTWLVYLYELKRCHNIL FT ETYLQQIAISTNEVSEQELVNELFHPVNDEIYRIPAKLSTKTPVCMYPCMDLDLIIENN FT IIALVQVPYFEQNHKFCLVLIYGHLVIMRIENICVDWEGGWWSKEFLSRRDVRSVLHGS FT LFSPIPLSFSHLGNGGSACCILQPLDSRTINCLPKNDQSDFNSKDIVIDEIMVNPAVFI FT TLLNDVEWYSVDDMNGFYIMPVKEYYRFLDMESIAKESPYWHENYFGKNNYLQPHVPSC FT RYCKKTSNSTGMEKILQYKNQFWAIKRKSIS" FT repeat_region 197181..197192 FT /note="(cag)4" FT CDS complement(199732..201282) FT /locus_tag="1MB.315" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.315, predicted protein, len = 517 aa, unknown; FT predicted pI = 10.5904; contains no predicted TM helices; FT some similarity to O15738, zipa (924 aa, Dictyostelium FT discoideum, EMBL: AF019980, AAB70839); Fasta scores: FT E():0.01, 22.063% identity (24.367% ungapped) in 349 aa FT overlap, (aa 52-382 of 1MB.315, aa 497-830 of O15738)" FT /db_xref="UniProtKB/TrEMBL:Q7YYC5" FT /protein_id="CAD98560.1" FT /translation="MSERTKKLAIPSHLRLDSLSLSCISTGLTPKSEINNSSISKGLTS FT TRSVNGQQSEVQKRLSVQLPSCRSLRTSIRINRSSTLSHLGKNNTKPDVNQEIGLEKQP FT RSFSNSSRASRTNVTTPKGFEFATSERAEQRLKHNRSSSSGSNIYKAELGLNLHQLYNG FT RIPAAPMSARSTSSIGAESRVSTFSKATTVPKPFSFATDARAASKSKELYEKLGSLKDK FT FNEEVSTKNKEFSEDRRIKTILESIKKFNLSNSDPSLDTITTEMNVTKEVVGSKYGIPR FT RNTTVPKTPKFATQMRSESKKAQLRGVLDSMLLSEKKEGNSNWSKSDTTSSISKSNILN FT NNRTQENHDNSSLAFFEQLKKQDSEPREAQSQHTDQKPELHARMQPQKLPKLRGFLSMN FT SLAHLGRQSFGTLEGSKENFKNSEENSPILNCKSTISGMSSIRKINESNTSIPNNWKFQ FT ATVARKQLRQFTGLEETDFNLKELDYYRHTVKPEFKEIDSNHHLTDDDLKTPLPRTYR" FT misc_feature 200137..200144 FT /note="tgcatgca" FT CDS complement(201883..203148) FT /locus_tag="1MB.320" FT /product="hypothetical predicted transmembrane protein, FT unknown function" FT /note="1MB.320, predicted protein, len = 422 aa, unknown; FT predicted pI = 9.0181; contains a predicted TM helix FT region; signal peptide predicted; contains predicted FT helix-turn-helix motif; some similarity to CBPA_DICDI, FT calcium-binding protein (467 aa, Dictyostelium discoideum, FT EMBL: U03413, AAA03471); Fasta scores: E():0.072, 29.412% FT identity (32.468% ungapped) in 170 aa overlap, (aa 254-411 FT of 1MB.320, aa 99-264 of CBPA_DICDI)" FT /db_xref="UniProtKB/TrEMBL:Q7YYC4" FT /protein_id="CAD98561.1" FT /translation="MVSLRALLPFIFLGLFQSIFLFCSGEEILQFCKKHNITLSELEDE FT AKQAGISISTLISMMNAEYPSPIEETAKSKVDSDSIKDPYTDSPTGNEDVLGFVPSSSV FT IYSSDIPRNTTIEEPATKIDSEQKVSASLDTETLKLSENTLFVNNADTRASSDKNSKTS FT ESCSCSKHNHSSKTGDCRVIIINESKKNRVKCTKDRNARRSEIGKRYYIINKSGEREEY FT KVVSHHQKKVRKNKDSKANKLKRVEVVNRHIQPVIIAQPSIAPLYAPMQVPQMPGHPGY FT VTPPQIPRFPTQGPPQVPPQVPPQAPQMPPQTRDNGDFEYMNKNYNGSDNKNSFIGPLT FT ISVIVIILIISCIIGGFCFCGATGANRNHGNMPLIPPPGTPNAPQNPPYNSRAPIIPQN FT QPPMQPPPGRQVFGYAVVPNIR" FT misc_feature complement(202063..202128) FT /note="1 probable transmembrane helix predicted for 1MB.320 FT by TMHMM2.0 at aa 340-362" FT misc_feature complement(203074..203148) FT /note="Signal peptide predicted for 1MB.320 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.945, signal anchor FT probability 0.053) with cleavage site probability 0.695 FT between residues 25 and 26" FT CDS 203917..204690 FT /locus_tag="1MB.323" FT /product="hypothetical predicted transmembrane protein, FT unknown function" FT /note="1MB.323, predicted protein, len = 258 aa, unknown; FT predicted pI = 4.9666; contains a predicted TM helix FT region; signal peptide predicted; some similarity to FT CYL1_HUMAN, cylicin i (598 aa, Homo sapiens, EMBL: Z22780, FT CAA80457); Fasta scores: E():0.19, 31.538% identity FT (36.607% ungapped) in 130 aa overlap, (aa 25-137 of FT 1MB.323, aa 328-456 of CYL1_HUMAN)" FT /db_xref="UniProtKB/TrEMBL:Q7YYC3" FT /protein_id="CAD98562.1" FT /translation="MRLEVLFALILLIFNINYSQGLGLRRNYNSYSESAYTDIKSSEAK FT DSDSDYSDVKDSSEILNKSIPLNKSEKSNYTSQVESEDEDDEPEALSNKVKSDKHFSEK FT DDGEDDYNSNIESSEIESSRNHSQSKKISKEEVNMPQVQYMARQQRFNLASTVNPPSDV FT SNINYVFLSPAAQRRIDRKISKLQKGGRYQGSMRETKAIGEWQKRYYSRPETEYSEAMS FT SNQGNNSIFMWVILGFIVLLLTLICICGMRFITKE" FT misc_feature 203917..203979 FT /note="Signal peptide predicted for 1MB.323 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.997, signal anchor FT probability 0.000) with cleavage site probability 0.524 FT between residues 21 and 22" FT misc_feature 204601..204669 FT /note="1 probable transmembrane helix predicted for 1MB.323 FT by TMHMM2.0 at aa 229-251" FT repeat_region 204904..204911 FT /note="(a)8" FT CDS 205037..206029 FT /locus_tag="1MB.324" FT /product="splicing factor, possible" FT /note="1MB.324, predicted protein, len = 331 aa, possibly FT tls-associated protein tasr-2; predicted pI = 10.7959; FT contains Pfam match to entry PF00076 rrm, RNA recognition FT motif. (a.k.a. RRM, RBD, or RNP domain); contains no FT predicted TM helices; reasonable similarity to Q96P17, FT tls-associated protein tasr-2 (261 aa, Homo sapiens, EMBL: FT AF419332, AAL16666); Fasta scores: E():5.5e-17, 32.770% FT identity (38.800% ungapped) in 296 aa overlap, (aa 9-301 of FT 1MB.324, aa 6-258 of Q96P17)" FT /db_xref="GOA:Q7YYC2" FT /db_xref="HSSP:1HD1" FT /db_xref="InterPro:IPR000504" FT /db_xref="InterPro:IPR012677" FT /db_xref="UniProtKB/TrEMBL:Q7YYC2" FT /protein_id="CAD98563.1" FT /translation="MANPVRGNRDPRRSLLIRSLRFDTPTSLVRREFERFGAIRDVYLP FT LDYRSRRPRGFGFVEYVEEEDARAALEKMDGATLDGVTINVTFAQEGRKSPESMRHREY FT ESFHGNGGRHLSGHRYGPHNHYKSDPYRRYPSPRDYREGSRSYGGNRHPPYRSRSRSPP FT PSRGYERYKEQYNYRRDEDFRREDYPRSHNIRGRSYSNGNEGGNYRSEIYERNYPQRDS FT ARSYSRGRSRSRSGSIGRSRSRSRSRDRIRSRSRSMSEFGSRNVSCSVSRNRSTSLNQS FT SCENPERGRNNHRKIETPQNESNFFQKEFMNEHNEKNYDRSDSDRQMSN" FT misc_feature 205079..205294 FT /note="Pfam match to entry PF00076 rrm, RNA recognition FT motif. (a.k.a. RRM, RBD, or RNP domain), score 68.0, FT E-value 1.3e-17" FT repeat_region 206433..206440 FT /note="(a)8" FT CDS 207193..208392 FT /locus_tag="1MB.325" FT /product="cysteine protease, possible" FT /note="1MB.325, predicted protein, len = 400 aa, probably FT predicted pI = 9.6864; contains Pfam match to entry PF02338 FT OTU, OTU-like cysteine protease; contains no predicted TM FT helices; good similarity to Q93Z76, (505 aa, EMBL:); Fasta FT scores: E():3.3e-09, 40.659% identity (42.529% ungapped) in FT 91 aa overlap, (aa 9-99 of 1MB.325, aa 216-302 of Q93Z76)" FT /db_xref="GOA:Q7YYC1" FT /db_xref="InterPro:IPR003323" FT /db_xref="UniProtKB/TrEMBL:Q7YYC1" FT /protein_id="CAD98564.1" FT /translation="MREECLLPYEVRVKNIEGDGNCLFRSIGSQLYGESEHHEIIRSAC FT MDYVDLNKESFSGFVHEYSSIEKYIQEKRKLGVWADNIEVQALSDLYRIPIYIFEKVRN FT SKLNASLLEKHRLEGFSGTMSENIFYENKEFVYKLLCKIEPRHSSFLDQIKNYYSNSRP FT IRLLYYNDLHYDSLFYRREHQTPIINKDIGIIEAESIKNLKVYRAIAKQKEKSSKFPVR FT GSKIKTDQVGHFNHTSYALLRKKAIKKFPVNYGSSSESDNYVAKSPFFLSVKNKYRDEP FT ESFFEGNYLDKISEGETHFEKLHPKSKEPHSKFISGPKSFSERAIGKPACVSKIKSLNN FT THLGVVDRYARDVQLSSHQKEFKKLLINDSPRLSSTPNPRKKCVAIYCPQILKQSAFRN FT " FT misc_feature 207238..207585 FT /note="Pfam match to entry PF02338 OTU, OTU-like cysteine FT protease, score 59.6, E-value 4.4e-15" FT repeat_region 207478..207487 FT /note="(at)5" FT CDS 208401..209486 FT /locus_tag="1MB.326" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.326, predicted protein, len = 362 aa, unknown; FT predicted pI = 6.1077; contains no predicted TM helices; FT some similarity to Q9H8G7, hypothetical protein flj13645 FT (435 aa, Homo sapiens, EMBL: AK023707, BAB14650); Fasta FT scores: E():0.078, 21.498% identity (24.627% ungapped) in FT 307 aa overlap, (aa 61-350 of 1MB.326, aa 87-371 of FT Q9H8G7)" FT /db_xref="UniProtKB/TrEMBL:Q7YYC0" FT /protein_id="CAD98565.1" FT /translation="MDSRTWFEGVSDCSKKIIEALKSDNHLYGRDSNHFAKEGLSIFYS FT YDLCSSSCSVPTVVTESNTLSSETKDNRMLFSSVDEETLLLLQEALKYHYSGGEGTFIN FT LGTPVNEINGLNSAHKIVHSPEITVPSDDKETKVNTETEEDIFEGVGSIFDHKEHKRTR FT TRSLERSLTPDLDEFIENVYDFKEEALHNTNVQSSIQSQVIQTRKDQNSAQRSFGLASR FT TKNRKRNIDYSLLSRKGPMSAPDKSFESLKLPQFEDSSSSYAECYLEVDGVSHYETHHE FT ELESSSIRSQNKEMEDSSSARDKSKRNIDKYKQRRKDKSVHQEWKSIQKIMNKQNFRSL FT ETFKSLISDNNGSIKNNFGQI" FT repeat_region 209174..209185 FT /note="(ttc)4" FT CDS complement(209570..211033) FT /locus_tag="1MB.327" FT /product="uvb-resistance protein uvr8, possible" FT /note="1MB.327, predicted protein, len = 488 aa, possibly FT uvb-resistance protein uvr8; predicted pI = 7.4037; FT contains four Pfam matches to entry PF00415 RCC1, Regulator FT of chromosome condensation (RCC1); contains no predicted TM FT helices; reasonable similarity to Q9FN03, uvb-resistance FT protein uvr8 (440 aa, Arabidopsis thaliana, EMBL: AB007646, FT BAB11034); Fasta scores: E():1.9e-35, 34.000% identity FT (35.312% ungapped) in 350 aa overlap, (aa 5-350 of 1MB.327, FT aa 31-371 of Q9FN03)" FT /db_xref="InterPro:IPR000408" FT /db_xref="InterPro:IPR009091" FT /db_xref="UniProtKB/TrEMBL:Q7YYB9" FT /protein_id="CAD98566.1" FT /translation="MGQNLSKSGVVVWGSTEYGQHGSKGEEVSPGPHLVDGLRHLSSIS FT KVSCGSNYSAAITNSGDLILWGYGGCGQLGFGNLEDCLVPRVNLSLKNVIQVACSDRHT FT AAILSNGELYTWGCSKNGKLGHGQFELSISNNVVSQPMKVKALEGEKVIQVSCGSYHTG FT CLTDDKKALTWGLGLQGRLGHGDTQDIFTPKLIESLAGLPIKEISCGGHHTAILLVTGK FT LYMFGGGAFGKLGFGSTDDVLIPRLLEGPLEDIQITKVSLGSQHSAAVTKCGKVYTWGQ FT GGRLGHIFNGPEHDFLSPKRLSNLEKAFIVDISCGNSHSVALSDVGDIYTWGMTKNIGH FT GIQGIHPNMPSKHPILQNKNIVQVVCSSSHSIALSDIGALVQKSSETRKPQSLAEDQEP FT KKADQSIEEFKTGLDSLIRRKILQGIKDKLKAGGDREKIEYLMDELEKSEEQNAVLVSL FT LDVSVRKLEILRKENEELRSKLELTRTTN" FT repeat_region 210043..210050 FT /note="(ta)4" FT misc_feature complement(210065..210214) FT /note="Pfam match to entry PF00415 RCC1, Regulator of FT chromosome condensation (RCC1), score 23.0, E-value FT 0.00026" FT misc_feature complement(210383..210532) FT /note="Pfam match to entry PF00415 RCC1, Regulator of FT chromosome condensation (RCC1), score 20.1, E-value FT 0.00059" FT misc_feature complement(210539..210706) FT /note="Pfam match to entry PF00415 RCC1, Regulator of FT chromosome condensation (RCC1), score 35.6, E-value FT 7.2e-08" FT misc_feature complement(210713..210856) FT /note="Pfam match to entry PF00415 RCC1, Regulator of FT chromosome condensation (RCC1), score 21.7, E-value FT 0.00037" FT repeat_region 211316..211323 FT /note="(t)8" FT CDS 211490..213688 FT /locus_tag="1MB.328" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.328, predicted protein, len = 733 aa, unknown; FT predicted pI = 4.7157; contains no predicted TM helices; FT some similarity to NUP2_YEAST, nucleoporin nup2 (720 aa, FT Saccharomyces cerevisiae, EMBL: X69964, CAA49587); Fasta FT scores: E():2.4, 22.717% identity (26.357% ungapped) in 449 FT aa overlap, (aa 245-672 of 1MB.328, aa 218-625 of FT NUP2_YEAST)" FT /db_xref="UniProtKB/TrEMBL:Q7YYB8" FT /protein_id="CAD98567.1" FT /translation="MYEIESHFFENKLQSWFSKTCISNFPEIQKLILEFLFSFKTSQKL FT SLVNKEGLSFYRSIQKATLGCIKELNTLEHQTWMNSNPDLWLSWLLMGRNNSKENDTST FT LIPLDPRIVMRILFNFRKLYPVSYKTRIIKDTNGEPFLNVELSAISLGNLENDLEDPKK FT PFSSIYDMFSSKRMVFGVDLVYGQRIYQFDSERDSGSGEIIISVQPEHIFDVLSHKEFS FT EHLYSTLLGLERKFCLGDWSGYFSNNAGKEMYGKFMKSVLLPLKTFEKTCRRPTFPFVS FT ILATSHLCKGREAGRGSENIKSSLYESSIDLLEEMSHQVLDDSSCSRMVEDAILYKSPS FT FAFFLSSSCLTESTFQLIIDSVSQKEKSFFLGKNLEPLEELYETNQLDLVWENFDENSP FT LFKNSYKSKHRPPWRTSIDNIINAMQEKQSQIISQQPSIAFNHQTLRNDAGHISRVNRP FT SFSYSNENSSTERNFEHDDNDEDRTYLDEGNSFDISPSSEQNHVESPSLVASGFVMGVR FT CLPASTAITSLGSIIGNGNPGRSGSINRVRGTNLLGNQSQRLSIPRSDITGRLSSCPTF FT SSIATSWGTPVPITRTIPIDDILGRRRESIHSTLDPEKNDQNELILSEIRVKYNLDSDE FT DEVQNGKNLINGKDKDEESENFKDKQKLDIYHLEEKSNSELEQALIEEEVLGSFTNMQI FT KDSDASILAKETLKSWKYVNYDFDEELEMESEEYENEE" FT CDS complement(213822..216095) FT /locus_tag="1MB.330" FT /product="kinesin heavy chain, possible" FT /note="1MB.330, predicted protein, len = 758 aa, possibly FT kinesin heavy chain; predicted pI = 5.8496; contains Pfam FT match to entry PF00225 kinesin, Kinesin motor domain; FT contains no predicted TM helices; reasonable similarity to FT KINH_SYNRA, kinesin heavy chain (935 aa, Syncephalastrum FT racemosum, EMBL: AJ225894, CAA12647); Fasta scores: FT E():1e-52, 37.255% identity (39.701% ungapped) in 714 aa FT overlap, (aa 26-712 of 1MB.330, aa 2-698 of KINH_SYNRA)" FT /db_xref="GOA:Q7YYB7" FT /db_xref="HSSP:1F9T" FT /db_xref="InterPro:IPR001752" FT /db_xref="InterPro:IPR019821" FT /db_xref="UniProtKB/TrEMBL:Q7YYB7" FT /protein_id="CAD98568.1" FT /translation="MGENENLGNIENVSGTSGTTGNEHGSGSGVHVYCRVRPPNEAEKT FT HGNGLLCVNVRSEQCIEISSSESKSDSETKERTFYLDHIFPMDTNQSYVYKTAAKPIVD FT QLFKGINGTVLAYGQTSSGKTFTMEGIIGDNEKMGVIPRMVHDVFETISNAEEHIEFQL FT KVSICEVYMERIRDLLDTSGTKSNLRIHEDKIHGIYVKDLSEYFVTSPEEVFELMALGH FT KHRAVASTNMNSYSSRSHLIFMLQLQQKNVFDSSIKVGKLFLVDLAGSEKISKTGAEGL FT TLDEAKTINKSLSCLGNVINALTDNTKNFIPYRDSKLTRILQNSLGGNSLTALIVTCSP FT SIVNESETIGTLRFGIRAKMVKNAPKVNQQYSVEQLQVLLNSAQRKLAERNNYIQTLEE FT LVKKLGGELPENKPSGDKGGSIILNSSLNIGERKELQEGIIPGSEKTTLNMRTLDNDEL FT DELEEAKQQLKENSEKITQLKQEISEKENNLKLMSEEKENLNIKLSDLIQELSQTKYQQ FT QDQAETVEHLQLKNKSLIGELEQSQIHIHDLEARIEEYKSEESQRRNEEKENESNKAFQ FT SLSSEIQQLREYLLAIRSNTDPKEDAAWSSEHKALLETIEDNVARIANLELQLKESNKG FT AKKLDINDQDTKSMLERMSQLDTNMEQLGKLYQKMVEQNSNLKSQSQLNERRLLRKEER FT IEQLERSLINAKTKYTKLLMQCNSLTKTIENISKLKPIFAKLAPPNIVKGIQGGGGKSS FT LVKA" FT repeat_region 213846..213857 FT /note="(tcc)4" FT misc_feature complement(215010..215867) FT /note="Pfam match to entry PF00225 kinesin, Kinesin motor FT domain, score 491.2, E-value 5.1e-145" FT repeat_region 216322..216330 FT /note="(t)9" FT repeat_region 216466..216476 FT /note="(a)11" FT repeat_region 217131..217141 FT /note="(t)11" FT repeat_region 217414..217421 FT /note="(a)8" FT CDS complement(join(217961..218791,218846..219037)) FT /locus_tag="1MB.332" FT /product="t24f1.1 protein, probable" FT /note="1MB.332, predicted protein, len = 341 aa, probably FT t24f1.1 protein; predicted pI = 9.0452; contains no FT predicted TM helices; good similarity to Q22743, t24f1.1 FT protein (312 aa, Caenorhabditis elegans, EMBL: Z49912, FT CAA90136); Fasta scores: E():2.6e-48, 45.455% identity FT (49.270% ungapped) in 297 aa overlap, (aa 3-288 of 1MB.332, FT aa 2-286 of Q22743)" FT /db_xref="GOA:Q7YYB6" FT /db_xref="InterPro:IPR006762" FT /db_xref="UniProtKB/TrEMBL:Q7YYB6" FT /protein_id="CAD98569.1" FT /translation="MNSDRKKVLLMGRAGAGKTSMRSIIFANYLPKDTSRLTATNNIEH FT SHLRFFGNMVLSLWDCGGQDIFMENYFESQREHIFRSTEVLIYVLEVRKDYSSKHATKD FT IEQDFAYFKSTVENLKLLSPKSHLFCLVHKMDKLSAIERESAINYYEREIGRVASNMNY FT RVFPTTIWDETLFAAWSEIVYALIPNVGLLEKNLKILAESCNAVELVLFEKSTFLVISH FT AENSNTLDSKHHRSRFERISNICKQFKLTCAKSQTNFVGINLETPNFSSIIKRFTQNSY FT ILVVINDKSKFINNQVTIKYHLINVKLGVTSASALYNIEHARDHFETIIASHLNSEINK FT " FT repeat_region 218137..218144 FT /note="(at)4" FT CDS join(219309..221223,221467..221957) FT /locus_tag="1MB.333" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.333, predicted protein, len = 802 aa, unknown; FT predicted pI = 5.4073; contains no predicted TM helices; FT some similarity to Q25823, clp (766 aa, Plasmodium FT falciparum, EMBL: X95276, CAA64596); Fasta scores: FT E():0.16, 21.888% identity (25.000% ungapped) in 498 aa FT overlap, (aa 181-645 of 1MB.333, aa 28-496 of Q25823)" FT /db_xref="UniProtKB/TrEMBL:Q7YYB5" FT /protein_id="CAD98570.1" FT /translation="MIQIPEKTQKDSSIYGLIGEFDFFGASDSDFGELVHTIVDQIGGE FT NTNNEKLFDVLKKKLTILDNGTLNLFAPTKSSKPDIKDIVLPDESYFDCKSSLKKELKS FT ICQNAQIDSQIFHSKDLSILRFVFSYHKFFNPQNETRYPQDYDFYVIKHPNTDFNGCRT FT FAIKKKHGKSSKQDESDNEQQLIPISYVSSVDNITTKWEKAGKRIINLMQAIVNIVPSD FT IKLFITTMQDIYPIAFRNSLDSYILYSKLLFHGIKLIPSTMSILLRFLINKLISLESEI FT HSKNPNNYTKERFEKWRQEELQTVAIKIRRGEYADINSAKSYIQDLDGLKAQFSQRFTE FT EDIDRNAQVIDNVMKGLFDFISEVYENGFQYTKIFVNSTETSYSTQTKNLKVPSNVSNI FT LSSSDLESSAVEDIVVSGYGKKEELLYKQFVDLESTILNIFETQILAIESCQFVNYIPI FT YLVCHCDSWCEKFLQIIFKKLFNPHETLIIRESSVDYIVFFVTNYQIVCNFKFYTPCIK FT YLMQFLHDFVAHWSIEKSDQNLSSQEILHSNKRSGFKRSFGGDSQKHSHLTLRNLFGLY FT CHVVFSLCKFVSVIINAILSENIQDSSFEDLCFLIDSLLNMNRGFIPFILCGDLSPINN FT IDPKDQDISCHILELLKMIEDEMEIHSEEEPEPPNQEIDIPEKKYGEMESSILPEQSLW FT EYVWGDVPVSHEELEQELYVTKNKMYKEDLDKQLDYSREELNGQDEDKEPKRGLCSKKP FT SSSGMSLLDTLLSSDAFKRGEAILHSYQRKHSPSGKKMNKFSRKSRLF" FT repeat_region 221581..221588 FT /note="(a)8" FT repeat_region 222082..222093 FT /note="(att)4" FT CDS 222113..224206 FT /locus_tag="1MB.334" FT /product="CG2614 protein, possible" FT /note="1MB.334, predicted protein, len = 698 aa, possibly FT CG2614 protein; predicted pI = 4.8573; contains no FT predicted TM helices; reasonable similarity to Q9VIK9, FT CG2614 protein (673 aa, Drosophila melanogaster, EMBL: FT AY061160, AAL28708); Fasta scores: E():3.3e-31, 28.723% FT identity (32.088% ungapped) in 658 aa overlap, (aa 1-628 of FT 1MB.334, aa 1-619 of Q9VIK9)" FT /db_xref="UniProtKB/TrEMBL:Q7YYB4" FT /protein_id="CAD98571.1" FT /translation="MELLPNSVEDFTSSEYWSEFFKKYGGESNRAFEWYGDFEVLRDLL FT IQSLRNSGRSELDNKRILHVGCGNSTLPAKLYDEGFTDITNIDFSSQIIELMREKNKSR FT EGLKWVCMDIEKDFGDYVEKAENLGKFDTIIDKGFLDAYLSDSTSENGLSSRKKSTDFL FT NSSINLLAPNGRYILITLGQEYVAKALTMGLYNKGLEVIVEPLVGIKDSKFLPYYIEII FT NKSDFQNFEKSFFRFRGSGTEEICSAEGQCIWTLAKRLKELSAMFWNNKYIGDFMPGEI FT KEYQLNIKESKNSLFITVYDTMSKEKKRKLTVGLLVPLGEEQDWLYSTRKGFEEICSQA FT KCKRLIVISRFYSDSEEALKVSEQEILDEISNNISPLALKGSNRFPILTVGGDKNLDKK FT CIYSCDSKYSKEILVYDIQESGIEKRQMIFRSSPRLIQSEVVIRRNDSKTIEIDYLSGF FT SNYYVGVILVSSLILDTKNQDKTRNALILGLGGGILASILRKFYSKPKLHISAVEIDEN FT VMNVAKNYFGFSESETKVIIGDALDYVNNNYLEIKDSLDYIIVDINSGNVNDSLMCPGV FT EFLSKGFIEKLIVSLTKDGCIVYNVSCRDSNRREELFNEFRDLLNKMEEKTNSKRMILQ FT AVETGDDEINELWIIKRETNDNIEKVRNFIIENELFIGSQENTSLETYDKKDLWIKRFS FT NLK" FT repeat_region 223025..223039 FT /note="(aaaga)3" FT CDS complement(224226..224876) FT /locus_tag="1MB.335" FT /product="f11a10.2 protein, probable" FT /note="1MB.335, predicted protein, len = 217 aa, probably FT f11a10.2 protein; predicted pI = 9.7606; contains no FT predicted TM helices; good similarity to Q19335, f11a10.2 FT protein (222 aa, Caenorhabditis elegans, EMBL: Z68297, FT CAA92593); Fasta scores: E():1.2e-36, 47.867% identity FT (48.792% ungapped) in 211 aa overlap, (aa 1-209 of 1MB.335, FT aa 1-209 of Q19335)" FT /db_xref="GOA:Q7YYB3" FT /db_xref="InterPro:IPR000690" FT /db_xref="InterPro:IPR003604" FT /db_xref="InterPro:IPR015880" FT /db_xref="UniProtKB/TrEMBL:Q7YYB3" FT /protein_id="CAD98572.1" FT /translation="MDYENRGGHKTGSGALASSQDIAIERRERLRRLALESIDLSKDPY FT YMKNHLGQVECRLCSTIHTNEGSYLSHTQGRKHQTNLAYRASKEKNLKAVVKPQAENPE FT QAKPRAPRIGQPKYKVSKHREGSTGTNCVYCKFYFQEILEDHIPGYRIMSCWEQKVEKP FT NPKYQYLFVGAEPYNTIGIRIPNIELIKQRTQTYWDEQRKIYHIQLYLSSSKQ" FT repeat_region 224790..224799 FT /note="(tc)5" FT repeat_region 224909..224916 FT /note="(t)8" FT repeat_region 224992..224999 FT /note="(a)8" FT CDS 225633..226598 FT /locus_tag="1MB.336" FT /product="sucrose-phosphatase, possible" FT /note="1MB.336, predicted protein, len = 322 aa, possibly FT hypothetical 47.8 Kd protein; predicted pI = 7.8353; FT contains no predicted TM helices; reasonable similarity to FT Q9C8J4, hypothetical 47.8 Kd protein (423 aa, Arabidopsis FT thaliana, EMBL: AC024261, AAG52615); Fasta scores: FT E():3.2e-06, 31.973% identity (35.338% ungapped) in 147 aa FT overlap, (aa 124-265 of 1MB.336, aa 91-228 of Q9C8J4)" FT /db_xref="InterPro:IPR006380" FT /db_xref="UniProtKB/TrEMBL:Q7YYB2" FT /protein_id="CAD98573.1" FT /translation="MFNEIWIRQHMFNNSKLIYSTGRNLKDFLLAAKQFNLLRPDYAIC FT GVGTEIYEFPNKEMNLETFCQRLSNTIGRNVTKEELFSLLRFQDMGERVETKEGDEEKV FT TKDVNPMFPRWCKSRLFAWPVDKWLEIIRKTFNRDELKKEIQENLNKIGLEYYINGNNF FT HDPFRLSVSIKTEYALKVYEEIQINKKSYRFAISGQGAWKYLDVLPDKGGKHLSIIFLQ FT DEILGNSIPLERFLVCGDSGNDAHMFTIETCKNCCVGNAQQDLKDFLLGGCLVNSDEPE FT SRKVLSSQSELLCRIMACQNLKPPKKVSFTPKMSFCISLN" FT repeat_region 226883..226890 FT /note="(tc)4" FT CDS complement(227018..227800) FT /locus_tag="1MB.337" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.337, predicted protein, len = 261 aa, unknown; FT predicted pI = 4.0180; contains no predicted TM helices; FT some similarity to Q943W8, putative bzip (762 aa, Oryza FT sativa, EMBL: AP003203, BAB64061); Fasta scores: E():1.4, FT 34.783% identity (36.782% ungapped) in 92 aa overlap, (aa FT 56-145 of 1MB.337, aa 83-171 of Q943W8)" FT /db_xref="UniProtKB/TrEMBL:Q7YYB1" FT /protein_id="CAD98574.1" FT /translation="MESEPVINQVLEEIKQSIILINESLGSEKYDISSLSNLSLSTILD FT ICDVRIVDELPPIQTRTEDLLVIPYENSQENLSISQIHNRSSSYIIVTSPNPVSTQNEA FT IGGASWTYYDDCESTSEGSFLTEDEDDDGTSDSEVDMSDLDRSFKNSSGINLEKSDDLL FT HKGGNCNSFIEKFGRNHLNDSSNNEHFKEVFPISFRSNIPLICESFVDNQENSMSIPID FT STNILKRQLVKEINEEERNFKMVKYDSQLPIGRDQVFY" FT repeat_region 227678..227685 FT /note="(ag)4" FT repeat_region 228106..228114 FT /note="(t)9" FT repeat_region 228610..228617 FT /note="(a)8" FT CDS 228619..230712 FT /locus_tag="1MB.338" FT /product="repeat organellar protein, possible" FT /note="1MB.338, predicted protein, len = 698 aa, possibly FT repeat organellar protein; predicted pI = 5.1277; contains FT no predicted TM helices; reasonable similarity to Q25662, FT repeat organellar protein (1939 aa, Plasmodium chabaudi, FT EMBL: U43145, AAC63403); Fasta scores: E():7.4e-08, 24.094% FT identity (27.717% ungapped) in 635 aa overlap, (aa 113-690 FT of 1MB.338, aa 39-647 of Q25662)" FT /db_xref="UniProtKB/TrEMBL:Q7YYB0" FT /protein_id="CAD98575.1" FT /translation="MEIKKITQMQQDEIFQENLELFLRNCELNREVGWREVENTDLLVR FT LNDYKENQKSLELFIKKLKEDEIDSCLGEISAKSQLKESQKIVRSLEKYNEYLKAMLRK FT YMNAGCHINKILNGEKITSEGSLENESEDEFRENLKKCLNILELLSMDIEEQVKYKDKY FT KALKTENKDLKGRIEKLTRVKPSKKVRMEVSLTIIEPFEFDKELIKDTQIVEDVFEQIK FT NDPVDKSLEEEDIDNKEIDISYEVECLNYLIKQHKKENFELKQKIFEYEFKDNKLDVEA FT LIESKLMDKQRELDELEEEYSNWVIRLQGENSELRKVIEIQEKKQSEFIDNFTKKSQEK FT AKEMQNMQLKLVENYEERLREKNEELRKLRGENIQGEKDSIEIESLNKIIKSLEEKLVS FT ISSERHRLSLELEKTAKDLKDTDKRLVLSEEEIKKREKEFDSLKAEQEKILEEYYHEKK FT LHQKFEQDAHSLKKELDNVLSDMYKNQKKTRSGIVGDSIDSDKISSIEAVIKHCKELEG FT KVEELGCELDQLKSGKSVTYTNSSHNNAEIETLFDQVQSLRKEREQLKKDIRQRDWAKV FT EVEILHKRITEEIDQLKRDNTRLMLENLRLRDSGGIQISQRQNFLNSNNENIQANSTRS FT SISGDKQELLNKGTETNPSNLEETNHNNSNVSVSRNSIQSILNLKAPRRPSLLVQQRNY FT SEK" FT repeat_region 231069..231076 FT /note="(a)8" FT CDS 231138..231890 FT /locus_tag="1MB.340" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.340, predicted protein, len = 251 aa, unknown; FT predicted pI = 8.4246; contains no predicted TM helices; FT some similarity to O24300, ptxa protein precursor (352 aa, FT Pisum sativum, EMBL: X67427, CAA47812); Fasta scores: FT E():2.1, 27.451% identity (32.184% ungapped) in 102 aa FT overlap, (aa 136-227 of 1MB.340, aa 60-156 of O24300)" FT /db_xref="UniProtKB/TrEMBL:Q7YYA9" FT /protein_id="CAD98576.1" FT /translation="MEIIRNIINIYLKKFLQLEDLKWDINQGLQVDHVRANSQSINKQF FT EEKGIPIQIYNGTLDSIKITYSPTNGTFQIHIKEINAQIKPRVLSTVGKKIQQGIVNII FT LDEDPIEFIDSYSYIRDLPISYLEQASKKSESSIADPDLIPDPPRFPTAALKIKQHTDY FT PPKYYPPLHFSIYKPKVTKMPATSTPLIFQNRCQEHHLPSSPLVYGPRNQSVPILQDFH FT SQFQPSYITNNNCENGHVRALRESGFCI" FT CDS complement(231936..232958) FT /locus_tag="1MB.341" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.341, predicted protein, len = 341 aa, unknown; FT predicted pI = 8.9518; contains no predicted TM helices; FT some similarity to Q94174, hypothetical 99.7 Kd protein FT (869 aa, Caenorhabditis elegans, EMBL: U70848, AAB09108); FT Fasta scores: E():1.5, 21.849% identity (23.744% ungapped) FT in 238 aa overlap, (aa 1-235 of 1MB.341, aa 91-312 of FT Q94174)" FT /db_xref="UniProtKB/TrEMBL:Q7YYA8" FT /protein_id="CAD98577.1" FT /translation="MEDNSGLINEKDFILKSRLEVKLNDLEKRHLETKEREAQTVGFLE FT RKLRQVELEHKTELKLRRQLRYEVNNLRKKLLETYQLYSNTLEHNELMRSEIDRLKGDS FT LKSKCRQVDLAVSRQVEIEINELFQNDCYFNNENSRFEKKNRLNLRGLNEFDLKESRLS FT MSFDARNGRHSFNRGLPSRKNSFFGINRTAERIKIPSDGRTLGKAISGNESSKDLNLKS FT ERAKDWLNQIEELYKPLEYPKLCNRMYKNELDSFSGIGIVDNSLNMGEDLLSDEEYYSG FT EYDVSENTEDVITNTGFIIDQINQQDKELSTLLNKTFISANKNRTASKNKLRNSRSNRK FT " FT repeat_region 233084..233091 FT /note="(t)8" FT CDS 233321..234541 FT /locus_tag="1MB.344" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.344, predicted protein, len = 407 aa, unknown; FT predicted pI = 8.0532; contains no predicted TM helices; FT some similarity to Q94LV7, hypothetical 24.7 Kd protein FT (217 aa, Oryza sativa, EMBL: AC020666, AAK43498); Fasta FT scores: E():2.2, 25.175% identity (27.068% ungapped) in 143 FT aa overlap, (aa 134-272 of 1MB.344, aa 5-141 of Q94LV7)" FT /db_xref="UniProtKB/TrEMBL:Q7YYA7" FT /protein_id="CAD98578.1" FT /translation="MSHIRPVIKLKGIEPIWKNSTDILSFEETSIIQKSLKHMSNEIKE FT LKECKQSLSYLEVVDIYKRFGDLINCFVTHFDFNRVKIFSQRRSIAKYSYHILICIINI FT IEDKSFQLDKESLVKSCQICLNYYFPSKAIKQMESNSNKSEFDLNSSSQRFDLNRSQDL FT CSTPPRSSVSMDISNCLPNPSPFNLSLISPGSSSISIYQQPITRYNSSENCLSEYTNLH FT IQHNQEEPFSCESEFFPNEFQLPMPIEEAKRRLREKLLRKEISDMKKQKNKELDYQSDY FT SNLSSPLKSKSLLHSWGSEQSDKSELHIEDLAGASSENPYKSNKVVNIKHNLQTQSDFW FT ESTKVAAQLAETLPEPHRTRRIRKEIYQQSIERSVELLKSLGICRNFRPSVLTKTMIHT FT VECIIDY" FT CDS complement(join(234589..235287,235355..235912)) FT /locus_tag="1MB.345" FT /product="hypothetical predicted transmembrane protein, FT unknown function" FT /note="1MB.345, predicted protein, len = 419 aa, unknown; FT predicted pI = 8.4094; contains a predicted TM helix FT region; signal anchor predicted; some similarity to FT BAB89670, (417 aa,, EMBL:); Fasta scores: E():0.04, 36.986% FT identity (38.571% ungapped) in 73 aa overlap, (aa 298-368 FT of 1MB.345, aa 277-348 of BAB89670)" FT /db_xref="GOA:Q7YYA6" FT /db_xref="InterPro:IPR004263" FT /db_xref="UniProtKB/TrEMBL:Q7YYA6" FT /protein_id="CAD98579.1" FT /translation="MKLSCNFLILLEFACMLIITGGIIIYLYIQEFPSNNNPRSFFLEN FT YPQFDYLDCNLNFDGNLDQINLQNITKQNRDNTTEICLSDEQIRIFNLTKNLRPRLNTN FT YSGYMGPWIEDGVFCNWITQYSTKQIKVCNESEPIPPVYIPIFWTSIHRNKVELDLKKE FT WKKEAQDVLNSLKNETMYFTVLQDAEGFKKSKLKFMSMSNLIVFNAGGATTGFKQVPIP FT LIKGELQYEGLKAKKDIWVSSTIVKKHFPVRKKLFETFSYYNVTDEMLDKITPFKVLEN FT VTNQFIHYQGDQFKQVIQRSTFHLCPRGFGRTSFRLYESVQLGTIPIYIWDDVNWIPYG FT NLMERLGIVIHISQIEDLFDILNSLSEDELKFKFEQIKKFKHWFTYLGITNYILKVIKR FT LPPDIILKQNIESKYLALY" FT repeat_region 234857..234864 FT /note="(at)4" FT misc_feature complement(235826..235891) FT /note="1 probable transmembrane helix predicted for 1MB.345 FT by TMHMM2.0 at aa 7-29" FT misc_feature complement(235847..235912) FT /note="Signal anchor predicted for 1MB.345 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.281, signal anchor FT probability 0.695) with cleavage site probability 0.137 FT between residues 22 and 23" FT misc_feature 235867..235874 FT /note="tgcatgca" FT repeat_region 235943..235952 FT /note="(t)10" FT repeat_region 235956..235966 FT /note="(a)11" FT repeat_region 236198..236206 FT /note="(t)9" FT repeat_region 236321..236330 FT /note="(a)10" FT repeat_region 236337..236344 FT /note="(a)8" FT repeat_region 236359..236370 FT /note="(taaa)3" FT misc_feature 236651..236658 FT /note="tgcatgca" FT repeat_region 236766..236773 FT /note="(t)8" FT repeat_region 237018..237027 FT /note="(t)10" FT repeat_region 237232..237239 FT /note="(a)8" FT repeat_region 237402..237411 FT /note="(at)5" FT repeat_region 237414..237423 FT /note="(t)10" FT CDS 237471..238736 FT /locus_tag="1MB.347" FT /product="hypothetical predicted transmembrane protein, FT unknown function" FT /note="1MB.347, predicted protein, len = 422 aa, unknown; FT predicted pI = 7.0484; contains a predicted TM helix FT region; signal anchor predicted; some similarity to Q8YJ74, FT hypothetical protein bmei0212 (299 aa, Brucella melitensis, FT EMBL: AE009464, AAL51394); Fasta scores: E():0.23, 26.708% FT identity (28.477% ungapped) in 161 aa overlap, (aa 140-298 FT of 1MB.347, aa 4-156 of Q8YJ74)" FT /db_xref="UniProtKB/TrEMBL:Q7YYA5" FT /protein_id="CAD98580.1" FT /translation="MNLHNDLTSGKAVKFQLLAFGLSILIFMGGTFYMLINSSRNLENG FT KQRIFLELEEKDITNQKSLDELDKLNDSMLRISLKLSESKDMFDTIGASINKIAKYVEA FT NEKNMTAKEFIIKEESKNGFVLDSVLFPCKYSFDGPIYILLTTTPKRIDNLGKYLDLLH FT KQTYGIKEVILSIPYIFERTGEEYPPIPGYLQDKNRFPLLRILRGKDYGPATKFLLPIE FT IGNIPEDSGLVILDDDTRYSRHLVCDYIYIHEKFPEAALGRRGQAFHDKCDPTYRTDRT FT HRVSHDQKSANFVIRSVDLLSGVGTYFIQKKFISKDILTLKNNCPTETINHMFFTDDIL FT ISGYLAYKNISRIAFSDELSIEDLNRPYLLGSGPGALWDINKETFHNDNSTAAFGLYWG FT CRNKDTIVSKHGEILCRWRNEV" FT misc_feature 237471..237566 FT /note="Signal anchor predicted for 1MB.347 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.065, signal anchor FT probability 0.805) with cleavage site probability 0.015 FT between residues 32 and 33" FT misc_feature 237513..237581 FT /note="1 probable transmembrane helix predicted for 1MB.347 FT by TMHMM2.0 at aa 15-37" FT repeat_region 237891..237900 FT /note="(at)5" FT repeat_region 238792..238804 FT /note="(t)13" FT repeat_region 238882..238889 FT /note="(a)8" FT CDS complement(join(238897..239654,239691..243111)) FT /locus_tag="1MB.350" FT /product="related to nuclear protein sa-1, possible" FT /note="1MB.350, predicted protein, len = 1393 aa, possibly FT related to nuclear protein sa-1; predicted pI = 4.9344; FT contains no predicted TM helices; reasonable similarity to FT Q9C2K7, related to nuclear protein sa-1 (1226 aa, FT Neurospora crassa, EMBL: AL513442, CAC28644); Fasta scores: FT E():1.1e-05, 23.184% identity (25.857% ungapped) in 358 aa FT overlap, (aa 2-344 of 1MB.350, aa 94-429 of Q9C2K7)" FT /db_xref="UniProtKB/TrEMBL:Q7YYA4" FT /protein_id="CAD98581.1" FT /translation="MPKRTKALNRDPEGFAIPEKLRNTPSPDADPNHKENWTISNYQLS FT KRNFSKTQLQTRCRLFKTIRSAVENPKQSKTITNSVFNQIRKGLLERPNHSSFTSSLKE FT IIRLILEGGGIPSVFLQSISDWTGSDSAEKCLENLIIDLQDNNTSILQYYPIRPKKPSD FT LYFCEIFTDFFLGIGDVLCKSIVKDINLESIAITCKWIKSTANASIRPLRHSGCVALNG FT ILRSLILTKLELLEKLSRFKIQFENESPGTGAKESLEEELSKTSKLCERLDSIIKDIVF FT DSWKSRVQDVSPEIRNNCLCTLSESILQPHMASIVCEAEIPSMLISLLPEELNSNRIQI FT LRSILICIETKDILNKCSVILTDSPFLRSLRNLVALASKLPSDPEVLSCGELAIRVLIG FT LLKNELLQEEFVDEIVDLLWLGPSSPAISSLLAEFVDSALFEGGISHENIEARELIELA FT CHQIENKKVKSLINLEELKTLEPGRIRSDLQTFLEFIHEFGRDLVVLTHRCVNAFWSKA FT PCVRDSKFLVEMLLATECSSSSQLEPLGEEMRKVLLLVLHANVSRIEDLLFIEPEGTSS FT LLYSKAESILTRFRAFVAISVILEYMEPLILLHEQNTDYLTILLSTFSTCISLYCKIIQ FT DKENSNFIPSGMRHFPIKKLIALIQSHSDSRVIDGICMTFSPIASLDCKNLQFNENSLI FT DIQSDVESLNKKLLDTFFASGREFLANTDGLLYSNVAENTSNSKKSKKSSKKSENAQNS FT ALNTMCNFRKAFGVVKYLTTSNIYLSSQYRDSLSTLKSSEMSPEQGSGVPITFEILFEV FT LERTERIIQLNLDQILSHQCTQLYSLTLDMLTTGYAHLVQDLLQGVDVDSIENNEIGNE FT DDQVSENNYLTLSNLKESTIEIYKAVRHKLSSILLESLRNNIKHKSTDNQCLNLISLLS FT LCSFLVITGLQGTIENNLRDPDADCELKWALSDSELALITKELIYWTCTNRSKNVNEDF FT KISSLSSSLIEGFNGILNFSDKDKPYYLLYPSCRIFEPLDNLKDEAFVTTPETLGIYKN FT KISDFLSHPFSPGCILSIISISSQYKTTLQVFTPILINYLTDIEDVMINNYYLESLHSN FT LGSETSKEKLAFANNFIESALLKEDQECLGVFTYNHARAQRLSKKLFPTFSSRFGWRRV FT EEIIKSNTSDLSRSILECLRFSLFGEISIPVFRDIISRDELIEKGIKTKGIIKLFLDFL FT MTQNAPGGGKTVINSLSAPEIQRILEQAKSLCGIDEDNSSGGIRLSVMLNKELLDNDIM FT NFLDIISGRSEKKISNKNQISSKKSTKDSENHSISTSIENSIVDPSKSKNNRSKRKRPS FT RASTSKVMYLEDDDDDEEDSIDFDEEDDIVEDSNLSENEAEIEILNDQE" FT repeat_region 239000..239011 FT /note="(tca)4" FT repeat_region 240781..240788 FT /note="(at)4" FT repeat_region 243158..243165 FT /note="(t)8" FT CDS 243347..247750 FT /locus_tag="1MB.354" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.354, predicted protein, len = 1468 aa, unknown; FT predicted pI = 4.5016; contains no predicted TM helices; FT some similarity to Q9W1E5, CG3060 protein (802 aa, FT Drosophila melanogaster, EMBL: AE003462, AAF47123); Fasta FT scores: E():0.36, 21.497% identity (26.107% ungapped) in FT 521 aa overlap, (aa 592-1100 of 1MB.354, aa 307-747 of FT Q9W1E5)" FT /db_xref="GOA:Q7YYA3" FT /db_xref="InterPro:IPR016158" FT /db_xref="UniProtKB/TrEMBL:Q7YYA3" FT /protein_id="CAD98582.1" FT /translation="MIIENLLADGLCESTTPDNLSELSEQVLEAWAMVSISAKNFVSDI FT NKVLDEYLYNSKVYEKVETYQDNIFNGFQSVPKSPESKLQLERLVSTLETPEITEALEC FT IYRNGLSIQILESFLGYSLDYIISKEITFFWTTLLLLGETDTSDSRMTSNDTNSVDGSF FT VENKDYSKDGKTLIPMDRDILSFHSCLIFGLVRLLWNIVFILYGATSLVNIPNMERSSF FT VYDCLSQKDQVFPNNLGECPLRRVIYGFMTKIRLLLIESIPSDFDLVLERYIFGILTFL FT SEKILNNDEFSIKNDISPIIMREYLLKVLINDSCGNTKKRMEICKSRLENLQHIYEHKA FT SRVLSLNGSFPSNLDFETLEVLLSLIGCSNPNEKCIDQLNSSFSDNRYIGEEIVRHRNT FT GDQDNWDNKKIQEILRVIAQGPIIEDLYKNNGENSLEELSRIHTTLLDELIWLVFCSIE FT QEKEYVPNKSSQITLMTRIGDLPVLMKLVGFEHLWKKRVVYLLKMQSVNAVLFLEGRKT FT EESNLRIYSKYVHEFMGPILMGLLDGGFGYIKSQDFELVDQENDKSESSGMANFQVNNN FT ISMIGKECEPMNGDLAFQMIFEEFYMEYNWSLKRRVIDLIMEFPRSKSSILDLYITMNY FT FPSSNILRDVWFSEISREILNYVNGKLLHLHVDTSIIVGFYVKSIIFLLMLDFPNDRMD FT KTLISFSDALRQRGDTTLCIVSWMPLMLENCSPALSDSEIIMPISCSDEGVYPQFSISN FT PNYRNECRGLFEKSFDQIQEYSHCFHIPQIKLVLSWISRIYGSNLTLLYDYIYNLASRV FT IGSGQELGFESNSVDGTEKCSLIDENTWEIDERRFQKDESVYEMIKLTIGGRSGENLLK FT GGSGSSKFGSKDEQQLLTNCSIILQDINSSILDNKEYSKSREGSLLNAESKPTVTGFTI FT SRNYWSEGVINMEIRDDTFPLAPVLEDEIKEYRQFFEREHPGRTFNCFCGYGIGLVDLT FT AIDGTVKSNIALNFLQISIYDYISSKHPENDLDYPEGESSKDGFLDSDKLLSSNSILNW FT FTISPKATDRSSDRISKPLTDKEEVKVTFMDILNHFRLDEQTIRWSIENMLSRGIIQIV FT TKEEGLECFDIPQTFSNIKLTREAGGEDEKEQGREMSVAQKDESFSLIKTTEEGYGSNI FT NDMTVMLDFNNMLNNRNSNASLIGLNRQSSSLKKNFTLSSVMETPKRTPHENLGETSSE FT RASRREEEGTAFDSRYLSTRRSEIENQDNMQEGEYDEGEEEDEDLNLNFPTGIITTTIS FT FKPKDLAQDTRHQGPTASRDGPSYSKKGENNPGSSAGGGSLPPMIAFSETYLEKYCYFT FT TPSVLDQQDNGLDSSSPSNSRSNKDREHANYDIIKECELLIRATLQLNGAMAPAVLFAR FT VRAAIASQGEDKFSQKDSDDHQGSKASTNTDTQYTLTWPQHVQAINNMVDRGEVYNKGG FT RLFLEK" FT repeat_region 243658..243665 FT /note="(ta)4" FT repeat_region 244162..244169 FT /note="(at)4" FT repeat_region 244300..244307 FT /note="(a)8" FT repeat_region 244347..244354 FT /note="(at)4" FT repeat_region 246941..246948 FT /note="(a)8" FT repeat_region 247886..247895 FT /note="(at)5" FT repeat_region 248029..248038 FT /note="(ta)5" FT CDS 248735..251551 FT /locus_tag="1MB.355" FT /product="aob567, aof1001, aoe110, aoe264 and aoe130 genes, FT possible" FT /note="1MB.355, predicted protein, len = 939 aa, possibly FT aob567, aof1001, aoe110, aoe264 and aoe130 genes; predicted FT pI = 5.6038; contains no predicted TM helices; reasonable FT similarity to Q05164, aob567, aof1001, aoe110, aoe264 and FT aoe130 genes (1001 aa, Saccharomyces cerevisiae, EMBL: FT X89715, CAA61860); Fasta scores: E():0.00065, 20.629% FT identity (22.391% ungapped) in 572 aa overlap, (aa 376-926 FT of 1MB.355, aa 125-672 of Q05164)" FT /db_xref="UniProtKB/TrEMBL:Q7YYA2" FT /protein_id="CAD98583.1" FT /translation="MSFEERISSENGGIEILGGANLVCGVTDSSRGLVSVSDLSPNLSG FT IKVQFGDPSPICGITGQDALGGFQSINQKTKSSEIGTTLNESFLVSISELRYIVLICIR FT WRLQNSSELSSNVSTILGIGTGSSIAPVDSPSTCGEESITPSKEYLVSNIFLYRSCISD FT SHRVELEFQVVRDILRLIDPTVESGGRHFSGRLVNSANRNIIKELAGGLDSQEANEMRN FT AESVLGSQFLLSKLPFPNYGTFLGKETAEYTIGDIINTTEHPTGLNLANVASTPSILGI FT LLKYFDVSWPSHIFESLLKSDVVGGGLQQQEDNISSTVKDSGDVSLSLGSFTHVLGLCK FT PCVFVNKTNKKCRNGVHCCFCHFQHKERKRGKRYKSSANCSSTSNSCNNSGAGTGVCLT FT SSSGQPSHSGANNSGGIASNYEELGGGIGNTSGFGVTSNGAGILNVEGFSKQDISALAE FT DCLRVSSISPASIRGPHNIFGSNRSELSQGAVGLSSIQPAGMTAASGSQCLPDVLFSHH FT GSQQLQRYPLKANQGANNPLTSFQSSARSNSVFGKSSCPTGSEQQQQMPVRYLVPPPPP FT PPSKTSGALQIFQNEKDVNFRDNSNPVATSTSAGFNQLGTNYQKGSSDKHSMRIPGLVL FT DSCPFQSSQGASQSFNDSFFGDFIGYNKLINEASEMNNWISANNPENSSNAINQNSKVY FT GCFQWLDQPEIGESIVDAWGVGNTFDSFIKGNGASTSPQNLNQCLYDSHSYSKAAQDWT FT FASSIIKNSSINASNSASESSFTVNNHSNLVNPNSASSLQISTSQPSSQKLRESIIGSP FT LNLEASSINFHDQAQTQSQAQVQSSQALPELAEGVPNSHNDYFLSSPSVFNSNATTSSL FT PSSLSDNNNINNGSFIMPSGSVVPPFLRGNQGAGVSSEYSDCSPLSFLTGGGATNFGMD FT FCFLNSWK" FT CDS complement(251925..252665) FT /locus_tag="1MB.358" FT /product="bax inhibitor-1, possible" FT /note="1MB.358, predicted protein, len = 247 aa, possibly FT bax inhibitor-1; predicted pI = 9.9006; contains Pfam match FT to entry PF01027, Uncharacterized protein family UPF0005; FT contains 6 predicted TM helix regions; reasonable FT similarity to BI1_HUMAN, bax inhibitor-1 (237 aa, Homo FT sapiens, EMBL: BC000916, AAH00916); Fasta scores: FT E():2e-14, 29.461% identity (31.278% ungapped) in 241 aa FT overlap, (aa 10-245 of 1MB.358, aa 6-237 of BI1_HUMAN)" FT /db_xref="InterPro:IPR006214" FT /db_xref="UniProtKB/TrEMBL:Q7YYA1" FT /protein_id="CAD98584.1" FT /translation="MESFFATNSRKSQFSGNFFNSSDLTSIQQTHLLKMYSSIIAGSFM FT TVFGVTAFINGMLRINSFVGLLAGIGVTFYLTASSSNKSSISIKRLAAYLLLCFVIGNG FT LGPLILFSNFVNPVIIPTALATTCIIFISLSFGVLFTKKRLSLYTTSFIFTTIAYLGLV FT SFFNIFTRSKFVDSLLSYAFVMVYSFYIYYDTQKTLEAIAYGERDFLLHSIQLYLDAVN FT LFTKIVVILIRKQQEEEEKRRKKE" FT misc_feature complement(251961..252317) FT /note="Pfam match to entry PF01027 UPF0005, Uncharacterized FT protein family UPF0005, score 30.2, E-value 8.7e-09" FT misc_feature complement(join(252087..252143,252159..252224, FT 252246..252311,252327..252392,252432..252488, FT 252504..252569)) FT /note="6 probable transmembrane helices predicted for FT 1MB.358 by TMHMM2.0 at aa 32-54, 59-78, 91-113, 118-140, FT 147-169 and 174-193" FT repeat_region 252665..252673 FT /note="(t)9" FT repeat_region 252849..252860 FT /note="(tctt)3" FT misc_feature 253124..253131 FT /note="tgcatgca" FT repeat_region 253228..253261 FT /note="(ag)17" FT CDS complement(253336..254466) FT /locus_tag="1MB.360" FT /product="tyrosyl-tRNA synthetase, probable" FT /note="1MB.360, predicted protein, len = 377 aa, probably FT tyrosyl-tRNA synthetase; predicted pI = 7.3964; contains FT Pfam match to entry PF00579 tRNA-synt_1b, tRNA synthetases FT class I (W and Y); contains no predicted TM helices; good FT similarity to AAL77671, (385 aa,, EMBL:); Fasta scores: FT E():3.5e-92, 58.511% identity (58.824% ungapped) in 376 aa FT overlap, (aa 1-376 of 1MB.360, aa 12-385 of AAL77671)" FT /db_xref="GOA:Q7YYA0" FT /db_xref="HSSP:1Q11" FT /db_xref="InterPro:IPR002305" FT /db_xref="InterPro:IPR002307" FT /db_xref="InterPro:IPR014729" FT /db_xref="InterPro:IPR015624" FT /db_xref="InterPro:IPR016485" FT /db_xref="UniProtKB/TrEMBL:Q7YYA0" FT /protein_id="CAD98585.1" FT /translation="MSSNSCTETIPKYILKGSEEPLKRSKLTLEERHKLCLSVGEECIQ FT EAELLELLKRKEHPICYDGFEPSGRMHIAQCILKTINVNKLTECGCVFVFYVADWFALL FT NNKMGGDLEKIKIVGEYFVHIWKAAGMDMTNVRFVWASDFINGEDSNEYWLRVFDISRK FT FNITRIKRCCQIMGRQENDEQPCASVFYPCMQCADIFQLKADICQLGMDQRKVNMLARE FT YCDAAGIKHKPVILSHKMLPGLLEGQEKMSKSDTSSAIFVEDTPEAVVKKIKKAFCPPG FT IIEGNPCIEYINTLVFPKFGHFHVSRKEEYGGDITFTNKEDFHKAYLSGDLHPGDLKKG FT LSDALNLMLQPIRDHFNTNPRAKELLQLVQSFKVTK" FT misc_feature complement(253417..254301) FT /note="Pfam match to entry PF00579 tRNA-synt_1b, tRNA FT synthetases class I (W and Y), score 213.9, E-value FT 1.5e-61" FT misc_feature 253883..253890 FT /note="tgcatgca" FT CDS 255148..256035 FT /locus_tag="1MB.361" FT /product="hypothetical predicted multi-pass transmembrane FT protein, unknown function" FT /note="1MB.361, predicted protein, len = 296 aa, unknown; FT predicted pI = 8.3091; contains 2 predicted TM helix FT regions; signal peptide predicted; some similarity to FT Q94BQ5, hypothetical 33.1 Kd protein (300 aa, Arabidopsis FT thaliana, EMBL: AY039962, AAK64139); Fasta scores: FT E():0.0072, 22.297% identity (25.191% ungapped) in 148 aa FT overlap, (aa 60-206 of 1MB.361, aa 64-195 of Q94BQ5)" FT /db_xref="InterPro:IPR004269" FT /db_xref="InterPro:IPR018143" FT /db_xref="UniProtKB/TrEMBL:Q7YY99" FT /protein_id="CAD98586.1" FT /translation="MQKSLSIHKHKMVLKLLLTSLLGLLNIIVWANGEEREPFCLEIYD FT NKNDEKHLPYYFLNEFPICKEHERRTCCKKSHSEAISRLFSTLVARSSLSTRCSNFYQK FT SLCSYCDADIGVGKKVIQKSPILCQSYCNLWYDACYEDYFDNIQNSYIRNIEDISFIRL FT NLIPCTDSSAICSPLHAITLDPTEFCSLNGFSTHQDFHSSSGPASLTEYNTECFNGIPA FT ASVLKPGIRQKTQSYKYKRSQYSKKPKAKNKFYQIIQDHINVFLENVKVPLPVIVFISI FT ISIWIINQVINFFI" FT misc_feature 255148..255246 FT /note="Signal peptide predicted for 1MB.361 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.974, signal anchor FT probability 0.017) with cleavage site probability 0.463 FT between residues 33 and 34" FT misc_feature join(255181..255240,255961..256029) FT /note="2 probable transmembrane helices predicted for FT 1MB.361 by TMHMM2.0 at aa 12-31 and 272-294" FT repeat_region 256104..256115 FT /note="(attt)3" FT repeat_region 256504..256511 FT /note="(a)8" FT CDS 256684..257466 FT /locus_tag="1MB.362" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.362, predicted protein, len = 296 aa, unknown; FT predicted pI = 5.9740; contains no predicted TM helices;" FT /db_xref="UniProtKB/TrEMBL:Q7YY98" FT /protein_id="CAD98587.1" FT /translation="MKDIKKVAETIKLLVAYREKSVRIFGDGLDLRAPIQPLFPTKAEN FT ISGTENSISRSSLESRTITTDAADLLNELAAMHYEWALDNYAKVRTRDHQEQQKKCSAI FT QKEEIAAPSPYLSYLSSQYDGIDSPIVGEILENKHGYIVMECILLLLNQNNSIEYGTEH FT FKKYASLMLLLSDILRCMKRQKSLHCDTMDFHVLVDILFREMVIMRDSGFALFEYVSCL FT NIVIKCIASTKEKYRWGEATSLLSVTESLIEGFGVSSQ" FT repeat_region 257502..257509 FT /note="(t)8" FT CDS join(257734..257760,257946..258056,258351..258542) FT /locus_tag="1MB.363" FT /product="iron-sulfur electron transfer carrier, probable" FT /note="1MB.363, predicted protein, len = 110 aa, probably FT iron-sulfur electron transfer carrier; predicted pI = FT 4.2297; contains Pfam match to entry PF00111 fer2, 2Fe-2S FT iron-sulfur cluster binding domain; contains no predicted FT TM helices; good similarity to AAM14263, hypothetical 21.8 FT kDa protein (197 aa, Arabidopsis thaliana, EMBL: AAM14263, FT AL161503); Fasta scores: E():5.6e-19, 55.000% identity FT (55.556% ungapped) in 100 aa overlap, (aa 11-109 of FT 1MB.363, aa 98-197 of AAM14263)" FT /db_xref="GOA:Q7YY97" FT /db_xref="HSSP:1I7H" FT /db_xref="InterPro:IPR001041" FT /db_xref="InterPro:IPR001055" FT /db_xref="InterPro:IPR012675" FT /db_xref="InterPro:IPR018298" FT /db_xref="UniProtKB/TrEMBL:Q7YY97" FT /protein_id="CAD98588.1" FT /translation="MDQRCSSEDAPKNISLLEAAQHEELDIEGACEASLACSTCHVILD FT KEIYDELEPPSEREEDMLDMAPQVCETSRLACQIKVDERLTKGNIHLPNMTRNFYVDGF FT KPSPH" FT misc_feature 257734..257982 FT /note="Pfam match to entry PF00111 fer2, 2Fe-2S iron-sulfur FT cluster binding domain, score 35.2, E-value 1e-07" FT repeat_region 258091..258099 FT /note="(a)9" FT repeat_region 258380..258387 FT /note="(ag)4" FT repeat_region 259086..259100 FT /note="(a)15" FT repeat_region 259102..259109 FT /note="(a)8" FT variation 259109 FT /note="(a)8 in clone 29 shown vs (a)7 in clone 12 not FT shown" FT variation 259142 FT /note="T in clone 29 (shown) vs C in clone 12" FT CDS 259203..261443 FT /locus_tag="1MB.364" FT /product="putative poly(a)-binding protein fabm, possible" FT /note="1MB.364, predicted protein, len = 747 aa, possibly FT putative poly(a)-binding protein fabm; predicted pI = FT 8.4175; contains four Pfam matches to entry PF00076 rrm, FT RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); FT contains Pfam match to entry PF00658 PABP, Poly-adenylate FT binding protein, unique domain; contains no predicted TM FT helices; reasonable similarity to Q92227, putative FT poly(a)-binding protein fabm (705 aa, Emericella nidulans, FT EMBL: U70731, AAB16848); Fasta scores: E():2.7e-41, 38.205% FT identity (51.468% ungapped) in 780 aa overlap, (aa 9-720 of FT 1MB.364, aa 39-685 of Q92227)" FT /db_xref="GOA:Q7YY96" FT /db_xref="HSSP:1CVJ" FT /db_xref="InterPro:IPR000504" FT /db_xref="InterPro:IPR002004" FT /db_xref="InterPro:IPR006515" FT /db_xref="InterPro:IPR012677" FT /db_xref="UniProtKB/TrEMBL:Q7YY96" FT /protein_id="CAD98589.1" FT /translation="MTSNNNVVPVSASLYVGDLDADVTETMLYEIFNSVAVVSSVRICR FT DALTRRSLGYAYVNYNSVADAERALDTLNFTCIRGRPCRIMWCLRDPASRRNNDGNVFV FT KNLDKSIDNKTLFDTFSLFGNIMSCKIATDVEGKSLGYGFIHFEHADSAKEAISRLNGA FT VLGDRPIYVGKFQKKAERFSEKDKTFTNVYVKHIPKSWTEDLLYKIFGVYGKISSLVLQ FT SDSKGRPFGFVNFENPDSAKAAVAALHNALVTPVGVELDSTAETPVDNEAGADSETSSK FT QESGEASNKKQTASGEASKDSSGTSNEESAQNEDGSADKNVSADVQPNRLYVSRAQKKN FT ERQVVLKSQHEAVKESHQRYQGVNLYVKNLADSINEEDLRSMFEPFGTVSSVSIKTDES FT GVSRGFGFVSFLSPDEATKAITEMHLKLVRGKPLYVGLHERKEQRALRLQQRIRGGAVP FT PVLRPGAIPPGPPGVHGAPMQFGVPPQMYFIPGNPNVAATAMPHGRAMVTGGFPNQNAM FT NNPWRPNPTRMPYTAGGVPPQMTGGPQMTAYNGNVIQQNGVSPNGAANATGSVQNGVTG FT NAVTGVQGAQNNRTGGNNQRIHNRHVQGNGQGGRPGSHGHQVQQMQKQGFKFPQNVKGS FT EMQRVDMMQNRQMDSSNGALVQNPLIPQPDVPLTAATLAAASPSMQKQLLGERLFPIIA FT QFQPELAGKITGMMLEMDNNELLELLSSDIEIKNKVDEAMVVLERAQQQIST" FT misc_feature 259242..259457 FT /note="Pfam match to entry PF00076 rrm, RNA recognition FT motif. (a.k.a. RRM, RBD, or RNP domain), score 83.5, FT E-value 2.8e-22" FT misc_feature 259506..259718 FT /note="Pfam match to entry PF00076 rrm, RNA recognition FT motif. (a.k.a. RRM, RBD, or RNP domain), score 84.6, FT E-value 1.3e-22" FT misc_feature 259776..259976 FT /note="Pfam match to entry PF00076 rrm, RNA recognition FT motif. (a.k.a. RRM, RBD, or RNP domain), score 47.8, FT E-value 1.5e-11" FT misc_feature 260295..260507 FT /note="Pfam match to entry PF00076 rrm, RNA recognition FT motif. (a.k.a. RRM, RBD, or RNP domain), score 91.3, FT E-value 1.3e-24" FT misc_feature 261198..261413 FT /note="Pfam match to entry PF00658 PABP, Poly-adenylate FT binding protein, unique domain, score 112.7, E-value FT 4.5e-31" FT repeat_region 261259..261266 FT /note="(ga)4" FT repeat_region 261462..261470 FT /note="(t)9" FT repeat_region 261497..261505 FT /note="(t)9" FT repeat_region 261654..261668 FT /note="(act)5" FT repeat_region 261738..261745 FT /note="(ta)4" FT misc_feature 261777..261784 FT /note="tgcatgca" FT repeat_region 261913..261931 FT /note="(t)19" FT repeat_region 262184..262193 FT /note="(t)10" FT repeat_region 262317..262327 FT /note="(a)11" FT CDS join(262364..262809,262915..263647) FT /locus_tag="1MB.368" FT /product="hypothetical predicted multi-pass transmembrane FT protein, unknown function" FT /note="1MB.368, predicted protein, len = 393 aa, unknown; FT predicted pI = 4.2207; contains 6 predicted TM helix FT regions; some similarity to Q9RG47, cps1h (388 aa, FT Streptococcus suis, EMBL: AF155804, AAF18943); Fasta FT scores: E():0.79, 25.604% identity (30.814% ungapped) in FT 207 aa overlap, (aa 44-236 of 1MB.368, aa 174-359 of FT Q9RG47)" FT /db_xref="UniProtKB/TrEMBL:Q7YY95" FT /protein_id="CAD98590.1" FT /translation="MTQGVILEMIPRVEQTENVIEEKPVENNDTDLAKLSNKIFGNEMA FT NVPLVFDEFPVTKLLLSFFVILFICFGVIDMILGFIRLGLMSFIISMFVLFFRCGERNR FT QHSSIFVLFIVALTSSLSWNSISNYILKNEGTLFEEKIYPLLILETNTNLRFTALSAFN FT WYLISMGFEFSVLVLYLLEGFYIVSIIPFISLLVSMFYVISKSKRILQYSEIFLPFVML FT AYVILRMIYEWVAPDDYSSMIGAYLVTIMVFKQVVGLSILFIPSINYYDMINDNGSYSV FT TVVVSFKIHQNKQENAVKSADIIDISPTNRVIAENPESENREEEAITVIQVSSSEDPEG FT LIKSVQNDQIELNKESFDLETQKSHSAVEDDAEANIPYEDESLKIEINSKLN" FT misc_feature join(262523..262591,262601..262654,262688..262756, FT 262886..262954,262991..263059,263087..263155) FT /note="6 probable transmembrane helices predicted for FT 1MB.368 by TMHMM2.0 at aa 54-76, 80-97, 109-131, 175-197, FT 210-232 and 242-264" FT repeat_region 263635..263649 FT /note="(taaat)3" FT repeat_region 263785..263792 FT /note="(t)8" FT CDS join(263964..266375,266463..268298) FT /locus_tag="1MB.370" FT /product="uvb-resistance protein, possible" FT /note="1MB.370, predicted protein, len = 1416 aa, possibly FT uvb-resistance protein; predicted pI = 7.8374; contains FT Pfam match to entry PF00415 RCC1, Regulator of chromosome FT condensation (RCC1); contains no predicted TM helices; FT reasonable similarity to CAD25339, regulator of chromosome FT condensation (440 aa, Encephalitozoon cuniculi, EMBL: FT CAD25339, AL590444); Fasta scores: E():5.5e-08, 30.000% FT identity (37.714% ungapped) in 220 aa overlap, (aa FT 1019-1203 of 1MB.370, aa 99-308 of CAD25339)" FT /db_xref="InterPro:IPR000408" FT /db_xref="InterPro:IPR009091" FT /db_xref="UniProtKB/TrEMBL:Q7YY94" FT /protein_id="CAD98591.1" FT /translation="MNRIRNAVRLRRKESQETTGKKSEGEINFQSENSKGLNIVGKENS FT GEIGIPEILSYFPDLYVDSSNGFSANGGGVGQMICGTISSFFEDTWNRAMGEMKQVKSR FT NICNLSSDLSLGIHLKREWHRLWLVRSFLAFAWESYLEDLMIENDKFPKEYLNLPSYGT FT NLEGIGKISLGEILCIVSCSKLTYSEKSTISIPFSTIQGKMIIKKSLSFNYNLHWKTPT FT PNTVAKFRVLQDQKKLPKTNTSNVEGGDSVFMSLDIDNPQFLWKSSFSNQIPKEFEDLQ FT TLASSVRVLRKDETAKFTIDGKKSIVIKLLNWYRETQDIITYENKNVNCKYFIEQEDFY FT FDNDLNNSQIQGNVTSVEYSITSDDNRKRKLFTGLTYISNGSYFSPMLGTVMRGNSKAV FT IYITKELANLELGILKKHSDNSKFSNCLRSIVTGMDSGKRYFLHLEIPSIEKELQDLQI FT KEKNKTLMFKHVIQNKKQFESRIQLLQDLKAFGHLVLSKFSIGNGESAIIENVYIREIG FT NDPILLETLRVKLNQHLLENNLFEVPFVFFEIENHSKSKYQAFIVCPCIRNYNELLFYS FT HFRWLKGEYIKKKRFEISPNEAYSRHIFGDLRQTKEQNHQRNDQIQMGMSHYNKGVTLL FT VELGRFNFHLEEGILEDLRDNRSLYSLEWEFEWLNNNSEGGLSLFFHVLRTSFFILYES FT YGGAISDLNQVLLDTLNFTNIQKWKLSLRFEKMMEFLILETKKRAGSIYFNIIGLLSQL FT LMEIHLEGKTSKRLVEEILNKVRESMKHASVQIILLDYLVTWILINVSVFKDRKDQVQI FT QLEVLCERMTFTGIINSILPVTTSNMKGENGKEAFFKYIMRFYIFEKLSQFLVNPMDAE FT TQYNNEKISYFNLLREEPTLDILSLGDNRSGVLGLGSPRIQLFSDLECNLYRGLSKINE FT TFLLDRVEEEKTKILQDVMISDESFLVKGISSIAYGTDHIVILGKEGDILIWGSNSSGQ FT CCIEKKPISKKINAEFNNEKDSKELLKRIEDYENTIFYPTRISCFSNICDKITISKIDC FT GAFFTLALDHNGDLFSWGQGRDGSLGTGSYEDSFVPQKIKLENKVKSFSAGMFHCGAID FT EKNQLYVWGSNEFGQLGTKLYMDNKNLNTPFKILVKFSRSSVTSSKITLVALDKQDSGN FT TEEIVEWKGVSFGEAHSIALDTNGLVWVWGQNNLKQLTGIPEILETEIMISNNIISSDY FT YRKLCSQVIFPTPLVSTDKVVVFASRKINMIFSGSTTCCAIDEQGKPWLWGLSFSGIDH FT HNVNSIFSRKNTSGSTEHKNLIEFEFPVRVFRNITPDFDSIRTVRFGKGNNNISMVSRL FT GRCYLWLENKDLVTDQMKYLNNHNCFILEDLQSISFRNNHLKNSRHHSILDVALLSDSI FT IFITKKTNKVKNN" FT repeat_region 265091..265098 FT /note="(at)4" FT misc_feature 267138..267281 FT /note="Pfam match to entry PF00415 RCC1, Regulator of FT chromosome condensation (RCC1), score 31.0, E-value FT 1.7e-06" FT repeat_region 268321..268329 FT /note="(a)9" FT CDS 268507..269811 FT /locus_tag="1MB.372" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.372, predicted protein, len = 435 aa, unknown; FT predicted pI = 4.8257; contains no predicted TM helices; FT some similarity to RA50_SULTO, DNA double-strand break FT repair rad50 ATPase (879 aa, Sulfolobus tokodaii, EMBL: FT AP000988, BAB67212); Fasta scores: E():0.17, 25.463% FT identity (28.278% ungapped) in 432 aa overlap, (aa 2-420 of FT 1MB.372, aa 274-675 of RA50_SULTO)" FT /db_xref="GOA:Q7YY93" FT /db_xref="InterPro:IPR011990" FT /db_xref="UniProtKB/TrEMBL:Q7YY93" FT /protein_id="CAD98592.1" FT /translation="MEEFHRIPIVIVEDEDELERIVNEGKNKLEVQDEDEDKKSRIDPK FT DLANRMKEKGGQCYKEKKFDEALGFYLKALGILELEQSLKENGEDFELLMGEIVLRSNC FT IACYVEKKDFERAVSESRKLLKYINEEGDFYLKKNGELPTLWVNIENKTRYRLSVSLYN FT LFGCGDVFSDEISKSSINESFEMIQTVLDYYQHWLKVSPPHEITVLYSKVKRAIESQNY FT ETKKVNSSLENLDFREKEVQSQTEKQDEEEISTKQNESHYLTSYITCFLEEYFEESGSC FT PKSIKSPIQIVDKMNCNNSIEFLRIWQNIYQNNQFIDYYLFFSHIFTNLDKIYNKTEIE FT VNILEKILERISVVLDELLKNSFQSKPKEKLISHIFKIIQNLEKTKRFDFVVLMLNENF FT QILNKMVQFEKYLSNNLLQEIKHFQENQTSKGIQI" FT repeat_region 270091..270101 FT /note="(t)11" FT CDS 270262..273300 FT /locus_tag="1MB.373" FT /product="putative arabinogalactan protein, possible" FT /note="1MB.373, predicted protein, len = 1013 aa, possibly FT putative arabinogalactan protein; predicted pI = 7.2004; FT contains no predicted TM helices; signal peptide predicted; FT reasonable similarity to Q9LM00, putative arabinogalactan FT protein (236 aa, Pinus taeda, EMBL: AF101785, AAF75821); FT Fasta scores: E():2e-05, 31.220% identity (34.225% FT ungapped) in 205 aa overlap, (aa 130-330 of 1MB.373, aa FT 33-223 of Q9LM00)" FT /db_xref="UniProtKB/TrEMBL:Q7YY92" FT /protein_id="CAD98593.1" FT /translation="MKVNRLFSPIGIFIAILLVNSFGPSEYIGLVDAAAAAAAAASKES FT VSLLSVLSKLVALKNYWKRLHKNRMKQPNNKGLISEMVRVEGEIKDLEAEAGNEFGGSR FT ADQVFNQAELRLSGMDPSGLLLEPKTLPKDPTVPASAPAKEARFVVSPPTTQTVTSVAV FT PTETPAVSVPAVPVPAPTVPTPAQATKVVIPPPVIQTATSVPVSGPASAMPSRVQATKV FT VVSPPTTQTATSVAVPTPEILTSAPAVPVPAPVVPTPAQTPDIEIGSQDQEGGVVTTPE FT YPGHLDIPGETGPEDFPQYETPGSPYSPTQTPEQFPLGSFSDNWKSQFVGTKQSDFLKF FT GVDGSEYESDPAAKAAFSRYIQATSNGVSDFSSSLRSLSECKIIAERHIIFLQAFIHHF FT SALSNIQLQPELYCLLFTNSMKPNIFSFVSNLKNYLNNNKYTFEEEHLARSVNIGLNYL FT ATKTFTRKDLTVTGVKKLVKENRELALECDADNVIFATSLLLLTNIYSDNLGMSSQINL FT ADICSILKGYQDLESRVREIVSLFYRAGDVSRHSLFDLSILVKESLESAYRALSIPFSD FT YFLDTDKKTLKSLTPMSTTINVDEIIKPIDHDDVIRTSDRYISGKVPIPEIPGPPSRRG FT SGPISDRVIPTSTDEIFKPLRTSKDYQPFKRGSLLHPPTPENLPDKASGRGFVKSLQVH FT MKPREYETEFIYGSSGFVSPQLTYFKRVSEREYNHDKKIKQAKEEQKRLEKKREQRMTR FT GYKDLRGVKELDGADESVSELPDISEEPSPLPEFEEPERIIPSKFETKLHKLVPKSSSI FT YTQPSQRFHSTPLAYTTLKPMKGMLSTESRLNSDLSSELELDEASEGTIGLPSQKGDDA FT AEKSHEGDVSVSDEFSNLSISKSEKTLTTEVAARELLIKCTDLSRSQKKRALSLFRVLK FT YLYKGHTTGWYITAFCRAVYFAEKDVCISKGVKSLNIKEMASECFNALNSSPFIKKNPE FT LSNLSRQVCYSYYKKRAATVCLG" FT misc_feature 270262..270381 FT /note="Signal peptide predicted for 1MB.373 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.949, signal anchor FT probability 0.029) with cleavage site probability 0.183 FT between residues 40 and 41" FT repeat_region 270364..270384 FT /note="(gct)7" FT repeat_region 273214..273221 FT /note="(a)8" FT repeat_region 273566..273574 FT /note="(t)9" FT repeat_region 273722..273729 FT /note="(t)8" FT misc_feature 273773..273780 FT /note="tgcatgca" FT repeat_region 274356..274363 FT /note="(a)8" FT CDS 274785..275207 FT /locus_tag="1MB.375" FT /product="hypothetical predicted transmembrane protein, FT unknown function" FT /note="1MB.375, predicted protein, len = 141 aa, unknown; FT predicted pI = 4.1970; contains a predicted TM helix FT region; some similarity to ENP1_YEAST, enp1 protein (483 FT aa, Saccharomyces cerevisiae, EMBL: Z36116, CAA85210); FT Fasta scores: E():5.5, 27.407% identity (29.839% ungapped) FT in 135 aa overlap, (aa 5-132 of 1MB.375, aa 36-166 of FT ENP1_YEAST)" FT /db_xref="UniProtKB/TrEMBL:Q7YY91" FT /protein_id="CAD98594.1" FT /translation="MASSLLENDIISQNIGNGLISKKKSKKKLNKFSKSSNSELEKQQN FT DKKEYYPTKNAMMESEDDDLTDEEFEEYEEYEEDMSNFDYLKIFKIPLILFILSFISVL FT FVDLFGKDYLGNELENGIYNMIDPNYGSFIWNLSGK" FT repeat_region 274860..274868 FT /note="(a)9" FT misc_feature 275046..275114 FT /note="1 probable transmembrane helix predicted for 1MB.375 FT by TMHMM2.0 at aa 88-110" FT repeat_region 275230..275237 FT /note="(t)8" FT repeat_region 275248..275259 FT /note="(aact)3" FT repeat_region 275619..275626 FT /note="(t)8" FT repeat_region 275792..275803 FT /note="(a)12" FT repeat_region 275815..275826 FT /note="(a)12" FT CDS 276075..277874 FT /locus_tag="1MB.376" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.376, predicted protein, len = 600 aa, unknown; FT predicted pI = 6.1619; contains no predicted TM helices; FT signal peptide predicted; some similarity to Q98QH9, FT hypothetical protein mypu_3820 (1183 aa, Mycoplasma FT pulmonis, EMBL: AL445564, CAC13555); Fasta scores: E():2.3, FT 22.015% identity (24.082% ungapped) in 536 aa overlap, (aa FT 50-566 of 1MB.376, aa 663-1171 of Q98QH9)" FT /db_xref="UniProtKB/TrEMBL:Q7YY90" FT /protein_id="CAD98595.1" FT /translation="MIFLLTCIYLFGITLLEFKANAQNILDDCVKAGLASYKRQNVFQK FT FRLRLDVSDNISKYENFLKNYNTLDENISFMDEEYLQDISRFLFDRNEILIDYFWDSMM FT DSLPPTWHVYGIRPIQASIIRTRFMESCVSNIYSLYKEGKIEKFHTAIPNLRDARYLTS FT GKVNEICRNIKGKIENFGFEIPPVIKTIGRSNLPVYYRCSEIQSDELADVTIFILDKKI FT PKFNIPKQKICEIVNVMIREPASFHNSCFGVLSFGLRGYLPIDNETRDSLDFACKIMDN FT IRKYTRLFSKEPIQMPNKNIKDIIVENLLRIYPNIELTGIGMEIVEQVKISEYRFIESC FT TNFSETLFALQGFKIIEKQVSKQNGMKTNGSSDEQKDEEDKEIDNNIAHPFEGDPRDLR FT KKLLLACSGIHMHLFEIKPKGVSFTKLINSRIIVQEDFSKVSKSKLLYLDMCDNKNAVI FT LDKHSITQDTLAEIILSAILHSSSKSMSSIALYNGFVKEYICSISGEIISSLRNNFETF FT PKFEETGFMNSCRQIISEEVFPRQKEKSMNFMLKKSIIKLCSIVVNSLIQISQLLNSNS FT SLDNIKGQDDDELTSSEYLSEEI" FT misc_feature 276075..276140 FT /note="Signal peptide predicted for 1MB.376 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.949, signal anchor FT probability 0.000) with cleavage site probability 0.718 FT between residues 22 and 23" FT repeat_region 276478..276485 FT /note="(at)4" FT CDS 278440..280392 FT /locus_tag="1MB.378" FT /product="hypothetical predicted transmembrane protein, FT unknown function" FT /note="1MB.378, predicted protein, len = 651 aa, unknown; FT predicted pI = 6.8020; contains a predicted TM helix FT region; signal peptide predicted; some similarity to FT Q9GQR0, extracellular matrix protein papilin precursor FT (2174 aa, Drosophila melanogaster, EMBL: AF205357, FT AAG37995); Fasta scores: E():0.37, 29.452% identity FT (32.090% ungapped) in 146 aa overlap, (aa 15-156 of FT 1MB.378, aa 736-873 of Q9GQR0)" FT /db_xref="UniProtKB/TrEMBL:Q7YY89" FT /protein_id="CAD98596.1" FT /translation="MNKKTYVLGAIGCIILAVGGVLGSETDLYGSDILGSAGEKSSSMF FT PSNEESASASGEDLAGISSDSEVTSSSGSGIDGSNLDMSKSQSQGDLSASVGGIAPHSD FT FVGGDASEGLRSSLASSIQPDITVEPSTSQQRSIGFVDKSTDDTSKPEERSSRMEFVIT FT RNAEIAIENLFPSLESANILTVSDISKYPSVELNDEKVLESQLNKKINYYKVELPMTHD FT TISGLSTSDPRVEIGVKSKFELLVDENQPRMERILRFLGSNPVDLAGQDKFDLATILFS FT VVAYGECIKRLNGINSALDVVEICTQLSREDVLTIQPEDLPKNVLKEKIELFKLTGTVI FT SIPLPLFTEASNLKWRHVSNSNGIYQVYNSGAIDGFAHFDFLYKKLYDIINTIHYPGFQ FT FIDDVRVDLYYGMKGIRLQLGNGEQSRRVKKNISNIMEFYDDWYKGPKSKRLCSQNYYN FT SLLAVGPIYDAVREMTKILKSIQDEMVANPSVVEEEFNGIPRQSLFNIHFNERKFFTNN FT VVPIFTNINDAVKAGNVKPIRSEILRILAILEEIEFNSKEVIKVLQDLNKCIKTSGHNF FT NYELGYIKRDYYKLVKAVRKIHPRHPFPLPSSPLYIKGKKFGIKNLFRKNAARAKLGYY FT REINPFNKSEKTSKK" FT misc_feature 278440..278508 FT /note="Signal peptide predicted for 1MB.378 by SignalP 2.0 FT HMM (Signal peptide probabilty 0.973, signal anchor FT probability 0.025) with cleavage site probability 0.950 FT between residues 23 and 24" FT misc_feature 278458..278511 FT /note="1 probable transmembrane helix predicted for 1MB.378 FT by TMHMM2.0 at aa 7-24" FT repeat_region 280404..280411 FT /note="(t)8" FT repeat_region 280696..280704 FT /note="(a)9" FT repeat_region 280725..280733 FT /note="(t)9" FT repeat_region 280752..280761 FT /note="(ta)5" FT CDS complement(join(280853..283570,283604..283944, FT 283981..284673,284716..285349)) FT /locus_tag="1MB.383" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.383, predicted protein, len = 1462 aa, unknown; FT predicted pI = 7.8473; contains no predicted TM helices; FT some similarity to Q9EN29, amv019 (524 aa, Amsacta moorei FT entomopoxvirus, EMBL: AF250284, AAG02725); Fasta scores: FT E():0.5, 21.114% identity (24.332% ungapped) in 431 aa FT overlap, (aa 871-1257 of 1MB.383, aa 107-524 of Q9EN29)" FT /db_xref="UniProtKB/TrEMBL:Q7YY88" FT /protein_id="CAD98597.1" FT /translation="MNPNCSILGNVLYNDSSVLKTITGKFFPLNKNRQFLDISTLKYNE FT LTVFSEFIPSNNEELASITVKLEDTCVDFWRYSIQESQFDLILVLDCSQNLHLFSLKSV FT SIPHSKHDCPYISYKIPNSFIAFPTSINTIIKDHNIKENEPNIFSNQLLIKNEFSISLS FT KNKTIPKINFIRPYEYSIKFANSFPIIFEKLIPEYCIAIDNDTGYIIVLNKNGIIEQLE FT LKIQTENLFHFDSLKYKLECISFSKTCKNYTKIQPIVFCLFSDLKNQILYGKTFFFDWN FT NLLKNQINPRDNYQKPISETNLIPLINSSSTSFSPQLYQMRIFCTHIPVIHMKSIHVSV FT GRSINDKSIPVLACASACGYKSGFINSGCICRFRKLLLIFNYEKNELKLILCELDAEEF FT NEEILTVNLNEYYQDKIYIDLRDVVIIEENVNEQRIKFFISSKYNKMKEFDIIKLQINS FT PSFELSDISSVSPYLYYNEECLIVKSIIVSSTFGVIKMITFPNFKQKFIFAPKILEILK FT MEKAIMAPNCFDNKFILAARRYQCCFDQIKGGYYFQYDFISTIAQFDSLKKGVLEIFII FT PIEKNRYLVVYSKFLESQIDLLIKQESIPENKYYLKKIQINGFLISGHSTILCVKVCHN FT LMLQVTESRILLYQGISEQVKAFENQNHIDHFQPNIKDLWEYPDLIMYAKMADDNHLMI FT ITQGHKLIQLKIESFQLFHFDIKDELTYIPIISSFESKKLILNNSSHVILTLIGDPHSQ FT IFANLSSIQNPLKNIKSNFNLNHFLDSEVGMITSIEFDNNFHDNIYITTSFGFLCIFSL FT IQFTQDLFQEKFQTNQTKYKKIIINLKSHISSQEVFNLKVIGTQRRHIFNNNQIWKNRI FT ILGSMTDYLYTELLETNEKFLKILHIKRLKFPIYSIITQFTENYQDYDHLYFLSLQGHE FT NQYIKLVNVESNKLQSCDKVLPLKSTRHKVENMIYLKENSWIVVNIENWIPDSHSNKCP FT TRISSQLFLFDSDANLCGSKTYSINDNDSNFEENFRFKIFPNNGKDSFFQVFNRGENQN FT FIPNTIFQLISVHSNSTNTHKFHKDPFIVSGSYSFKGVESGLIYRSPSDHKKFFFFLVE FT KNFDTDKALELVSTRVISCLDISKTKRNYDSINTDITTYSQSIDHIIAFGNKLKLKVAD FT FSISFPNMLIHNVTFQNHARIIAIDLLESISDLSKKQLIFHRIYFDLGTYSDHKLTQVK FT YQDNFDHFYHLDFPILDNKIIKDKQIQIYKGLSNQISLLIYENSLNQDSKLFFIGLQII FT LNYMYKRRNNEIISAFPAIITNLLFGIFDIVFSNKIKVNSTKSISSNSGAFFSKKKVNK FT LDKKQFENGIINTKISLYQISYLYFQVCSKCIKWKILMVLIVWYNLITTFKNNQDLEFI FT SIRKLVCPLFQDWVLELEREVTIHFETIMDHSSLIENFFNSEYHFQQYQELLFSEFIEE FT " FT repeat_region 281219..281227 FT /note="(t)9" FT repeat_region 281927..281935 FT /note="(a)9" FT repeat_region 282840..282849 FT /note="(at)5" FT repeat_region 283399..283406 FT /note="(t)8" FT repeat_region 283588..283595 FT /note="(at)4" FT repeat_region 285424..285431 FT /note="(t)8" FT repeat_region 285453..285462 FT /note="(a)10" FT repeat_region 285990..285997 FT /note="(ta)4" FT repeat_region 286201..286209 FT /note="(t)9" FT CDS complement(join(286242..287130,288013..288034, FT 288071..288108,288143..289992)) FT /locus_tag="1MB.386" FT /product="ci-meta2, probable" FT /note="1MB.386, predicted protein, len = 933 aa, probably FT ci-meta2; predicted pI = 7.2925; contains no predicted TM FT helices; good similarity to Q9BLJ2, ci-meta2 (813 aa, Ciona FT intestinalis, EMBL: AB041856, BAB40595); Fasta scores: FT E():5e-08, 50.000% identity (54.737% ungapped) in 104 aa FT overlap, (aa 335-431 of 1MB.386, aa 156-257 of Q9BLJ2)" FT /db_xref="UniProtKB/TrEMBL:Q7YY87" FT /protein_id="CAD98598.1" FT /translation="MTTSLINKNRTPTRARGERVYKPGGKKEWRVVFWTHESTKPSRRQ FT CSFSEAKYGKEAASLLSRAVLDYIDVKGVVPDDLHDPPILDPAKKELIEYYSALHESRK FT ALQKKNKPKNSGNSNNKNISEPSIVPLYTLAQTQSPTMSNISCTSPQQPIKLTKTEKYS FT DLNNLREQFQIESQPNFGILNNSNHNSPNESRILSPLSIITNSGGNSSGNNGEVGGAGV FT GGVGAGEGIENGISITEESTTKNEPEVSISSTNSNHIKKSSSNNFLSNIQIPPIPNITS FT STDFNNESENGINNNSRLKKQFSSDIDGFNTMINRVISSINEQNPFLSGTGAAGTMGLA FT GMGGIGGMGGTGGMGGTGGIGGTGGIGGTGGIGGTGGIGGTGGIGGIAGTGLGGVGMGT FT HFTLMGSIFPNQLPPGIQNHVPTTHFNSTSSPPSSFPDLLSPNCNGIFGPDLNNFLSQT FT FHQIAFQNQFNFFNHLGYLNHLAGLTITGVGGNIITGHPNFMTGSLIHNFPLFQNQDTS FT LSANNNNTCIGNSSQRNMFMNGTFNENEGGESLNGFLISGNHESNSKTTNNLDSTGSST FT LISAALSQQEGGDSFSLSPNSKINKGMVVSVQTSSDLSTPLCILNSYLNHLSLYVRNFT FT YKTKINGSIECNSLDRVKRIFPESNRNQSSEIIGIKRKHVIIYDFEGRLTNTSDLEISQ FT REVLEYLKKCIQSIKSIYLLKNGYSSFSLEFPYICCSIDAECISSIKSIKNIPQNLKSK FT IIASIEYPLVISWGETYKIYLGNVFQSVHPQILKSLGIKTIMDFTPNKIKSASNNEVRI FT IHVNNSESKIPIEDEYLIYSNLPIEETIKSFKELEESNPDPKDIFPIMIVGTRVTNETV FT SIASVIVSYLRKMQITATLIFTLNQIGLDNNIISPTENEDIFKLLHPSNSQIAQMISFN FT FP" FT repeat_region 286856..286863 FT /note="(at)4" FT repeat_region 287380..287387 FT /note="(t)8" FT repeat_region 287391..287398 FT /note="(ta)4" FT repeat_region 287444..287455 FT /note="(atga)3" FT repeat_region 287457..287468 FT /note="(taaa)3" FT repeat_region 287482..287491 FT /note="(at)5" FT repeat_region 287532..287539 FT /note="(at)4" FT repeat_region 287554..287563 FT /note="(t)10" FT repeat_region 287687..287695 FT /note="(a)9" FT misc_feature 287802..287809 FT /note="tgcatgca" FT repeat_region 287907..287914 FT /note="(a)8" FT misc_feature 287930..287937 FT /note="tgcatgca" FT repeat_region 287937..287952 FT /note="(a)16" FT repeat_region 288210..288219 FT /note="(ga)5" FT repeat_region 288420..288431 FT /note="(tat)4" FT repeat_region 290157..290164 FT /note="(at)4" FT repeat_region 290312..290320 FT /note="(t)9" FT repeat_region 290591..290598 FT /note="(at)4" FT repeat_region 290680..290696 FT /note="(t)17" FT repeat_region 290962..290971 FT /note="(t)10" FT repeat_region 291356..291364 FT /note="(t)9" FT repeat_region 291415..291423 FT /note="(t)9" FT repeat_region 291425..291434 FT /note="(t)10" FT repeat_region 291527..291546 FT /note="(at)10" FT repeat_region 291548..291555 FT /note="(at)4" FT repeat_region 291694..291701 FT /note="(a)8" FT repeat_region 291775..291782 FT /note="(a)8" FT misc_feature 291895..291902 FT /note="tgcatgca" FT repeat_region 291909..291920 FT /note="(taa)4" FT repeat_region 292073..292082 FT /note="(at)5" FT repeat_region 292087..292094 FT /note="(t)8" FT repeat_region 292181..292188 FT /note="(at)4" FT repeat_region 292280..292291 FT /note="(ata)4" FT repeat_region 292349..292358 FT /note="(g)10" FT misc_feature 292410..292417 FT /note="tgcatgca" FT repeat_region 292436..292444 FT /note="(t)9" FT repeat_region 292556..292565 FT /note="(a)10" FT repeat_region 292570..292581 FT /note="(agaa)3" FT CDS 292643..295027 FT /locus_tag="1MB.390" FT /product="asparagine-rich protein, possible" FT /note="1MB.390, predicted protein, len = 795 aa, possibly FT asparagine-rich protein; predicted pI = 4.5602; contains no FT predicted TM helices; reasonable similarity to ARP_PLAFA, FT asparagine-rich protein (537 aa, Plasmodium falciparum, FT EMBL: M24328, AAA29491); Fasta scores: E():4.8e-12, 24.356% FT identity (26.797% ungapped) in 505 aa overlap, (aa 302-788 FT of 1MB.390, aa 31-507 of ARP_PLAFA)" FT /db_xref="UniProtKB/TrEMBL:Q7YY86" FT /protein_id="CAD98599.1" FT /translation="MLNVEGGDIDEDINYSNYQREIYSNELEKKTGLNNNNSNNNNSDD FT YYYYYGNDTIQESYYSNYDKNGMNKNNDEYELNDNEVSLEMYSKELDIISLSKGIVENI FT WYVEIESNGEVVWSSVIACDFSKQENDNEENNRQWESLSSKSGWNKKQLFGNEVNIITK FT KQLIELLIQDGITNENSIKLFTSIGMWKSLFLTLNYSTYIRRKREMLYSKQVKDYIVKF FT NNRRETIERRMRINYNEDDRTNLIKLIYQGIQDLFGSISICNSPEIFKITYKIPFYWAS FT LIANIYVFEKLRNYDIMLIIIEYISEVNPELIFEFQKYSKSSIQSNNNNMIEMLNIFNN FT ELLQNTSTCFQEYLSDLLSITEKSKLILSSYITSNNSDTEKQKSNINIEENQSIQFKDK FT NQNDENVVKINNQSDIKRKPNYMRPTRLSEIRRQNSRKSINEKVHSKFEEGVNTSTSVF FT TTANTNSNTTNIINTTSTTNNTTNITNNNNTTNNNNSNNNNSNSKNTVLGQVKANIQKD FT PEKEMIKDNEKNPSSLKNDQNKKNRFRNLSTKHLIDSKENNEMLNNSVSNDNLSKNPTI FT INTNNSINYNSYDENKFINEKNNLKTILNILEEKSKSQVVNKVNFDDELFNIISELDKI FT NFKNQNNENDISTNISEDFSQNIYSNNKSDNDKDFQYQDENINDETYLQVNNIKSHLDP FT YDWYYSSSNNENGDNENVQMTTKLNKLSPKTVHTNNEKVTQLISLSENDDNNHNNRNIT FT IEENVSSNTDINELKYIESILDKEIEELQRQEDELASNIFF" FT repeat_region 292725..292733 FT /note="(a)9" FT repeat_region 292742..292753 FT /note="(aat)4" FT repeat_region 292776..292790 FT /note="(att)5" FT repeat_region 293496..293503 FT /note="(at)4" FT repeat_region 293620..293631 FT /note="(taa)4" FT repeat_region 294257..294264 FT /note="(a)8" FT CDS complement(join(295059..296807,297678..301286)) FT /locus_tag="1MB.396" FT /product="cytohesin-like protein, possible" FT /note="1MB.396, predicted protein, len = 1786 aa, possibly FT cytohesin-like protein; predicted pI = 8.7063; contains FT Pfam match to entry PF00566 TBC, TBC domain; contains Pfam FT match to entry PF01369 Sec7, Sec7 domain; contains 2 FT predicted TM helix regions; reasonable similarity to FT AAL92328, cytohesin 3 (931 aa, Dictyostelium discoideum, FT EMBL: AAL92328, AC115599); Fasta scores: E():4.6e-18, FT 21.181% identity (24.271% ungapped) in 864 aa overlap, (aa FT 58-895 of 1MB.396, aa 25-804 of AAL92328)" FT /db_xref="GOA:Q7YY85" FT /db_xref="HSSP:1RE0" FT /db_xref="InterPro:IPR000195" FT /db_xref="InterPro:IPR000904" FT /db_xref="UniProtKB/TrEMBL:Q7YY85" FT /protein_id="CAD98600.1" FT /translation="MSPRALKKGVQIMEAENVPNISSNVCSENFNSSDFENQNKNQSTT FT QVRALSQESEFEVSVNFSENPITTFSYVPEQDSCQNHSNFSSKSFSTLEPEQIRFASNS FT SSASLPVMRISQDWSGSAINRSPDGNQPVIPSLTAKKIRQILMASRISNSFQLKSETGG FT VTSNQQSKHNSAQSIKSIDTTSENSPSFKSFGVSNETKRKTSNSSHVFQNHPNLNSDEY FT NHYNSSIDCQYHKAHQKSGSFASSNTKRRYNIHQGVSSDIFCSQPYISQGQCHMDILKE FT EIQLMEQLEQGVLLFNNSPEEGISYLIEHKLVEDDPLYIANFILHTDFLDKRKVGELLG FT GHSNLSLSILNNYVHLFNTNSLEPDIALRYFLSRFFLPGESQMVYRILERFSVSYIRDN FT PTTLYTSDQIHTLCYALVMLNTSLHNSHVRTKMSKQEFITMCMHSNLPVTSTQLENMYD FT RVAENELKPLLSPSEKVYGRLSRDPKVLRSKQVSQVSPTLLQKGTIFRRFSNKNSSHTI FT IAWISSDSKFFCWKKVRSKSSHLLNPSNFVSNMSLKLSKLVNYNKKRLSRPLNNNSINN FT NKAMDSNIFLRDENLFDNQSCILTRHSRITSRIRRAFGMDISDLSCILLDDIVDINVGV FT SSKIHLDRKSSKILRKGKTSSTLSKNNANHLQSELESKCFSLITRSGESINLCSLDSTF FT PSLLIWVRFFHQTILKNQETKEAKEAQDGNVMTVYGIPIIDKGIESDLLRVWHHGIFIQ FT WENHWSLNSFINLCNTELPNAVSLSNKEVLSSNSNSNSISTLFDLHQYNSCADNNPTGS FT ILQEIVSGQKENLNSNTNSWKSKSKMSSNPQKMAWIHNKKNKYKASVPKSTFSWKIMSI FT FSFLKLKKPKYAPPSHNLHNKYINYQAPISHLILHLWVNNIPCSYRGILWNISVGNNLQ FT IQAATFNHLLRLRSNFLTDLSESICPSACGEMQMCMHAYFEGGPLKEAFFIHLKRFHRD FT ISNVFPELHSFFIGAGAVLGYRIWDKMKSKQKAKYIEQKQMANEQSDSTFMTVPSPLGG FT LSPISQSHINRKILEDDEDSELESVLLFFNDNPSIYRIDQSTKILVECFILYRPDIGYV FT EGMSHIAMILLLFNINLMEAFKTFTNLLHNSFFLDMFMLNHRNVKMRLDFFDMLFKELM FT PSLAYHFDLLSITSDTYLISWLTSLFSTCIPIQVIPKNYIINYLNYHFFTSINIIIIII FT IQMSSSFLSASSSISRNSQILPNYFFQNGSLNEGLGNIPVEHNNESKVSNIVSSSSSNS FT NQPPLLSNSDNHVRYTPIHSLTGAPIYNSISMGSDEYSERLRMSNELRKSQQFKRTQGK FT APSPLISSSSSSPSPSPSPLSSQKNSCILSQCSSISNGSLLGVLGDYYKVERYNTLANE FT RRANNSKVVVRDEKILNSISKNIVNNIPSSHNNIVGVNESNDQLKSMFSIKSSERKESN FT NKVKENALIQEFKRNSGSLVNNKRPLFSEELSPITCMNNTCSNIKGAINILGITGKIFM FT DVLFQLVNAFDLDNREIPNYNNKNKFENMPHFPVSSIEIPPDTIILQCFDCGLIYYTDL FT PLNKQQLGTIGILDPKPVEFGKGDSDEIIYGISNRNSPYYCWGNLEPRDLSPNQIEFLR FT KIDLRKQNRFSHQDLVDLYYYRFRRDRRFDESFPSFVQSDPQLHLYKFYRETMHGAYNG FT NKTPLPELGNHFLPSPWSENRIEGNGYKKSSKYSSSSRSTDNKNNFEKDQDGFIILPKF FT DPPSEKPSEYDLVHKPDNSHVFPFILDIL" FT repeat_region 296353..296364 FT /note="(gat)4" FT repeat_region 296739..296759 FT /note="(tat)7" FT repeat_region 296979..296986 FT /note="(t)8" FT repeat_region 296989..296998 FT /note="(a)10" FT repeat_region 297003..297010 FT /note="(ca)4" FT repeat_region 297096..297103 FT /note="(a)8" FT repeat_region 297130..297137 FT /note="(a)8" FT misc_feature complement(297582..298328) FT /note="Pfam match to entry PF00566 TBC, TBC domain, score FT -21.5, E-value 5.9e-07" FT misc_feature complement(join(297606..297671,297711..297776)) FT /note="2 probable transmembrane helices predicted for FT 1MB.396 by TMHMM2.0 at aa 1170-1192 and 1205-1227" FT repeat_region 298034..298041 FT /note="(ta)4" FT repeat_region 298382..298393 FT /note="(atgc)3" FT misc_feature 298383..298390 FT /note="tgcatgca" FT misc_feature 298387..298394 FT /note="tgcatgca" FT misc_feature complement(299892..300443) FT /note="Pfam match to entry PF01369 Sec7, Sec7 domain, score FT 147.7, E-value 1.3e-41" FT repeat_region 301365..301373 FT /note="(a)9" FT repeat_region 301671..301678 FT /note="(cg)4" FT misc_feature 301689..301696 FT /note="tgcatgca" FT repeat_region 301779..301786 FT /note="(ta)4" FT repeat_region 301839..301846 FT /note="(a)8" FT CDS complement(301853..303310) FT /locus_tag="1MB.399" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.399, predicted protein, len = 486 aa, unknown; FT predicted pI = 4.7994; contains no predicted TM helices; FT some similarity to RRM1_HUMAN, putative ribosomal RNA FT methyltransferase 1 (329 aa, Homo sapiens, EMBL: AF196972, FT AAF06797); Fasta scores: E():0.34, 29.310% identity FT (34.694% ungapped) in 116 aa overlap, (aa 334-436 of FT 1MB.399, aa 193-303 of RRM1_HUMAN)" FT /db_xref="UniProtKB/TrEMBL:Q7YY84" FT /protein_id="CAD98601.1" FT /translation="MISVEGDNSIIKQLNNCLSCLLRGFKCQDVLIRITNEYLMLYSFP FT FGGFSSCFQFNFDGRLFDKFETGDIVEFERLYKVDSLIKCFNNTQWFKATRFIFEISGN FT DDDTLRVTLHGKIGVIRKHCIYPVIGHQDRILLHETIFWKRRHGLRISSGLLKDIFGYM FT EIKESKMTLSISSQESLISIKVQPIGISGYDVPITGSSIGTGNTSGNSVSNNSTYQDIQ FT IKKEQIDTLSLCESILENSNFTFNSREFKIAIALAENCQLPVFITLRAPGSPLIISIGQ FT RAIIDQMCVSDLGNDIIPFEYHWEKPLFHSKLFEIDHEFIEKTQIITGFSAVFLLSTSI FT DPESPPDLVTSFNSPIEDHDPDQDQDRDQDYHAPMVISQMSEPRNYTPVSSKNISDIKK FT DYSPTSSKAIREAPPEECSLKKEVETFDCSFSKLDQFNLDSIFQKFGIENQDSSKDPEN FT SNNYSKIDIPTKDQGVKSFDWISSLLW" FT repeat_region 303350..303357 FT /note="(ca)4" FT repeat_region 303635..303642 FT /note="(t)8" FT misc_feature 304103..304110 FT /note="tgcatgca" FT repeat_region 304127..304134 FT /note="(ga)4" FT CDS 304199..306685 FT /locus_tag="1MB.400" FT /product="hypothetical predicted protein, unknown function" FT /note="1MB.400, predicted protein, len = 829 aa, unknown; FT predicted pI = 4.8887; contains no predicted TM helices; FT some similarity to Q9SZ55, hypothetical 92.0 Kd protein FT (852 aa, Arabidopsis thaliana, EMBL: AL161579, CAB79906); FT Fasta scores: E():0.14, 20.149% identity (22.438% ungapped) FT in 402 aa overlap, (aa 108-492 of 1MB.400, aa 228-605 of FT Q9SZ55)" FT /db_xref="UniProtKB/TrEMBL:Q7YY83" FT /protein_id="CAD98602.1" FT /translation="METENKVEVIDFEIGEKGKDNADTKEMEQLLALDDFKKVIGLLER FT AKESSLRRLDRTLLLIEENFEQVSKRFDQDDLSCLDYRMCDVDLPEMEVDPTEEEINNT FT VNCFLSKLVETGKIQLRQLKRELSEVKEVTESILVSEVNEEENKQIFESPKPKKVKDES FT GFVASMKRLWENKTPKLSSIWRRNEESKNLIVNKLGVSRLDKLDIVDGFLFKDDDTEKK FT DGVSLDIEMESIVEAEADVEVDPETEFGVETEIEGTGEAEADMEQNIETETEVMTSARN FT NTSFKSISNEDVISPKPPNAILSMGGSSPLQSPPLLVLTPSTPIPIVPTTISHQAQGGE FT ARNSIAKVQVGETSAGTEILRTSVVIGSGEIESHTIEGKERVSKSTTNANEDSPLPPRP FT IRRQTSEKMQEKGENRVEEVEGGEQTPILKGGLEGTPIDKNREEVGENDFVRPERISHV FT KKSGLRSILHHSANRIPLEQTMSSSGLGTGAGGGAGATAGAGENNNTANRGSINTVRQS FT RGSTAANNLVKTPINKGFCEMETEGKDKLRIGESPCRRYHKRLKLLPPVNPDECYVLTD FT SEDEQQNCGVGGNGGGELKITKAQKKIPLWARSMNWIPKMKEQRNVDPFSIFGDSCMFM FT DLEDVFQRPWYISTIARDNRIKSWQNDRLTSLNWSEDSLTSDELRQYKMRMNLYIDKSN FT EVYVTEPCFTPSPNPLANGAWNHINKQRINSGGTTVIRKALNLSIHRLNNSISNKPNSS FT IPSSHLAPPSSSVSSSKLGEATLNHLQTDKNSSISNIENNNIANFLSKPINSSKDCPNV FT AHQHSNHIIGNSSFLQ" FT repeat_region 304569..304576 FT /note="(ag)4" FT repeat_region 305572..305579 FT /note="(a)8" FT repeat_region 306709..306716 FT /note="(cg)4" FT repeat_region 306811..306820 FT /note="(ta)5" FT repeat_region 306822..306833 FT /note="(gtat)3" FT repeat_region 306838..306849 FT /note="(ttat)3" XX SQ Sequence 307050 BP; 106085 A; 47767 C; 49909 G; 103289 T; 0 other; atgattttat gtccagctta attgaaatag gagacttggc atcctggtcc ctttcttctg 60 ctaaagtatg tttttacttg cacttgaaaa aataaaatac atttcagcca ggaaatggaa 120 tacaacagtt gagagataat aactcaagca cattttggca aagcgatggc caaagcccac 180 acacaatcac gctgagattt cctaaaaaaa caaaagtttc ggtaattgat ctataccttg 240 catacaaaat agatgaaagt tatactcccc aaattatttc tattaggtct ggaaatcaag 300 aaagcgattt ggaggagttg aaagaaatgc aacttactga accagatgga tgggtaagaa 360 ttcctctctc cccaagggaa atcgcggata atttttttaa agatgcaatg taagtaaata 420 ataggtaatt caaacaatta attgttattt ttaaggccta tacaaatcaa gacaatgtgc 480 gattctcaaa attatatttc tgcattctgc attcaggtaa gaaacatctt agaaaatatg 540 aacatactta tttattttag atagcaatac ttgcaaatca tcaaactgga agagatacac 600 atgtcaggta tgtaaagtgt gtttatttga attatttaat atgccttgtt agacaaataa 660 gagtttgggg gccaagagaa gtgaatgaca acgttgtagg aaagtaagtt aaaaatataa 720 cttaaaagat tgactaaacc tttcagagtc ccaatttcac aaccaataaa aatgggcacg 780 acgattgata caagaatgta tcagatttta agataaaatt gtaaagtgca aaattaaaaa 840 aaatttatta aattaattcg taattgacac gtaacccaac catataatct tccacattag 900 aataggcaaa aaattaaatg aacatgttat tttatgaata attctcaaat atttagatat 960 agtattaagt actaaataat aggttcttaa acgtaactag aaatgttgaa aatattaggc 1020 tttttgatgg catgcttcca gcaaagcaat aaccagtatt acaaagatga taaattcaaa 1080 gattgtagcg attcaaattc gcttgtgaag tatacatcaa ttggaataaa aataatagta 1140 tattcaattt catggataat ggtgttaata tgggatgctt atgaatatgg gatggttttt 1200 tatgggctaa tattgagtat ttatatcttg aaatattact caaaatactc atgtttaaga 1260 tgtgtatttg aagagatttt taaaaaacaa atactgaaac atatattccc aaaaagttta 1320 aacaaaatag tagaatggca atcttaatac gcgcattatt taatttgtta gtaatttgtt 1380 aataaaatga acaaggaacg aacacaagcg gatcaagata cttcaaagtg gaagattatt 1440 tacccttcat acttaaatag taataataca aagtccagtg gtagattatc ttctctaatt 1500 cattgtgtag aagatccaac tattgcagaa atagcagaag tatgcatcca attgggaata 1560 ccttgtaagg ttgagagcaa aagatattca aaggactgcc gaactcttgg aagagttagg 1620 tttcaacttt ttgatgaaag tgggagggct tttaatgata gaattttgac aaaaaaaatt 1680 ctactcaatc aaattggtat aatgatacca aaattaaaaa atagacaaaa tactacatca 1740 tctgttattg atcatgcgaa ttcaagaaat aataaggcaa atattgatga cattcataat 1800 gaaaataatt gcaaaattaa ggaaacagac tcgataactt atagcggtgt aaacatacat 1860 tccagcaatg caaagagcaa ttcaaaaaaa aagaaataag tcaagaaatt aagtagtata 1920 aaagaaatat taaatatgaa caatttttat ctataattaa aaaaattata taactgaaaa 1980 aaaattcaaa aacataggca tagaggaaat agcatcttaa ttgtagtttt tttgaaattg 2040 atcatagctt aaaattgaaa ccaagaactg ctggttcatc agatctgtct catctttctt 2100 gcctgttcca cagagttgtt ttttgtctcc ttgccaaata atagtatgca aagaattaat 2160 aacgagcttt tcttcttcgt atttttgata tattatagta ttgctatcag gcgaaatata 2220 cctagtcgta agtaaaatta tgtaatccca tttgtaatac tctctttctt tatttggaat 2280 atcagtatat aagttatcta tcgtccactg tatatcgtct tttaaacaat tacaaagggc 2340 aggaactatt tcaaggggag tatttggaaa tctttcatta accatcaatc caacattttt 2400 ctcatatata acggaattaa acagattttc aaattctttg ttgcgacttt tcgaaataac 2460 tccacttaaa tagtcaacaa ttttatttaa agtttctttg tattgacgaa agttcagaat 2520 tgtagaaaat gcaataatac tgtcagaaat agaaactgta gttcctatat taccttgatt 2580 acaaatcaaa tcaacaaatt catgaaattg aatgccctta attcgagaat attgcgacat 2640 tagtaaaatg ttctttatgc tatggtaatc attttcatta gggtcgttaa attcaaattc 2700 tccatcaata acagattcat tatcactaga gtctgaatca gttcttttag ctttctttcc 2760 gtaagatggg ctttcagttg ctgacaaaaa aatttctcca tctatagatt cattgttgaa 2820 gtttaattgg ttagattcat catcaactcg aatatgtttt tttgagctca taataaattt 2880 gataatatgt atattcaaaa atcagttatt aataaagaag agctatatct ataatttatt 2940 tcgaaataaa tttaaatcaa aaaagactaa acaactaacc gcgccttctt tgatatactt 3000 tgaaaagtaa ttttgaccaa aaacttaatt gaattatcaa aactaacgat ataaattaac 3060 attatgttgt gtagataata atttaacaca agaataattg caagttaaaa aagagcaaac 3120 gttgttggcg cgaaatcaac agatttattt agcaaaatca ttttaaaatg gattctaaag 3180 gaaaaaaaaa cctacgctca agtaatggcg accgtaagaa aaatgcggtc gcaagagcta 3240 ctggagaaat gaataaaata tttgaaacta tactcaataa gaaagcaact ctcaaagcta 3300 attcaagagc gcaaaaagag aagaataaac tggcaaaacg aacaaaaaaa gacagtgtaa 3360 ttagagataa tacaaataag agcttggata ttaattatgg ctatattaaa cctataagat 3420 atcttaatga tgggctacca gtttatagat tggaagatat taatctaggt aagtaaaacc 3480 gcataatagc ccggacgatt aattctcaat aggtgatgga ggaggcacgc ctgactgtcc 3540 atttgattgc aactgctgtt tctaattctt tgagggtaag tgttaatcac acgcttgtag 3600 ataaattgga aaatggataa gcttttttca acagttatct ttcaaaaaaa tagcaaataa 3660 ttaaaatgtt aatttattat caatttaatt gttaatttgg ctacattttt tgcatatctt 3720 ttttcaaaag tttaaggttt aaattgtatg caaatttgtt cgcccccccc cgcaaaatcc 3780 gtactcccca ctttttgatg tatgcaataa caaataactt attttaggtc atattagcaa 3840 cgtaattatc tcatattttc tttatgtact ggtgtcaact actgcttcca gcttcaaatt 3900 aatacagtag gcaaggatat ataagaaaga atagatttga ttaatatgtc catagaaaac 3960 cggcaaataa atataaactc taacataaat ataaaaacat gcgataaatt gaaggctgaa 4020 ctttctttgc aaaaatacaa gggcgagata aatataggtg taaattcgaa gagagcttcg 4080 aaagttatgg atgccgttgg gaaaatttcg aattcaaata agatacattt gaatagagag 4140 aatgtgcgag ttattgtaag agttagaccc attcaagatt gcaatgaatc tagttctagc 4200 tgcgtaagta ttattaatga acaacagact caagcaaaac aaattgtctt ggaggaccca 4260 cgccaccgag gtctaccaaa aaaatacgag tttgatgaaa tatacggtac tgaaagtact 4320 acagaagaca tatattcaaa agagattaaa gattacgtta atccattatt atcagaatgc 4380 tcatgtataa atatttttgc ctttggtagc tcaggaacag gtaagacatt tactatgcat 4440 ggcgatttta ataatgagat aggaattgtt ggattaacaa ttaagcaact tattgaaata 4500 aatgacaaac aagctgaacc tggggccttc tctttttcat tttttgaagt ttattgcgaa 4560 caaatacaag atttacttac tggagctgag cgtttagaac atttggataa acctttaaaa 4620 tcaacaaaat ctaacatttc tataagaact gatatttgtg gcagaattag aattgttgga 4680 gcaaatagtt cagaatttaa aacttgggat gaatttaatt ctgcttattc atctgcattg 4740 aaaaaacgtg cttctggtaa gactgcagtg aattcaaata gtagcagaag tcatgcatgc 4800 attcaaatta attatattcc gcctgcatct aatatcactc aatttgaagc agaaactaga 4860 aatgaggaca caatagacaa tagaagaaaa cgaggaagta tttcgttcca tgtcaaacct 4920 aaagtaaatt tgagttatcc ccgtacaatt gttaatctta tagatctatc cgggtttgaa 4980 aataataaga ttacaaacaa caccggaaaa agaatggcag aaagtacttt cataaattct 5040 tcgttattat cattgagcaa agtgatcaat gcattgaaga aaaatgctgg aacccaatca 5100 actcagagct gtatcccgta tagagaaagc aaactcacca ggctactaca ggagtattta 5160 ggtggaggtg cagatccgcc ttattctccc tattgtttga gatgtataat ggtatgcaca 5220 atttctccat cagttacatt ttttcagcag acttatgcta ccctaaatac tccaagttat 5280 ggtaataatt caatcatgag gaaatacatc ggaattgcgt cagccacgac taacatagat 5340 ttaaaaaata gtttagaggt aaatagagct acattgaaaa gtcaaaaaat taactccagt 5400 aagtccttaa ataaaatagt tactgccggc aagaactcaa attcaaatac tatacatctc 5460 gcagccaaag ctacaagtga aaaaaagcta tatgataggg cggatgagct tttctctaga 5520 aaaaactcat acaaaaatat tgaaagtcgt gtggcgcaga tgataaaagg aaaggtacca 5580 ttaaatgttg gcgaaccgaa aaaatctaga gatatcaatt atggagaaat acccaaaggc 5640 tcagtgccag gcagtagcag ccaaaagaat atagtaaaac agccatattc aataggaagt 5700 accaagatta aatgcattga ggcaaaagga ccagaagaaa gattgatttc agatttaacc 5760 aaatgcgata aggcttgctg caaatctatt agatgcgaaa atgtaatgga aacagtaagc 5820 gaagctttag gaagagatga ttgttcagcg gattcaaata caccagaaca tcgtgaaggc 5880 ccccaatcta accaaaataa tgttaaagat ttgcagataa ccgatacttc tgaaaacaag 5940 tcgatttgcg caaacttaga tgtttctaga gatgagaagg ttaacgtgga attgaataaa 6000 agtaattcaa gatcgagtgc aaaggataag ggatcactgg gaatgataat tcgtagacca 6060 gttacaagat cccaaactaa aaaagattag taagttcgtt cggtagttag tttctaaaca 6120 ttaagatgca agaaaaagct gcctgagcta agatagttcc tattgttgtt gatattcaag 6180 attagttctc ttaatgttat aacttggtat ctatattcac tgtattgtta caaatattaa 6240 gtctattcct ttattatttc ttcgcagggt agatcgataa ctttgaattg aatttctgtc 6300 ttaaaaatac attttggcgg cctttcaaag tatcttgcat attcaatagt cgaaactgtt 6360 gatccggatc caactgccga gaacctgaaa gtatacttgc caccatatcc aaccatacct 6420 tctgggtgtg gatctggaac gtaacttggc tcagcgtcga taacttttac aattgaatca 6480 ttcggtttaa taatcatctg ttggctatac cctgtcgtgg gattgccctt tatattcaca 6540 gtaatttccg ttccaggctt aacagtaatg aaatatatta tggaatcaga gctacttatg 6600 tcttgaacat tgataattgc ttctttcgag ttacacaaat ccaaatttac gagtttaact 6660 ttttcaaggt tgttggatgc cttcaagcta ccactagatg tcatatcaga ggcgtttgaa 6720 atgccaatca ttatataaat tgcgaaaaaa aataaaagtc taaaaattgt cttattcatt 6780 ttcaaaatgc ttcaactaat tagcacctct ggcggtaaaa tagaccgaaa gttattagtt 6840 attgtgtgct acattccgtt atatgatttt ccttgaaatt aaatgtagat tcctctgaat 6900 attctttatt ttttgaaata tttgactcta atttaatgaa attttctttt gatctgtttt 6960 cttttaaatc ataaattctt tgtaatttct tttccttaca ttccttgttt tcagttttaa 7020 tactttgttt ggcatcattt ggctgattag atttcaatat ttcatcttta tttgtaataa 7080 agtctgaatt attaaatatt tcctggtcat ttaaattatt gctggatgaa ataaacttaa 7140 aattcaattc atgtttcaat tgattcaatt ttaatcttaa ctcattattt tctttttttt 7200 gaaattcaag agagtcattc aaaattttat tttcattgtt aatattttca atctctatct 7260 gaagaatctt agattttact tcgagatcct ttttatcatt tagttctagg ttaagtaact 7320 ctataagttt ttctgaatct ttaatttttc tcatgaattc ataagaatca gaaattgaca 7380 aattttgaag agagttattt atttgataac aatcgttcct ctcagtaatt aaattgcata 7440 ggagctcaaa catatttaga ataatcgagt ttaaattttg aatcgcttgg ttctggcaaa 7500 taattgtttc tgaatcaatt ttattcttct cctcttcgta atgttttatt tcagattcta 7560 actgtaaaat ttcattttta aagtttagaa gttctaatgt atagttaatt gagccttgtt 7620 ttgatatttg gttatatatt gatataaaat tcaaatagct tttcaaattt gaaaagaagt 7680 tatccaaatt ctcatttaaa tctgatttgt tatattccgt tttttgtgtt atttcattat 7740 tataatcata aatataccag cttataaaac tatatatgtc taataacgaa gcctctaact 7800 tttttttcaa ttcaacgcag tataacgaag ttttctcatt tttacaaatt agactttcaa 7860 tgaccaaatc cttgttatct aactgggttt gaaaaatgtt taacttttca ttcgattctg 7920 ataacaattt tgacaaatgt tcaatctcac tttcttgttt aattatttta tcattcttct 7980 gatcaagatc aaccttcatt tcataattat catttattat tttttctgta ttacgaacct 8040 ttgcttgcat agtgttaatt tcttcattta aatttttgat attttgatct tttaaatcaa 8100 ttatttctgc gtattcagca agcaaatgag aaaattgctc tttatttatt acgctcaagc 8160 tatcattcca ctcaaaatct gacgatttta ctttatttct attcagctcg gattttaatt 8220 cagccttttg gctttttagc aactcaattt cctgcttaag aagatcaata tcttcaactt 8280 ttctattcaa attcactatt ttttctttta gttctgctat ttcatatttt tcttcttttg 8340 atttcaaaat atttacccta actttatttt gaattttttt tgccctcctt gcaaagtcaa 8400 ttgtagataa tgtctcatag tagtttaccc tgtcaggaga gcagttacat atgataatag 8460 ttctcgagtt accacccagt gagtctgata aaattctagt tatttttgag tctctgtagt 8520 taatatactt attttttgag tgaaaagagc cgtcagcaaa gtttgtctgg tttgggcaaa 8580 tatttgcatg tggttcaact attgtactct cactaagctg agaaattact tgagaaagcg 8640 ctagcaagct tctatttatg ctcattccct cttttcttct atctccttct aattgagttc 8700 ttttgatact ttcacttcct gctaagtcta caaaatttaa tatacctacg caaacatcat 8760 tagaatttat atgggactct atcttaatcc ttagtattgc atgacttctt gatgatcttt 8820 cattcatagc tgtttcagca actctccttg acttcatgcc agttttaatg attccatgaa 8880 tgtcttctgg agaagataca gttttcgagg tcaaattgat aaagtctaca gtaccatcta 8940 ctccatcgat tactttgatt tgttttgaat tgaaatcatt tgaactatta tttaagttgc 9000 tctgtggcgc aagcaagtca aatagtttct cattatatac ttcgagatat gatactgtga 9060 ttttactatt tgtaccaatg gattggttac aattagtata ttcaggatta aatatttcat 9120 ttatacttag aggtataatt ccatcatagc ttcctttatt atcaccaaac atagtgtggg 9180 ttttccctga agaggtttgc ccataagcaa aaatagttac atttatccca cttaaacatg 9240 actttactgc gtctttaatt aatttgtcat aaattaaata gtttgttgat ttatcatcaa 9300 atacatgatc aaaataataa ttcgtttttt ttgaaacatc ataaatagat ttatgtgaaa 9360 gttcccatat agaattgttt tctatttcat tgatttcgct gcatacggct gggcgaaacc 9420 taatcgcgac gctaaaggcc gatttagagt gcgaagatac atttttatta gtctcatcgt 9480 tgataaagct gctaaaatga ttattagata ctagttcttt tttagactgc tgaagagagc 9540 ttctatatga atctatcttt gtattatcaa ttttacataa cagagagtcc tctttcatta 9600 ctgaattgcc caacattttt atatataatt aattattaat gcatctgctt aacattttag 9660 tgtaattgaa aatcaagtat tcaactaata ctcttgttca gaatgcacta ttatcataac 9720 taataagaag aaaataatta gaaattatta ccgcgtaatt aataaacatg catgtgggga 9780 atctgttggt caatttgaag tctacctatt ggaatagttg tgagctgtat tactaaacac 9840 acatctttga tatgcctaat gtttctataa cgataaagga gactgcaata tattctattg 9900 aatattttga agagttgttg gataatttaa agaatgaaaa tgtagatttt gtccctatta 9960 aagacacaaa aatatttcta tctagtaaaa ttgatataga cgtcgattca tttttatcta 10020 agtctaaaca ttttttgatt agtcccaatt tcattagctt tattgagttt tggagattat 10080 atgaagagat gttaaaacat tctggagtat ataatttttt gcataatttt gatgggattt 10140 taaatgggat ggtatactta agagatagaa taattgatga atcagttgct cacaaaaggg 10200 tgagcgattt gactaagata agtatttcta atacttcaag tgaacctatt aaaggcattg 10260 gtataacaat tgctcagata catttaataa tactggaaat aatttctaaa aatatttgca 10320 acgatagctc atctatttat tggcaagatg tattaaatca aataccccaa gatgggaact 10380 caaatatgtt tttagaattt attgatatta ataattctat actacagtgg ctgaaagaat 10440 atttaaataa ctttagtcaa acatcaatat gtagtaccaa cacaattgtc acaaggttat 10500 taagtattaa agaggaaaca aatcacagtt atggagacga cgatatgaat aatgattcag 10560 aaaaagtgat atcaagttat aaaagcaaac aaatggaaga aaaagctgaa ttcgaaattc 10620 attgcgattt attaaatgaa aatgaaatga gcgagctata cattgattca tttgaaagct 10680 taccttggaa aggccaaacg atgttctcaa attttagaga gttgcagatt tctcttcaat 10740 ctatactaga acgagtaatt ttacactcag acgagggaag gaataatctt tttttaaaaa 10800 gagatgaaat aataatcaga aaaatttctc atatgattac ggaattagaa gactatctat 10860 ataaccaatt tgatattctt aataaggaaa ggaataaatc cgaaaaaatg gaaatagaaa 10920 acaaaaattt aaaaaaacaa cttaagacag cgattgaaga atcagaagaa tacaaatttg 10980 aaattcaaaa ctctataaat cagaaaaata tatatgagaa agaacttgaa catttaatat 11040 cacagaaaaa caggttatcc aatgatttaa ttagttttaa agaagaaaat aaatcccttg 11100 aaaggaaata tgaaggctta ttatacaatt ataacgaatt gaaagaaaaa catgatatat 11160 tgtgcgagga aaaccaacaa ttaattaatg aaataaataa atataaagaa gagaataaga 11220 ttgaaagtat agagagaaaa aaaaatgaaa ataactccta cttagaatta gatcaaaata 11280 ttattatgag tgtaggccat agggctaacg acttaattta tcactcgatg gaaattaaaa 11340 atgtcgattg cgaaaataat ataaaagaga attctaatta cacaaagaag ggaaatttat 11400 acaaatatcc tgcaatcata aaaacaatcc ctgactttct agaaagttca attcaaacgc 11460 ccataagttg ccgatcgtat agagaaaagg ttggatattt ttcagaaaaa acaacggata 11520 aagaattaag gatagtgaat gttgtttcac cagtagctga aaacaaattt gaaaactttc 11580 tgattcaaaa gcggaaaaaa gatactcata aacaaaaaaa aaacttatcc aaaactaaaa 11640 atgaaatggg atgtatgctt caataatata actatatagg aatttatttt tcattaatcg 11700 aatttcttga gttttttatt ctatttgcag aattgatcgc tctttgaatt tttattcctt 11760 taaattcgat tatttttctt tgattagact ctgcagcacg taatttaaga acttctagtc 11820 tgttcttttt aatattttta ttaaccgatt tgtcattagc atcgggggaa attaatttcc 11880 tattatattt agttttagaa tttatcttat ttatattatc actgagatta ggaggtttga 11940 atttttcgtc atttgataaa taagttttcg tgtcatcaat gttcactttg tcatttgtta 12000 tcatacaaga aattggtata ttgtttcccc aaaataattt atgagcatat aaattgccaa 12060 gagtaaatct gaaatattta tctaagtatt tttttgaaga ttgatgtttt tcttcagagt 12120 cattcaaatc aaaatttatt tgaaaaacat cttgttcaaa cttcgtgttt ttttgtggta 12180 aataaagtaa accaataaac tgaagtgtat tcataactcg aataaatcta ctgaataaat 12240 ctttatattg atcaggaaga caatttattt ctaaccttac tttattagtt ttttcaagtt 12300 tattttttga atcagattta ttgcaatatt tatttgagct gtccattatt tgcttacaaa 12360 atgtagaaaa taatttaaac aaatttattt tatttccatt atttaaagtg taaattttat 12420 aaattgctga aaagtcatca aaaatatttt cattatcaca agtttggtca agctcattga 12480 ttattttcgg aataaaatta ggtgtaagaa tttcttctaa aaaatagtcc atttgattag 12540 aaaaagtttt aatatttgaa tagataaata tgcgaaaaac attatcacta atatgctctg 12600 aaagctgtga ttcatcaaat gaatgaatga aaagttttga aatgttacat tcaaaagttt 12660 tgatattcca tggaagctca atgaatgctt gattttttat ttcataaaaa gatttatttc 12720 ttgttaaaaa aatctcggta atattattta tttttgtaaa aactatgttt ttatctaagg 12780 caacgttatt tataccttta atttcacgag ttagattgcc tattttttta gtcagttcct 12840 ttcttctatt cctttcgagt acctctaacc attcgataat caagttatat cgctcgttaa 12900 tatcaaatat atttaacata tccgaaagta tggtattaat aatacacaat cctagagcta 12960 tggactttag gttcaattta agcagagaaa tttcaactat tgctgcttct cttaacttaa 13020 ttccagataa aaaactttct tcatttatct ggtatttgag tttcaattga tctattattt 13080 gtttgctaag atcctgaata cttatttcct ctttacgttg aaacaaaaaa tcaaagtgat 13140 tatctgtgaa atgcaattga aatataataa aaatagtctt caaagaagaa gttatactgt 13200 aatcatactg aaagaagtta tcttttatat gccttatgca tgcagaagat aaaagtggga 13260 aggaaaaaag ttctgattca gaatcagtga ttgcgttgga aggatcattt aattcaagat 13320 tatttcctgg tattaactct ttaaatttta caaaattgcc tttaagaaaa attcttgaag 13380 taaagtcatc taaatgtaga agtggattta ttattgactg aaagaaagcc atctttgagt 13440 ctagcaattt tacggaaata aatttcaatc tattcataat ggcttgaccg acaatgcgct 13500 gagcatataa tatattagtt gaaataccca aaattattgt gaatgtgatt ttatatgttt 13560 ctttaatttg agatataatt tcgagcagtt gtcctagttg accgggagca aggcattcac 13620 tacttggaat caaaataatt atgggcttaa tatttaaatt gtttttcttt ctcattttaa 13680 atatcgaccc aataaatttg aggcctttaa taatgtttat gtatttttgg ttactgtttc 13740 catttgataa agaaaatttt gaactgtcga aattaaatgg aatggtttgg ataggtaacc 13800 acttgttttt actttcaact ttagaggaat cagaaaaaac tatttttttg tttaagtcaa 13860 agtgagtttc ttttattcca gagtgttcaa tgttacattt ccttttattc cttttaactc 13920 tgtcagctaa tttttcatca ttttctttta gtttattaag ataagttgat ttcccagcca 13980 atgagtcatt ctcgtaatca tatccttttt gagaatcaat aaaattattt tcctcaatta 14040 gcaagtctaa aaatttatct ttaatgctat tccagattgt ctctaataat tttgatacgt 14100 ttaatccttc gatacctatt ttctcgtggt ccaagaatat tgtatatggc tggcctgaag 14160 aattgagtag ttcttccaaa acattaaatg ttataagatg gtcgcaattg tttgatcctg 14220 atagcgcacc tatgacagtt agctcgtagt ttacatcatg atccacgagg gaatcaatgc 14280 tttcgagtat ttttaccatc ccatcgctca tctcaagtct aaaaccattt agaaattttg 14340 agaaaaataa attccaagat ttattaaacg ataaattgaa gtcattttta cagtaggtaa 14400 atggcctaac agctccaaat ggagtgaata tattgctttt atccgttttt tttgagttgg 14460 aaaatttgat attttctttt ctatattcta ttacagagat atcaaatggt ttatcctgca 14520 tattattaat tctaaaattc catttaggta tttggttatt cggatgttat attaattaat 14580 acaaataata aacgaattaa gaacttatat attttaatta tctttactta ttacttaaca 14640 tatttatcca ctatacatta gttcgtcctc ccattcataa tgcgctaaga cattttatta 14700 tcgaaatttt agttaaataa aaatgaaaaa taaaataaaa gctttagcga cgtctaattc 14760 ctcaatagaa aacaataata aagtattcaa acaatgcaga ttaatcggga cagtgctttt 14820 cgggacttta gcaggaattt tggggtttaa gggattttca ggtaagtcat cttatgatta 14880 ttaatattcc tattaaactt ctgttttagg cattatgttc tatatttact ccttaataac 14940 tactttaatc atcatagtac tgaagataag aattaataag tgcaatcact attttgacac 15000 gtttaatgaa gtattaggaa ttggtgatta ctttctagta agttctcaga aaataaatat 15060 aaataaatga ctaattttta gagttttatt cttttttggt cgatttcata ttgtatttgc 15120 cacatttact attgacttag tttatcaagg ctcaaaaaga tgaataggca ttctcagcat 15180 ctttagaaaa aaaatggcgg aataaatctt atttgatgga aagtaggtca ttcaagtaat 15240 aaatttacag aatttaaata cctgacagtc taataaatta gatacatttc atattagtac 15300 tggttaaatt ggccacatat ggcgggaccc cacatggaaa gctaatctat tccccactta 15360 attgtgtgga aatgtgctcg taacgcccca ccgaagtgaa tggaaaattt gttttagtat 15420 tttttgtcgc atgctcgaac actaaataag gttaaaacaa ctaattcgta atattaattg 15480 ggtcttaaat tccaataatc ataaataatg tatgtggtga acagaaaggg tgaagaggag 15540 cctgtatcat ttgaccagat tcttagcaga atcactaaat tatcatatgg acttcatcca 15600 cttgttgacc cagcgagggt tactcaagca gtgataaatg gtctatatag tggaatcaaa 15660 acatctgaat tggacgaact tgcttcgcaa acttgtgcat atatggcagc tacccacaat 15720 gatttctcta aactcgctgc tcggatatcg acttctaatc tacacaaaaa tacttcttct 15780 gatattggtg atgttgcatc gcaactatat aattttaaag acaatcaagg ttgccctgct 15840 cctttaattt caaaacctgt ttatgatttt atcatggaaa atagagaaag aattaattca 15900 aaaattgatt tttcaaagga ttttgaatac gattattttg ccttcaaaac tctagaaaga 15960 tcctatcttc ttaaaattga caacaaagtt gtggaaagac cacaacatct tctgatgcgg 16020 gtctcttgtg gtattcattg tggtgatatt gaagctgcac ttgaaacata tgagttgtta 16080 tcgcaaaaat attttactca tgcaacaccc acactcttta attcaggcac tccacgacct 16140 caaatgtcat cttgtttttt actcaggatt ccagaagatt caataaacgg tatctttgac 16200 acactaacta aatgcgcaaa cattagtaag actgcgggtg gtcttggggt ggcagtgagt 16260 aatattagag gaactggttc atatattcga ggaacaaatg ggagatctaa tggattaatt 16320 ccgatgctgc gcgtttataa tgacactgct aggtatattg atcaaggtgg agggaagcgt 16380 aaaggtgcca ttgcaattta tttggagccg tggcacgttg atgttgtaga gtttattgaa 16440 atcaggaaga atcacggaaa ggaagagatg agatgtcgcg atctctttcc tgccctttgg 16500 gttcctgatc tgttcatgga aagagttgag aaggatcaag attggactct aatgtgccca 16560 gatgaatgca ggggattgca agatgtttgg ggcgatgatt ttaagaagct atatgaagag 16620 tatgagaaac aaggccgtgg aaggaaaacg atgaaagcgc agaaactttg gttcttaatc 16680 ttacaggctc aaattgagac ggggacacca tttatttgtt ataaagatgc tgcaaatagc 16740 aaaagtaacc agaaaaattt gggaactatt gtttcaagta acttatgcac agaaatcata 16800 gaatacacta gcacggacga agttgccgtc tgtaatctgg cgtctattgg ccttccaaaa 16860 ttcgttgata aaaataacaa gacgtttgat tttgataagt taaaagaagt tacgaaggtg 16920 attacgagga atttgaataa attgattgac gttggatatt attctctcaa agagtgtaaa 16980 aaatcgaatt taagacatcg tccattgggg attggcattc aagggctcgc agactgtttt 17040 atgatgcttc gtatgccgta cgagtctgag ggagctaaga agttaaataa acaaattttt 17100 gaagttatct attatgcagc tctcgacgct agctgtgagc ttgcagaaaa atacgggcca 17160 tacgaaacat attccggttc gcctgcgagt aagggtattt tacaatttga tatgtgggga 17220 gttacaccag actctggact ttgtgattgg gatttgctta aagataggat ttccaaacat 17280 ggtattcgta attctctatt aatatctcca atgccaacag catccacttc gcaaatttta 17340 ggtaacaatg agagttttga gccatttacc tcgaacattt atcatcgtag agtactatct 17400 ggtgagttct ttgtagtaaa tccgcatttg ctaaatgatt tactagagct tgggttatgg 17460 gatgataggc ttaagcagaa tattattgca aataatggca gtattcaaaa tatacttaca 17520 attccggaag atatacgcga gctttataag actgtatggg aaattaagca aaagacagta 17580 attgatatgg ctgcagatag gggtccatat gtttgtcaat cgcaatcttt gaacattcat 17640 atggaaaatg ctaactttgc aaagttatca tctatgcatt tttatggttg gaaaaaaggt 17700 ctaaaaacag gcatttatta tttaaggacc caaagcgcga cacgccccat acaatttaca 17760 gtagatcaac aactcctgaa atcggaaact aaagaaaagg attcactgga gactaacaaa 17820 cgacaagcac tcgaaccaga ggcacaaaag ctcatcgctt gcccattaag accgacaaat 17880 atgaaggatg atgaagaatg tatgatgtgt tctggttaaa taataggctt cagagttcat 17940 ggtttttaat caagtctttt aaaatatttt ttatgcttgg ttttaagtgt taattaggtg 18000 attcggtaat ttattcttcc acacagtaca cacgcttaga ccgccaactc aaattgacat 18060 ttaaaatttc ttatccactg aaatgaaatc tagaattatt ccctcaaaga ttatcaaact 18120 cggcaaaaac cgagtttttt gtaggacttt tctaattact attctaattt ttctgatttt 18180 tttctttaag cgaagattgg tgatcgattt attgttgatt gtcgaactta tattaattga 18240 cattttattt tcaaaattcc taaagaaaaa aaaggcgatt tacagttcta taagttctac 18300 aaatttaaac ttggagaaaa gaaattctat tgagcttaaa aaattatcta ctgatagctg 18360 taataactcc cttaagcaac aaaaaactat taaaagcctg agcgatgaca gaaaatcacg 18420 aaaaattggg agaaaaaaaa ataagggata tggctggact ctcacaatta tagcctcatt 18480 aatattgata tttacgtcgt tttatgaaag ctttattgct aacattattt ataaagaata 18540 ttctaatgaa aaggaaataa caagcctaat agatcaatat tcggaagaaa ggaagtatta 18600 tagcaatgtt tttaaatata accctccaat taatttacct actctagaat tttccaattg 18660 gttactcact aattgggttt ttagaactat ttcaaattta ccaattttat tttccgacta 18720 ttcaaatatt aaaaaaaaag gaattgaata tatcgaaact gaatttgata cgttcattaa 18780 tttgaaaaaa ggctcaatat cgaatataac taataatgaa cttgaaaata gtaagtttga 18840 tgggcaagaa acaaatattg attttgtttc tgattcaagt gaattcaatg aaaacgatag 18900 aataagtatt gtaattccag ctcataacga agatgaattt attagcaaaa caataatctt 18960 tactattgag tcaactccaa ctgaattact aagagagatc ataattgttg atgattttag 19020 tgaaaaacca gtgtttgaaa tacttgaaga agaacttcca gaaaattata aaaaatatgt 19080 gaaaataatt aggttaaaga aatgtgaagg tttaataaga tcaaaaatta ttggtgcaga 19140 tgctgcttta ggccctaaca tatttttttt agatgggcat tgtaaaccaa aaaagggttg 19200 gtcagaagcg ctagttaaat caattagaga aaattacaag agagttgtat gcccaatcgt 19260 ccaaagtatt tcaaatattg attggagtga tataggaact gctggtgcga aaatgatgat 19320 agaatggaac tttgcattcc attggtatga tgatggatta ccagaaattc caatagcatc 19380 tggaggaata ttaatgatta caaaaagatg gtgggaagaa agtggtaaat atgatccagg 19440 aatgctttat tggggtgggg aaaatattga gcaaagtttt agagtatggc tttgcggcgg 19500 agaaatacat gtggtaagaa attctttagt tgggcatatt tttgaaagaa ataattcaaa 19560 tagaagaaat caagatttcc aatataaaaa aatgttaatt gataatatga atagcaatca 19620 tcaaagaaca gcatttgtgt ggttaagcga acagttttat gaaacttatt tcaaaaatta 19680 tcatgtatta ggttatttac ctattagtta tactaagggc ctaagcgaaa gactttcatt 19740 aaaacatata ttaaagtgca aaccttttga atggtatata ggaaaattta gaccggcctt 19800 tgaaaggcaa ggcgaacttt attataattt ccaccacatt cagcatgtaa aaagcaaatt 19860 atgtctctct atagcaaata aacagaatga tcggattgga gttggaaaag ctgagattga 19920 aattccaatg acagtggtgc caaacgatgt atcgagatat agcattaaaa cgacaacaga 19980 ttatgatatt ttagctttaa aaacatgtaa ctatcttgat gaatcccaaa agtggagctt 20040 tatccttgga aatcggatgt tatataactt caaatcaaag aagtgcttag ataaagcaag 20100 ttcagttaat ttattcaaaa aaatgaagac aaaggatttt atatattctc caaataacag 20160 ctcagaatct aatactgaac tagaattgcc tttactgtat gagtgcgact ggaatcttgt 20220 aatgagagca agaaattaca atcaattttg ggcatggaaa gatattggcg ataaaagtgg 20280 aaaaatcgtc aattggagcg gcgatgagca tagatcaaat gtaacagggg gcgcagaaga 20340 attcatagtc ccaataaaca aaggagtaga tagcgaaagt tactgtttgt attcaaatac 20400 tgctttaggt agttatgaag aaacgaaaat gttttacagt aattgtaaaa cagaaaacaa 20460 ctctgaagaa atttcattta agaaaatttg gaggcaaaat ctttttgttt gaaaaatgga 20520 gctgcatttt attttgaaaa agttcagaaa aatactcaac tatgtgttta tacaaagtat 20580 tacgtattaa agatagtgat taaataaagt gaatcccgta ataatacata ttactttatt 20640 gcagcctgtg atataaaatg cgaagttcta ttatattttc aacactgctt tcaagctgct 20700 atcccacaca taaggaatgc caataaaaat tatggaaact ttgttttctt tacttaaata 20760 tcaaaacatt cgaaatcccg tgatttaaag ttgcagattt atttgattga attaaatcaa 20820 aacatgcgca cacaaaatta aattgtcatt ttgggtataa tttttttttt gtaagaaatt 20880 tatcaataca gagttgtaac aattctaaca tgtttttgag tatattcggt ttgcaagtaa 20940 atagtttgaa ttatttttac aagtatttag ctatcattgt tatcttgttc acaaaatatt 21000 gtagcttata atattaagaa actctctaat cgagcaacta attaaaaaag gaaaaataac 21060 atacaataag aaatataatt aatttcggcg attttaatgt ttgtactgcc tatatttaga 21120 gctttttgta aagctcattg gaattaaaat gagccaattt gttcttatac aaattcaatt 21180 tctaattaat atattttaca attgcattag gagttagaaa caaaaatcag tttttgaatc 21240 gtagttaatc gaataatgaa gtatcatatc tctttctctt ttttacttct ggcattatct 21300 cttcaaataa aaaaccttca attacttcgc tattttctga atataaattg acaaaattta 21360 ccccatttga tgaaattccc aaaaattgta taagtaactt gacaacttca agaaaataat 21420 ttcttgaata aataactttt ttatctaaag aaaaagtaaa acatttgttg aaaatattaa 21480 taaaattgaa ctccaaggat acttttattt cacctatctt atgaacaact gagaagggga 21540 tattagtttg tagaaaactc cctaattgcc tttctaattg ctttataacg cttaaaggaa 21600 tgcattgcgc aaactcactt ttaagttgtg attttttatg ttctgagctg actatttcgt 21660 atttatagaa tataataact aaaccgaaca aacttgagaa tagtttgcca tccactggaa 21720 gacctatttc atcaacaaaa ttaaatatgt caagaatatt attcaaaatc tttaatataa 21780 tatttaatga tatattaaag tgcccagact cagaaatgtc attccagttg ataatgccaa 21840 tatcgattaa attggtatta attcctttct tattattgtt tttaatttga tcatttatta 21900 gatactcaaa cttttcaact gctggtagtt gcctaattga tttaaactca tttatcaata 21960 agtcaattaa gtgatctaat atttgattta aattcactaa ataataatat gagatatcac 22020 ttaaattagc aaaattatca cattttgtat ctaataatga attataaatt gggaaaaaaa 22080 taggagagct atagcggtca tccaatgtta gatctttgtt gaaacatgcg aaaatctctg 22140 tttcagaatt tatgcagtta tacaatatat tgttcatatc gaaagtaatc tcattcttga 22200 tcaccaaaat taacattgaa ggaactaaca taaacttata tttataacat aaatactttc 22260 tgtttcttct tagatacaaa ccattataat gatcattaat caatttatta gtttcaagcc 22320 cccaaaagtt catgactttc ttattcaata aaaggaactt ttgataatat aaaagcagat 22380 ttgttgaatt ttctcctaaa agaatatagt taatcattga gaaataggaa aatgattggt 22440 ttataatagg cgtaaatata ctttgtgtta acttattttt ttttaaagtt gtttggctag 22500 aatttaactc atcaaaaatt aattcattta tcatttctgt aatgctataa ttattaatcg 22560 aatgagagtt gattttaatt aaatccttaa actctgcaat ttcattgtca ttgattggaa 22620 atgatttatt tgtgataaaa aggacaataa attgcttaat tgattgaaga ttaatgaatg 22680 atttataaaa ataaaatagt ataccggcaa gatttgtact taaatttgaa tatgaaatat 22740 taatattttt taaaatatca tttatttggc tcttcataaa tccaatcttg ttgtataatg 22800 ggttcttctc gggaaattta ttcaaataaa tattgaaaaa taatatataa atagttatta 22860 atcttgaatt aaaaaactta gttatctcaa taccagaatg tttttcatag ttcttgaaaa 22920 atgtatctat ttcatctgta tttccactta aaatcaaata acaaataatt aatgattcga 22980 ttgaaaggta atatgaattt tttaattcag ttattgcagc ttcaacatga ttatggttgt 23040 ctacagggct gtatataaat gaagaaatca ttatttttgc aagcttgaat cttgcttttg 23100 attctaataa tgatggaagc caaacacaat tttggatttg agagaaagaa tttgtataaa 23160 tacagtacca ctttttttct tcattggaaa tttttggtat aaattctgta aacaaagttt 23220 ttcttaattt tgatattact tgaattaaat tgataagtaa atctttttcg aaacttcttg 23280 gaaatatttc attattgaag ttatgtttag atagaacttt aactttattc aagtatatta 23340 acaaatattt agcgttagtt tttccataaa cgcttttaat attaataagc ctaattatta 23400 gttgatcaaa tgaagctgaa agtatattat ttacaatctg agaatagggt atttgtatat 23460 ttctttcagg cttaaggaaa aatttgattt cgcactgaag taagtaaatt aattgggcaa 23520 ccattctttt tatgaataca ggaaaatttc tttctagtct ttgaaaagta tcataatcat 23580 ctagttcaca gataatttgt ttcaatacat tattgcagca tgtaactata tttaaatcat 23640 aattatccat taaaaaaaga ttcatattta gtaatttcaa atatttgaac aatgtgctaa 23700 ggctaagttg tatattatta tttcctgtta ctccaaatct acctggactc cataagctcg 23760 atgatctatc catatatgag acaggtatgc aggaaatgaa acaatctttt ataaatctga 23820 aaggaataaa tttttgaata ttatttttcc aattaatttt aagaaaaata ttatctccat 23880 taggaaaaga aatcaagttt tctaccttgc ttctaatttg aactaaagtt tcatccaaaa 23940 attcattgct taaatattta atatccaaat tcgatttcat aaacctatcc aaaagaattg 24000 tatagtaata tttagagtat aaaagtggct ctaatgaagc aataactaaa ttattttctt 24060 ttatgtagat tgaaatttcg tcaaacaata gatcttcaaa accatgtttt tccacctcaa 24120 attcatccat ttcaaagttt gaaaatgaag gggaaagaac cataatattg tcatagtaat 24180 ttaatatttg atcagaaact tgcgaattat agaggattga agaaaaatca tcttgaagaa 24240 gagttttccc agtagtatca aaataaaatt tttcactatt cattatctca cttaaggttt 24300 ccataaataa tggatatatt aatggagatg aaataatatt ccagatattg ccggcatctg 24360 cttcttttaa gttatttaaa ttgaaaaata gtctttcaac tagtcctatg attgatgtat 24420 aaaactttac tttactatct gagtattgtc tttccgttaa agaagtacag ttaattaatg 24480 tttgaaagat attgcaagag agaataatta atggtgaata tggagaaata tattctaaaa 24540 ttccagatga acgaaattca ttatttttta ttaaattaat tatccaatta atgaataaat 24600 tcttgaaatt tgtttcattt gaaactaata tttttagatc aaataaataa tcatatattt 24660 gagattgatt ataaattgcc atcttattaa acggcatcga gggtaatggg caaactggaa 24720 gtccataaag aagctgaaat tgtattccta gcaattctaa atattcagtt tcttgttcag 24780 gaaataaaca gtatttttct aactcaatct tttcatataa ttttttcata ttcaaattgc 24840 aaaatagtgc tgaataattt ttgactactt ttctacaatt atggtataat aaattccaat 24900 tttttaagtg actatttgtt ttatatactt gaacaaaagt ataaatttcc ttgaaccaat 24960 tttcaataac tgaaagatta gaaaataagc cgtactttat tgttatagtt aatagtagca 25020 cttttgtgct caaggttgaa ttaaaaaatt tatcaaattg ttcatttatc attacctcat 25080 taatgtcata aatgccttca aaatcaaaag tctgttcctc aataattgga aataaaatgc 25140 aataattgtt caaaatatgt cttttaaatg tatcctcttc ggattgttta gtataaactt 25200 ttgtttcttt ttgctcttta ttttcatcct taacaaattc aaatgaaaac tccaaatatg 25260 taagatatct tattaggtta attattgaat ttgaagatat ttcttctata gtgcttgaat 25320 atttgaaaaa atcgaataaa gctattagaa gattaattaa gtttttcatg actaatatat 25380 tcaatttatt tgctgataaa ttattcataa aaaccttatg aatacatgag atatttatta 25440 atatagtttc catcgttagc ttatgcaatg gttttttaga taatttactt atatttatca 25500 aatttaattc ataaatctct gtaattttta ttattaattt tgtaatattc gaaattatag 25560 aatttactat aatgttgtta aaatataaat attgtatatc tttattaata ataagtacca 25620 aattttttaa caatgaaatt aaaaattcgc aaaagttttc agatagatca aatcgttcaa 25680 attttttaat ccattgatta atatctaata ttatgctatt tgtatgctga agaaatttat 25740 gatcaattat ttcatttctc aaattgcaat ttattttttt catctgcatg tattcaagat 25800 aattattgtc taaaaatact gaataatttg caataatatc ataagtggga aattcatggc 25860 tgcattttga attataaatg ctaatccagt ttgaaaattt ttctttaatt gtcctgatac 25920 tattatctga aagaagcttt agagcaaacc cagttgaaat aattccacct tgattatttt 25980 gatctaaatc tgaaaatagc tttaatatca aattaatatt tccagcgtat ccgcttattt 26040 tcattagatc ctcttttaaa aaaaatacaa gttcatcttc caagtaaatt tttactatag 26100 gtatttcaat cactttaaat attattaata atttttttgt tgagagatcg atttcagaaa 26160 ttaaataatt atttaatttt tcttttatta tagtgaaagt gtccagtaaa tacaaaaaat 26220 ttgattttaa ctcattataa aaagatattt gtatgttact tttagtattt gaaataatat 26280 tattggtgac gaaaataaaa ataattaaag ttttcataat actgtaaaat agcaataaaa 26340 tttctttctg aatttgtttg ttaacttgaa aaagtaaata ataaagtttg cttgtaagtt 26400 ttgagcaaag gtagttttta tcaaaaatcc ttgatttgtg atttgaacat tgaaatctaa 26460 tcgttggttt atcattatat ctatgaaatt ttgtaaaatc aactatattt gttattattc 26520 ccattgtata tttgaataac ttttctgtaa tactaaataa caaactgcta attgtttcag 26580 aagtaattat tatatcttca ttctttgttg agaatataaa cttctgtatt ttgtttaata 26640 aattattaga atttaataaa aattcatcga ctgatgttag aaggttacct gaaagaaatg 26700 aaaacctatt atcaacttca aaaatatgtt gctcatcaaa attcataaga aatttgtgcc 26760 ttaaattatc aaataattgt tttatatctt gaaaactaac atattttgtc tctagcatag 26820 ctgaatattg ctctgttata aagtttgttt catgaacaat catttcatca aattgttcta 26880 tagaaggaaa attgttattt aaatttaaac taaaaatatt atcgcacttc atctcttctt 26940 tttttaaaac acttaaaata agattgtagc aataatacgg gattgtttgg atccaaatct 27000 tatattgctt atcattatat ttaggaaatg ttacagtatt acttgaaata tcaataatta 27060 tcttttttaa tggattgtct aattccaata ataaatcacg ttcaataact tctttattca 27120 aattaacatt tggtaaatca ttattaacgt ttttgcttct tgttcttcta tcagaattca 27180 ccttaaattc gtttttgtaa tttgataaat ttaatttagc acttaaatca tttatgtagt 27240 cttctagaag ttggaaaggc tcaaagctta tatataagtt atttatccca aataaagaaa 27300 taatttgttt aaagtatgaa aaaatattgt ttgaatcatc ttgaattgac ttagaaatgt 27360 ttcctattga ttttatgtaa ttttcttcaa agttggtatt tgttatatat ctttcaattt 27420 caagacactt tttttcgtat ttcttacaac attctattct taataatata ttttcgaaaa 27480 tcttattata tttctcttga agtgccaatt taggagatgt gcctttaact tttagttctc 27540 tatagatttt gataagtgaa tactttatct ccttaattgt ctcggtgtaa ttactgtgat 27600 ctgataattc aaagttatct tcatctaaaa tcatggagca taatttctgc aacttaagtt 27660 cttttaacat aagatctcca aatatatggt cattattctt atttgctgaa attgaattcc 27720 ttatattaaa taaattccaa gaactgaaaa aatcaaataa atattcacaa ttcactataa 27780 tttttttatt aaatatttct tcttccgaat aatcacccaa aatcctctcc agaacaatac 27840 tacattccaa aaaatcgcct tgatagaata aaactttgca aagtaattta aatagaacat 27900 tttcttcttc aaggcttgaa ttcatttttt ccaatttatt ttctcgttgg caaataagaa 27960 actctataca aaatctacaa agttttaaaa catttgtggt tcttccatct aatgcttcaa 28020 taaaactatt tctttttaaa acattatttt ctccagaaaa tgatatagaa ttaaatattc 28080 tataatatct atccaaaatt tcaaccccaa tgtttgcaat ggttttgcag gtatttagcc 28140 ttgcattacc atcataaata tcatctttga ataatgaaaa aagctttttt gattcatttt 28200 cgaaagacag taatatgtca tagttacata attgttcaat attcatagca aatagtctat 28260 taaaaagtat aaattgaatt tcatgtattc ccgcatcaac ttgaaaagta acatttatta 28320 ttaacaatcc ttctaaagaa agatgcaatt attaaaatta taagctttaa ctttaagaac 28380 tatttccaat gtttctaagt gatttattgt gttatattga gattaaatta atcataataa 28440 agactaaaag ccgcaaattt attcactact ttatgttata gcaatttttc ggcgcatata 28500 atacgtgtaa taataccgat tgttaaatat cgcacatttc ttatataaag acattcaata 28560 atcatgaaac atacaaaata tctactattg ttgcttactc ctacattctt agtatattgt 28620 tcaaatataa ctcccaaatt ggaaaatgat acttttatag acgctttaga caaggatttg 28680 tctgagaatg aaaaagaaaa aaaaatgaca gtttgttttg aactaacaca aaaggaattc 28740 atagagaaac gtgatgttta tcagaagata gctgcttcaa taaaagataa agagtcaata 28800 acttctaatg atgcgcttag agcgcttttc catcagaacc tcattacttg ctacttcaac 28860 tcaaatataa aagatataaa ttctgtcatt aatagaaata taccgaaaga agggatcgcg 28920 aagatgttta cagttagtga aaacacccca ctaagatttt cgtccaatca attaaagatt 28980 cttgaacgag ttatctcaat gagtacaaaa aatacaggca aacaaatttc tggaggaatt 29040 gcaaaacaat taagtgggtt ttatgggcat ttatatttca tttttgcgct acttctaatt 29100 ggagtttcat tttattttgc tctagatcgt ctgaataaaa gcattaaagg cgctaaagga 29160 aagataacta aaaagaacaa ataagctcta aaagaataat ccttgattaa attaacacta 29220 ttgccattgt atgggttggg attcattagc tgagttgatt ttactattat caatttccca 29280 ggttcttcct cttctaactt gttctccttt tgaaaaggtg attttggatt ttccttcgat 29340 ttgtttgttt ccttcaggat tttttcttgc aataagtgca ataggatcaa ttacttggta 29400 taattctttt gaccttgttg agtttaagtc aaaaaactgt acagaggttt tcactggaat 29460 gaaccaagga aagctattct ctattgtaat cattgattct gtagttcgta ttccaaaaag 29520 tgatttaatg caaaatgatt catcgcatat gtattcacat ttatcgattt ttatagaacc 29580 tccattttcg gaaataaaaa gtggaatatt ggatttttct gaagatgtgt ttgttgtatt 29640 ttttcttgaa atcaaataat taacttgaag attttcaata tatatttttg atgaatcttg 29700 cgacactaaa gttgtcggat aggaaattac agcgctttta gcaataaaaa caccaccatc 29760 gtcagtaaac acatctccct taataattgt tttatcacaa ttcagtattg aaaattcatc 29820 taccataatt ccactattta cttctaataa atatctttca gaaacattca tgagtgaatt 29880 tccttgtagt acaactactt tattaatatt tacttgatac tgctgaattg cattattttc 29940 ccttttaata tttattattg aaccggatgt taatattacg ctttcgattg aaatgcttcc 30000 ttcgaagaca agcatttcag aaaatgtgtc aattgtaaga ttgccattaa ttagaagatt 30060 gattggaatt ttatttctaa acagaaataa atccgtaaat tcatttcctt tttctccatt 30120 taccactgtt aaagacttcg tcgacatacc caagctcccc tgattaatat aaatttgttc 30180 tacttcaaga tagcttttca taattaaaac aacatttcca tttttcacaa ttaatttccc 30240 gttaattata aatatgccat tagaaatgaa aaaactctta taatcgttag attttacttc 30300 aaagcttgga tttacactat taattattaa attaccttta tttataaaaa agctagattt 30360 tcttgttgaa ataaatgatt tcccaacttc catactttct cctaatagaa aactcccttc 30420 aattaaatct acatgctcat ttaccttcaa actaccatta gcaatgaaca tattacttga 30480 atttgataca agtattgaac caagaataca gtgttgatat ctctcacaat tattatttga 30540 attattattg ctatttgagt catttgaatg acctaaaatt aggatttctg atccgtcact 30600 aatgacagca ttttcaaaaa tactaacttt gcccatggac aaaaactcag atccatcatt 30660 tatttcaatt gagccttgca caaatacttc tccataaatt gtagcagctg atgctgaaac 30720 aacttggaaa aatcctgcca ttttatcatt tatttctgta tattgtgtac tgcttatacc 30780 aaaaatactt gaaatattat agctattatg ataatattct atattttcaa tgctttggtt 30840 cagtatttta agagagtcag cgtaaaatgt agacgaatct cttattgatg caatgtgtag 30900 cattacattc ccacctattc gaattttaga accggaatgt aacaataaaa tactattaat 30960 ttttaattct ccctttttca cgataacaac tgatgatcta attaaagata gcctgcttgt 31020 aaagatagat gaattcatca tatgaaggct tccggaatcc ctaatatgta tgcctagccc 31080 atgcgtagtg ttagttttca aattaccgca agtaccaaaa acaaggcctg catcagaaat 31140 aacgacttcg ccagcactaa tatcgcctct tacccataat ttaccactat ttccgacata 31200 tatctttctc ccagaccata gagagcccaa gttctgttct tttgtttcac tttcattgga 31260 tttggaacta tcttgaaatt gattttgaac aatattatta tcaaatgtat ttaaacaact 31320 gttagaggca tgaaaaagtt taactctagt gatagaaaaa gggttccagc tatctgatcc 31380 actattattc tgatatacat cagaaaataa atatagaact ccaataaata gattagatga 31440 cagatccacc tgtaaatctg cggtgtagac attttttcca acaatcaatt tcccaatgac 31500 tattaagtta gaaaatggaa gtatcattga gcctcttaca gttaaacctt tcctaataat 31560 acaccttctt tttgagtgta ttggttgagt tttaccaact gacaaatttt gagcttcaag 31620 tgcatcaaac gaagtcagag tatttagctg accatttgca acaataagat ctccacatat 31680 taagacatcg ccatcaacaa taactgagac acaatttgaa tcttcgagaa ccaaatcttc 31740 attttcataa ttcaatatta tatccccgga atcggaatag gaatattcgt ttgaattata 31800 tgaatatgca attggagaag gatcgaaagc attttcacaa taacttcttt ctttcatatt 31860 ttcaaaacct ttcaatcttt tttcgttgca atattgattc tttataccag aagtagttgg 31920 gcagctaaag ccaatatttc tgatttgatc tgctaataga tcatacattt gatttacttg 31980 attaatataa accttttctt ttgtatcatt gattaattta cgcaattttt catgctcctc 32040 attgaattta atctcatatt tattcattaa ttttcttttt actaaattct gaatttcatt 32100 ttgatgaatc ctttctgata ctgttatcgg attgctaaat tcataatcag gaatccatgg 32160 tagagaattt gcaacaataa ttagagatag gcattggatc acaaatataa aaattacgcg 32220 tggtaataat atcaagttgt ccctaaaatt cataaagttt tgtttaattt ttttttcgaa 32280 gtttcaatta tgttgcagga ttgcattgtc ggcgcttaat tgtgtggaaa gaaaaagaat 32340 aaaatgatta attataatta tcaagctaaa gcgtctatac atatacattt atgtatcatt 32400 gactcaaatt tttcagatag tgctcttata ttcacattat ttgaagattt aatcaaatac 32460 ctgacttttg ctacgaagta aattggtagt tcaatttgtt tatctgagat tatagctgta 32520 ataaacctca gagcttcatc gataaaacta aagctaagta aattttcgat tacattgata 32580 aaaatgtgta tatgttgacc aattgatgtt gagcaacttc taattatctt catcaagtac 32640 tctgggtctt tatacatttt ttttaattga ataggtagga ttgaagtata agcgctatcc 32700 gatggattat cactgttcat tgcaccgtag atgcgttcat ttgccaataa tgcaatgtga 32760 acgccctgga gaaaataagg agagtgattt atacggctta ttattgaact ccaataatta 32820 atattaatag caaaggccaa attgtcagat cctttcagta tttgtatatt tctaagctta 32880 gcattttgaa ggagtatttc ttcgaatact gtaattttcg gatctggtga gctatatgat 32940 aacttttttt ttttctgtcc actagtttgc gttaggaaat catttttata ttcaactaaa 33000 gaattaaaat atattgtatt gtatgataaa aaaaagcaat aagtgaacat ccaaaatcca 33060 atagttttat aaatattatt tggaataaat tgagttaaag ggttcgaaaa aattatccat 33120 ttattatttg aaagatttga agaataatct ttaagataat tgttcataaa cttatttgca 33180 acttgggtta gtctgtgtga agtctcaaag tagccattga ttattaagta gtaatttagc 33240 tctacaatag atatgcttaa caaactagct aagttttttg aacaaaacag atgcggatta 33300 aatattgttt tgaagtaaat tccactacgt tctaagtaat tcttgtatct ttcaatgttt 33360 ttgcaaaaat taatgaaatg ttgataagtt tctgctaaag ttaaatctga agttaataat 33420 ttatttattc cgacaaaata ccactttaag taagaaatat attcaataga cggtaaaaat 33480 gtgaccttat ggtgaattcc ttgagaataa ttattttgat agattatttt tggaagttca 33540 ataccgaaaa agagatttgg tatacaaata ggtttatcag aatattgccg atcattaaat 33600 tggtttatta gtgtagcgct tgaataagta atgcatgtgt ttaatgaaga gataattatg 33660 tttaaaggct cggaatccgc attaatgact tctttttcaa agctgaagta ttggattata 33720 ccactcaaat tctcaagcgc ctcatttatg attttaaatg aatcgaattc ttcataacga 33780 gaattattca ttaaaataaa ttctatccag ctctccggaa ataaagcatt tgtacaatta 33840 ttatatgaac acaaatcgca atatttatcc agtgcattta aatatgctaa agtgactatt 33900 tcttgatacc tctctcctct tttccattca ttcattagaa acttataaat tgatcgaact 33960 atgaaagcac cttttgagta aaatcttaaa aaagcttccg aattagtgtt attaaggaat 34020 ttagtttttt tccagaatat ctgaattaac aatgaataaa attcttcaag ttcagttgca 34080 aaaactgatg ccgaaattgt agaaaatttc aattcacttt tattttcaac tttgtttgac 34140 aaataatgct tgatccaaag gaccatcaga aacagcattt tctcatgctt agaattaaag 34200 catgacaatg atttatttag aagcttaact attgaatgat cttcgatatt atttccgctc 34260 aaatatatat attgtattag atatttttta tgaatctcat taattaatat tttctggttt 34320 ggagagcaac tggagaaaca tgcgcatatc agattaattg atatttctaa taaattaaag 34380 tatattgaat tatagtatat ttttattctt ggtttactaa acaatctaaa tattaatgaa 34440 attaaataaa taactttgat ttcgttgcaa gaaaaccctt caagtatgtt ttttttattt 34500 tgatcttctt ttggtttaat taattcaatg atttcatcaa ctaacttaat atcactgatc 34560 aattgtttta atattgaaag tgatgaatct gaagtatgac ttttataggg aagcaaacga 34620 atgatatatg aaatgtagcg cgtgtagttg tgttcaaaac atccattgat aatttgttgt 34680 tcaaggtaat cgtaaactaa gtcaaccagc tgaaaattac attcttgaga aactttttta 34740 gaacaaaagt cattaagaaa tagctcccag aaaattttat ttgaaaaaat atatagagaa 34800 agatcattgg ctccagaatc gataaaaaca tctgaagaaa aaaatttgta tattgaatta 34860 taatataaat aaaccctata aacttgatca acagacggaa caaaatgatg ttcattgatc 34920 ctcccacaat tatgagtaat aatattcaac tttcctgtta ccatatattt gcctagattc 34980 aaaaatgaat aatcttctga aaaaaactgt aatatatttt caactccaca gctatatgtt 35040 aaatttatat ttctaattaa atttttccag gaaaaatact tgaaaatatt tcttgtaata 35100 attgaataga tattcaaaat taaaatagta ttttttacct tcattatttc taaattagaa 35160 gctttaaaat atttattagt ttctttatcc acggaattta atactaagca cttaacatca 35220 ccgttatgat ctgctttaat tgattttcca acgaaattta aaacttcatc atttgatgtt 35280 ggtgaaagat tgacaaagaa ttggattatg ttctgtatta atgatttaag ttcatccgtt 35340 gtgaaattct gtgttaatat atcataatag atcaagttaa aattatttat ataattagat 35400 gctgggctga aaatttcatt tagcgatgta taatggttaa tagtatcctc taatttattc 35460 ataaaagtaa gcttcttaga tagtacttct ataagctcca taattaaagc gcacactata 35520 attttattat ttattccatc tgagatgtaa aaatcgaata aatcgaataa atgtttgtcc 35580 aatcttgttt tattgctatt aaaaataatt ctcaatacca aatactttaa ttttagataa 35640 tttcttatta tttcaagaga tatgtataga tatgaattac aattgtcgga aattgatgat 35700 ggcttatcca actcttgtct ggataaagaa cttataacag aaactgatat aggagataat 35760 aaaaaagttt tatctttatc tgaaaaaatt cgaatacagt aattgccaat atttttaaag 35820 taattaatta agagatgaag aggcgtgatt atgaatatta tcttttctat atcatatttg 35880 cattttatac gagtctcaaa ctctttattt attagtgctt tcccatctag atatatgtaa 35940 acatttttca ttaaatttac caagtcctca aatttagaag ttgagtataa aatattcaac 36000 agctcacatt gtttcttttc gcaaacaatc tttgaaataa tgctttcaag gaatctcaaa 36060 tatctattaa aagccatagc aactacagtg tttgatttac actcaccaat aactctacag 36120 agaagaaaat ttatgatttc ttcaaagttt gatagttttt ctatattgga aattaacttg 36180 cattttaaat cactaatatc aagtggattt ttaaaagaat tttgtagtcc ctgactccaa 36240 actaacgtag aattgcctag attggcattt gatgggaaat tgttcgaagt gtgatcattt 36300 aagcaaattt tgtgaatctc acaaattgga aggcataaat cataatcatt cataatattt 36360 ccaggaaaag ctgccctggc aatataaatt tctgatttat tacaatagac ctgaggatat 36420 aacaattttt ttttatctcc gctctgttga agatctaaat taagataact tgaaagttgc 36480 aaattaatgt tttcggaatt ataatcacaa aagttcaata tcttatagag ataaactgta 36540 ctgttaatca acaaaacaaa atgatctgtg ccttctagga gggaaattac agggtgggta 36600 tctatattaa ttccaatttc aatatctgtt cttctatcaa ttatattact acaaacatta 36660 ttttctgcag gattaaatct gatcaatata agacataatt ctttattttt ttcagaaaat 36720 gtaattaaat aaatattttc tgtgtttttg ttggttactg tagttttcac tatttgaaat 36780 gcagaaattt cgagagattc tgtaaagata ttttttttaa taaatagtgg taaaaactcg 36840 tctttcaaga ttgttgaatt tgaaattatc ccaaaaaccc ccaattcatt gcatgctaaa 36900 aatagagatt tagttgcttg agtaagtaat ttcaaatttg aaagctccgt tttgaaactt 36960 ctcgttgtca aaatattaaa accatactcg cattccccaa atacagatat ctcaatgata 37020 tataactgtc cttcattaga gcaaactgaa atcgtgagaa ttgatgtgca tctatcgaaa 37080 aataaactac aattacaatt tggattgaat ggtaaattta tattaaataa tgatttaata 37140 tttggtgctc tacatttatc ttgagctctg tctctagaaa atattatagt ggagacattg 37200 taaacacaga ttgtatcgtt atttattact gaggcaatat gtaaatcatc gaacaacttt 37260 aattcattac caaaaaattt gtcaaaatca gaatacgtat ctgtacaatt gtttctttca 37320 tttgttaatt ctgcataaaa attattagag aaagcaaaat agccaatgat ctgctcaaca 37380 tttagaagca ttttgattcc taagttatat aattgatttt tattgctgaa ctttgtaact 37440 atttcgtccc cacttaagaa aaaaaattgt tatacaacgt cgcggcatca tgaactttta 37500 taattgcgat ggattattta ccatttgaca cctacagtgt gaataaccaa atactgttta 37560 taatacaaag agtttttaaa gataatgtat ttttactgtt tatttatatt gctatcttct 37620 tatcaacaac tattgtcaca atttttattt taagaatact tttaaatggg ccaagctaat 37680 aataaaaatc aacaaaaaaa agcaactcaa ctccatcata gaaatgtgca tgccaatgat 37740 gattttttat ttttatctga aagataccct gagcttaaaa aatgtatcaa aataataaat 37800 aataaagtac gtattaatta taaccccgca gcattacact gtatttcgaa agtattgctg 37860 cattacagat ataatattaa ttgggacata ccggataaat ttctaatacc aacaatacct 37920 tcgagagcta actatgtaca ctttatttca gacctgttaa ctccagaaca cttttataat 37980 acggaaaaag taaacgatga aggattaaga aacgatctaa agacaaaaac atgtattgaa 38040 ggtggtacag atgtttgttt ttctgagctt attcctagag gaaaacaagt tttggggttt 38100 gatataggaa ttggtgcaaa ctgcatattt tctcttctat gcaacaaaat ttattcttgg 38160 aatatgatcg gctcagatat ttcaattgaa agcctgagtg tttcagacac tattattaaa 38220 aaaaacaatc tttgtggttg tataaaactt ttgcatcaag aaaaaccaga atatatatta 38280 ttcgggatac ttgacaaaac tgaaatagaa gacttgaaat tctcattcac tatctgtaac 38340 cctccatatt atgattcagt tgaagattca gaaattaata tgcaccctgc tcgcttcagg 38400 agttgtcaaa attacgaaat aataactcat ggaggtgaat ctcaatttat cttgaaattg 38460 tatttcgaaa gcaagaattt ttcgaaaagg gtaatctggt acacttctca agtttcgaaa 38520 ctaaaaaatt tgaaattttt gaaaagtgtt cttaaaaaag aaatcattaa caatgaatta 38580 aaatcactta gatacactac tctaaaacaa gggaagcatg acaagtgggt aattgcatgg 38640 agcttttttg aaaaggaaga gagaacctca atattaaagt ttctcagaaa caataaaaac 38700 atgagctctt gacgaatttt aattaaatat taatgtttga tcaaaatttg catgtgtatg 38760 catgcaaatt attatttagc acatgcgtta attatttaca cagaataata aattatcttc 38820 tgttaacctt attattaata aacttataat aaaatggatg ggacgctaca tcaaaatgaa 38880 attatttctt tgctagacga aaaaagaacc atttcaagca caataaaatt ttcgtcgcta 38940 cttgggtcta ttacaataac caagttagta atatttttgg tcggattctc agatggactg 39000 actcacctcg caacgctggc aatttactac ttgctgaagg atgatttacg cctctctcct 39060 cctgaagtat cggttatata tgcaattcct gcgatacctt ggtttttgaa acctttattt 39120 ggtaagctac aaaccatttt ttaaagtata aataaatttc ttaaagcatt ttgcagtgat 39180 tcaataccta ttgcaggtaa gcttgaagct cattcaaata tattcgtata acatacttat 39240 aggaatgaga agaaagccat acttgatttt tttctcaatt cttcaagtta taggcttttt 39300 actacttgca acaaacgtaa gttaactcta aatttataat aatattttta atagtttcag 39360 gctgatactg tttttaaagc tgcagtctgc ctactactaa tatctttaag tgcagcattt 39420 tgtagcagta ttgctgaggt aagttattag aaaatagatt aattaatcaa taatctctaa 39480 aaaaacaggc attagttgta gagacatctg gaattaatgg tggtgctgaa acagtctccg 39540 attattttgg ctcaaaggca ttgggtgcac tagcaacggc atatttttct ggttcattgt 39600 tagatacata cagtaaacaa ggaatatttc ttacaacgtc aatttttcca ttatttgttt 39660 ttatcgcttg tctgataatg gatgataaaa aacagacaga agatctcact gcgaagaatc 39720 agctattttc gcttaaggaa tttttgaaaa aacccataat ttgggggcct gcaatatata 39780 tttttactta tactgcagga ccagactacg acgatgcaat gtttttttac tttacaaata 39840 ggcttggctt ttcgccgaca tttatgggta gcttaaggct tacttatgga attgcaggta 39900 taattggaat tgttctatat agaattattc tcaagaagac cccattccgt gaaatattgc 39960 tttggactac tttgttttca ataccaattt atattctacc tctagcgctt gtcacaggac 40020 tcaatttgaa tatgggaatt tcgaatagaa tgtttgcatt gtctggtgga tttctgattg 40080 aagccattgc agaaatacag ttgttaccac ttcttgtaat gactgcaaag ttttgcccaa 40140 aaggccttga agggtctgta tatgcggtta tgatgtcaat acgaagtctt ggaatagggg 40200 tttcaaaagt tatctccgca ggattggcct actcacttgg aattacagca ttcaattttt 40260 caaacttggg gttattaatt tggatttctt ccgcatttct cctgctgcct ctcttttttc 40320 taaatttggt tagttatcat actttttcca tatttcaagt tcattttttt aggttgtgaa 40380 tgaagaagag atccaaagta cagaaaatca agtataattc aatattctca aaatgattaa 40440 aattaaataa cgaaatatct tatgagattt tatttagaat tgaaaataat tttctatact 40500 acagtaaatt tttactatga aagtctatat tgaacggtta ctttgcttct ttactttgta 40560 aattcttata acccaattat cagttgtata agcttcctca aaatgagtaa gtgagaattg 40620 agttttacct atgtggtaat tccttactaa gtcaaaacca tccttctgaa gatccgaaaa 40680 cctataatat gatagcttgt aaatcagaga attaagcatc gcattagatg cgtctttccc 40740 cacagtataa tgcccactat ccgatagata atcgttttgc tgaatgtgag gataaattcc 40800 agaagcaatt cttaccatcc ataagaattt attgatatca tcagaagaat attttgcaaa 40860 cccaccaaat acaacaaaaa cataatctac atccagtttt tctattattt tataagcctc 40920 atcctcagga gatgccaatg caagcccaac tgtagcgata tgtgtgttat tccatgtatt 40980 gttatctaca attacggttc tgtcgcctaa ttcagtgcat tgatatccat aatcccacca 41040 agacattatc cttgcattgt aaggggtgtt ctttctcaac caataatatg cttccctaaa 41100 atcatcctga attagtctag ttccatctct aagacggttt gaagtaatta ctgatggatg 41160 cgaatacgca acagcagagc tccaaacact atgcacgaca taactcacac agagcaaaaa 41220 cattaataat acaaaaaaaa ctctgagaat accaaaattc ccttggctcc cgtcaccata 41280 agtggttttt tttgtttgcc ttctgcctat tagactcgat aatataaaag aaaatcccac 41340 tgcagaaagg caacaagccg caggcccaaa aaccagcatc agtctaatca tgacagaaga 41400 aaaatagaca gcaaggactc catatacaat taaaaatatt gcagtatcgg gaatgcacgt 41460 actatgtgat gtttgaaatg gtgaattttt tggtgatttt ttttgagaaa tataaattga 41520 atacgcacag acaaatattc caagagggaa aaatatcgca gtaatatgga ggtcaaaaat 41580 atactgtgac caagttgttg cttggtgttc tgatactgat gcgataatag gaacatgctt 41640 agacgcatat gtgggatcta aaagagtcat acttcttgcc gcccacctag tttttccaga 41700 cagtgtcaat accaagaagg cggagctaga taccaagaag caagaaatcc atatagcttt 41760 tagaagtatt cggcttgaag aagtgctcag aatatatttt gaggagttca atgctataac 41820 gcacactaca agaaatgcca tacagtgtga tgaaaggtgt tccgaagatt ttacggcgcc 41880 cacgttgaca aatggaatgt taaggttcaa tattgtacca ataacataaa atacaatata 41940 aactacaaag tgtttgtatg taaatctgtt gagtaataca agagctagtg tatagattgc 42000 aattgtatta attacaaaca catatcctcc ccaactcata gtcatataat tgtaggcaag 42060 agctgcaaaa aatgaactca atattctacc atcatttact gccctcaaat atagtgcaaa 42120 gctgaataca agagcaaaaa ttgcaactgc ttcattatca taacttcctg caacgctcct 42180 actcagatac gtaggagata tacccgtaaa gagagctgcc attattcccg tttcatttcg 42240 cttagttatt tgaaatgtga gtaagtatga agcaagtgca gttagacttg agattattgg 42300 tcctgtaaaa acacaaatgt gtaaaatact tacaagtaaa cccaattgat gtgcaatata 42360 cctcatcaaa gcagcagtaa gcatgagacc aggaaataga gtttgcccaa taatcctacc 42420 aagagggtac caactcctgc tatcaaacca gttccaaaat gcataaaacc catgttttga 42480 aagaaatttc gaggttctat agttaaaatg tggatcaaac tcgtgtatta tcgcctcata 42540 tcttactacg gcaaaaagcc tcacaaaaat acaaagtccc ataattaaaa ccacggatag 42600 aaacaacaaa acaggagata tccttccagc agattcaaat attgattttt ttttgtattc 42660 attttctaat tctttataac ttatgtcttt cattattaaa tcgaagcatt cagataaaat 42720 tttctaaata atcaagaacc tttaaataag gtaagaattt ttattttctt tgccacaaat 42780 tttatagttt atattaaatg tctcaaaaat aaagttgctg tgtgcctccg tcaggtaatt 42840 cgatggcgca aatttaatat ttatttatta tgggaattgt ttcatcaagt tcaaatctaa 42900 gttcaatcga taaaccaaat gagcaaaata atttccattg tactaatggt catatgaatg 42960 gccacaatct agaaaatatc cagtgtgtaa ttagatggtc atttggtggg gatgaggtat 43020 ttgttactgg tagttttaac ttttggagaa agcaggatga atacaagtta tttaaaagtg 43080 gccacgatca tttgattgca atagagctta ctagaaatat tcactttttc aaatttattg 43140 tggatggaga gtggagatac tctccagaat acccaattga atcagacagc gaaggatata 43200 taaataattg tatagattta acaaaatata aagctccata ttattcaaca ccttgcgata 43260 aatcacgcta cggcgtccag gagttccatc aagagctgcc aacagagttt cctgtagatg 43320 cgccagcatt gccaattctt ttaggtaaga gcagatgccc acttgaaact gccaatggta 43380 tccatattcc gtttcattgt atttcgaacc atatttacta tgactctctt gttcaagaaa 43440 tctttggaac tcatatagtg acattttgcg tgaccaagcg ttggtttaaa gaaaagtaca 43500 tgcaaattga tcattgtatg caaaagttta cgacaatact atatgtgtcc tttagattaa 43560 ttgatgaatt ttatcctata ttaatttgca agaaaaatga ctacaattta aattgttttt 43620 ctaaaagtgt ttctgactca gaaaataaca atcctcacta tgacgcttat ttttcagata 43680 gattgctcac aacggccgaa atgtttgcca caatatttag ataaatcctc gaactcaaaa 43740 tactatcaaa taataacata tgtattttca gggcattata tattacaatt ccagtgagat 43800 acaatcagcg aggtttaata tttttgttaa ggtttagaat ttccgtagca atggatttaa 43860 tttacttttg tttagggcat tcattcacta cctgtagcta aattattcgt atcaataaat 43920 gtcaaattat cagcgcctat ttttagagac gagaatgcgg ttatatttac aggaaatgta 43980 gaaaatatta ctgagggaat ccagattgtt aaattttcac tttaattttc aactgatatt 44040 tcttgcagtg cgtatagatt aacgcacaat ggtagaaaag aatctccctt gtggagaaga 44100 tattcccaag cggcaacttt ttgaaataat aaaggaaatt ggcccaagca gcacgtcaaa 44160 tcaagaaaac tttgatgaga ttcttctagg atggattaac gactcaagca actttaaacg 44220 agatattgcc acttcattca tttatatgct caatgataca tcgaaataca ataaggatcc 44280 aaagtataaa gaaaaaggca gcgatgattt ttctgaagtt cttagagcaa gatggaacga 44340 gctgagtaat aacaaaaatg accaaaattt attagttcaa agttctagag aagatattag 44400 gctgggttgg aactcttccg tttttgtgga aggagtgaac aagataatga aaattcaaga 44460 gtccaagaat aagcaattta atactaattg gaatgacatt attgacgaaa taattgataa 44520 aagtgactta ctagatctaa ctgataatga tggaagcttc gaattatttt gttcactttt 44580 aggtgaatca tatcgaaggg atgaaaataa agtttataac ccaaaatgta acgaactgta 44640 ttataaatta gacccttttt taaaagaatg gaagaatcct gatattcaag ctctttttct 44700 tttgaaaatc tgtaaaacaa gctatgaaat aaataatagg agttataatt tgcatcaaaa 44760 ggactattca aatgatatat tagaggatac attaatagaa aatgcagtaa attttgattc 44820 agtctacacg aatatgagta attactcaat tatattagat gctagctcaa ggattcacaa 44880 ttcgattgtt tcaatgttcg aatcagacga atccattttg gaactcatta aagatggttt 44940 actcggaacc ccttgtgacg atatatttaa atctaaatta ccaaagattg gaatctttca 45000 cactaaaggt tcatgtggat tattaatttg tcatgatttg ctaagggtca ttattaatat 45060 atataatcga attaataatc agagattatt agaactcctt caacatctta ttttttatga 45120 gattccaaga tattcgccta ttggcctact ctcatctctt ggagcaattt atgatgattt 45180 tccacaagcc gtttctgtat ctagtgacct agctaacaag ataagtgagt cgttcgatat 45240 attagtgcca aataataaag tcacgaaggt ttcgttgatt aaaatggcaa tttacagtgc 45300 tcaagagttt attctatttc ctatttctaa aatagttgca acttctcatg aaaaagaata 45360 tgatattggt cctcgttttg gagtgattat gcaaatcatt ttcgcctctt cagagaaaca 45420 taggaatgat aatagccaat cccagtttat gagagatatt atcagccaaa ttgcattctg 45480 ttacagaatt tcatctgttt ttctaatttc aaagttgaac tccctttcga ggagctacct 45540 ttgggagttc ttggactctg aatggaagac cttacctcaa ttttataaaa tgaatgtttc 45600 tgattgttta tctcaaattt tgtctagaga taagcttgat ttactgaaag attcagaata 45660 ttctcatgaa tcgatcgact attatcattg gtggtttctt tgtatcgata ttgttttttc 45720 tggaattagt atatctcaaa attccgaatt ggggttgtct tctagtataa gtgacgaaat 45780 tagctctcgc gctgttaatg tcctttttac tctatttaaa gaagaactag agggtaaaaa 45840 tattgaaact ccactattta ttattgctat tatggactat atccgagcca gacttgtata 45900 catgacatac gcttcagctg aacttcaaac tattctagta aatagaagaa catcattacc 45960 ttcctttgcg gttgataaag ctgcaagttt taaagttttt gctgcaagag ttgttcgtag 46020 aaatttatct ttaggttcat ctcaactata tcagcctaaa gaatgctgga tttcttcgct 46080 tgaaactccc tcattcaagt tttcctgttg gcgtattagc tcatcttcta ttctcgcgat 46140 tcaagaagca attttgaagt ttggagtaag cattaaagga tatgaaagca cccctatatc 46200 tgaacttcat gatgctttac ttgaatttga gaagattaaa gatacaaata atttacaaga 46260 taaaaataac tctttgggct ctattattaa agatgttagc tccgaaatta tgttgaattc 46320 ttccgtaaat gagaaagatg aaaaaactct taataatatt tcacaaactt tattcaccaa 46380 taacgattca gaacacaaaa actccacaaa aattaatggt aaatctgaac tgatagaggc 46440 tgaaaatgaa ggaaatggat ttgaaatcaa tcatgtcaac gattttctga caaaatgtta 46500 ttcaggtgaa attaacacct ctgaattaac agttgaatta aaaaaaatgc actctctttc 46560 aaaccaccca ggtaaaaatg ttaaaatctt caacactttc cttcaaacgt tatttgatga 46620 atgtagatcc tatcctaaat atcctaatca agagcttaaa attactgcag aaattcttgg 46680 gattttagtt aaagaggatc tacttatttc atttggaaat gcactagtgt ttgtattgcg 46740 atgtattatc gaagcactta gaaaaggaca ttggaccaaa atgttttgtt ttggagtatt 46800 tgctatggaa atgtttatag acagatttat ttctttcccc caatttctct ctgcaataat 46860 taatatgtct caacatttaa aacatgcaat agaaccgtat gttacttatt gtgagtcttg 46920 tattgccatc cttccagaaa atctaaagaa taagttatac attgaaagga gtgtaatgga 46980 gtctcttaac ttagatatcc caccaaaacc agagagtctt atttcaaagg tgcacccaga 47040 gatgattaac tttgaattca gacaaaagga gactagcgaa agtccttgta aaccaaatat 47100 gtctattgac aagcttggaa gttttataac cttaaatctt tcagaaagga aggtgcttcc 47160 aactggaatt acaattgatc aacttcaagg ctttggcttg ggtagtttag aaaaattaat 47220 gaatgatcct gaaatattaa acgcactagt aactccctcc gaaaatgtaa ttgaacacat 47280 tttcacgata tgtaatactc ttgcgagcac gaacatcgaa actaaagcta tcgaaatggc 47340 tgatattcta aataaaaacc cggaatattg tcattggttt gcgttttatc tcgtgaagaa 47400 tagggcatct aaggaaaaga ataaccattc tacatatatc aacttcttaa ttaagttgga 47460 taaactgatg ccaaaaaatt cagatttact gctcgaggaa aaaactgagt cggatattcc 47520 attaactcat aaaggaaatg aggaaagtaa aataaacatc attgaaatta ctactcttgc 47580 cagttatgac tgcattaagg ctttgctgcg atatgcaagt atattgaatg aggtctcctc 47640 attcttaaat gtactacgtc atttaggata ctggctaggg caaattacaa taggaattaa 47700 ccgtccgatt atccacaaat acctaaatcc cagacaattg cttatagata gctattctag 47760 aggttgcata gcctctgtcc taccttttat ttgtaaaata ttggagaatg ttaaaggagg 47820 ttactattat ccgccaaacc cttggacaaa caatatttta tatgcactag ccgaaattca 47880 ttcgcttgcg aataattcaa actctcatat gttcgaggta gaacttttat ttaagcaact 47940 ggagcttaat ctagatgatt atgttggtaa gtcaaactac ctaggtttga gcagccatac 48000 ggactatact gagcataagg ctctgggcga aaaacaaaga ggtcataata tttatccaaa 48060 gacccagaca gagcataatt ctcacataac tttaggaagc tcatttgagc gcccaaatat 48120 tacaaataat gtaattaaca gctctttaaa ccaatctgct ggcttatatc aactaagtgc 48180 aaatattgga gatgcacaac ttgcttcgac cttcatgcca cctaatcatt catcccaaat 48240 gatgcatcaa caaactccac agcagatacc atcatccgac atacaatttt gggccaacaa 48300 agtactaatt tctccttcta ttgtattatt tcagatacaa ccatcgctca gacctttggt 48360 accgctagct ttagacagat caataagaga aatactacag gttgtgattc caaggtcagt 48420 tagaattgcc gcaattacta ctaaagaaat tatcggtaaa gaatttgctt ttgaggctga 48480 tgaaaacatc tataaaagag cagcccatct gatggttgcc gctttatctg gctctatggc 48540 tattgcagca tgtcgtgaac cccttagagt tgcatttact gctcaactaa gacaagtatt 48600 acacccaaca ccatcgagag atggtgagga ccatgtatta attgaacaag ttgtccaagt 48660 catttgcagt gataatattg atttaggttg ccaaatcatc gaacaggccg tggttgaaaa 48720 agctattgaa gaactagacg aagttatctc tccaggaatt attgcaagga ggaaatccag 48780 agaaaccggg catcaattcg ttgatacaga cttttatgga ggaccgaaca ctcagaattc 48840 cgctactttt tggtcatcac tacctgaaaa tttaaaatac agacataatt ctatgagaca 48900 tttgcagctt tataaagatt tcttacaatt caccttaatg agaaatttgg aaaggaggga 48960 ttctgttacc caatatgagc tacagaatag tttacaatca aatcaaataa catctctata 49020 tcagcatgga agtaatgacc aatttaatag ccaaacccaa caatggaata attctaatgc 49080 aattcaattt tcacaccaag ccgaaagtat ccaaaatttc aatacagtta ggagtgacaa 49140 tacaagctcc caaatgcctt cttctaatca ttctcaaatg aatacctcgc cttcaactat 49200 tgtacaacct ccggaacctg taagggtgcc tcttgttttt gaattggcat atctgcctct 49260 aatgatgcgt gttgacgaat gtttgggtca aataaaggat gtaatacgcg agattgccct 49320 atatcctccg attttctcta aacagctgat tcctccagta agcaataatt taagcgaagg 49380 aatgtctgtt aaccagaata tatactcaaa acctctcggg tctaatattt ttacatatac 49440 tccgaaatca accgcacatc cagttctttc tgtgttatct tctctccaat ctgaccacat 49500 tttattttat ttatgcagag tattatattc tattggaaaa tctgcgtcac aaagagaaga 49560 tgttttgatt gggatctctc aaaaattgtt taaaacactt tttgatgcag gcgccgcctt 49620 tcagcagagt acaacaggta tacttccctc gagtagatgt attgcctctt cacttggttt 49680 cgatgcagct ttgttacata tcgaggtatt tctggcgcta tgtaatcaaa tttcgtatta 49740 tagctcaaaa ttttggctga agcttaggaa agaggcaata ggatggttta tttacactat 49800 agaggatcca aaatattccg ttgatattgt aattggtgca ttaagatatg atcttatctc 49860 ctcagatgaa cttgatgttt ctctatcaaa cattttagaa actgcaattt ctacgcttaa 49920 tgactctaat ccagcaatcg gtggaaatag tagatgtttg agaattgtcg agtttattta 49980 caaactgttt ttcagaagta ttgaagattg gcactaccca attactaaga agttaccaag 50040 cgctactaaa aacttgaatc gattatcaaa taacagtgta gccttccaaa acagtaattt 50100 ttctgcaata ccaatagtgc taccagggct ttattataag ccatactctt atacaactaa 50160 tcttggagaa cttaagaata aagtagaaag tattctattg gaattggaat ccaacaagtc 50220 aattaagttt catgaaatgt ggataggtga ccaatgtcca gagattttga acttttactc 50280 agtaattcag tgtaatttgg atacaatatt gaacccgaga tatattgcgt tacctactcc 50340 tattaaacca cctcctgata tttcgaaagg aatcaacacg atttttgatg agtggatatt 50400 gctattgaga ataacaattt ttaatggagt tggaggttcc gagcgtaata atccatacag 50460 aaatttattt cttcaaaggc tttctaggca aggtctacta agaatggatg acactacaga 50520 aaaattattt actgcttgta ttgagagagc aatctattta agcttaaatc acaatagcag 50580 tgactcagac gccttgaata actctatttc ggaaaatgca aatgactcac gtaataatat 50640 ggatccgttc ccgatagatt cattggttag attgattaca actatggcaa ggtatgtaga 50700 tccacagcaa atggctgcag tagtaataac tcacaaattt ctttcaatac ttactagagt 50760 tattcataaa gatgcagagt cacatggctt taatcaaagg ccatattata ggatttttta 50820 ctctttgtta caagaatatg aaagtatagg attcaatact gaaatgatac actttacctg 50880 cattctaagt gtagttcatc accttcaata tttaaatcct aatagagtcc ccggatttgc 50940 atattcttgg atccagatta tctcaagtaa ccgattcttc ccgtacctac tacgccatgt 51000 aaaaggttgg caaccttatc aagctctatt attgcaaatc ttcattttta tatcgccatt 51060 cttgagaagt gtacagctat caagtaacat taagacaatt tatggtgcgc tactccgtat 51120 ccttctggta ttactacacg actttcctga gttcctttgt gactacagtt gcagtttctg 51180 cgatgtcctg cctgttaact gcatacaaat tagaaaccta atcttatccg cattcccaag 51240 aaatatgaaa ctgcctgatc catttttacc caccttgaaa attggaaatc ttcccgaaat 51300 gaagctaata cccagaatga ttgcaaatta tggtgcttac attctataca aggatctaaa 51360 ggtaaatata gataaattct ggattactag agatgcatca atccttcccc tcataacaga 51420 aactatcaaa atgccaaggg atgaagcact taaatgtggt actaaatact ctttcccgat 51480 aataacaggt ttattgcttt atattgggat atacctcccg aatggaaatg aatctaattc 51540 aagcattgac ggttctcata atgggatatt taatatcttt aattctgatc catcaactaa 51600 ctcgattgaa tctgcatcga aactggatca aacaccaaac ataaaaagtg atcaattaga 51660 aacttttgaa gatccgtccc tttcaataat tctatttctt tgtaaggacc tggatatgga 51720 agggcgtttt gtcctcatat ctgcaatgac aaattttctt ggatatccta attcttacac 51780 atactacttt agctcactta ttttatggtt gttttccaaa agcaatgact ctattgtgca 51840 agaacaaatc acaaggatac ttctagaaag gcttatcgtg cacaggccac atccatgggg 51900 tcttctaata acatttattg agttgattaa gaatccaaaa tacgcatttt ggagctgttc 51960 atttgtgcat ctggctccgg aggttgagaa gctctttcag tctgttgctc aaacttgttt 52020 aggtcaagcc cctaataaaa cgaatttggt gaaccataca taagttttat ttattaagct 52080 atttaattag agttagataa tatatatgtt gatattattt gttttaatat ttttgcaaca 52140 attagtgaac ttgatcttac cctaatttca accggctctt cattttttcc attatcatga 52200 atcttgtctt ctctaaaaga ttcgctgtgt tcattacaag gtattatttt aaacatgcaa 52260 ggtattcttt tgcgattaga cagatacaaa gtggatgttc ctccataaat cattggagtt 52320 gaaacgcttg gatcgtcaat actatccatc acactatata cgcaaaagtt tagcagttgt 52380 agaaagattg gaatttcgag agaagaaatc cctaagatcc ctttcactga agattgagtg 52440 agtttttccc aaaaatttcc aaactctttc ttgcccattt ttttagttgg agccataaag 52500 tttgtaagta ttactggaag cctaaaattt atacaaataa gctttgaata attactatct 52560 gtgttaattt cttccatttg ttcatttatt tccgaaactt catcttttgt atttgagaaa 52620 gctcttatta gtagtgaaat cattggagga ttcaaataag ggccattgca aattaaatta 52680 atcctttgct caaattttcc ttcattatca aaacctttat tagtttcact gaaaacttca 52740 acatgaaggt tttgcgatgt agaatcaaat atatttggaa atgaattatt agaacttttt 52800 acagacaatt ttttctcctc gcaaaaagta gaaatacgct ctatgttaaa tatcctattc 52860 ttctcatttt tccccaattt cacaataatt gttgactctc cagaggaata tttaaagctc 52920 ccgtttgaaa atccaattga taataggcta aaattatata aacttccttt atttgataga 52980 cataaggaaa gccaggtatt tctactgcaa tatccatcat cgccatattt atttgattgg 53040 ttgggtatta aaaggctatt ttttttgtga ttaaatactc catccaataa ctgatctcta 53100 tatctcaaag ctgaattatt ttctgatggt atgtttaccg gagttatatt cccatttata 53160 ggggtaacac tatattgtgt atttgcaaca attgcattta aaaattcggg tgagcctata 53220 caattacctc tgtttccaac accgattttc agaatttcgt agattcgatt ttgggtctca 53280 tgatcagagt tacaataaag cttcgtaaaa gaaagtaaaa tagtactttt taatagtgga 53340 atatgatcat cttcagaatc gatagttaat atatcatgaa taatcaacaa tatttcaact 53400 tgcttaataa ttgatacttt acttgatatc agctttccat attcacctaa aataaaacat 53460 aaccatctta ttccagatct tgggttgaaa ttatctcttc gattttttaa tttatttaat 53520 aatttatatg attttattgc tacatatttt tgagtactta tatcatttcc aagtgattca 53580 ttcactttct ccattaaatc agtgcattca attttcaatt tagaatcaaa tagaatatct 53640 gttatctgaa aaagagtatt ttctgtaatt tcaaatcctg gatttgggtt acagaaagac 53700 ttctcgagag tttgaaataa tacattaatt gtcttatatg aaacttcatt tttgtttgaa 53760 aatcttctaa ttgtatagca aactgataaa attatctctt ccaataatgt ttttgtcttt 53820 tctggaaaaa aacaatcact tgaacctttt tgtatgttat gtatttcttc aatatttaat 53880 ttataagggt ttaatatata cttgtaaaga actccattaa gaatttcttt tgtaataaaa 53940 acccaattat ttttgttgca gatataagaa aaaatattta atagattcaa agttgtatct 54000 tcatcaaaag agcatgtaaa tctcaacaat gtaaccaaat ttttttttac taattcttga 54060 attgccttat ttgttttcga ttcaaaaatt agtgaaatag aaatagaaat ataatctttg 54120 ttatccgcat atagtaaacc ttcaataaat ttccccacta aatgaccaat attagcagat 54180 aattcttgat tacaaatctt gttgaaatat tttgtcatct caattgcaat accaattata 54240 catagaaact caatttcatc aaaggtagaa ctaaaactta aattacattc atttactgat 54300 ttaattgcat ggattgcatt aaaaaatacg tcttccatta tcttattcat cttgtaattt 54360 acaaaatagt cattaatcac ttcagggaac aattctaaaa tctctattaa tttaacctga 54420 agccaaaaca ttggaatttt atgaaatctc caatttttta tttctttgta taacctaatt 54480 ctcgataacg taaatattaa atttggaact ataaattccc aaatctttac taatttaata 54540 tttaaattag agttgattat ttctctttcg tctcttaata cggaatgatc tgtttcttgt 54600 tcatttaaat ttttgtaggc attttctgtt ggttccaagt actcagtgtt taaataatac 54660 ttcaaactat tctttataaa aacgcattga gaaattagac agtcagcatt cctttcgaat 54720 tggaaatatg ataatagtct ttcaccccac tcgtttgccc tgagccgatc agggcaacat 54780 tgaaataact tcgttaaaca acaaattgcc ttcgatctga taatatttgt gtcattaagt 54840 tgtttttcaa ctggtatttc tgcgagtttc tttatgtcga taaaaagatt gtccgcaaag 54900 tctaaagttg gagaattccc tatgaaattc aatgctaata agcaattttt gattttttgt 54960 tttctaatat cgatgttttt acctgaatga ttttttccta ttcctactga aaattgtcta 55020 aaattccgtg atttatcaga acctttttct ttgtctacta atacttcaaa gcaattattt 55080 aagtcatttt ttattgtatt tattaataat ctaagtaatt ctaaatttcc cctgtaaatt 55140 aatgaagcag cgatataacc acattgctta aattcgaaaa tatttgagct aaccaactct 55200 aaaatttcaa gccatccaaa gtcaatttca taccccatta cgcttatgta tgctaatctc 55260 caaattatct tttctttatc cttgagagat aattgaggac cttttttgga gctatcatct 55320 gtcttctcca aaatagattt tagattagaa atctcatatt caactaattc catattatat 55380 ttatacaaat gaatatgaga atattattta tttatattaa caccatactc ttttagatca 55440 accaccaatt tgtgagggcc gtaataaaag catttttttt tttttttact gcgggtagaa 55500 atgctttatg tatatattaa attatcgata aagatttaaa taaatagaat ttctttacaa 55560 atatttgtat ttcactcaat cactgtaact tgataaacac cctatttttt gtgcgtgatt 55620 tagttacttt ttgccttctt taaaaatagt ctagtaattt cactattcta tagcaagtat 55680 ttaactgata catagattta tattcatttt ctatttgaaa ttttttgaca ataattattt 55740 agcatgatta gtcctaagaa ttttttatcg aaaataaaat attgaaaatg atttgctttc 55800 attcaaaaaa taataataaa tagtacttct gaatgtcatt gaaggttatg atttcaagtt 55860 ttaaactaat tgttaatttt ctttatcgaa tcataaatag tataataact acttctaatt 55920 tctatattta atcttttgaa taatatatgt aaaacaaatt aaatttatat ataaaaacta 55980 agtaatcttt aatatctgtt taagtttaaa ctccatcgaa tcttcaattt ttttttcgca 56040 ttctggtacg ttggagattt catcattctt ttttatctta ttctctatgt ttcttgaatt 56100 taggtagtct agcccatttt tctttttatt gatattaatt gacagcaatg actttaattc 56160 aacagctaag ggatgctcat ttgaattaat atgtcttatt tcagttttat tcttcgtctt 56220 tcggcatctt aaatctgagt cattacaatt attattatta ttattattta cttcagagct 56280 attttgttta tctgtgattg atatttcttt ctttcccatt tgctttaaac tgctattttg 56340 atttttcagc caataataac tataattatc acattttttt ttgctttttt ccgctttatt 56400 attaaactgc aaatcaaagt ttaatatttg tttacttatt atttctaaat gattttcatc 56460 tttaagattt tgaatacatg gtagcaaagg aatacactct gaagatttaa cactaatgcc 56520 tcttaattta gagcacatac cgtttaaatt gtttgcatta atttgagttt catccaacaa 56580 aatatcaata attgttgacg gataatattc cttgttaact tttactccac agcatacaga 56640 tacaactgtt ccaactgttc caaagggcaa caatccatca tttgtattta tataaactac 56700 cctagtcccc aaagggtacc caaatcttga aattaattta tttatgccag agttaaactg 56760 attctctgac tgataaaaat gctcattcct acataatttc gacaaatcta tatatttatc 56820 atttgtctcg atttgtgaat ttagaattag tctttccaat ctagaaattg ttgaaggaag 56880 cattgttgct gaattagaac gtgaagataa aaatggtaaa tttctatata atttaccctt 56940 aattatcttt atcattacat taaatttgaa gtcttgatct tttgaatcaa tcaaatccga 57000 gaaaatatcc ttgacatgta ttttctttaa gttaaatacg ttttcaacct tttcattgtt 57060 attgtcaata ttaaatgtgt ttgagcaaat attttttcct tcagaattta ttatgtgttc 57120 atctaactct ccctggtcca ataataccaa acttttaaaa atagccaaaa caaaacaagg 57180 gaaaaggtct aaatattcta atattgattt tattgcttcg ttgctaacaa taggagtccc 57240 aagttccaca ttagtattcg atgatgtaat aaatgaattt tcatctacga atttactata 57300 taaaggtaaa taaagcttat tttttaattt attttcaaat ttaaaaagat tcatcgatat 57360 ttcttgcaaa gtactatcgt tgcacttcac ttgaattgga ccaataatta gtttcttaag 57420 aaataaggtt gaaaagcgca tatactcaag tgatttagta gcatctaatc cgcattttgt 57480 gtttaagcaa tttaacaagg gtttgtatat tatttttgca acatcctcaa tagaaaacca 57540 ttgtattgaa gataagtctt ccaaagcata ttcaaaaatt ctcttttgaa gttgttccac 57600 tttattcttg ttttttggaa ttcgaaatag tactaatctt tgagcttcat ctacaattgt 57660 gccgatatat ccatagtttg gattttgttt attaatgcaa aggagctttt cacctaaatt 57720 tattttacta atctgcttaa aacgttcagc ctcactctct tttaaatctg aatacaataa 57780 tattagtggt agcaaatagt gttttgatat ttttgagaag ttaaattcaa ccttgccact 57840 ctcagtaatg taactgctct cagcaactct agcctcaaca agacaattta ctggatcaga 57900 tatatccttc agttcagcat caaaatcttt caagtctaaa acttctttgc aattaaccaa 57960 ctttgaaatg gaaagcaatt cttcatcaat attcattgaa tttgaggggg aaagaacaat 58020 tcctcgtttc tttagaagtg aattttcatt aaataataaa cgaataaact ctttaggatt 58080 gttgttaggt tcaataatag gttttgtcag cagcttcggt cttagaatat atctcaaagt 58140 ttttattgaa acaagctcag caaccttcat tctaggatag tcaaccaata ttcttatacc 58200 tggatttctt gtattaatgg aaggtagaaa gagattattt aaaaactcct ttgaaatatg 58260 agatgaagat attgaggaaa aaattgattc attactactt tcagtcccaa aaacacttat 58320 cccagaacta taataaattc tataaggaat tctgcaaagg tttgggaacc aatcgaaagg 58380 taatattgta ttatctagta aataattagg gaagtttgat acaccgtcag ggaatgccgg 58440 atgataaaac gattttgaaa ctaccttcga gttagtaatg gaaggtaagt ataatttcac 58500 tgtgctttca attttaggag taggaaatgc atagttatcc aaatagaaaa taaatggtct 58560 cccttcctga tttctggctt tatcttcggt atctaaaaac tccgaactat tgataattga 58620 tgaatgagta tttgtattag catcaattaa attttctaat tcacttgtac ttaaataaaa 58680 tttataccga tttttcagtt catattcaaa attagtcttt attttgattt caaaatttct 58740 aaaatcagag tcataaaaac cacattttga aagaacatat gtaattgaat ccaatagaag 58800 gaactcatca ataaaaggaa ttaatgttac gccaccccaa ggaaccttga cgccttccat 58860 atcaacttca aagttagccg gataaaaact caataatggt gagtttggat tagtaaacag 58920 cttcctaaat ggctttggta aaagatcctt gctatttgaa ggaagaaccc ccattagttg 58980 ctcaaacggt tgcaatggat ttcctttaat aaaccgaaat gctaacccat taagctcgat 59040 ttcctccgaa tacattgtgt gaacaaacat caatttagaa gaaccatatt taacaatatc 59100 atctctactc aatgttgaaa tactcttatt ttgtagccaa caacttaaaa taatggcaat 59160 atcacatgca aatggcgcat acctataagg gtaataccat ctccatgaag gaacacctct 59220 aaaataataa tagctgaccc attgcaaacc ctcaaggtaa caaaaaacaa tatcttcaat 59280 agaattagtt gttatcctat tggaattcgt gttagattta atatttgctt gattttgaac 59340 atttgaattt tctgggaaaa acttatcgat atttattgcc atttttacaa agtaatatct 59400 ccaacgcata acttcaaaat tttcgggtct agtttcttgc cactttgccc taaatttatc 59460 ttgttctata gagtctgatt tcataaacct tggaacacct attgaatctg aactaatttt 59520 gttttggtct gggattacct ttttccaata atctggatta gaaatacgtt tttcagcttc 59580 ttctttctcc gaaataatat gccaaattag aaaccgtaac aaatttacat aattgattct 59640 accacaatct tcgagtatcc atggagaact actcaacttc attaaaaaaa aatgagccaa 59700 atactttcta taactactta ttatccttgc tagtccttga tcaacagtgt ggaatgggat 59760 atgagggaga aaatcatttc caacaatgaa gcctaagata ataaaatcgt caataaggcg 59820 ttctccatct attaaaacgt tatcgaaaat actaatttga tcctgtgaat catttttcag 59880 tccaaacttg tatccgaaac ataaatttaa ctcctcaaca cttaaatttc taggatttaa 59940 atcatttata atataatctc ttaaaataga gatatgcaga aattgaaacc tttcctttgt 60000 gcaaaccgtt ctactctcat aatttcttga acttgaaaat tttatttctt ccctcaacaa 60060 agaaaaatgt ggttcatgag aagcaagaga aagcattatc aaatcagcat caaggccata 60120 caagcaatga gtcgtgttgt tattatagtc tctttgggat ttaatacaac gtatgaaatc 60180 cattatttta tgttcgcctt caccaggtac gtcagctcca cttaaaacaa cttctaaatg 60240 tttccataat ggatcagtat gtatttgatg gtaaataaaa aattccagtt gtctccgcaa 60300 ctcatgcata aaagtcgtac caggagttat acaattcgaa tcaaaaacgt tattttgtgt 60360 tgtatcaata gaattatttt tctccatttt ctttagaaac tctgaatccc ttgcagacct 60420 aaatcttcta cttctttgtt gattcatttt agctctagga gcaaccccat caactgcaat 60480 ataaagtaat ttcctaggtt tagctatgta taccaactta tttatatatc taaaaatagc 60540 agcccaaatt tctggagacc ctttttcaga taaattaatc ccattacttg gagtgcgcat 60600 ttcactagaa ttcacactat tgtgagcaat tccatttaca tccaaatata aattatcaaa 60660 tggtggtata attccatctg atatttcctc attaatttga ggataccttt ctgatatcca 60720 tctatagaat ctcgagattc ccattttgaa actaacattt ataaataact taaaagaaaa 60780 aaattaaatc ctcccgccaa atcaaatttc tgtctaattg attttttttc ttagatcaac 60840 tcaaaaacga tcaaaataag aatatgctaa aaatctaaga acgttagata caataccatc 60900 ttttagttgt attttaataa aattttaaca ttttttaata gaatagattt tagaaattat 60960 taatcctctt aagtatgaaa taagaaacac attgaatata tgaaaccctt ttctatgtaa 61020 gaaagcgact tggcaaacga aactcggagt tgtagaggtg ggatggccga gcggtctaag 61080 gcgctagttc gaggtgctag ttttcgcggg ttcgaatccc gttcccatca ctttttgaca 61140 ctggaaaaca aagaaaagtt aaaattctat cagattaaac tatcaaagta gtttggcctt 61200 ttctagataa ttttttgata atagatgttt ttagcgataa taaaattatt attcattgaa 61260 attcagattt atgcagcgtt acaagttggc ccaatacatg ttgtttcata tgacacaatt 61320 ggggtcgtta taactttctt gacgcaacct gtactagttt tttctgtatt gtctgggcat 61380 ctactatctg gttttgttac caaaacgcac tccttattgt caaacaaagt atatggatca 61440 ttgcatgtta atgagaatgg agcatattca acaatatagc atgtgccatc ttgatcggta 61500 acatacccag atggacaact ggaaactact ggtgtagatt ctatcttttc acaaccaccg 61560 gatgggcaca ccttaggtgc gaatcttgcg catctatcac ccaatagttg actatcagct 61620 gtacagctta atgaatatgg agcaaaaact tctcttgaac aaacagttgt cctcacaatg 61680 cttacttcct gagtaactgc atgatgatga tgatgatgtc ctagagcgtg atgatgtcca 61740 gaactatgat gtggatgatt tgaagctgtt cctaaagacc ttctggttcc ttgggcaact 61800 ccagcctcca ttgtgtaacc cggtgggcaa acaagctctg gtgggaatgt tcttattctc 61860 tcgcaatcgc catttggaca gactttatct gcatattttg cgcacttatc gctgataact 61920 attaatccag gtgggcatgc caattcataa tctgcatatt cattagcaat acacctctgt 61980 ccacttctct caaagttttg aggacaagag agaattggtg gtatataact gattctttca 62040 caattgttgt ttggacaaac ctttgctgtt gattgttggc acatatctcc caataatctg 62100 aatccatctg gacattgtag tgcatatggc tcataaacat ttctcataca ttgacttgct 62160 ccactggctt tatgaccatg ggatctgatt gcaacaccag cctgaggtct ataatatcca 62220 ggtgggcact ccatcgttgg tggtgaacga atgagttgtt cgcagtttcc atctgggcaa 62280 actttaacag tatggatcgc gcatttatct ccgattattg aataattatc tggacaagaa 62340 aggtcataag gtgtatatat ttgttttgta cactcttgta ataactggtt tccatgtcca 62400 tgtccatgtc ctgcatgatc tgaatgatga tttgattgtg gtctagtaaa tccaggtggg 62460 cataccatat tggcaggttt acttatcaat ctctcgcagt taccatttgg acagatttta 62520 tttgtaaaca atgcacatct atcccctaat aacacgaacc ctgatgaaca tactaaacta 62580 tactcttcaa aaatagttct aacacattct gctgaaggtg ttgaataatg atgtgttccg 62640 gcggtatgct tgcttgaaat tgctgtttga attccgttgg catttctgta accttgtgga 62700 catgaaagag ttggtggaac gacgacttgt tttctgcagt caccttgtag acaaattttt 62760 tctgtttttt cgatacattt gtcgccatct aaaataaatg gagattcgca ttttaaaata 62820 taaggtactg aaattgtctt aatgcactca ggggaagcta gatttctgtg atgatgatga 62880 tgatgatgat gatgtccttg ataaattgtg ccggtttggt aaactggagt tggataagtt 62940 gttaaaattt cctgtgtttg atgaattact tgtggctgat aatttacttg tctttgttga 63000 attacttgtg gttggttaat agtttgtggt tggacgggaa cagaaacatt tggatggtta 63060 atatgagaat ggcttattgg ctgaattggt ttagatgatg agaatcctgg agggcatact 63120 aattcagcag gctcattata tatacgctcg caattaccat ttggacatat tttatccgaa 63180 tatgtaacac atttatccgc tacaagtctt gatccaacag gacagataag attatatgga 63240 gcatggtctg catggatgca tttcatttgc tgaataggag caacaactgg aactggttgt 63300 ggtggctgag aaattacagg agtatattta tgtccatgag aatatccctt ttgagaagta 63360 gattgaaccc cagctaagtg tctgtggtgt ggatgagcag aaatattcat aacttcatct 63420 atttggtgat atccaggggg acaagttaaa gaaacaggcg cagttacaac tctttggcaa 63480 ctttcattcg ggcaaacttt ttctcttgtg gtagcgcatt tgtctccaac caatgcatat 63540 cctgcagggc atgcaagatc atatggtgaa tatttgttaa cttgacattg agatcctaca 63600 tctacagaac ctggcggaca ttttaaagta ggtggagttt ttacaaactt agcacactct 63660 tttgatcttg ttccatcttg acaaattgct tctaatgggt agtaatttgt taatacacat 63720 tcttttccta caagttcagc attgttctcg caaataggca cagctgggac agaatcagct 63780 tttgtacaag tttttggggt aatttttgga acatcttcac aaattggcct tggagcctct 63840 ttatgatgtt gtgtatgtat attctgggtt tgaattggta taacttgtga atgtgaatgc 63900 ccatgagaat gaccatgatg atgtcctgga ttttgtatct cagatgtttc ataaactgtt 63960 gatgtttctt ttgcagctgc taatcttctt agtgctttaa catcaggtgt tttacactct 64020 gtaacataag ttgttactga acagtcatat tggcttcttc taacacattg tttacctgtt 64080 tccaatgaat aaccaggtgg gcattccaaa acaggcgaac taatcgatct ctctacgcaa 64140 acgtccccct ctctagtacc aacaggacaa acgtattgaa gagccccggg aacaagtctt 64200 gtgcacgagt ctccatccaa cgcagaacct ggaggacatt ctggtgtttt tggtgaggta 64260 gttgtttgaa cacactgttg attttgtaga gtgaatccag taggacatac agttgaagca 64320 ggagaaaatt ggatacaatc gtctccattt ggtaaaaatc catctgggca gatagcatcg 64380 atcattttta gtgcagtaca ttgatttcct tgaagtttat atcctggtgg acattctgaa 64440 acaggaggtg cttcttttat ttgctcgcat tgggatccag aaagtgtata tccaggtggg 64500 cattctctgt ttggtgcgga gaaagaagca caatttgtac cattatcaat aaaaccagtt 64560 ggacatacca tatttatcct ttcaactttt tggcattgtc catttaccag gtttgtacct 64620 ggaggacact ctggttgtgg tggagcttga gtagtttggg tacattgttt tccaattaat 64680 acaaaacctg ttggacactc cttttctggt tgagaaaact gaacacagtc tgatccattg 64740 tcgagatatc caggtggaca aatggcatct tctagttcat atgaaataca actgttattt 64800 tccaaggttg tacctggtgg gcattctgtt aatcttggag cagattctgt tttaacacat 64860 tgttttccgg aaagagaaaa tccttggggg caaattttct ctggtgcaga aaattggaca 64920 caatcttcgc cattatctac aaaccctggt ggacaaattg tatcaactct ttctatcgat 64980 tgacattgat catcttttaa aattgtacct ggtgggcatt cgaattccat gggagcattt 65040 accatttgaa cgcacctgtt cccactcaat gtaaacccgg gtgggcattc ctttgtgact 65100 ggagaatatt ttacacaatc ttttccattg tctaaaaatt ctggtggaca gatagcatcg 65160 attgcttgga ctgaaataca ggcattatct tgtaatactg tacctggagg gcagacaggt 65220 tgagttggag cagaactagt ttgtatacat tggaggcctt gtaaaatgaa atttggtggg 65280 cattcctttg caggagctac atatagtaca caatcatctc ctgagtctgt atatcctggt 65340 gggcagacca tatcaatatt tttaatcact ttacattttc cattttcaaa tattgaatta 65400 ggtgggcatg tcgattctaa ttcagctgat tctggtgcca tacattgttg tcctgacaaa 65460 ttgaatccag gaggacagat tttatttgca gggagatatt gaacacatct atttccttct 65520 tcaacaaaac cagaaggaca aacggtatca atttgttgaa ttaatttaca tgtgccattc 65580 tccagtatag tgcctggagg acattctgga ttaggaggag ctgtgtctga ttgaacacat 65640 tgttttccag aaaaaacgaa tcctggggga catgattttt caggcatagt gaatgcaaca 65700 caatctcttc cattatctac gaaaccaggg ggacatactg tatctatcat atgtttagag 65760 atacactctg ttccatgtaa tgtagtacct ggaggacaag atggttgttg agaagcggtt 65820 ttagtagcag tacattgctt tcctgatatt tcgtatcctg gtggacattg ctttagtggt 65880 gctgttacta ttaaacaatg cccctcgtga tagattgctg gttcctgaca gaaaggcata 65940 ggaggaacct ggatacgttg gacacaaact ccattctcca ttaaataccc tggagggcag 66000 ctttctagcg gagtttcaac atacgtaggt acctggcctt ttgtgtgtgg caaaggggaa 66060 gctacaccag gaatgttggg tttaacctga ggcgcaccaa atgcaccaat aataaacgac 66120 aataataaaa tacgtttcat aataaaagaa aaaactattt aatgaaattg acctaataat 66180 taaattaggg cagatattca aatagtaatt gaaattatta ggtaaaaaat tgttgattga 66240 ggaaaaacaa taattcgagg atacacctaa cgcatatacg atacaattta cacaaattcg 66300 tgagtgctca tattagagtt taatctaaat tgttttaaag aaacatatga taataaacta 66360 tcgctttgca ataaattaaa atattgaact ttcgttggaa tgtgtttttt caataatttt 66420 tgatcggatt gtaaatttat tggataaaac caatcaattt tttgggcgca attttcagag 66480 atggattaca ttaacattaa atattatgaa tactgacata cttgaaagta gaaataatag 66540 aaggccaaga tatgtttcaa ttagcttaaa taatgatgaa ttaacacatt tttgtccatt 66600 cttatctgat aatgatgaaa aagtcctgtt tacgttgaca tgcaattatc tttctttggg 66660 acaatttgag ttagcaagat caagcatact tcagctaagt tcattaaact ttagaaaggt 66720 gtctgagcta ctttattcca taatttatta tggtcctccc ccagactggc agctttctgt 66780 cacaatttct acatcagcac attttatact tgcatgcata agagaatatg aatcattttt 66840 cagaaataat actaaaattg aaaactacat aattaaaagg actgagtttg atttattaat 66900 tgggcaaatg atagcggata tctccgatgt taaaattagt tttgaaataa ttaagaaatt 66960 aagaaatttt tattccgttt ttttaattaa tggacttgac gtcaaagttc cagaactgaa 67020 aatcttgcca aaaattgttg gattaccttc ttatcttagc aggcttcctt cattttcgac 67080 aaatttgaac aaattatctg aaaatttatt gattgacttg gcaaataatc aaggaaagta 67140 tctttcgatt aaaataatta atgaaatgtt tgaatgtttt aatacatctc ggagatattt 67200 tattaaaatc gcaaaaatac taattatgga tcattcgaat acattaaaat tcgagactag 67260 tttgatcaaa gacaactcaa tacttaaaaa tacattatct tctatcaatg ttagcttaaa 67320 ttggattgga ttgaagaatt gcattattga tctgattttt ttcaaggaaa attgtggaat 67380 aatatcctta aattcgcttg gagatatgat ttcaaaaaat atttcaataa ttaaattaga 67440 agatattttc gaaaatcatg atttagagct tactaatttt acatatgaaa aagatgtttc 67500 atcaaaaaac aactttacta aaaataaaaa gccagaggat gttatttcaa tagtatttat 67560 gatattttgc ataataaatc tattttgtaa atataaactg aacaaggaga tttttcttaa 67620 gtttttagct gaccaaattg aaattttaaa gtcttgtgaa gatatttgta acacatgtat 67680 atttgactct agaataattt caaatgttaa aagtctattt tcagaaaatc ttcttaagca 67740 attaagtaat tttcaaagca ttcatcaata cttaaatgat ataaatacag ttaaattttt 67800 cgaaaatgat acttattctg atcatattgg aacacttgtt aattcaaatt ccgaccaaaa 67860 tttattgttt aaaattgttt tggaatttga caatattttg tttaatatta attttggaat 67920 aagtaattct ttgggaattt tttcattacc aagatatatt gagaataaaa ataaagttac 67980 tttaaatcct attttaactt acgatttttc aataaattgt aaatttgatg aaaagccttt 68040 tttttggtgc gaatatttaa ggtatttaaa tttatctaga aataagtttg aagagctgcc 68100 cattattttt ataacaaatt tatttgacaa gatttataaa accaaaagaa atattaattc 68160 ttttgagtct gcaaataaaa ttattctttg tttcccacat ttaagagcaa tttgcgttca 68220 tcttgggatg aaaaacgatc ctatttttaa ttgggaacta cttaagaaca tctggttgcc 68280 atttagatta attagcaaaa acgaagaatc acttgttgag agcaaatctt tggaactcga 68340 tctagatgaa atatcaagaa gacatgcaat ttccatgttt atttcggaag aacttaattc 68400 gaatttttgc agcgagttta gatatagttc aaacaaaaat attagaaaaa tatttgaaga 68460 aataactctt aaaaaatcaa ttataaatta ttattttgat aattttaact cttccaaaaa 68520 gtacacaaca ataaattggg tttcactgga agcatttatt tcaaacataa tttctcttcc 68580 actaacaatt ggtagtgata tttcgttatt aacaaaagaa agagactgca taaaatcata 68640 cttaatacaa aaagaagttt tcgagacaat tgatggaaag ggaagttcaa gctttgggtt 68700 aaatttacag aacacatcta ttgaaattgg aaatattcaa actgaatact ttcataatga 68760 attaatgtat tgtttttttt gccgaatttt tatttattta aagaaaatta aaataaacaa 68820 ttcgattatt catactaaaa acaaaaaatt attcaacatc acacaattgt tgtactactt 68880 atttatttgg agtgatagat tatccagaaa ttttcaaatt gaaatgcttc cagttattga 68940 aaaaactttt gaaaaaacaa ttttgatgat tgaattgata ttattttcat ttccatttaa 69000 tattccaaac ttgccactac caaaaaattt ggaaattttt tcttggtggt atcgctttat 69060 tttatcaacc ccatcaagag tattcaatac agggaaaaca ataatccgct caagtaaaag 69120 ttcagaatat tttcaaaatt tgtttaaagt tagatgggtt attcgcggaa tttatgaaaa 69180 taatccatgg tattcaaatt tatttagttt tgaatcattt tgtgttttaa attattacaa 69240 caattcgact agaaataatt actcagaaat attacttttt ccttctattc ttattctaaa 69300 gctattatct attcagaact atgaaaaatc tattgaatta atattgagtt caaaatttca 69360 cgtaaatata acgctaatag ttctggacgg ttttattttc cattcactct ttgaggaaga 69420 agtaattaat aattattgca atattaacaa aatttatgga ataattaaaa acgcgaaaac 69480 tgaaagatta acgaaacttg atttattaaa tattaacaag atttctttat tttcggcaat 69540 ggaggtattt tctatatata ttcaaacaaa atttatcgat aatgaaaatg agataggctg 69600 gatattttat gcttttccta aactatatca agcattcgta ttgatagatc tctatatttg 69660 ttttggaaat aaaaattcct caaatttatt tgaagatata gcgaaggcta gagatatgat 69720 agaacagtac accagtaatt taaatattaa atacaagtca gtggttcaaa ctactttaga 69780 cagacttttt atattttatc agtcttccag cacaatttct ttaacaagat atataatgga 69840 gatagataca ttaccaaatc aaagttctca gataaaaact catttaagca gaattcatga 69900 tagaaaatta ataactacaa cgttaactca gtcattaaat tatttatctg gcttttcttt 69960 aattaaacca caagacgaaa gtttaattaa tattaatcaa attttggaac aaagtttatt 70020 ttcaaatttt aataactcta gctatctatt ttcagtaatt cattatataa agtctatcat 70080 ttcagagtta tttttaaatc tggataagca caaagcattt ccaattttag cattgactcc 70140 aaatgaaata atttcctact ctttctttga aagaaagtta aaaggaggta gtaagagact 70200 ttcgagaata atgaacgcag acttgatctc tgttataatt aaagctctta attgttttca 70260 ttgctcaaca tcaatgttca ggaacagtga aattagctca catagtgata tggaatttct 70320 aattaaacag cttgatccat ttaagggaac atttgcaatg ttgtcccaca atattttaca 70380 gtctaaagat gtacaatata gctattttat atggaaattc gccgtcaaga atcgaataaa 70440 ctttagtctt gaaaaaaaac aaattattct aatttcattg gaataccgta tctggaaata 70500 cgatatacat aagcagcttg aatctaaaat atttaatgcc aatatagata agttatctgg 70560 attctttaca tctaaatctt tggaatacta taattattta ttaaagtggc tatcgataat 70620 aattacattt ttttctctaa attgtaatga tttacatctt ataatgaata aatttgataa 70680 aactattggt ttaattaatt atttattccc aatttattct attagtcgta ctaatactgt 70740 tgacaaaatt ttgtttcagt tatccagaga aatgttgaga aatcaaaata taatcaacct 70800 acatttgaat aattttaaac gaataataaa attaaataat ccatattatc tttataggtt 70860 aataactaga gctcagggaa tatcagatcc atattttatt ttgtcaatca ttgaagttac 70920 tagcaaaagt ataaaaaata gctattctgg aattaaaaat tctaaatcaa ataatttcaa 70980 aagtaaattt attgaagaaa atatctatgt aattttaaat gtccaaaggc tattttgtat 71040 ttcatatatt aaacatttta aaaataatag cgatatatgg aatgagtggt tcattttgat 71100 taaaaagtgg gaaacaacga aaggtataat atcaatgttg aaatactcaa tagaaatgaa 71160 gatgttcagt atagcaaaaa ttatatctca acaattgcta tttaaagtta taaagcattg 71220 tggggatgtt caatatacag aaaaagaatt gaaacatttt ttttctgggg ataacttttt 71280 tgaagaaata ggaaaagata ataatttcaa taatttttct ttagtttcaa ttgtcaatta 71340 tattattaca aatattagag atgaacaaat ttttcagaaa tataacttta ttaaattatc 71400 taaagctgtt gtttcaaaat ggattggtag tttcgaaagt ataattggat ttttaaatag 71460 taatattggt atagatgaaa aatatattat tttggatgcc tatattaatt cacagaactc 71520 ttcattagat gccaatcgta tcaatatatt aaagactata ctgacttcca taaaattggt 71580 aaaagacctt aggatgagta atattcaagc gtccaattta ttttctccat atttaattat 71640 taggatggca attatgtcaa ggaatagtag attgattgaa atattaaaga gagatattga 71700 tatttttcta aaatccgggg atgttttatt aaatttaatt gaagagtcat ttggaattat 71760 tgacgataaa cagatactaa aggaaaagat aaaaaaatct tatatttttg ggcaagctga 71820 cataaattcg tgtataaaat tatttccaat gattgaaaat aatattaaat tatgtttaaa 71880 tatagaagat attccatcat acaacaatga tatatatttt tctatcaaac tgatttctat 71940 tatacctaaa aaaaaaacaa tgagtgatct tttatttagt atctctaaca agatatcaaa 72000 ccagctttac agaatattta gaaattctat ttcagataat tcattaatcc gcacaaggtt 72060 tgaatgtagt aattcgcagc tttgggttgc tgaggaaggt tcattaaaaa taattcctaa 72120 cagcctacat cctttttcca aattttcaaa agcaaaaaga aactcgtatt atatcaaaaa 72180 tcggtgtcta aataaatata agagcggatt aaatataaat ttagaaaaga gttataataa 72240 tagaaatgaa atcaacatag aaaatatttc aaaatcaaga attttattta gaaatttatt 72300 tgttttatta acttgggcaa gcgattatat tgggaaaagc gaatttagca aatcaattaa 72360 aagcttaaca catttatttg aaatttggca aaggaatcct gcattatcat tctctttgaa 72420 agattttgtg tatataagtg attatgttat tctagttcta gttttctctg accaaataaa 72480 gatattaaaa gatttgagtg aaaataatat tcttgaaaca aatatgaagc ctttagataa 72540 aaatcatgaa ccaaaaaatt taattatttg taacatagtt gcaaattcaa ttacaatttc 72600 aagaatgaac cagtattata acggaaaaag tgaccattca aatcccacta ccctaaatgc 72660 atctaaatcg agaaatgaaa tgataatgga tatattttgc aataaatgga agattaatag 72720 aggaaaagtg accaatattg aaattttaaa gggcaacaca gtgaggatta ttgaaaatct 72780 tttacttcta aagcctgata tcttgaattt aagtatttta ataataatat taacttacca 72840 ttcatgggga aagcttcaaa tgtatattag tgggcaaata aaacgtatgt tacactttcc 72900 atgcgcatca atattaaatt ttaaattttt caattcaata tttgaattat atgaacaatt 72960 atttatctca aatgttaaaa ttagaatgaa tgaaatatat acaatttcag atttagtttc 73020 aaactggaaa gatttttctg ttggaattga aaataattca gatttttttg aaaattctaa 73080 tgataatttg agtttgtcat ataccacatt ttttttagag agaataacaa attctattaa 73140 aaaaaactta ttcttaaaat ataaaaaagc aattaaaatg ataaaatgca ttcaagtaaa 73200 aaaattgaac aatataataa tttattttaa ccaaaaaaaa gtgaacaaca aaaaattctt 73260 tcaacttgac ttcattaatc acttattaat acaaagtact ccaaatcact attttttaat 73320 aaatactttt gtaaaattaa atctatggtg ggaattaatg gtttacattc taagatggag 73380 ttttgagctt ggaccttcca aatcgagatc tttattaata gataaaagac gtgaaatacg 73440 agcaaacagt gaaaaaactt ttgaagagat attttttgac ctggttataa aaaaagctca 73500 ttcaaaaggc caattaaatt ctctgatcaa tgcaatgaat gaagccgtac cattgggagg 73560 cccaagagcc gctatagttg tgaaaaagtg caaaaatgca atacaaaaat acttagaatg 73620 tagtggggct ttggaaatgt tatatagatc ttttctatca ctttatattt caattgagat 73680 tcctcacgca cttatggggg caatagcaat ccatctttca cttctagacc atttaaatat 73740 tgattcaagg actggatact tagaatcagc tttgtaccat ttcaaaaatg caaatttggt 73800 attgaaaaaa aaagttaagt caggagtaat taaaaagaac aaacatgcat ctccaacaaa 73860 ctgtattgac tgcgaaagaa ctgcagaagt cccatttctt tctcaaaatt tgaatattgg 73920 gaaattaaac gacgtactta ttccatttat cgctgatatt ccaatttata aaattaatat 73980 ttctccttgg ggtggtcttt cattgccaat aatacaaaga ataatcagat tagtcgagct 74040 gcaaaattcg attattaaac tcttaataaa agaaaattca tcaatgtcaa ttttaagtcc 74100 aaattataag gatagaagaa atgttatagt agttctcttt cttattcgtg aatatcctct 74160 tgcatttaag acttctaaca tgcttgaaat accattgatg gaaattttgg tgcatgcgac 74220 taaagaactg atagtttctt atccaaacag caattctctt tattgttttc tggattcaat 74280 gaaactatgg ctgtcagaat acgactcgga tgcactaata tcaaatgcaa taaatatgtg 74340 gatctcggaa aaaaaaataa atcttaataa tatcccagaa ttagaaaaaa acagtataat 74400 ggaactcgtt aataggctgt caaacccttt gagcagaagc gaagcattta aaatgataaa 74460 tcctttggaa agagttaaat taagttaata ttttttcata attaaaggca gctaagtagg 74520 tatattgcaa aactataaaa aatgagtgca agcggtaatt gatatattat aaagtcgtat 74580 aatgcattaa ataaatatat aaaccactct ttaagatttt tttttaccga ttttattgaa 74640 atatttgtgc atgaaaattc tggattacaa aacttgcatt gcttattaca tgcggacttc 74700 attctaatca aagaacgcct aaatttactt ttctgccaag tattatttag ttttgtatta 74760 catcgtttat ttgataatga tttatctacg taatttgtga acattaccat attttttgga 74820 acagtattca aaatatcctc tccgaaatac ttcttaggag tagttagaag tttatttgat 74880 tgtgtaaatg agtattcttt attatatctc tttttaactg gggttctgtt caaacaatca 74940 ttaactaata atcgatcatc ggatcctaaa taatccaaat gttcttcata gttctcactt 75000 ttggaatttt gtattttaat aagtagatct gtttcctcaa acaacgagag atttttttca 75060 tttgttcctt tagagaaggt atttattaac tttaaatcat ctctttcttt ttcttttggt 75120 gtttcaggaa gaaataaagt gctcattatg taaaatataa acaaaattta caaaaatgtg 75180 ggtcaagtta cggcgggaat atcgaaacta aatttttaac tctggagatt gagtataaaa 75240 aaatttgtta tcccattttc ttccctttac tggtaaaaca gtaattccta aaaaatattt 75300 ggcctaaaac aatgtatcat aaacatacct ggtccatcac tagatgaaaa ccaataatta 75360 tctctttttg caattctcga gctttttctg atctcagaat cggttaataa taattcagcc 75420 taataaatta gtttaatttc tctgagtcct tccaaaactt acctttgaat atagggatat 75480 tagtgctgtt tctactttaa gaattctatc tcccatattc acgactttga atcccattgt 75540 aattaagtct gtcaattctt cttttgtcca acctccttca ggtccaatag caaggataat 75600 tggtccagtg tggctttgta atcctatttc agtaatcttg ctgttgccta agacatcagc 75660 aacaattcca atcatgcttt ctttgcaaaa aaaataattc ctaatattgt taatgtttaa 75720 caaaaaggaa gtaaagtttc ttaccttaaa tgctgtagaa aaaatgtcca agaagcataa 75780 acataaacat ctgggcacaa tgtttttgat gcctgttcaa gcccaagctg tacaatttca 75840 tcaatagact ctttttttaa cttggatgaa tttaaatacg acttttcgga tttatcagta 75900 caaacaaaaa ttattcttcc aactccaatt gtaaccgcat tttgaagtac tttctcaaac 75960 acttttggtc taggaagtgc aattaataag tcgattaagg ggtaatcatg tttatattcc 76020 tctttatgaa tttctgtatc taacttaatt gtaactgcat attgaaaatt atttaaatta 76080 ttgatttccg aaatagatct tctggatcta ttatttactt tattacctat ttcttcaatg 76140 ctagataagt taattattga ttctgatttg gttattccat ctttattagt ttcttttctc 76200 attattttgg ttactgttgc agttcctttt ccagagtttt taactccaac attaactttt 76260 gagccaatat taatttccag aactgagata caatgattgg actgtctgga ggatagctca 76320 acagttaggt cttcattaat atccttttta tcaataatga ttaagttcat tagaattgat 76380 ggaaaaatga ccattatggc ggtatgatct gcattaagac tggatcttct cgcatggtgc 76440 ataagaaaga tacgatcaca taataaaact aagctaacaa aagattgggt ttcgtattga 76500 aagatcattt attttttatt tgaaaatgcc tttatatttt caagatgact tattctcaat 76560 aacgaaagat caaaaaaaat atattaacat caaaaggtat actttaaaat caatcatttt 76620 tttttaaata acttttttag tctgataaga acaaccttta cttttgttta cagtattttt 76680 atttttacta ctgcagtttt agtcttaaat aaatattatt catttcctgg aattgttgca 76740 attggtaagt taataataat atattaagca atttcaactt cgtatcctaa gtatttaaca 76800 gtggaatttt ttttttctta tatataatta ttgagtggtt taatcttcca aaatttttaa 76860 attggaatat ttcatatttt ttgattaaaa gaaaactcta taaatcatct acaagaattt 76920 ttcaaaaaaa agtcagcaga ccgtaagttt ataatgtttt ttagaaaatt attaatggtt 76980 tttaggtatt tgaaaaactc gaaccttaaa aaaaataaaa atgaactaga agagtgtatt 77040 gagattgatt gtaaaagtgg aagttcagaa tcgtttaatg aaactggatt atatgactat 77100 gcatttgatc attcaataat ttctaaagct gcttcagatg attgggtagc tgcaatacct 77160 cgaatacata aacaagtggg tttaaatttt tcaattttag atttatcatt tttaggttat 77220 gagcaatgaa ttttattcaa acaaattatt tacattaaaa ggagttgaat tgattcattg 77280 gggaattcat acaggttacc caagtttttc ccaattatta aaaaatacat cagaaatact 77340 tagttcattt attgaaatgt gttttaggca aacacaaacg cctgcatatc aaacaggtaa 77400 ggttttatta acttattttg ttaaaataat atttttaggc cacattgtaa aatcaatgtt 77460 gtggtccggg gtgtatattt gggcaacaac aaagatgtat cgtaaaaata ataatgaccc 77520 atggaatttt aaacgtattc cggagattta ctattttttt gaatcaacat ttttgtggat 77580 gctttcctat gattatgtaa ttggtacgca ctaaaataat ttatatgaaa agttgatttt 77640 tgtttcaggt ttaatagcaa tatggtatag aaaagagagt acaatacgtt atattttcaa 77700 tttttatcaa attattgact ttttgtcgct tcctccgttc ttattgttta taattacaat 77760 aagcccacaa tacgaatcaa gccaaagttg gttattaatg taagtttagc aatatttttt 77820 tcaaacttca aatatttgat tagattcggg tggttaagat ttttaagact atttcgaaca 77880 gaaaaagtgt tggatatgtt atttccacat gtttctgcaa taaaaagaag agttatcggt 77940 aagacattct ttctttaatg atatttttta ctaaaataat taggaatttt tgctggtgcg 78000 ataaccatca ttttaactca tgctggtgca atatttacga tagagtcacc atcggaaaaa 78060 gaatataata gtttgtttga ttatttgtat tattcagttg taacaattgg tactgtaggt 78120 tatggcgact tttcacctcg aaaaagagaa ggaagactag ctacaatcat tttgatcacc 78180 ctgaccttag ttttacttcc tcatgaattt caaagactaa aagaagcatt aaatacacca 78240 ccagattcaa ttggatcatt tattaggaaa aatgatacat atttgtgtat tattggacca 78300 attccaccaa aggtaagtta attatgctct ttgcaatatt ttaacacttt cttaaagcag 78360 ttgttattta ttacaaaaag tctttctctg caaaaacaaa gaagattcaa atctatagta 78420 cttgtgacac caatacaaat tttagaatat caaaacatag ttaaaatatc acaacaaaga 78480 ggttatattc gattaagtat aaagcagggt tttttaggat ctgcaatcaa taacctggtc 78540 caatatagct ctattgtttt aatttatgga tcagagaagc caatattaca tgatgttatt 78600 tgtaatgaag gaacgcaaaa aagtgacttt gatgcactga ttactgtaat gtgtaagcga 78660 taaaatgaat aattatgatt aataatatga attttaggtc tgactaattt attggggcta 78720 aaagataaat tttttcttat tttttattca tcgcaagttg caagtttatc aaagataact 78780 ggagtattag ggagcctatc tttagagaat ttaagaatta aattattaag caaatgtata 78840 agtaattgtc cagggtttct accaattatg ctacatctta ttataccaaa gaatgaaaac 78900 atgcttaaaa agctccagga accaccaaaa aatacatttt ttactagtaa tatcgataga 78960 tcaaaaatta ccccatatga atggaaatgt agatggaggg gactgcattt taaggtttat 79020 acattaagat ttccaaactc attttttgga tatccaatac aaatttttac gaagtatatt 79080 tataaacatc taggaatatt tttaattgga gtttattgcc ctgatacgaa tagaacttat 79140 ttaaaccctc atcaatatat aattggagat aattcaaata tttattatga ggggtcaaat 79200 taccttggaa ttgtgttatc ttccagtatt tcccttgtaa agaaagcaga gttattggaa 79260 tcaccaaaaa attacacaga agaatttatt aaaaaagtgg ggaggaggag ctgttctcca 79320 tctaacgtat ttcttaatag aagtaaaaaa ttcaagtcaa aagagtatat tgaaaacaaa 79380 gatcaattaa atatttatga ttgtacaatc atatctgaaa ttcataagca ttgtgattcg 79440 aatgataccg ataaaaatat cataaaaaat ttgataaata ttaatgaaac tgggaaaaat 79500 atacaaatta ctcagcataa tcgtgaattt acaaatttaa tcaattattc aagttcttac 79560 gatgatcaat gtttatttga ccaggtaaaa aatatggttc tccccaatca ctttgaagaa 79620 actgcatcta atatgacgaa tataaatagt aaatttgata tctgcaaaat caattctaat 79680 acagttagtt tttctcaatc aaatttaaat tttctacctg aaacaaatac cggtattcca 79740 gtagtttcaa attatttaga agcaatttca aaagttttta caaatagaga atacccaatt 79800 gtgctgatta tcggatggtg cgaacagatt gatatgctgg taaaattaac cttttcactt 79860 aaaccttcaa attttattgt tttatgtgaa aaaattatag attccagttt tataaatgaa 79920 gagttttttg gaagaattgc ccaaatatct ggtgttggga catcagaaca tgatttaaaa 79980 caagccggag tattggttgc atcaagaata attatttttg attcaacagg gaatccttca 80040 aatatatata aagaagaact aattaacgtt agatatggaa aacatgccat tggcacttgg 80100 atagtagtat gttatttatt ctccaaatac aaaaaaaaag ataattgttt agtaggatcc 80160 caaaaaatgc caccattaat aatagatatt aaagagacag acattgggtt attattattt 80220 cagtactcgg atgattggcc aaccaggatc gatggatcaa cttactctat tccatataag 80280 aacgaactag atttttttta ttcaagacaa tttcttagtg gacagttttt tgtggataat 80340 attattgact cattgattcc ttttttagtt ccaatattag ataacaatcc tataaataaa 80400 tcatttatta atcaaataat ttatggaaat ccaactagta aaccattcgg aaatgttagg 80460 tattataacg aaaagaactg tagtttgcaa atggaagaga ttccatttgt atttgtgaat 80520 tctacatttt ataacttatt ttcagtaagt tcgttttcgt tttataatta taaatttatt 80580 cttgatttaa cagcaacttc ttaaacatgg caggctagcc attggaattt atagaccaat 80640 tagagtggac gatcatcaat ttaataaaaa acaagatata aaaactcaaa aattttctcc 80700 aagtttagga gatttagagt cagttattaa cgagaattcg gtaaaagaat gctttaattg 80760 cgaaactcat ttaataatat catgcccacc accaaaattc aaaataaatt taggagattg 80820 tgtttatttc ttatagagta tatatcttaa atatatttat taaaataatg acttttattt 80880 gattgatttt tttttttggt agatatttca ccattatttg attggtgtgt gcacaccaaa 80940 gttaaactat ttgaaaataa acaaatcatt atgtggggat ttatagaaca attaagaaca 81000 aaagcaggag gatttattga cttatccagt ggaattattt gttgtgcagc atttggaagt 81060 ttattactga tatttcgtaa gtacattccg catccatctg aattctttcc atttggttgg 81120 ttttttttaa aatttggtat tcataagcat gacaatttta atttattatt ggatattctc 81180 gagggagttg atattcctga aacaggaaag tattcaataa gaattaaatg cggtagagaa 81240 actgacgaat catctacaac tatttgcttg gaagatgtaa acaagtttac taataacctt 81300 acgaaatcat ttaaatgttg ttggaaagaa aagcgtatta ttcttgtaag acaaagggag 81360 aactatttat ttattgaatt aatcagtcat ggcactttta ccagttctgt aattggagaa 81420 tgtagaatta gcattatgga tataattgat gcaaagtttc caaaaaaagt aacttataac 81480 atccaaaaag acaggaaggt tgttgcgaaa ctactgttat ctttttatag aatttctgca 81540 aacataatag ttgaggatac aaatccagta ttatttcaag caatgattaa tttacaaact 81600 gatgcagata tttcaggcaa taaatcaatt tatgcagatt ttgacaaaat gtcggaaaaa 81660 gaacaattaa tattcttttc taaagcatta caaggaaatc tatattactt agaaaatggt 81720 gacaaaggta agttatcaca attcttttta aacctttaac tttttttttt taagataaac 81780 tgagaatgtt ttatttcaga gcagtggaag ttacaattaa taggtaagtt aataaaaaaa 81840 atataaagca attattaata ttaaacaggt gggagtggtg ctattgggca accgaaggcg 81900 attatttaaa aggaagtagt aaattaggag gatatccatt tcttgctatg agtgtggtca 81960 ttccagataa aacggataga gatcaaatat acatcaaata tcatgattta tacgggattc 82020 atgacctttt ctttaaagcg atagaaattg atagagatgt ctggtcagaa gcaatatatg 82080 aatttatcga gagagtaagt taaattcagt actacaaaga cgattaataa atattcttag 82140 ttaagatcat atctggatca tataaaggat ggcgttataa acaccgaaca attaaacatt 82200 cacttggaga acgctaattt ttcaaattca ccctcggaac agccattcat tataaactta 82260 aactcaagag acgaaagcct aacaaataat actcaccacg aaataacaag cgaaaagaag 82320 gacaaacaag tgacgatgat tgagtccaaa gtaaagaact caagaacagt aaaaatacac 82380 acaaacgggg gttcagaaaa gcattccaag aacatcaact caccaaattc tataaagaga 82440 atgaagatgg aagtaaatcc atcaagatat gaaaacgagc cattaatata aatattgagc 82500 tactggaatc tcgaatctca aagactcaga agaatcaata ggaatttcca atttgcatca 82560 gtaaatcaga tactatctag caatggaact tcaatcatcg tcattgagaa gaaaatgttt 82620 acattttttg ctcagacagg gataacaata atactcaacc atcaaaaatt gatgtctgta 82680 ggtgtcctaa gaaaatcatc atcacaaaat gtaaacttta aagtttagga actaaaacaa 82740 aaggcattat aaaaatatac tgccagtaga gagtttcttg ttccatcaaa ttatagcaac 82800 gttctcttca ataccgatta gagaatatac aatcactatt gtacacacta gttgcaaagt 82860 gattttttac aatagaacga gacaataaaa ttgagacaac agtttgcgcg gttatttttt 82920 ttttgcaaat gaaagcttga gtctggtcga attgtttaat agaaagttgt ctagtttaga 82980 gattgtagtt gctgcttgta cttgagtgaa aaagtctaca aatgcgactt gatgattctt 83040 atacaagtaa attaaactat tgagtaatct ttagaaactt actggtataa aacggagttc 83100 tttgaatcca tttatgtgac taaataaagt atacagttct tcttctctac atatccccat 83160 ctccatttgt acaaaaactg tattgcttac ctcaagcggg ctagcgttgt ttggaagcct 83220 gtgtagaaac gatgtagtgc catcaatttg catctctggt cttcctaggg agttttgtgt 83280 aagctcatta ttaataaagg gatgaacatt caattttttg tttttgttag aacgcatgag 83340 gttcacgtag ttgttatcat ataaaacttg attgttaact agtttaatct ggtttttttc 83400 aggatcgtcc agagtttgaa ataaaccatt gttttcagca ttttttaacc attgagccag 83460 gtattgcttg tataattcaa tgcgagcatt aatagacttt ggtaccatcc gaatgttgga 83520 actggaattg cgaacttgga tatttctacg ttctgactct ttagctgcaa atgaaacatt 83580 tacgtgcttc ccaaatattt gcgtcccaat ttcattattc acagcattta cagcagaaac 83640 agcgttagaa aatgttatcc atgcctgacc tttcctaaaa tatgagttaa acattttaat 83700 tgattcaatc ttgccatatt ttgcaaaaat ctcttcaagt gacaattttt gctttgggat 83760 agatatactg tcatttaaat tgttaatata aatagttttg ttttcatcat catcagatag 83820 gctgttcttt agtactttaa taccattttt tctatccatt aaagaattgt tcattgcaaa 83880 caaaatacat aaaaattcag attacaagtt atctaaggcg ccagatggat tttttatttt 83940 agttcttgat aatgaattac gatgaaataa gaagcctcat attagatacg gcggaagaat 84000 ataagcattc gcttggtagc gattataggc tagtgttgaa agaaataaat caattaggtg 84060 atatatctgt gagcgatagt tttcaaaaga ataaagttga attttctttt tcacttaaga 84120 agggtaaaga gaaatataaa atttctatct atgtaaatag aaaattatta gagagtaatg 84180 aatctcttat cggtaataaa catctaaaaa aatcagttga aatatctgcg ttggctttct 84240 gggaattttc ggataaaaaa aaggaacttt ttgaatattt tactgaactt tttgaagagg 84300 acggcatttc accatttaaa aatagcgaat cccatgatat attttattct aactgtgaag 84360 gtttgaggga tgagctaatt ccttcctgcg acataatatc attccttgac attttcttac 84420 tctcattatt cagatccaaa atacctcaac tcattagaca cttgggctgg gatgagagta 84480 ttccccttag aaattcacta atagaacaca tagcatccga tgaatgtttt aaggacctag 84540 atgaaagcag tacagacagt gagtctgagg agagtactgt gaaaaaaaga gtaaatagga 84600 gatactaaaa gtattaatct ttaaggtata agttattact atgaaccact aggtaaatct 84660 agtactaata aagttgaaat cgctgattaa cgcacaacaa ttttcagttc aagataatac 84720 catcatattt agactggaac cattgcatag cttcatcctt tgtaactcta ccgtgcttac 84780 caacctttga tctacatttc cttcttaggc taacacgatt acctggtcta gttaattgta 84840 cgaagaagtc cataccgtaa ataccggtgc taggatcgta tttaataccc aaatcaatgt 84900 gctcatctat tccaaaacca aaattaccag ttgcagagaa gttcctttta cgaagttcgt 84960 attccttaac tttcaaaccc ttctcaagaa tctcttcggc tttatcacca cgaacagtaa 85020 cataacatga tatcttttca gccctacgga tagagaatga acggatagtg aaacgagcct 85080 gaccaaagac aggtttttgg tcagtcaact gttccaatac cttcgctgca cgggtgaggc 85140 gatcacctga ctggccaaca ctaatattaa tgaccaactt ctcaattttg atatttttca 85200 tagggttcac ctctgctgtc atattgaatt ataattattt aaatcctgaa caataataac 85260 ttaaggaagt attggatgcc tgtacaaact gaaattaaaa aaaccagttc cgtgaacctt 85320 atttaaacgt ctgtagacgg ttatttgtac aaatataaat tgatattttt gagacaggtt 85380 ttcgtctctc aatagcgggc aatttagttg cgcgctgaat ttttttaata ttcataaaaa 85440 aaaatatata atataatata tataaattac agcgtgtgat aaatgtcaag caagaagaaa 85500 aatgctaaca aggtaagtag tacttatatt tattgttacg ctaggaatac ttctgttatt 85560 tcagaaacag gatcaagatc tcgaggattt ggatgcacta ttagctgaat ttggaacggt 85620 tgataatgaa gaggtaaatc aggaattaca agatcttaag gaagtgaaag cagaggaaac 85680 tgtatcttct caaactctta aaaacaggct taagaaacag aaaaaaaaac aagcaaagtc 85740 acatgctcta gccactgagg atgttggttc ggaagccatt acaaaaccca agcctataag 85800 tgcagctgct aaagcagctg cagaaagatt gaggcagatt caagaaaatg aacaggaatt 85860 gaaaagaaga gaagaggagg agaaaaagaa ggaagaagag aggcggaggt tggaagaaga 85920 agaagcattg cgcattcaag aggaaaaact tcaaaaacaa aaacaaagga aagagcgcag 85980 ggaacagtta aaggctgaag gaaaactctt gtctgctaag gaaaaggctg taaaacaaaa 86040 gcgtgaacaa tttgttgaat accttaaaca gcagggcgta gtttctacaa gtgaaggtag 86100 tcttgccaat tcaagtttca gtagtgggct tgcaacaaga aagaagaaaa ataataaatc 86160 taatttaaag gaagatattc tatctacgga tattatagaa aatagtgaga gaaatcaaga 86220 tatggaaact aaaacggtta tacaggagtt tgtattagac tcatgggaga aagcagttga 86280 ttatgaagcg ggatcaaaaa gtccgaacgt ttctacaaaa aatattagag acctagttcc 86340 gccaaagaaa gtggacggca tcgcggatac aaactgtatt gaacatattg cagaggaatc 86400 atgtgaagac ctggggttta gatcaccagt ttgctgtatt ctaggccatg ttgatactgg 86460 aaaaacaaag cttcttgaca aaatgaggaa aacaaatgtt caggataatg aggcaggagg 86520 aataactcaa cagattggtg caacttactt tccacctgaa atgctttccg aacaagttaa 86580 aaaagttgaa gctgactttg aattgcaaat acctgggtta ttattcatag atactcctgg 86640 ccatgagagt tttaataatt tgagatctag aggctcgtca ttatgcgata tagctgttct 86700 tgttgttgat attatgcatg gacttgaacc gcaaactaga gaaagtattg gattactcag 86760 aagcaggaaa tgccctttta tcattgcttt gaacaagatt gacagacttt atggctggat 86820 tgagcagaat tggtcttctt ctaggtcaac actttctatc caaaatgaaa gtacaagaga 86880 tgagtttgac actcgtctta atagagttct tctagaactt tctgaggaag gtcttaattg 86940 tgatatatat tggaagaatg atgattttag aggaaatgta tcaattgtac caactagtgc 87000 tgttactgga gaaggagtgc ctgacttgat ttatttaatt gctcagctga ctcaaaatta 87060 catgggcctc cattgtttgc aattgaatac acgtgaatta agctgcacga ttcttgaggt 87120 aaaggctatc gacggtttgg gcgtgacaat tgatgtgatt cttgtatcag gaatattaag 87180 ggaaggagat acaattattg tttgcggtct ttctgctcca attgtaacta caattagagc 87240 attattgacc ccccaaccaa tgcatgagat gagggtgaaa ggcgagtata tacatcatcg 87300 ttttattaaa gcttcaatgg gcgtaaaaat atgcgcgaac gggcttgatg acgctgttgc 87360 gggaacacag ttactagtgc aatctaagaa ttcaactcca gaggagatcg aatcattgaa 87420 agaggaagtt atgaaagata tgggagacat ttttagttct gttgatagga cagggaatgg 87480 agtatatgta atggcttcaa ctcttggatc cctagaagct ctattagttt ttctgaagtc 87540 ttccaatatt ccagttgttg ctctaaatat cggcactgtt cataagtctg atgttagaag 87600 agcatctatt atgcacgaaa gaggttttcc tgaaatggca gttattcttg catttgacat 87660 taaggtagat gcagaggcag aagttgaagc caagaaactt aatgtaagaa taatgaaggc 87720 aaatataata tatcaccttt gtgatatgtt caccaagtat tatagcgatg tccaggaaga 87780 gaagaagaag gagaagtctc agaaggttgt atttccttgt atattaaaaa ttattcctca 87840 atacattttt aatgctagag atccaattat ttgtggcgtt tatgtggagg agggtatttt 87900 aaaaccagga acacctcttt gcattcccga gaaagataac ttgatgattg gccgcgtaac 87960 aagtgttgaa tttaacaaga aacctgtaaa tgaagggaag aaaggtcaag aggttgctgt 88020 gaagattcag ccttttgcat cagatacaaa catcacatat ggtaggcact ttgatcacaa 88080 cgataaacta gttagtcgta ttacaagaga ttccatagat attttgaaac aacattttag 88140 agatgatcta tctaaagatg attggaaatt agttattcag ttgaagaaaa cattcggaat 88200 tccttaatta gattatattt gtagttttga tatattaact ttaaattttt tctttcatgt 88260 ttaaaaactg agttttttag tacttttttt agtctttgat ttaatgacag gttaattctt 88320 ccacccgcac cctccagttt atgataaaaa cgcctcaatt attctggata atccagagat 88380 gattaatcca gtagcggtta tgcatatctt attgttatta agctactttg tttggcgcag 88440 tgatggatgt tgatcttcca atttttgaat tcactcttaa aaatcttgag aaaaaagagt 88500 taaacaattt ggcagacaac aagattggtg ttacggaaaa taatccagat gaatgtgata 88560 tatcggttag attagagaac atttgctgga aatctcaatg ttatgtcgac attttgccat 88620 tagacttttg tgatgtaaat cattcacttt taaatgagac taattcaaga aaaacggtcg 88680 gaatgggatg ggaattacag cgtaaactaa ctaaactggg atacaacaat ttcaataaat 88740 acttaattag taactatatt aatttatttc caattggtat ttctaagtcg aaggctaaac 88800 tttgcaattg ggcaaactat aaggatttta ctcaaaagaa attattgaaa ttatgcatta 88860 acggaattcc atccgatatt cgaggagaag tttggtgcta tttattagga agcgatcgaa 88920 tgctgaggaa taactcgaat gtatatttta atgagcttaa tgggagcatc gataaaaata 88980 ttgaaaacca aataatccta gatttacaca gaactttccc aaactctaaa tattactcta 89040 attctagcaa ttttaataaa gttggtacat taagtagagt tttatacgca tttgcatcgt 89100 atgacaaagc aataggctac tgccaaagca tgaattttat tgctgcaatt ctactaatta 89160 atatgaagga agaagcagcg ttttggtctt tagtacaact agttagcagc aatagaaaca 89220 aagagtttat ggtttgtagc tggggagatt tagagacata ttatggagaa aggatggatg 89280 gagtaatacg tgacattgcc attttagaga ctctgtgcag acaatttatt cctaaagttt 89340 cacaaaaatt agaaaatata ggagtaaact tccaatggtt tgcattagag tggttcctct 89400 gcttttttgt tacatctttg cctttaaaat caattatgga gatattagat ctcatttttt 89460 gttttggcag tgatgtttta tttaatattt cgattgcatt attagatata aataaaaaaa 89520 aattactttc atccgttaat atggaagaat gtatggaaat tttgaagaat attacaagaa 89580 atattactga ttcaactaaa ataatcagaa aagcaatgaa atataatatt tctagtaacc 89640 atatacgaaa attaagagaa gaaaattaat ttagtagaat taattaattt acccaaattg 89700 gtttatttct ttctaagtat ttatctctta ttttctcgaa tttataagta aattctgata 89760 ttggaagctc tacatcataa catatttcga tatttatttt tttgttgttt ggctctactt 89820 ccatattgtt taatatccat tcattcagtc ttcttgggtc aaacgagagc ctaaatactc 89880 ccatctttct taagtttgaa gtaaaggtca tatttcgata attaattagg ctacactcta 89940 aattacaaca tcctgctaaa tcgataattt ccaaaaattt gcaattactt aaaatttgta 90000 tcagggcatt agctttgaaa tttagacaat aagataattt gagaccagat atcattttta 90060 atttacttgg tatcttatca aatatttttt cagaaatgtt ggcacaataa gaaacatcca 90120 atatttctat cccaggacag ttttcaaaaa ttatatcaag taattgatca ctaatgtttg 90180 tatttgataa taccaaaact tttagatttt tagagtaacc aaaaaaatgt tcaagttcaa 90240 tatttaaaat atttttgcag cccctcaagt ccaaatattc aatattgtaa caatattctt 90300 taacagtttt gggaatctta gtaattccga cgcagttttg aagttctaag tgtttaatgc 90360 taggttttgt acagacactc acatcagttt tgattaattt tttttggtag tttgaattga 90420 ataaatcttt caatgagttg tcattaattg aaaagttttg tttaatcctt gatcctatct 90480 ttaaaaattt aattgaagac ttcatcccaa ttactcgaaa agaagagctt ccggtaatgc 90540 tattacaatc aatgatttct aagaattcca gattccttaa tcttaaaagg aaaagcgata 90600 gcgcagagtc agtaatgtta ttatttccat tgatagttaa tttgtttata gatgccggga 90660 gatatttttc ggaagcaatt tctttaatat cagagtcttt aatattttta cttgaaatga 90720 tcaaacttcc taaattctta aaattgatta tattgttaga ttttttgcgt aatctatgat 90780 taattaaaat ctcccatacg ttaccttgga aaaatttcaa tagcctactg tttacaaatg 90840 ctaaattggt tgatttaatt aagtcatctt tatagaatga attataatta atgtaggatt 90900 tttcattttc ttcgtaaaca accggagtaa caagcattga gagaacataa aataataaat 90960 cttccggtag gctgctgatg caattggttg aaatgattgg gtaattgccc gagttagttt 91020 tacagaaggt attctgcctt tcaccttcaa acgtactttc tgaagcacag tttttcatct 91080 tctgatgatt catttgttca aattttgtaa ttttttgttt tttggtgatt agcattttat 91140 cattattatc gggaaatttg tttttgaaag ttagagatga tttataaata atatcattcc 91200 tagagctgaa tctgggagat aaaaattggg gagcaatacc tgatactttt tcaataaagc 91260 aaaaaatcga gtttggcgtt gcattcgaga aaaaaaaata tatgtatttt ttgctaacca 91320 ttgtgaattt tccatctaaa tagaacttcc aaaaaattga taaaatatcc tttttaaatg 91380 aataattaaa ctcacgatta aacccatgga tacaaacttt gatgtcatca gcaaataatt 91440 tataatatat tcctgttcca ttggcaccca aattttgcat gattcttttt ccagaaacat 91500 gaacctcaac atttgaggca aaaaacaacc attctgattc actcaacatt ccagccccga 91560 cttacatctt tattttcagt atttaagttt ttttcgcaca aaatcaaaca aaaattaatc 91620 agcttattct gtgagaaatc ttattaagag atacccggca aaatactaaa tttttgtccg 91680 gcagatattt atatattagt gctttggcaa atatgattaa aattttttaa acgaaaatta 91740 actaatttct attcaaagtt actagttgga tagtgccttt agctcacatt gttggctgtt 91800 gggtataatt tccctcggcc ataactgggg gaccaaagta tgcaagcgca tttctacttt 91860 tatagtattc atttctaaca gctgattggg tcattttgat tctagacttg agagcgatac 91920 tttcaaaaat tccagcccaa agtgttacat gagtaagcgc aatgattgtt accaatgttg 91980 caagagaacc aatagttttg caatctggga agatgattgt taaaaacata aaaataccgg 92040 aaactttgca caataatatg ctcgcaaatc ctagtatctc tctggttcca tcgcttgaat 92100 aaacttcaac gagaaatgca gcgaatccaa gtattgttag tccaacactc gtaatagcta 92160 agcctgcatt atacatacat ttaattacgt ggttgagatc tagaccagat agagcattac 92220 cttcatagta acccacattt gagtacaatg taactacaat aagtatcaat cccaacaaaa 92280 taacgagaga gccttggcca aatgctttac atcctgacct aaacccacgt gtttggctaa 92340 tactgccatc atcgtccatt agaattgtcg acatatttaa gcatattaca ccaaaaatca 92400 taaatatcat aaagaacaac gctgttccag ttgagtttgt tgcttcgtta aatgggctta 92460 agaactttgt tatgtgcaat ccaaatccac taaatattga gcagatcgtc aggccaataa 92520 aacccaaaag cattgaacca aatcccagtt tgaggttgtt tactaaaaag ttttcaaaaa 92580 ataatggttg cgaaggccct cttccttcag gagcatctcc tggtcgaact acatacgtaa 92640 aatacatggt ctcgatttag attcaattaa taaaatcaga gctattaaaa tttaagatta 92700 aacggtattg tcctttaaaa gccgtttaaa ttgataaagg cactaacagt acatgtattg 92760 tttataattt ttattgcaaa acccaagcat gtctccgtaa ttaaattttt attgcacgta 92820 cagttctcca tgcgtgagta cacctttcca caagaaagtt ccccattttt gaagatgcgg 92880 atcgcttttt ttttttttaa atttttaaag ggtagacaaa acgtttatca taaaaacaat 92940 ttcctccaaa aatgttataa acgatactta cagttattaa ggcaatctaa tatctaagtt 93000 gttttcatta gatagtgagc tgttgtttct aaattaggga gtagaaaaga cttggccgaa 93060 taatttgttt attaagtgct aatctatgcg catcacttaa atgagatcaa aatactctag 93120 taattgtttc tatttacgat tgatttagcg ataaagcaat ttcaatttga aaatacatgc 93180 ttaatttgtt aatgttacat taatcttcaa atctagttta gagacaagac atcacattcc 93240 ggttgtcgct aataggttta taaatacaat aaggacaata cgacgtggcg gatttgtgtg 93300 gcatgcatac gtattaatta attgggacat ttaataatag ttacaaaaca gcgcatcaga 93360 gctaggaaat acaccaacta ttaatttgtt tattagaggt actaatagac atattatatg 93420 tattaatttg tctacataat gtagcctgga tacaaaaacg ttaagcacat acacacaaaa 93480 ttattgcaag agattattct taagaactaa atattaaacg cacaggaggc aagttagatt 93540 ctattggctt agcgcgattt tttttttttt ttccagaact aatcttgatt tcggaagtca 93600 taaacaagca ctcataaagc taagtgataa gaataataaa atgtattctc aaccatattc 93660 tggagaggag cctatgctct atcaaagtag aggtccaaac aagtcctcta ctaatcaatt 93720 taatatgtct gcaactaatg gcatgcaggt aataaacgat ggaataatgg acgatgaagc 93780 cttaaatgaa agaggtgaag agattacccc acttatatcg aatttctctt gtattactct 93840 caggactggg acgttattgc aatgcgcaag cttacttgta ttgattatcc tatataatgt 93900 atttggcaat aaag |