Dbfetch

LOCUS       NM_078717              17943 bp    mRNA    linear   INV 26-DEC-2023
DEFINITION  Drosophila melanogaster kismet (kis), transcript variant A, mRNA.
ACCESSION   NM_078717
VERSION     NM_078717.3
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (bases 1 to 17943)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: Impact of
            High-Throughput Data
  JOURNAL   G3 (Bethesda) 5 (8), 1721-1736 (2015)
   PUBMED   26109357
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 17943)
  AUTHORS   Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
            Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
            Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: The
            Rule-Benders
  JOURNAL   G3 (Bethesda) 5 (8), 1737-1749 (2015)
   PUBMED   26109356
  REMARK    Publication Status: Online-Only
REFERENCE   3  (bases 1 to 17943)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
            Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
            Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
            Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
            Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
            Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
  TITLE     The Release 6 reference sequence of the Drosophila melanogaster
            genome
  JOURNAL   Genome Res 25 (3), 445-458 (2015)
   PUBMED   25589440
REFERENCE   4  (bases 1 to 17943)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
            Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
            Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
  TITLE     Sequence finishing and mapping of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1625-1628 (2007)
   PUBMED   17569867
REFERENCE   5  (bases 1 to 17943)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  TITLE     The Release 5.1 annotation of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1586-1591 (2007)
   PUBMED   17569856
  REMARK    Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE   6  (bases 1 to 17943)
  AUTHORS   Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
            Ashburner,M. and Anxolabehere,D.
  TITLE     Combined evidence annotation of transposable elements in genome
            sequences
  JOURNAL   PLoS Comput Biol 1 (2), 166-175 (2005)
   PUBMED   16110336
REFERENCE   7  (bases 1 to 17943)
  AUTHORS   Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
            Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
            Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
            Celniker,S.E., Rubin,G.M. and Karpen,G.H.
  TITLE     Heterochromatic sequences in a Drosophila whole-genome shotgun
            assembly
  JOURNAL   Genome Biol 3 (12), RESEARCH0085 (2002)
   PUBMED   12537574
REFERENCE   8  (bases 1 to 17943)
  AUTHORS   Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
            Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
            Rubin,G.M., Ashburner,M. and Celniker,S.E.
  TITLE     The transposable elements of the Drosophila melanogaster
            euchromatin: a genomics perspective
  JOURNAL   Genome Biol 3 (12), RESEARCH0084 (2002)
   PUBMED   12537573
REFERENCE   9  (bases 1 to 17943)
  AUTHORS   Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
            Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
            Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
            Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
            Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
            Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
            Rubin,G.M. and Lewis,S.E.
  TITLE     Annotation of the Drosophila melanogaster euchromatic genome: a
            systematic review
  JOURNAL   Genome Biol 3 (12), RESEARCH0083 (2002)
   PUBMED   12537572
REFERENCE   10 (bases 1 to 17943)
  AUTHORS   Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
            Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
            Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
            Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
            Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
            Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
            Gibbs,R.A. and Rubin,G.M.
  TITLE     Finishing a whole-genome shotgun: release 3 of the Drosophila
            melanogaster euchromatic genome sequence
  JOURNAL   Genome Biol 3 (12), RESEARCH0079 (2002)
   PUBMED   12537568
REFERENCE   11 (bases 1 to 17943)
  AUTHORS   Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
            Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
            George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
            Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
            Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
            Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
            Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
            Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
            Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
            Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
            Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
            Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
            Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
            Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
            Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
            Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
            Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
            Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
            Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
            Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
            Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
            Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
            Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
            McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
            Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
            Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
            Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
            Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
            Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
            Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
            Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
            Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
            Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
            Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
            Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
            Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
            Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
  TITLE     The genome sequence of Drosophila melanogaster
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
   PUBMED   10731132
REFERENCE   12 (bases 1 to 17943)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
            Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
            Smith,E., Yu,C. and Rubin,G.
  CONSRTM   Berkeley Drosophila Genome Project
  TITLE     Drosophila melanogaster release 4 sequence
  JOURNAL   Unpublished
REFERENCE   13 (bases 1 to 17943)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (20-DEC-2023) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   14 (bases 1 to 17943)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   15 (bases 1 to 17943)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (10-NOV-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   16 (bases 1 to 17943)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   17 (bases 1 to 17943)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (20-APR-2020) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   18 (bases 1 to 17943)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (22-APR-2019) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   19 (bases 1 to 17943)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   20 (bases 1 to 17943)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   21 (bases 1 to 17943)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-OCT-2015) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   22 (bases 1 to 17943)
  AUTHORS   Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
            Park,S., Svirskas,R. and Karpen,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
            Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   23 (bases 1 to 17943)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
            Svirskas,R. and Rubin,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
            64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   24 (bases 1 to 17943)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  CONSRTM   Drosophila Heterochromatin Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
            Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
            Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE   25 (bases 1 to 17943)
  AUTHORS   Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
            Rockville, MD 20850, USA
COMMENT     REVIEWED REFSEQ: This record has been curated by FlyBase. This
            record is derived from an annotated genomic sequence (NT_033779).
            
            On May 8, 2012 this sequence version replaced NM_078717.2.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider :: FlyBase
            Annotation Status   :: Full annotation
            Annotation Version  :: Release 6.54
            URL                 :: http://flybase.org
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..17943
                     /organism="Drosophila melanogaster"
                     /mol_type="mRNA"
                     /db_xref="taxon:7227"
                     /chromosome="2L"
                     /genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
                     bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
     gene            1..17943
                     /gene="kis"
                     /locus_tag="Dmel_CG3696"
                     /gene_synonym="136/31; 2532; 5841; anon-WO0172774.164;
                     anon-WO0257455.5; BEST:GM02209; CG18326; CG3660; CG3696;
                     DmelPex20; Dmel\CG3696; EC2-7; EK2-4; EP(2)0474; EP474;
                     FBtr0078144; GM02209; Kis; KIS; KIS-L; Kismet; kiss;
                     l(2)07812; l(2)k08827; l(2)k11324; l(2)k13631; l(2)k14112;
                     l(2)k16510; l(2)s3527; l(2)s4771; l(2)s4793; Pex20;
                     Su(Pc)21AB"
                     /note="kismet"
                     /map="21B4-21B5"
                     /db_xref="FLYBASE:FBgn0266557"
                     /db_xref="GeneID:33185"
     CDS             525..16493
                     /gene="kis"
                     /locus_tag="Dmel_CG3696"
                     /gene_synonym="136/31; 2532; 5841; anon-WO0172774.164;
                     anon-WO0257455.5; BEST:GM02209; CG18326; CG3660; CG3696;
                     DmelPex20; Dmel\CG3696; EC2-7; EK2-4; EP(2)0474; EP474;
                     FBtr0078144; GM02209; Kis; KIS; KIS-L; Kismet; kiss;
                     l(2)07812; l(2)k08827; l(2)k11324; l(2)k13631; l(2)k14112;
                     l(2)k16510; l(2)s3527; l(2)s4771; l(2)s4793; Pex20;
                     Su(Pc)21AB"
                     /note="CG3696 gene product from transcript CG3696-RA;
                     CG3696-PA; kis-PA; peroxin 20; Kismet-L; lethal (2)
                     k11324; lethal (2) k16510; EP474; kismet"
                     /codon_start=1
                     /product="kismet, isoform A"
                     /protein_id="NP_523441.1"
                     /db_xref="FLYBASE:FBpp0077803"
                     /db_xref="GeneID:33185"
                     /db_xref="FLYBASE:FBgn0266557"
                     /translation="MDGNAQNMFHGNHQGDPYYRYDAAAAAAAAAAANPRAMGPPGGY
                     PPRHRPPHPLQQQPQYPSYQPTAENLYGLGAADHQAAMGLGVGVGVGAGMGDLSGWGA
                     APSVPGQAAAYPGAGLGAYGHPQAAPPSQQQRQSNASAYRQHMPAYGQQDQQKLASYG
                     AQQSMMAGYGSLAPQQQQQQQPPTPQQQQQAQQVQQQAQQQQQQQQQQSQQQARLPQH
                     YPQAPGQHPLYPGVGAPTVQAGQPPTPQAYAHSPYGSPMQHQPSRGAPQHVGYGHTGL
                     DQASAYGQHVPGGAPPGHHLSAQQQQAAQQLGAHQQQQLQSQQQQQAQQQQPHVSLMQ
                     QQQLPGAGLPPPHMGMPPSYGSHHQQQQQQQQTQQQQQQSAPPYMSSSKHQQAAAQLN
                     SSPQYRAPFPQLSPQMSPRPPTMSPHPQMSPRPVGVSPAKPPPPQQQQQQTQAQQPQQ
                     VVSNQPSQHSIQGMVGLPSPRPQPPTSGAGGAGAKGPAPVAAPVNTLQALEQMVMPSQ
                     ALGLPPMDYPSAYRSAVGPRMPASPQQQQQQHQQWGAAHGAQHLAALQQQQAQQQQQQ
                     VQQQQQAPPQQQQQAAMLQAQQQHHQQQLSMYGQPMGQQPQTAQQQVVPPVTSQPPVV
                     AASQQQQQQQHQQQSLNDQLQHSINDIVGGGNSSGATGGAQNQHQQQPQLAAMSSPLH
                     QQNLPNMHHLIQPTMETTQPTQQQQQTSQQQQQPQQPQATLTSPHHQQMQHQSPIHQQ
                     QQSPHHQQHLPSYNQSQQQFDPFANILGDNQPAASQQTMSPAHVSSPVPAQMQQQQHQ
                     TQQQPSQQVPQSQQQQVSQPQQQTQQQQPSTPSHHLSSILNETANSNDSSTLAVAAND
                     SNNSSSNSSTNSIVNNQPTSVHDSNSQSSINSSSNNQQQLLMDNTNSHGGNSESAQNA
                     GVDVGLFDNNSMSSNSAAVAAVASNAASAAAALLDSNSQSSNAEFEKNQSEGEGLGVV
                     EGVIANETDLPQDENSVSKTQDITLSTELPTIQDDLQEKPTAEMEQTQLGQSLGDLSK
                     SLLEEPKTVEESGEDKSQSAPGVAPPADPIIPVTQPPQQQMPPIGMPMHPIAPGYEAQ
                     AQGQGHMPPMMPPYGGAPGMYPPYPMLHQQEIAALQQQIQELYCMPPGHELQHQDKLM
                     RMQERLNLLTQHEVNDQCAGGPQCLLFQNVPPMYGPPGNPLLNQQMVESPQVSSTTGR
                     GRGKSANKPRKPRAKKGEKAQVGQQQDLMDISGNVAMGAANAVSSIPTTQLPVSEDCV
                     TQGAGDTSVVGMLEYSEGMGDLSQDVHSANELDTSTDGSGKKKKPRKPRTPKDPNKPP
                     RDRTPKAKKPKDPNDPNSTETPAAVKKRAGASKRRKQYGEDGAEQTGEGSEVEDNKPL
                     VPKAGSADGEAETTSGGTGVDGESPDYDDIPVSKIPRGSNEDEKEAEAGDETVDSVPD
                     SAGDPASTPSRRDRKRSTARSRRNANSEEGGSARKNRGSLSAKALKKRRNRGRIVPES
                     DGEDDTLDRTPPPSPPPDSEMDSNKRRSSRNTQRKKYIDDVMLRFSDDENSLLVASPV
                     KKDKKPSANASNSNAGSDVEKTEPQSGAEGDAAQEVGEEKSNLPLDESSQLEASSSTS
                     AVAEKERQISTDAANAAMSSKPNYVYINTGDEDSMVVQLVLAMRMGKRELILDKPKEK
                     APEPKQDEEKSELDEATTDKPEGDEKFKTEGESKKDLTDSEETKLESSAMEVDSKEES
                     EPDDSKKSDEDNKDKDKMEVDDEVGKSDKESKPEEQSETVKTEENSKAIEEDKSSTVL
                     TADHAKEPETVLEKMEVDEKANDDQSAVSKAEGSDEKSTDDSNPEEATTEKNKESLEI
                     EGEKERVKEGEESVKKENDEKTEADMENKPEPVFIDVEEYFVKYRNFSYLHCEWRTEE
                     ELLKGDRRVAAKIRRFQQKQSQQLNIFENIEDEPFNQDFTEVDRVLDMSVHTDETSGE
                     TTKHYLVKWKSLPYEDCTWELEEDVDNDKIEQYLRFNKIPQRSEWKSKKRPHPELWKK
                     LEKTPVYKGGNSLRPYQLEGLNWLKFSWYNTHNCILADEMGLGKTIQSLTFVHSVYEY
                     GIRGPFLVIAPLSTIPNWQREFEGWTDMNVVVYHGSVTSKQMIQDYEYYYKTESGKVL
                     KEPIKFNVLITTFEMIVTDYMDLKAFNWRLCVIDEAHRLKNRNCKLLEGLRQLNLEHR
                     VLLSGTPLQNNISELFSLLNFLEPSQFSSQEEFMSEFGSLRTEEEVNKLQALLKPMML
                     RRLKDDVEKSLAPKEETIIEVELTNIQKKYYRGILEQNFSFLKKGTTSANIPNLMNTM
                     MELRKCCIHPYLLNGAEEQIQYDFKSQHGEDPESYYKNLILSAGKMVLIDKLLPKLKA
                     NGHRVLIFSQMVRCLDILEDYLVYRKYPFERIDGRIRGNLRQEAIDRYSKPGSDRFVF
                     LLCTKAGGLGINLTAADTVIIYDSDWNPQNDLQAQARCHRIGQRKMVKIYRLLCRNTY
                     EREMFDKASMKLGLDKAVLQSMNTQGSKDGNNKQLSKKEIEDLLKKGAYGAVMDDDNA
                     GDKFCEEDIDSILKRRTQVITMESEKGSTFSKASFAASGNRSDITIDDPDFWTKWAKK
                     VDIDPDACERDETEDLVLSEPRRRTQIKRYGHEDVMELNSEESSNENSDEEGGIGLRS
                     RRRKEKRDRAREKKGNDEYIPRERDALAALGLEEIQYGNWAKSECFKVEKGLLSFGWG
                     RWSELLELGQFKRGWRDVDVEDCARIILLYCLQVYKGDEKIKTFIWDLITPTEDGEVQ
                     KISRDHSGLHNLVPRGRNGGKSNKESTGVSTPAPGSASGSNSGNTTPAHKSSALDGSD
                     KDSGGINASAVAAAVTSIHDPNHWSKKEKYDADAYLEGAYKKHLGRHANKVLLRVRML
                     YYIQHEVIGDFVQQIKDNTPISELPIRPPPTPDQVPSSWWNPICCDKSLLVGTYKHGC
                     EMYRQMRADPNLCFVTHVGAGAAGDDLAVTSLPINEDDANSKHDDGDEVDDDGTTTTK
                     DSDSTKLTGGDNKDSLLDPERPSSSGLDEEESVAGSYPPTVAAVEDATTMWPSMQDLN
                     TRLRRVITAYQRNYKKEELKLQQKAKLQALVSSTPPLSVPTTPTLQSMQMQLQSGSGS
                     GGSGIGSSAQHDASGRSLQALMQSQGASTSAAAAAAASSAGGSCSGSGQVGMGGAGVM
                     PAMPTAADISLMLSILNTTDVNQLANLDINKLAMYLVSMNNKMERQGKMELAAKERES
                     QRLQLIPKKWNRREEYEFLRVLTGYGVDLHVSTPMASSNGSSLSPDWTKFKQMAHLER
                     KSDETLTDYYKVFVAMCKRQAGLKLSESERGLEGIIEEIEKEHAKLILDRLEVLAKLR
                     EVARNPMLEERLKLCTKNADTPDWWEPGRHDKELITAVLKHGLYRSETFIFNDPNFSF
                     GESEKRFIRELEAQIQRTIKLEAFNAEKAEKLAAAEKDAATEKAAATEKAAAEKAASV
                     KNEVIDLDDELMTNESVIKKESPVTPIKEEIKSEESPEKHDTADNLEDKNSDAEESTK
                     IKGKEDFNPETTEKEALDESTNLNKDKSSTETLVPEIEDSKPSEDKMEVEELPVAENG
                     EKEGSPEKCSDAEDKKFSEEKPSSPAVPDGKVSEMEADEAAPNTEENCSGDSESPEKE
                     CEDKVVEKVEEEKGENSYEKAEEQKEETEKLEKSSIESEPSEKSETVLEPEVAVPKKS
                     TGDQDILDLVAGPSDPDDDEVMKEKEKAVEEECKKQAAELKARFPDLEVIQPATVKQQ
                     KLEKPKLEMCMIRWFKDFALERRIAHIVACVESGNWPVDSKYSAFATCKGATDLSIAL
                     HESIPHLSSLERRSTTPDVITITTDQGVTKHLQTSHMQQVASSASAASTSSGVPQVTQ
                     SKPSSSNSLPGLDAKSINAAVAAAVAAAAAGGNATSLSSLLPGMSLSAATGSSAGGVS
                     GLSVPSAGSGVGKKRKRHIAIDVETERAKLHALLNSSTMAPKDWESEIANMEALSGSS
                     GRGRGGNSSASSGMQPPPAHQHASLSRQSSGQFSKPAVPAMKTPPPSMGAPMDLSSSL
                     PKMNMTEMLKSASSSGAIDLSEVQDFSMPSKKSSVHAALSSAFPSMGNKSKLDDTLNK
                     LMKKNNCTIEEPVIGKEKKRKKLDEIVLGLSAAKEQKTFPDPSLPSSKKPQIPPSVSV
                     TPANLQSSSNQQSNQKPFTITVTTVPGKSKSGSSNSGSGTGGSSSASGGGAGGSGLSA
                     LQNMAMGGLSSKDSLNALLAQTMATDPQTFLKQQQKMMQFLPPAQRKAYENMLAEMEQ
                     AMKISSKFSTNSPHDVKVNKWLSDMTSPLGDQLSIDYVGGGGASGSGSGNSRRSNRQQ
                     GNSQQSSSAAQLQKQQQQQQQQQQQQQQHSMPGPQNLTGEEPVPVINKQTGKRLGGNK
                     APQLKRLMQWLTENPNYEVDPKWLEQMQNPMSTPSPRPASMESGYGSSAVKSHGGRPL
                     SNLSSTSSSSHTQQQSSAAQSQAGGNSGSSKKNSRQQTAASAALDQAALQFGSLAGLN
                     PSLLANLPGLGAFDPKNPLAAFDPKNPLLSMSFGGMPGMGNIPGLGNLNNMNLFASLA
                     GMGGLGNLAGMDTQSLAALMAAAGPTLGGLTGASGGAGSGKSQAQSQSSATSSSSSAS
                     KKKQQQQQQQAQNEAAQLAAAISASTGGSGASGGKNAAASASQLAAGFPFLFPNPSLL
                     YPPMGLGGLNPYSLGSSGLGSAYDQLAQQYNLLNGATSSASNTSSTQSKSHQSQSKSS
                     QSRNTTASANSAASLMNAMASMGGGASTVTTPSTSASGSGRGRQSSSRNQSQTTPTAA
                     DMAQLSSLLMPGADPHLLESLSRMSNMDLAQATRLMSSLGMPPLSGTPSSSGGGNTST
                     SKRSSQAANEANVQAKEQQKWLESLARGALPTDLAALQAFSQGKMPSTSGSNTGTSTS
                     SSKSSKAAATAAAAQLPQIPGMSSDFPQAFLAEMAAQAMAAAGGSLPLSGPGSLASLA
                     GLTGGSAGGGGSSASGSTSHSSSKRQREQDAFKQQMDYYTKTLGLGSGISLIPTSSAG
                     GSSASSSAANAAAAAYAAALDAEQQHQQQQKALSKRARGDLHPTKEELAALAAGLPLN
                     LGASMSSIEKSLRGGSSSSSSSTPAPMTAEQDKVTLTPLNASGGGSGSSSSSAAAAGL
                     AANLPSQTTITIAPPISSGASTSSERSERSESRISLTITNAADAAKLPPPYEEADELI
                     IQPILKKPTAANPGSSHGGSVDDLDTAENTSAANLSGSSSSSAAAAAAAAAEENRRSS
                     NRLKRPRSGNEQGSGSVEGQPPEKRRELRSTRHTRSSADASTLNLSTGSESGAEERNE
                     "
ORIGIN      
        1 aaatggaggt ctttgatgtg tgtgtcgcgt tgtcttgaaa tttgctgcta gaattttgcg
       61 ttaaacgctg tgagcgaatg tgtttcactg cgaatatgtc aaaggtcggc catgaagtgc
      121 gcgaaacacg aaacacaaat gacttttacc ctataaaagc aacaacaaga cgttaaattg
      181 cacaccagtt gtaattaagc ttagtagtta gaaccaacaa ccaactaata tataatcata
      241 atcgataagg cgaaggtgga ttgagacacc agagaccgac ctgatccacg gggtaaagtg
      301 agaagtgagg aacaaacatc ttatcccctg agaagtgaaa cttctccgca attgtctggc
      361 tctcaattat cgactacatt taggcttccg acattttgac aacggccacg cgcagaaagc
      421 agcatcaaaa ttaagccaat tgacgaccaa atctgaaacg cagcggcgtc gcgtctttcg
      481 acgcgtaacg gacatttctg tgtgcgctgc gtttaaaaaa tctgatggat ggaaatgcac
      541 aaaatatgtt tcacggaaat catcaaggag atccctatta ccgttatgat gcggcagctg
      601 ccgcagcagc cgcagccgcg gcgaacccgc gtgcaatggg accgcctggc ggctatccgc
      661 cgcgccaccg ccccccacat ccgctacagc aacagcccca atacccatcg taccagccaa
      721 cggcggagaa tctctacggg ctgggggccg ccgatcacca ggcggctatg ggcctcggcg
      781 ttggagtagg agttggtgcc ggaatgggcg atctaagcgg ttggggagcg gcgccctcgg
      841 ttccgggcca ggcagcggct tatcccggag ccggactggg ggcctatggg catccgcaag
      901 cggctccacc gtcgcagcaa cagcgtcaga gcaacgctag cgcatatcgt caacacatgc
      961 ccgcctatgg acaacaggac cagcagaaac ttgcttcata cggcgcgcag cagagcatga
     1021 tggcggggta cggatcactg gcgccccagc agcaacagca gcagcaaccg cccacgcccc
     1081 agcagcaaca gcaagcacag caagtgcaac agcaggcgca gcaacaacaa caacagcagc
     1141 agcagcagag tcagcagcag gctcgactgc cgcagcatta tccgcaagct cctggtcagc
     1201 atccgttgta tcctggagtg ggagcaccca ccgtccaggc gggacaacct cccacgcctc
     1261 aagcttatgc tcacagtccg tacggaagtc ccatgcaaca ccaaccaagt agaggagctc
     1321 cccagcatgt gggctacggt catacaggac tagatcaggc ttctgcttac ggacagcatg
     1381 tgccgggagg agcgccaccg ggtcaccact tgtcagcgca gcagcagcag gctgcgcagc
     1441 aattgggtgc ccaccagcag cagcagctcc aatcgcagca gcaacagcag gcccagcagc
     1501 agcaacctca cgtctcgcta atgcagcagc aacagttgcc gggtgctgga ttgccgccgc
     1561 cgcacatggg aatgcctcct agttacggaa gccaccatca acagcaacag cagcaacaac
     1621 agacccagca acaacagcag cagtcagcgc cgccttacat gtccagcagc aagcatcaac
     1681 aggcggctgc ccagctaaat tcctcgcccc agtatcgtgc acctttccct cagctttccc
     1741 cacaaatgtc gccacgtcct ccgacaatgt cgccgcaccc acaaatgtcc ccgcgaccag
     1801 ttggcgtgtc gccggcgaaa ccaccgcccc cccaacagca gcaacaacaa acgcaagcgc
     1861 agcagccgca acaggttgta agcaatcagc caagccagca cagcattcaa ggcatggtag
     1921 gcctgccatc gccacgccca cagccgccca caagtggagc aggcggagcg ggagcgaagg
     1981 gtcctgcgcc agttgctgct cccgttaaca cattacaggc cctcgaacaa atggtgatgc
     2041 catctcaggc tctcggacta ccacccatgg actacccatc ggcttacaga agtgccgtgg
     2101 gtcctaggat gccagcttcc cctcaacagc agcagcaaca acatcaacaa tggggggcgg
     2161 ctcatggagc acagcacttg gcggccctgc agcagcagca ggcccaacaa caacaacagc
     2221 aggtacaaca gcagcaacaa gcacctccgc agcagcagca acaggcggcg atgttgcagg
     2281 ctcagcagca acatcaccag cagcagttaa gcatgtatgg tcagccgatg ggacagcaac
     2341 cacaaaccgc gcagcaacaa gttgtaccac cggtgacttc acaaccgcca gtagtggccg
     2401 cctctcagca gcagcaacaa cagcaacacc agcagcagtc actcaacgat caattgcagc
     2461 actcgatcaa cgatattgtt gggggtggta acagtagtgg agcaacaggg ggtgctcaga
     2521 atcagcacca gcagcaaccc caactggcgg ccatgtcttc gccgttgcat cagcaaaatc
     2581 tgcctaacat gcaccattta attcaaccta ccatggagac aacacagccg acgcagcagc
     2641 aacagcaaac gtcacaacaa cagcagcaac cacagcaacc gcaggcgact ttaacatcgc
     2701 cgcaccacca gcaaatgcag caccagtctc caatacacca gcagcaacag tcgccgcacc
     2761 accagcaaca tctacccagc tataatcagt cacagcagca atttgatccc ttcgctaaca
     2821 tactaggaga caatcaaccg gctgcatccc aacagacaat gtcgcccgct catgtaagtt
     2881 cgccagtacc cgcgcaaatg caacagcagc aacatcaaac ccaacaacaa ccatcccaac
     2941 aagtaccaca aagccagcaa caacaggtgt cgcagccgca acagcaaacg cagcagcagc
     3001 aaccctcgac cccatcacat cacctgagca gcattctcaa tgaaacagcc aactcgaatg
     3061 attcctccac tctggcggtg gctgctaatg acagcaacaa cagcagtagc aatagtagca
     3121 ccaatagcat tgtcaacaac cagcccacgt cggtgcacga cagtaattcg caaagcagca
     3181 ttaacagcag cagcaacaac cagcaacagt tgctcatgga caacacgaat tcgcacggag
     3241 gaaattcgga atcggcccag aatgctggag tagatgttgg tctttttgat aacaactcga
     3301 tgtcttcaaa ttcagcggca gttgctgccg tggcctccaa tgcagcttct gcagcagcag
     3361 cacttttaga tagcaattct cagtcctcga atgcggagtt tgagaagaat caaagcgaag
     3421 gtgagggact cggtgttgtg gaaggagtta ttgccaatga aactgatttg ccacaggatg
     3481 agaacagtgt ttctaaaact caagacataa ctctaagcac ggaattgcca acaatacaag
     3541 acgatctgca agaaaagccc accgctgaga tggaacaaac tcaattggga caatctttag
     3601 gggatctatc taaatctttg ttagaggaac ccaaaacagt ggaggaatcc ggagaggata
     3661 aatctcaatc ggcccctgga gttgcacccc ctgcagatcc tattatcccc gttacacaac
     3721 cgcctcagca acagatgcct cctataggga tgcctatgca tcctatcgcc cccggatatg
     3781 aggcccaggc tcagggacaa gggcacatgc ctcccatgat gcctccgtat ggaggagcgc
     3841 caggaatgta tccaccttat ccgatgctgc atcaacaaga aattgctgca ctgcagcagc
     3901 agatacagga actgtattgc atgcctccag gacatgagct tcaacaccaa gataagttaa
     3961 tgcgtatgca ggagcgatta aatcttctga cgcagcacga ggtaaacgat cagtgtgctg
     4021 gcggtccaca gtgcttattg ttccaaaacg ttccgccaat gtatggacca ccaggaaacc
     4081 ccctgcttaa ccagcaaatg gtagagagcc ctcaagtatc tagcaccacg ggaagaggtc
     4141 gaggaaagag tgccaataag ccgcgtaagc ctcgcgccaa gaagggtgag aaagcccagg
     4201 tgggtcagca gcaggatttg atggacatca gcggcaatgt ggcaatggga gcagccaacg
     4261 ccgtttcaag tattccaacg acacaactac ccgtctctga ggattgcgtg actcaaggag
     4321 cgggagatac cagtgtggtc ggtatgctgg agtatagtga agggatgggt gatctctctc
     4381 aagatgtgca ctcagccaac gaactggata cctcgactga tggcagtggc aaaaagaaga
     4441 agccccgcaa gcctagaaca cctaaggatc caaataaacc cccacgagat agaacgccga
     4501 aggcgaagaa gcccaaggat ccaaacgatc ccaactctac cgagacacca gcagcggtaa
     4561 aaaagcgagc tggtgctagt aaacgcagga aacagtacgg agaagatggg gctgagcaaa
     4621 cgggagaggg tagcgaagtg gaagacaaca agccattggt accaaaagca ggatctgctg
     4681 atggtgaagc tgaaaccact tcgggaggaa ctggagtgga tggagaatct ccagactacg
     4741 atgacattcc cgtttcaaaa attccgcgtg gctccaatga agatgaaaag gaagcggagg
     4801 ccggcgatga aacagtagac tctgttccag attcggcagg agatcctgct tctacaccaa
     4861 gcagaagaga tcgtaaaaga tccactgcaa ggagtcgccg caatgcaaat tctgaggagg
     4921 gtggcagtgc tcgcaagaac cgtggatctc tttccgccaa ggcactaaag aagcgaagga
     4981 ataggggtcg cattgtacct gaatccgatg gcgaagatga cacactggac agaactccac
     5041 ctccatcacc gccaccagac tctgaaatgg attccaacaa gcgaagatcg tcaaggaata
     5101 ctcaaagaaa gaaatacatc gatgatgtta tgctaaggtt ctccgatgat gagaactctc
     5161 tcttagtggc atctcccgta aagaaggata agaaaccatc tgcgaatgcg agtaattcaa
     5221 atgcaggaag tgatgtggaa aaaacggaac ctcaatctgg tgccgaggga gacgctgccc
     5281 aggaggtggg tgaagaaaag tcaaatctac cactggatga aagcagtcaa ctggaagcca
     5341 gttcgagtac ttctgcagta gcggagaagg agcgacaaat ctctactgat gctgcaaatg
     5401 cagccatgtc ttccaaaccc aattacgtgt acataaacac tggcgatgag gacagcatgg
     5461 tggttcaatt agttttagcc atgcgaatgg gaaagcgcga acttattctg gacaaaccca
     5521 aagaaaaagc acccgaacct aaacaagatg aagagaaatc tgaattagat gaagccacta
     5581 ctgataaacc tgagggagat gaaaaattca aaactgaggg tgaatccaag aaagatttaa
     5641 ccgattccga ggaaacgaaa ttagaatcct ccgcaatgga agtagacagt aaggaagaat
     5701 ctgagcctga tgattctaag aaatcggatg aagacaataa ggacaaggac aaaatggaag
     5761 tagatgatga agttggaaaa tcagataagg aaagtaaacc agaagaacaa tcagaaacag
     5821 tgaaaactga agaaaattct aaagccatcg aggaagacaa gtcctctacc gttttaacag
     5881 cagaccacgc caaggaacct gagactgtct tggagaaaat ggaggttgat gaaaaggcaa
     5941 atgatgacca atcggctgta tcgaaagcag agggatctga cgaaaaatct actgatgact
     6001 ccaaccccga agaggctact acagaaaaga ataaggaaag tttagaaatt gaaggggaaa
     6061 aggagcgggt caaggaggga gaagaatccg ttaagaagga gaatgacgag aaaacagaag
     6121 ctgatatgga aaataagcca gagcccgtct ttatcgatgt ggaggagtac tttgtgaaat
     6181 atcgcaattt cagttatcta cattgcgagt ggcgaacaga agaggaacta ttaaagggtg
     6241 atcgccgcgt agctgctaag attcgtcgct ttcagcaaaa gcagtctcag caattgaata
     6301 tatttgagaa tatcgaagac gagcccttca accaggactt tactgaagtg gatagagtgt
     6361 tagacatgtc tgtgcataca gacgaaacca gtggagagac tacgaagcac tacctggtca
     6421 agtggaagtc gcttccctat gaggattgca cttgggagct agaagaagat gtagataacg
     6481 acaaaattga gcagtacctg cgctttaaca aaatccctca acggagcgag tggaagtcta
     6541 aaaaacgacc gcatccagaa ttgtggaaaa aactggaaaa gacgcccgtc tataagggag
     6601 gaaatagcct tagaccctat caacttgagg gtcttaattg gttgaagttt tcttggtaca
     6661 acacccacaa ctgcatactc gctgacgaaa tgggccttgg aaaaactatt caaagtttga
     6721 catttgtgca ctctgtatat gagtacggca ttagaggacc tttcttagtt atagctcctc
     6781 tatctacaat tccaaactgg cagcgagagt tcgagggctg gacggatatg aacgtagttg
     6841 tttatcacgg ctcggtgaca agtaaacaaa tgatacagga ctatgaatat tactataaga
     6901 cagaaagtgg aaaggtattg aaggagccca tcaagtttaa cgttttgatc accacttttg
     6961 aaatgatcgt gacagactac atggacttaa aagcctttaa ctggcgcctt tgtgtgattg
     7021 atgaggcaca tcgtcttaag aataggaatt gcaaactcct tgagggtctg cgacagttaa
     7081 atttggagca cagagtattg ctctccggaa ctcccctaca aaacaacatc agcgagctgt
     7141 tctcgctgtt aaactttctg gaaccctcgc agttctcctc acaggaagag ttcatgtctg
     7201 agtttggaag tcttcgcact gaagaagaag taaataagct gcaagctcta ctgaaaccaa
     7261 tgatgttacg tcgtctaaaa gacgacgtag agaaaagttt ggcgcccaag gaagaaacca
     7321 ttatcgaagt ggagctaact aacatacaaa agaaatatta tcgaggtata ctggaacaga
     7381 actttagttt cctgaaaaag gggaccacat ctgctaatat cccaaacctt atgaacacca
     7441 tgatggagtt gagaaagtgc tgcatacacc cttatctctt gaatggagcg gaggaacaaa
     7501 tccaatatga tttcaagtcc cagcatggcg aagatcctga gtcctattat aaaaatctga
     7561 ttctttccgc tggtaaaatg gttttaattg ataaattgct acctaaacta aaagcaaacg
     7621 gccatcgcgt tctaatattc agtcagatgg tgcgttgctt ggatatcctt gaagattatc
     7681 tagtgtacag aaaatacccc tttgagcgaa tcgatgggcg cattcgcggt aatctccgcc
     7741 aggaggccat cgatcgttac tccaagccag gttccgatcg ctttgtattt ctcctttgca
     7801 ccaaagctgg tggattaggt attaatttaa cagctgctga tactgttatt atttacgatt
     7861 cggattggaa tccacaaaac gatttacagg ctcaggcccg atgccatcgt attggccaga
     7921 gaaagatggt aaagatttat agattgcttt gcaggaatac ctatgagcgt gaaatgttcg
     7981 acaaagcttc aatgaaactt ggattggaca aggctgtatt gcagtcgatg aacacccaag
     8041 ggtctaaaga tggcaataac aagcagctgt ccaaaaaaga aattgaagat cttttaaaaa
     8101 agggtgctta cggggccgtc atggatgacg acaatgctgg tgacaaattt tgcgaagagg
     8161 acatcgattc aatccttaag cgacgcactc aagtcatcac aatggaatcg gagaagggtt
     8221 caaccttctc aaaggcttca tttgctgcat ccggtaaccg gtcggatatt acgatagatg
     8281 atcctgattt ttggaccaag tgggccaaaa aggttgacat tgatcccgat gcttgcgaaa
     8341 gggatgaaac ggaggacttg gttttgtccg aacccagaag acgcacccaa atcaaacgct
     8401 acggccacga ggacgtaatg gagctcaatt ccgaggaatc ttccaatgag aatagcgacg
     8461 aggagggagg aatagggctc cgttcccgtc gccgcaagga gaagcgagat cgtgctcgtg
     8521 aaaagaaggg caacgatgaa tacattccac gagaacggga tgcattagct gctttgggct
     8581 tggaggaaat tcaatatggt aactgggcca agtctgagtg ctttaaagtg gaaaagggac
     8641 tgctttcctt tggttggggc cggtggtcgg aacttttgga gttaggacaa ttcaaacgag
     8701 gttggcgtga tgtagacgtc gaggattgtg ctcgcatcat acttctttac tgcctgcagg
     8761 tgtacaaggg tgatgagaaa atcaagacgt tcatctggga cttgataaca ccgacggagg
     8821 acggggaagt gcagaagatt agcagggatc atagtggcct acacaacttg gtgccacgag
     8881 gtcgcaacgg tggaaagtca aacaaagaat ccacaggagt gtctactcct gctccgggct
     8941 ctgcatccgg aagcaatagt ggaaacacga ctccggctca caaatctagt gcgctggatg
     9001 ggtccgataa ggacagcggc ggcataaatg ccagcgcagt agccgcggcg gtcacaagca
     9061 ttcacgatcc aaatcactgg agcaagaagg agaagtacga cgctgatgcg tacttggaag
     9121 gcgcctacaa gaagcattta ggcagacacg ccaataaggt tttgctgcgc gtgaggatgc
     9181 tctattatat tcaacatgag gttattggcg actttgttca gcagataaag gacaacacgc
     9241 ctatcagcga actccccatt cgcccgccgc caacgccaga tcaagtgcca tcttcatggt
     9301 ggaatcccat ttgttgtgat aagagcctgt tggtgggaac ctacaagcat ggctgtgaaa
     9361 tgtatcgtca aatgcgcgcc gatcccaatc tgtgctttgt cacccatgtg ggcgccggtg
     9421 cggccggtga cgatctggct gtcacgagtt tgccgataaa cgaagacgat gctaattcga
     9481 agcacgatga tggtgacgag gtggacgatg acggaactac aacgaccaaa gactctgact
     9541 ctactaaact caccgggggc gataacaagg actcgcttct cgacccggag cgtcccagtt
     9601 cgtctggctt ggacgaggag gagagtgtcg ccggcagcta tccgccaacc gtggcagccg
     9661 tcgaggacgc cactactatg tggccatcta tgcaggacct taacacgcgt ctccggcggg
     9721 tgatcactgc ctatcaaagg aattacaaga aagaggaatt aaaactgcag cagaaggcca
     9781 agctccaggc attggtttca tcgacaccgc cgctgtcggt tcccaccact cccacccttc
     9841 agtctatgca aatgcagctg cagagcggct ccggcagtgg tgggtccgga ataggatcct
     9901 ctgctcagca cgatgccagt ggccggagtc ttcaggcgtt gatgcaatcg cagggggcca
     9961 gcacatccgc agcagcagcc gccgccgcat caagtgctgg aggatcctgt agtggatccg
    10021 gccaggtggg catgggcgga gcgggagtca tgccagctat gccaaccgcc gccgacataa
    10081 gcctaatgct cagcatactc aacaccacag atgtcaacca gctggccaac ttggacatca
    10141 ataaattggc catgtacctg gtcagcatga acaataaaat ggagcgacag ggaaagatgg
    10201 aactggcggc gaaggagcgc gagtcccaac ggctgcagct cattcccaaa aaatggaacc
    10261 gccgggaaga atacgaattt cttagagttc ttaccgggta tggtgtggac ctacacgtct
    10321 caacacctat ggcttcctct aatggaagtt ccctttcgcc cgactggacc aagttcaaac
    10381 agatggctca cctagagaga aaaagcgatg aaacactgac ggactattat aaggtatttg
    10441 tggccatgtg taaacgacag gcgggtttaa agctctctga atcggagcgc ggcttagaag
    10501 gtatcatcga agaaatcgaa aaggaacacg ccaagctcat actggatcgc ttggaagttc
    10561 ttgccaaatt gcgagaagtc gctcggaatc ccatgcttga ggaacgctta aaactctgca
    10621 ccaagaatgc ggatactcca gattggtggg agcccggtcg acacgacaag gaattgatta
    10681 cagccgtcct caagcacgga ctttatcgct ccgaaacctt tatctttaac gatccaaact
    10741 ttagttttgg agagtcagag aagcgtttta ttcgagaact agaagctcag attcaacgta
    10801 ctatcaaact ggaggctttc aatgctgaaa aggcagagaa gttagcggca gctgaaaaag
    10861 atgctgcaac tgaaaaagct gcggcgactg aaaaggcggc agcagaaaag gcagcttccg
    10921 tgaagaatga ggtcatagac cttgatgatg agctcatgac taatgagagt gtcataaaga
    10981 aagaatcgcc ggttactccc ataaaagaag aaattaaatc tgaagagagt cctgaaaagc
    11041 atgatactgc tgataatctc gaagataaaa attcagatgc tgaagagagt acaaagataa
    11101 agggtaaaga ggattttaat ccagaaacca cggaaaagga agcactggac gaatctacaa
    11161 atttgaataa ggataagtct agcactgaaa cattggttcc agaaattgaa gatagcaaac
    11221 ctagcgaaga taaaatggaa gttgaggaat tgcccgttgc cgaaaatggt gagaaggaag
    11281 gttcgcctga gaaatgcagt gatgctgagg ataagaaatt ttcagaagaa aagccaagtt
    11341 cccctgcagt tcctgatggt aaagtgtcag aaatggaggc agatgaagct gcccctaata
    11401 cagaagaaaa ctgttctggg gattcggaat cgcccgaaaa ggaatgtgaa gacaaggttg
    11461 ttgaaaaggt tgaggaagag aagggtgaaa actcatatga aaaggctgag gaacaaaaag
    11521 aagaaacgga aaaactggaa aaatcgagta tagaatcaga gccctctgaa aaatctgaaa
    11581 ctgtcctcga acccgaagta gctgtgccca aaaaatccac tggcgaccaa gacatactcg
    11641 acctggtggc tggtccttca gatcccgatg atgacgaggt gatgaaggaa aaagagaaag
    11701 cagtagaaga agaatgcaag aagcaggcag cagaacttaa agcccgcttt ccggacttgg
    11761 aagtgattca accggcaact gttaagcagc aaaagttgga gaagccaaag cttgaaatgt
    11821 gcatgatccg atggtttaag gattttgcgt tagaacgccg tattgcgcac attgtcgctt
    11881 gcgttgagtc aggaaattgg cctgtggata gtaaatacag tgcctttgca acctgcaagg
    11941 gggccactga tctaagcatt gctcttcacg aatccattcc tcacttgagc agtttggaac
    12001 gtcgctccac aacacccgat gtcattacca taacaacgga ccaaggtgtc acaaagcact
    12061 tgcagacctc ccacatgcag caagttgcca gctcggcgtc cgctgcatcc acttcctcgg
    12121 gagttcccca ggtcacccaa tcaaagccct cttcttccaa cagtttacct ggcttggatg
    12181 ctaaaagcat caatgctgct gtggcagcgg ccgttgcggc agctgcagca ggtggaaatg
    12241 ccacctccct atcgtctctg cttcccggaa tgtcactgag tgcagcaact ggctcttcgg
    12301 ccggtggagt gagcggattg agcgttccga gtgccgggtc tggagtggga aagaagcgca
    12361 agcgacacat cgccattgac gtggagacgg aaagggctaa gctccacgct ctgcttaaca
    12421 gttctacaat ggcgccgaaa gactgggaga gtgagattgc aaacatggag gccttgagtg
    12481 gctcctctgg tcgtggtcga ggtggcaatt cttctgcttc gagtggcatg cagccgcctc
    12541 cagctcatca acatgcttca ctctcgcgac aatcatctgg acagttcagc aagccagctg
    12601 tccccgcaat gaagactcct cctccctcaa tgggtgcacc aatggacctt tcatcaagcc
    12661 tgccgaaaat gaatatgact gagatgctta agtcggcttc cagttcggga gccatcgatt
    12721 tgagtgaggt tcaggacttc tctatgcctt ctaagaaatc cagtgtacat gctgctttaa
    12781 gttccgcttt tccctcgatg ggaaacaaga gcaaattgga cgacacgctt aataaattga
    12841 tgaagaaaaa caattgcacc attgaagagc ctgttattgg taaggagaaa aagaggaaga
    12901 agttggacga gatcgttctc ggcctatcgg ctgctaaaga gcaaaagacc ttccctgacc
    12961 catctctgcc atcatcaaag aaaccgcaga tccctccaag cgtatcggtg acgccagcaa
    13021 accttcaatc ctcgtccaac cagcagtcaa atcagaaacc ctttactatc actgttacca
    13081 cagtgcctgg aaaatccaag agcggctcgt ccaacagcgg atccggcaca ggaggaagct
    13141 cgtcggccag cggcggagga gccggcggca gcggcttgag cgccctgcag aatatggcaa
    13201 tgggaggtct gtcatccaaa gacagtctca atgccctgtt ggcacagaca atggccaccg
    13261 accctcagac gttcctcaaa caacagcaaa agatgatgca gttcctgccc cccgctcaac
    13321 ggaaggctta tgaaaacatg ctagccgaga tggaacaggc catgaagatc agttcaaagt
    13381 tttcaacaaa ctcaccacac gacgtcaagg tcaacaagtg gctttcagac atgacaagtc
    13441 cactgggtga ccagctgagc attgattacg taggaggagg cggggcaagt ggttcgggaa
    13501 gtggcaatag tcggcgctcg aaccgccagc agggaaactc acaacaatcc tcaagtgcag
    13561 cccaattgca aaagcaacag caacaacagc agcagcagca gcaacaacaa cagcaacact
    13621 cgatgcccgg gccacagaat ttgacaggag aggagcccgt accagtgatc aataaacaga
    13681 caggcaaacg cttgggaggc aataaagcgc cgcagctaaa gagacttatg caatggctta
    13741 cggagaaccc caattacgaa gtggatccaa agtggctgga gcagatgcaa aatcccatgt
    13801 caacaccgtc accaaggccc gcatcaatgg agagcggcta cggctcctcg gcagtgaaat
    13861 cacatggtgg tcgtccatta tctaatttga gtagcacctc atcatcgtcc cacacccaac
    13921 agcaatcatc tgcagctcaa tcgcaagcgg gagggaactc aggcagctca aagaaaaatt
    13981 cccgccaaca aactgcggca tcagctgcac tggatcaagc tgccttacaa tttggctcct
    14041 tagcgggtct gaatcccagt ctgttagcaa atcttcctgg actgggcgca tttgatccca
    14101 agaatcctct tgctgcattt gatcccaaaa atccgttgct atccatgtct tttggaggaa
    14161 tgcccggaat gggtaatatt cctggactag gtaacctgaa caacatgaac ctatttgcca
    14221 gtttggcagg aatgggagga ttgggtaatc ttgcagggat ggacactcag tcgctggctg
    14281 cgctcatggc tgctgccggg ccaactcttg gcggattaac tggtgcatcc ggaggagcgg
    14341 gctccggcaa gagccaggcg cagtcgcaat catccgccac ttcgtcatcc tcttcggcta
    14401 gtaaaaagaa gcagcagcaa cagcaacaac aggcacaaaa tgaagctgct caattggcag
    14461 ctgctatcag cgcaagcact ggaggatctg gcgcatcagg gggaaagaac gcagctgctt
    14521 ctgcatcaca attggcggcc ggatttccat tcctatttcc gaatccatca ttgctgtatc
    14581 cgcccatggg actaggtggc ctgaatccgt attcccttgg atctagtgga cttggttccg
    14641 catacgatca gttggctcag cagtataacc tgctaaatgg agctacttca tcggcctcaa
    14701 atacgagctc cacgcagtct aagtcgcatc agtcgcaatc aaaatccagt cagtccagga
    14761 ataccacagc atctgccaac tcagcagcaa gtctgatgaa tgctatggcc agcatgggcg
    14821 gtggcgctag cacagttacc acgccttcaa cttcagcatc aggatccgga cgtggcaggc
    14881 aatcaagcag cagaaatcag tcccaaacca cacccacagc agcagacatg gctcaactca
    14941 gtagtctact tatgccagga gcggatccgc atcttctcga gtccttgagc cgtatgagca
    15001 acatggattt ggcgcaggca acacgtctga tgagctccct gggaatgccg cctctttcag
    15061 gaacgccgag ctcttctggt ggaggtaaca catctacaag caagagaagc agccaagccg
    15121 caaatgaagc taacgttcag gccaaggaac agcaaaagtg gttagaaagt ttagcccgag
    15181 gagctctgcc aactgattta gccgcgcttc aagctttctc gcaaggcaaa atgccatcaa
    15241 catccggatc aaatacggga acatccactt cctctagtaa gagctcgaaa gcagctgcga
    15301 cagcagcggc agctcaatta ccacaaattc ctggcatgtc cagcgatttc ccacaggcct
    15361 ttttagctga aatggctgca caggcaatgg cggctgcggg gggatccctt cctcttagtg
    15421 gtccaggttc attagctagt ttggctggct taactggtgg ctctgctgga ggtggtggtt
    15481 cttctgcctc aggaagcact agccactctt cttcgaaacg gcagcgtgaa caggatgcct
    15541 ttaagcaaca gatggactat tataccaaga ccttaggtct gggatcagga atctcgttga
    15601 tacccacatc ctcggcggga ggatcatctg catcctctag tgcagcaaac gcagcagcag
    15661 ctgcgtatgc tgcagctttg gacgcagagc aacagcatca acaacagcag aaggcgctat
    15721 ccaaacgagc acgtggggat ttacacccca caaaggagga attggccgcc cttgcggcag
    15781 gcttacccct taatcttggt gccagcatga gtagcataga aaagagtcta agaggcggaa
    15841 gcagttccag tagcagctct acaccagcgc ccatgactgc ggaacaggat aaagttacat
    15901 tgactcccct taatgcctct ggaggcggat cggggtcctc ttccagttca gcagctgctg
    15961 cgggactggc agctaatctg cccagtcaaa ctacaataac catagctcct cccattagca
    16021 gtggagccag cactagcagc gagcgttccg aacgctccga gtcgcgcatt tcgctaacca
    16081 taacaaatgc agcggatgcc gcgaaactac cgccgcccta cgaagaagcc gatgagctaa
    16141 ttatacagcc catactcaaa aagcctaccg ctgccaatcc aggcagtagt catggcggaa
    16201 gcgtggacga tctggataca gctgaaaata cttcagctgc taatctgagt ggctcatcct
    16261 ccagttctgc agctgctgcg gctgcagcgg cggctgaaga gaatcgacgt tcctccaatc
    16321 gcttgaagcg tccgaggtct ggtaacgaac aaggatctgg ttccgtggaa ggacaaccgc
    16381 ctgagaagcg acgggagttg cgatccactc gtcatacacg ctcttcggca gatgccagta
    16441 cgcttaatct ttcaactgga agcgaatcgg gagcggagga acggaacgaa tgagtattgc
    16501 gcagatccga gcgccagcgc tagcagaatg acacggccca tactaagggc cggagaaaat
    16561 cgcgtagagg agaagccaat ccggccaata catgcaagtc accggagcag aaaatcaaca
    16621 acagaaagat gatgagaaac cagctgcaga tgcggatgca aacgagatcg aggcgaaatt
    16681 taaatgctag gttaactgtt accactagat cccttaagct taaagttgtg taaagtataa
    16741 gaaccatata taatctatat atagagatta ggagccaatt tggagaaaac ggatttgggt
    16801 ttgtctcaac gccataaagt tcgacactgc tgcaactccg tctgaactgg agggcagagc
    16861 gccttaccca tgacttactc aagtttcatc tgctgcttat atggtgcttg ataagtcaat
    16921 agtgtccctt cctctggctg cttaattatt tatataatta cgaacataat taaaaagcgc
    16981 ttataattta attgtaacaa gagaaacaaa gtgctgttgc acgacgagga ttttccgttt
    17041 aaattattgc taaaacgtat aattgataag cgacagaaaa atgtttaaac aaaacactat
    17101 ccatgcataa aatgtagaga cacatatact tgggtttctg gcgctgtcct ggcacatgca
    17161 gtctctatac atatataata cctgtacata attcataatc ttaaaatgac tcgtattgct
    17221 taactgtttt taatatagta aggcaccatg tattctcctc aaactaaacc attttccaca
    17281 cttccatccc caatatgttg tatactgaac tctctattct ttatgtgtgt ccccgaaagc
    17341 tgcgtgtaat tttgatcatt ttaattctgt aattagtaaa tgtgaaagga tggaatgcta
    17401 atggcgcaaa tcaatctttc aagcgtatat tttaaactct tgtatttgta attcgtatgt
    17461 tttattatac acttgtgtgt cccatgaacc accgacacca acatgcaagt cctggcgcgt
    17521 aactaacagg tgggaagtgt ttattgaaaa atagaatttt ttaattaaat caattttgtt
    17581 gtaattgaga agcccgagaa tatgaatctt gcacgcttga tgagcagcaa tgcaatattt
    17641 aggtcatttt gtataattat tcagctgagt aaatgaatta tggacaaatt tgatccgcac
    17701 tttatgagaa caatttattt tgagacgtta acaaggcaca caaaaaaaaa gcaaaacaaa
    17761 caaccacata tattgcatac ttaatttgta agtattactc gtaattaaaa ttattatata
    17821 actgcagaac aacaaacaaa agcaacatca accagaagaa gcaacaacag caagaaacac
    17881 ccaagcatca actattgata ttgatgataa ataaatatat aaaaaataaa ttttaaaaat
    17941 tac
//