Dbfetch

LOCUS       NM_001103423           11466 bp    mRNA    linear   INV 26-DEC-2023
DEFINITION  Drosophila melanogaster uncharacterized protein, transcript variant
            D (CG34417), mRNA.
ACCESSION   NM_001103423
VERSION     NM_001103423.3
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (bases 1 to 11466)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: Impact of
            High-Throughput Data
  JOURNAL   G3 (Bethesda) 5 (8), 1721-1736 (2015)
   PUBMED   26109357
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 11466)
  AUTHORS   Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
            Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
            Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: The
            Rule-Benders
  JOURNAL   G3 (Bethesda) 5 (8), 1737-1749 (2015)
   PUBMED   26109356
  REMARK    Publication Status: Online-Only
REFERENCE   3  (bases 1 to 11466)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
            Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
            Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
            Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
            Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
            Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
  TITLE     The Release 6 reference sequence of the Drosophila melanogaster
            genome
  JOURNAL   Genome Res 25 (3), 445-458 (2015)
   PUBMED   25589440
REFERENCE   4  (bases 1 to 11466)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
            Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
            Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
  TITLE     Sequence finishing and mapping of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1625-1628 (2007)
   PUBMED   17569867
REFERENCE   5  (bases 1 to 11466)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  TITLE     The Release 5.1 annotation of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1586-1591 (2007)
   PUBMED   17569856
  REMARK    Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE   6  (bases 1 to 11466)
  AUTHORS   Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
            Ashburner,M. and Anxolabehere,D.
  TITLE     Combined evidence annotation of transposable elements in genome
            sequences
  JOURNAL   PLoS Comput Biol 1 (2), 166-175 (2005)
   PUBMED   16110336
REFERENCE   7  (bases 1 to 11466)
  AUTHORS   Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
            Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
            Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
            Celniker,S.E., Rubin,G.M. and Karpen,G.H.
  TITLE     Heterochromatic sequences in a Drosophila whole-genome shotgun
            assembly
  JOURNAL   Genome Biol 3 (12), RESEARCH0085 (2002)
   PUBMED   12537574
REFERENCE   8  (bases 1 to 11466)
  AUTHORS   Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
            Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
            Rubin,G.M., Ashburner,M. and Celniker,S.E.
  TITLE     The transposable elements of the Drosophila melanogaster
            euchromatin: a genomics perspective
  JOURNAL   Genome Biol 3 (12), RESEARCH0084 (2002)
   PUBMED   12537573
REFERENCE   9  (bases 1 to 11466)
  AUTHORS   Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
            Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
            Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
            Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
            Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
            Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
            Rubin,G.M. and Lewis,S.E.
  TITLE     Annotation of the Drosophila melanogaster euchromatic genome: a
            systematic review
  JOURNAL   Genome Biol 3 (12), RESEARCH0083 (2002)
   PUBMED   12537572
REFERENCE   10 (bases 1 to 11466)
  AUTHORS   Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
            Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
            Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
            Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
            Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
            Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
            Gibbs,R.A. and Rubin,G.M.
  TITLE     Finishing a whole-genome shotgun: release 3 of the Drosophila
            melanogaster euchromatic genome sequence
  JOURNAL   Genome Biol 3 (12), RESEARCH0079 (2002)
   PUBMED   12537568
REFERENCE   11 (bases 1 to 11466)
  AUTHORS   Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
            Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
            George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
            Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
            Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
            Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
            Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
            Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
            Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
            Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
            Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
            Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
            Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
            Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
            Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
            Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
            Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
            Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
            Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
            Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
            Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
            Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
            Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
            McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
            Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
            Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
            Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
            Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
            Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
            Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
            Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
            Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
            Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
            Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
            Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
            Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
            Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
  TITLE     The genome sequence of Drosophila melanogaster
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
   PUBMED   10731132
REFERENCE   12 (bases 1 to 11466)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
            Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
            Smith,E., Yu,C. and Rubin,G.
  CONSRTM   Berkeley Drosophila Genome Project
  TITLE     Drosophila melanogaster release 4 sequence
  JOURNAL   Unpublished
REFERENCE   13 (bases 1 to 11466)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (20-DEC-2023) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   14 (bases 1 to 11466)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   15 (bases 1 to 11466)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   16 (bases 1 to 11466)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (20-APR-2020) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   17 (bases 1 to 11466)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (22-APR-2019) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   18 (bases 1 to 11466)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   19 (bases 1 to 11466)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   20 (bases 1 to 11466)
  AUTHORS   Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
            Park,S., Svirskas,R. and Karpen,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
            Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   21 (bases 1 to 11466)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
            Svirskas,R. and Rubin,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
            64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   22 (bases 1 to 11466)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  CONSRTM   Drosophila Heterochromatin Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
            Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
            Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE   23 (bases 1 to 11466)
  AUTHORS   Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
            Rockville, MD 20850, USA
COMMENT     REVIEWED REFSEQ: This record has been curated by FlyBase. This
            record is derived from an annotated genomic sequence (NC_004354).
            
            On May 8, 2012 this sequence version replaced NM_001103423.2.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider :: FlyBase
            Annotation Status   :: Full annotation
            Annotation Version  :: Release 6.54
            URL                 :: http://flybase.org
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..11466
                     /organism="Drosophila melanogaster"
                     /mol_type="mRNA"
                     /db_xref="taxon:7227"
                     /chromosome="X"
                     /genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
                     bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
     gene            1..11466
                     /gene="CG34417"
                     /locus_tag="Dmel_CG34417"
                     /old_locus_tag="CG3950"
                     /old_locus_tag="CG3960"
                     /old_locus_tag="Dmel_CG3950"
                     /old_locus_tag="Dmel_CG3960"
                     /gene_synonym="BcDNA:GH23906; CG3950; CG3960;
                     Dmel\CG34417"
                     /map="6B2-6C1"
                     /db_xref="FLYBASE:FBgn0085446"
                     /db_xref="GeneID:31591"
     CDS             569..10720
                     /gene="CG34417"
                     /locus_tag="Dmel_CG34417"
                     /old_locus_tag="CG3950"
                     /old_locus_tag="CG3960"
                     /old_locus_tag="Dmel_CG3950"
                     /old_locus_tag="Dmel_CG3960"
                     /gene_synonym="BcDNA:GH23906; CG3950; CG3960;
                     Dmel\CG34417"
                     /note="CG34417 gene product from transcript CG34417-RD;
                     CG34417-PD"
                     /codon_start=1
                     /product="uncharacterized protein, isoform D"
                     /protein_id="NP_001096893.1"
                     /db_xref="FLYBASE:FBpp0111616"
                     /db_xref="GeneID:31591"
                     /db_xref="FLYBASE:FBgn0085446"
                     /translation="MEVECDLGDIQNEELLRKMWQQSEDSERKRQIRSHLYKLRESRL
                     CNLYRHETDPMSEPNGNGHGNVNTLAGYAGKDPLATSHGDALLDQNFQSLKSKEVRDS
                     TSPTHELRFHSMTLSQPNTTGWDVQTSSEVSPDGRAYRTETLAKTDGVEKLNGGGLAE
                     FKGRNEQRSSASHQGDDKNFVKQASDSSKTQLQETVVFGDETSGRTEMKMSSTSTSSS
                     SKVVSSSSTVEYGDEPRYLLDNQEKPHKQHQTREWEQDQRRQEQRIQEQRQEQQIRQE
                     QLQRQEEFQRQEQIQRQEQIQRQEQVQRQQERFTESREQSNSTRNVQTQQHYEENKRY
                     VDMDKASPEYQRHVQHLMEQPGEIISNTVEYPKPNVKMITTVKRLPDGTIVKNKRYET
                     EQLTPSQSHTTHKQTNNQTHNQTRNQVHNQRREQDEQDSQTRDVVDNVEQKHVTESQS
                     FSSVKKSSRRFSTETTSETVEEYDDRDQPTSRVVQKAPTPSYAPPSHSPRQSPAKDFS
                     THGFPSVRPNKATQEYPSQRPGIAGEEVVVVRSEKSRQVKQQTSSQRTIETEVVGDDY
                     QEPQRSPQKLREAPTPSWEQPATRRQPVEEDFSTHGFPSVRTTTTSTRPDQPDGEVLH
                     TSKTVSRNQSANRKTNTERIIETQVEHPNAPSHSTPRSGSPRRSQPRDDYASSPRGPA
                     GKPSQSRPSNTGGSTTTKTTTTSEPLTRRQLQKEREVDAAHRAFAASLRSSSPADSTT
                     SVGSHHQTPRSSVSSNRTFRREMREGSHDSQAPSESSRISSTTVTRHTTGGNVTSNTI
                     KTKTTKPVKTASEPRSPTPKAGGSVTTTTTTITTKSSPSPAPASPTTAAPPTSAPASS
                     APPDNKLSQYTYTTTKPGDIFSLPPTTPPTINNEPTLTTTRNTTTTTTTTTTATSDNL
                     QNHPKKSAIEPATDTDSQPIRKVKLSANEAKVVEEAPCVRRQYYQLGNEGENPETPES
                     SGTPANKKPHQMRRPHDEPEPQLRRSSKSPSVEPRQVQRETTFEGRRVSQDREISIDE
                     LILIEETSGAPGSPKIPSPRAQSPGKPATRSQSPEKPQPRATPRQSPEKEQPAFKQTH
                     EVYTPSQPEKQFPRARSPEKTPGWTQPQVSPRQSPEKQLPRAQSPEKVPAVRQPSVSP
                     RQSPEKQIPDPKTRDQGPGLPRISPRQSPEKQLPKDVPQKSRQSPEKDLTNQQRREEE
                     IFRSTITTTQKRTTNNLNEEFITNERDNQNQPISEKKPQIPANAEPNTKPSETIESPD
                     GGFPSKTTEVEAQPEVKESPTYRKKGLTRRETFEDRCRQILGMEEDGDTQGTYTERPN
                     NEQEDVNVSHTTIETIQVKIEDCPNDDEDDKPRRVTETYVVRTQPKIKVEEELFVDVT
                     EAEDVEILVNPSKKSPKEEDSPKYPKGPETPKSPRNDQRIPSIPKKGQSPVQFKTEET
                     PRYPQEQERYPKEPETSQYPKESPRNPKEDAETININEETTIVITKEGSKSPSPRWSP
                     SPERRVPKSQQPPSPTASPSVSPVSGRKIPNEVESNFVTEKIIDCRGKTVVEKISQRP
                     RTPSPTTPKKNTKPSQKIPERVPETESEPEKDSESETKKTTSVSVTKTETERRNSRTT
                     KTKQPLPLKEPQSKVPAGKSPRKDSLTGRKRDSLVEETRITTTTTTTRQGRKPSDTNG
                     SPSIKDRLRSSPRKQKTSPQQTRTPTPAQTRNPEDDVDGDSSSPDASPTRVGNERRRS
                     SNISVHTEIIIDHMAPKSPKTERRSQGGTGNVPSPIRKLPVTERKESAPVPRVTRRDK
                     AEKVTRSTSENIIKMSGTKPHPEMSSLKPGGDRSRPSKCCTTKTINLSEQRINTATDI
                     EGVIIDIQQAKSSREPSPDRIVPTPVPAELETGKPRYPDVVQEPDDEPRRKPQVTNIP
                     IFEEESQTYVGCQISELHSSNGIEVDILDNPTVEAPKSLDYPVNTPDTDESLLSVHEK
                     VSRFTHSAEKVKEPKVSAPFSREFDVNAKIPENDDCLLSINQKVDKFLRTAENVIRPT
                     SLPSRPEIERPGLEEIDEELLRDDCTLSVSQKVHKFIDTAEKLAPTMPQKSPRLVANI
                     ERHISRQSEPERELDEESEPELDRDTDVEDDDQTSQLETEEEITQTVTKKETLKEFKQ
                     QTKETRETRRDSKAEPEKLQKKSPQTKVKEESARVPKYQAKVSQKVSQWEPKKQPQRE
                     PKVTQKETPLEPKKQPLSKVKDEPEKVNKREPKVPQKESQTKLKEEPERVTKKTPQKE
                     PRKEPLRQSEDEPEFSPEEEFDDEPLPMTKTHTTAIEMKRQKDILNRPSVFGQRTPER
                     KSSTTPSPTKLNGTRGRPSPSTNLITEEKRSYRNQVTNVSKPGTRKTTPSANSPAQSP
                     PPKTTSISKRMEQISQQSWVVQDVDVDVEVVGPAPPSHISEKPQGKSPSPTSSRSLSR
                     SPSRSPSRRTSTNLNTTSTNTTTTTEHPSTIKTTTPKPTSTNPSKDEPEIIPIESLTE
                     KSITTTYTTNTTGRNVASRRNVFEPVHETHVDSEPTGRRPSYMDHTKSSLEHIRRDSL
                     EINKSHYSRKSSMEDDSPVEPRNPNSSVKFDVPRKSSSRGADEPRKTSLKGKDEDSDL
                     ELEIEEIFDLQRLEKLLETVASYEMRRRIRAQMRLIRKNMINAGTTTTITTITTSTTP
                     GKSSPLPKIRRDQSPAGAAEVKTKEVRTTTSRRQQQQRVEQVDSTTPIAPGKTSPHGK
                     PPVKPRERSASPAQKRRISPPGKQSPGDRSTTTTTKVTTTSTTRGAPSKPAQGPIWAD
                     RSKVLKGHATVPQTNGSTPRKGSTSSTTSSSGKITRTMTSSSTTTSSSSTTNTRNKQR
                     EEDSITSSYGVGPTDENGLPLFGIRALKKKATPPAEEPCETKQEVTGYVIEEQFYSDN
                     KSPPRHERKELIYSSNADELAAIKQQLQDEDDSSPPLLDARVVREFKKVESQQSLPED
                     ARYVRRGSVKELSEKFIRKESSSSTHSTHSSIAQSLVRHEDETEDDSESNDVCSVIEA
                     PQMRQNQSHTMSTTRSSNTRSFLNSSADQRQVTSVDDVLERMRNADNVEEPGDSSEDR
                     EARALLNKFLGASVIMQGVESMLPPTATGQRLNSQGVKTTRITNTYSKSGNSTSTTNN
                     TSNTTNKVSSSSAPVTRTTCDIEEIWDEQVLKQLLEQASTYEERRKIRARLRELMAER
                     EAAQHKSSSSSEQRTETKSKDGGATVTTTTTKVTTRTVSGSAASKNISPLAKFKQLDK
                     QAAAQQAQKSSPTTSTPTTPGGSAQPLFKFTDPALNARAATVKDQLLQWCKHKTQEYE
                     NVQINNFSSSWSDGLAFCALIHHFLPDAFDYTTLTKQTRRHNFELAFSVADEKAGIAP
                     LLDVEDMVEMSRPDWKCVFVYVQSIYRRFRNCQ"
ORIGIN      
        1 ccaaccgcat cattcgccac gcccctcagc cctccgcctc ctttggcttg ccgcacaaac
       61 ccaacatcaa aactcactcg ctgttgcatt cgaaaaataa aaaaaaaatt tttaaagtgt
      121 ttacgcgttt aatttttttt tttgactgtt ttgttgcagg acactgcaac acttcgtgct
      181 ttattttttt gtgaaataat tagtatttgt tttaaggtga aacccgactc cttcgcgtta
      241 ttagtcgcgt gtgacagcag caacatccag caaataaacc gcaaatcaat caaacaatcg
      301 gcagagttga aataaaacaa aaaaaaaaca aagtagcgcc cgcgttttta accgcgcgca
      361 agtcgctggc aacttttcgc cggaaaggca ggcgcaactt gatctttccg taaattaaac
      421 tatttactgt ttaaatattt gtcaagtaca caaaaataca tacgttgctg ctcgttgttg
      481 cgtcgtctaa agttacaaaa gccttcaagt cgacgtcttt ctctgataat aagctcaaaa
      541 aacagcagca gcaacagtca ttcctgccat ggaggtggaa tgcgatctgg gtgatataca
      601 aaatgaggaa ttgctgagaa aaatgtggca gcagtccgag gacagtgagc gcaagaggca
      661 gatccgatcg catctctata agctgcgcga gtcgcgactt tgcaacctct atagacacga
      721 aacggaccca atgtcggagc ccaacggaaa cggacatggc aacgtgaaca cgctggccgg
      781 ttacgcgggc aaggatccgt tggccaccag ccacggcgat gccctgctcg accagaattt
      841 ccagagcctc aagtctaagg aggtgcgcga ttcgactagt cccacgcatg agctgcgatt
      901 ccattcaatg acgctcagcc agccgaacac caccggttgg gatgtgcaga cctcctcgga
      961 ggtcagtccc gatggacggg cctatcgtac cgaaactctg gccaaaacag atggcgtgga
     1021 gaagctcaat ggtggtggcc tggccgagtt taaaggtcgc aacgagcagc gctcgagtgc
     1081 ctcccatcag ggcgatgaca agaactttgt gaaacaggcc tccgatagct cgaagaccca
     1141 gctccaagag acggtggtct ttggtgacga gaccagtggt cgcaccgaga tgaagatgag
     1201 ctccacctcc acctcgtcct catccaaggt ggtgtcttct tcgtccaccg tggagtatgg
     1261 cgatgagccg cgttatctgc tggataacca ggagaagcca cataagcagc atcagacccg
     1321 cgagtgggag caggatcaga ggcgtcaaga gcagcgcatt caggagcagc gccaagagca
     1381 acagattcgc caagagcaac tgcagcgcca ggaggaattc cagcggcagg agcaaatcca
     1441 gcgtcaggag cagatccaac gccaggagca ggtccaacgg cagcaggagc gattcacgga
     1501 gagccgcgag caaagcaaca gcacccggaa tgtccagacc caacagcatt acgaggagaa
     1561 caagcgctat gtggacatgg acaaggcatc gccggagtat caacgccatg tccaacatct
     1621 aatggagcaa cccggcgaga taatctctaa tacggtggag tatcccaagc ctaatgtcaa
     1681 gatgatcact actgttaagc gcctgccgga cggaaccatt gtcaaaaaca agcgctacga
     1741 gacggagcaa cttactccca gtcagagtca tacaacgcac aagcagacca acaaccagac
     1801 acacaatcaa acccgtaacc aagtccacaa tcagcgccgc gagcaggatg agcaggatag
     1861 ccaaactcgt gatgtagtgg acaatgtgga gcaaaagcat gtgacggagt cgcagagctt
     1921 ctcctcggtg aagaagtcca gtcgacgttt ctccacggaa accacctcgg agaccgtgga
     1981 ggagtacgat gaccgtgatc agcctacatc cagggtggtg cagaaggcac cgacaccctc
     2041 ttacgcccca ccatcccatt caccacgtca gtcgccggcc aaggatttta gcacccatgg
     2101 tttcccctcg gtgcgtccga ataaggccac acaggaatat ccctcgcaaa gacccggcat
     2161 tgctggcgag gaggtggtgg tggtacgttc ggagaagagc cgccaagtga agcaacagac
     2221 cagctcacag cgcaccatcg aaacggaagt tgtgggcgat gattaccaag agccgcagag
     2281 gtctccacag aagctgaggg aggcaccgac acccagctgg gagcaacccg ccaccagacg
     2341 tcagccggtg gaggaggatt ttagcactca tggttttcct tcggtgcgta caaccacaac
     2401 aagcactcgt cccgatcaac ccgatggtga ggtgttgcac acgtccaaga cggtgagtcg
     2461 caatcagagc gccaatcgca agacgaacac ggaacggatc attgagacac aggttgagca
     2521 tcccaatgcc ccatcgcact cgactccgcg atccggaagt ccacgtcgat ctcagccacg
     2581 agacgattac gccagttcgc cacgcggacc ggctggcaaa ccgagtcaat cgcggcccag
     2641 caacactgga ggtagcacca ccaccaagac aaccaccact tcggagccgt tgacgcggcg
     2701 tcagctgcaa aaggagcgcg aagtggacgc cgcccatcgg gcatttgccg cctcgctgcg
     2761 cagcagttcg ccggcggaca gcaccacctc ggtgggttca caccatcaga cgccacgatc
     2821 gagcgtttcc tcgaatcgca ccttccgtcg ggaaatgcgt gagggttccc atgacagcca
     2881 ggcgccatcg gaatcgagca ggatcagttc gaccactgtg actaggcaca ccaccggtgg
     2941 taatgttacc agcaacacca tcaagaccaa gaccaccaag ccggtgaaaa ctgccagcga
     3001 gccaaggtcg cccaccccaa aggcaggagg aagtgtgacc accaccacga ctaccattac
     3061 caccaagtcc agtcccagtc cagcaccagc ctcacccaca accgctgcac cacccacctc
     3121 agcaccagcc agttccgcac caccagataa caagctatca cagtatacgt atactaccac
     3181 taagcccggc gatatattct ctctgccacc aactactcct cctactatta acaacgaacc
     3241 aacactaacc accacccgca atacaacaac aacaacaacc accaccacca ccgccacctc
     3301 cgacaatctc cagaatcacc cgaaaaaatc tgcaatcgaa cctgctaccg ataccgactc
     3361 ccagccgatc cgcaaggtga agctgagcgc taatgaggca aaggtggtgg aagaggcacc
     3421 ctgtgtgcgg cgtcaatatt accagctggg aaatgagggg gaaaaccccg agacgcccga
     3481 gagctcaggg acccctgcca acaagaagcc ccaccaaatg aggaggcccc acgacgaacc
     3541 ggagccacag ctgaggcgta gctccaagtc gcccagtgtg gagccacgtc aggtgcagcg
     3601 ggaaaccact tttgagggtc gtcgcgtctc ccaggatcgg gagatctcga ttgacgagct
     3661 aattcttatc gaagagacga gcggagctcc tggttcgccg aaaattccat cacctagggc
     3721 tcaaagtcct ggaaaaccag cgactaggtc ccaaagtccc gagaaaccac agcctagggc
     3781 aacacccaga caaagtcctg agaaggagca gccggccttt aagcaaacac atgaggtcta
     3841 cacgccaagc caacctgaaa aacagtttcc ccgtgctcgc agtcccgaga agactcctgg
     3901 atggacccaa ccccaagtct cacctcgtca aagtcccgaa aagcaattgc cccgtgccca
     3961 gagccccgaa aaggttcctg cagtaaggca acctagtgtt tctccccgtc aaagtcccga
     4021 gaagcaaata cccgacccaa agactcgcga tcaaggtcct ggtcttcctc gtatttcacc
     4081 tcgccaaagt cctgagaaac agttgcccaa ggatgttccc cagaagagcc gccaaagccc
     4141 ggagaaggat ctaaccaacc agcagagacg ggaggaggaa atattccgca gcaccattac
     4201 cactacacag aaacgaacca caaacaatct aaacgaagaa tttataacga acgaacgaga
     4261 taaccagaat cagcccattt ccgaaaagaa accacagatc ccagctaatg ctgaaccgaa
     4321 caccaagccg tctgagacga ttgaaagtcc agatggtgga tttccttcca agaccactga
     4381 agttgaggct caacccgaag tcaaggagtc acccacctac cgcaaaaagg gcttgactcg
     4441 tcgcgagacc ttcgaggatc gctgtcgcca gattttgggt atggaggaag atggcgatac
     4501 tcagggaact tatactgagc gaccgaataa cgaacaggaa gatgtaaatg tttcccacac
     4561 aaccatagaa accattcaag ttaagatcga ggattgcccc aatgatgacg aagatgataa
     4621 gccgcgtcgt gtgacggaga catatgtggt acgcacgcaa ccgaagatta aggtggaaga
     4681 ggagctcttt gtggatgtga ccgaagcaga ggatgtggag attctggtta acccatcgaa
     4741 aaagtcaccc aaggaagagg atagcccaaa atatccaaaa ggaccagaaa ctcccaaatc
     4801 tcccagaaat gatcaaagga ttcccagcat tccaaagaag ggacagagtc ctgttcaatt
     4861 caagacagaa gaaactccaa gatatcctca ggaacaagag aggtatccaa aagaaccaga
     4921 aacttctcaa tatcccaagg agagtcctcg caatccaaaa gaggatgccg agaccattaa
     4981 tataaatgaa gagaccacca ttgttattac caaggaggga tcaaagtctc cctctcccag
     5041 gtggtctcct tctcctgaac gtagagttcc taagagtcaa cagcctcctt ctcctactgc
     5101 atcgccatca gtatctcccg tttctggtag gaagataccc aacgaagtgg agtcaaactt
     5161 tgtgaccgaa aagatcatcg attgccgggg aaaaacagtt gtagagaaaa tcagccagag
     5221 accgcgtact ccaagtccaa caactcccaa aaagaatacg aagccatcgc aaaagattcc
     5281 agagagagta cccgaaactg aatcggagcc ggaaaaggat tcagaatcgg agaccaagaa
     5341 gacgaccagt gtaagtgtca caaaaactga aaccgaacgt cgcaattcac gcaccactaa
     5401 aaccaagcag ccactgccgc tcaaagaacc acaatcaaag gtcccagccg ggaagagtcc
     5461 tcgcaaggat tccctgacgg gtcgcaagcg ggatagtctg gtggaggaga cacgtatcac
     5521 aacaacgacc acaaccactc ggcagggtcg aaagcccagc gatacgaatg gctcgccttc
     5581 gattaaggat cgtcttcgct cttcgccgcg caagcaaaaa acttctccac aacagaccag
     5641 gacgcccacg ccggcacaaa cgcgtaatcc agaagatgac gtagacggag actcctcttc
     5701 cccagatgcc agtcccacca gggtaggtaa cgaacgtcgc cgatctagca atatttccgt
     5761 acacacggag atcatcatag atcacatggc gcccaagtca ccgaaaacgg agagacgatc
     5821 ccaaggtggc accggaaatg tacccagtcc aattagaaag cttccggtaa cggagcgcaa
     5881 ggagtcagca cctgtgcccc gagtgactcg acgcgataaa gccgagaagg tcacgcgctc
     5941 cacaagcgaa aatatcatta agatgagcgg tactaagcct catcctgaga tgagtagcct
     6001 taagccaggt ggagatagga gccgacccag caagtgctgc accaccaaga cgatcaatct
     6061 gagcgagcag cgcatcaaca cggccactga tatagagggt gtaatcatcg atatccagca
     6121 ggcgaagagc tctagggaac catctccgga taggattgtg ccaactcctg tgcccgctga
     6181 actggaaacg ggaaagcccc gttatccgga tgttgtacag gagcccgatg atgagccgcg
     6241 tcgcaagcca caggtcacaa atattccgat tttcgaggaa gaatctcaaa cctatgtcgg
     6301 gtgtcaaatt tccgaattgc atagctcaaa tggcatagag gttgatattc tggataaccc
     6361 cactgtggag gctcccaaga gcctagacta tcccgtaaat actcccgata cggacgaaag
     6421 tctattgagt gtgcacgaga aggtctctcg attcactcac tcagccgaaa aggttaagga
     6481 gccaaaggtt tccgcgccat ttagcaggga atttgatgtg aacgccaaga ttccggaaaa
     6541 tgacgattgt ctgcttagca tcaatcagaa agtagacaag ttcttgcgca ctgccgagaa
     6601 tgttatcagg cctacgtcac tcccttctcg acctgagatc gagcgtccag gattggagga
     6661 gatagatgag gagcttctgc gcgacgattg tactctgagc gtctcccaga aagtacacaa
     6721 gttcatcgac acagccgaga agttagcgcc cactatgcca cagaagtcac cacgtctagt
     6781 agccaatatt gagcgtcata tttcccgaca gagcgagcca gaacgcgaat tggatgaaga
     6841 gtctgaacca gagttagatc gggacacaga tgtggaggac gacgatcaga ctagccaact
     6901 cgagacggag gaagagatca cccaaactgt aactaagaag gaaaccctaa aggaattcaa
     6961 acaacaaact aaggaaacca gagagacacg tcgggattca aaggccgaac cggaaaagct
     7021 acagaagaaa tcacctcaaa ccaaggtcaa agaagaatct gcaagggtgc ccaagtatca
     7081 ggcaaaggtt tcacaaaaag tgtcacaatg ggagcctaaa aagcaacctc aaagggagcc
     7141 aaaggtaact caaaaggaaa ccccattgga acccaagaaa caaccacttt cgaaagttaa
     7201 agatgaaccc gaaaaggtga acaaaaggga accaaaggtg ccccaaaagg aatctcaaac
     7261 caagctcaaa gaggaaccag aaagggtgac caaaaagaca ccccaaaagg aaccacgcaa
     7321 ggaaccactg agacaatccg aagatgagcc cgaattctcc cccgaagaag aattcgatga
     7381 tgaacctctg ccaatgacaa agacacacac cacagctatt gaaatgaagc gtcaaaagga
     7441 tatccttaac cgcccgtctg tatttggcca gcgcacgcca gagcgcaagt ccagcactac
     7501 gccatcgcca actaaattaa atggaactcg tggtcgacct agtcccagca ccaacttgat
     7561 taccgaagag aagagatcgt acagaaatca ggtgaccaat gtgagcaagc cgggaaccag
     7621 aaaaaccact ccctcggcca attccccagc gcaatcgcca ccacccaaga ccaccagcat
     7681 ttccaaacga atggagcaaa tcagtcagca gtcatgggtt gttcaagatg ttgatgttga
     7741 cgtagaggtg gtgggacctg cgccaccttc ccacatcagc gagaagccgc aaggaaagag
     7801 cccatcccca acgtcatcgc gatcgctatc gcgatctccc tcccgatcgc ccagtagacg
     7861 cacctccacc aatttgaaca cgacctccac caacaccacc accaccactg agcacccgag
     7921 caccatcaag acaactaccc caaaaccgac ttcaactaac ccatccaaag acgaacccga
     7981 aatcataccc attgaatccc ttacagaaaa gagcatcacc accacttaca cgacgaacac
     8041 cactggacga aatgtggcta gccgcaggaa tgtgtttgaa cctgttcacg aaacacacgt
     8101 ggattccgag ccaactggtc gccgtccctc gtacatggat catacaaaga gttcccttga
     8161 gcacattcga cgcgattcgc tggagatcaa taagagtcac tattccagaa agtcctccat
     8221 ggaggatgac tctcctgtgg agccacgcaa tcccaactca tccgtgaaat tcgatgtgcc
     8281 aagaaagtct tcgtcaagag gtgcagatga gccaagaaag acatcgctga agggcaagga
     8341 tgaggactcc gatttggagc tggagatcga ggaaattttc gatttgcagc gactggagaa
     8401 gcttttggaa acagttgcca gctacgaaat gcgtcgccgc attcgtgcac aaatgcgtct
     8461 tatacgcaag aatatgatca atgcgggtac aactaccacc atcacgacca taaccacgag
     8521 cacaactccg ggcaagagtt caccactacc caagatcagg cgagatcaaa gtcctgcggg
     8581 cgctgctgaa gtaaagacca aggaagtgcg caccaccaca agtcgccgac agcaacaaca
     8641 gagagtggaa caggtggaca gcacaacacc aatagctccg ggaaaaacat cacctcatgg
     8701 taagccgcca gtaaagccga gggaaaggag tgccagtccc gcgcagaagc gccgcattag
     8761 tccgccggga aagcaatcgc ccggagacag gagcaccacc acaaccacta aggtcaccac
     8821 cacaagcacc acccgtggtg ctcccagcaa gccagctcaa ggacccattt gggcagatcg
     8881 ctcgaaagtg ttgaagggcc atgccaccgt tccacaaacg aatggaagca ctccccggaa
     8941 aggatccacc tctagtacca cttcgagcag tggcaagatc actcgcacaa tgaccagttc
     9001 cagcaccaca acgagctcca gcagcactac caacacacgc aacaagcagc gcgaagagga
     9061 ctcgatcacc tccagttatg gtgtgggtcc aacggacgag aacggactgc cgctcttcgg
     9121 aattcgggcg ctcaagaaga aggcgacgcc accggcagag gaaccctgtg agaccaagca
     9181 agaagtcaca ggttatgtga tcgaggagca gttctactca gataataagt cgccacctcg
     9241 tcacgagcgc aaggaactca tctactcaag caatgcggac gaactggccg ccataaaaca
     9301 gcagcttcag gatgaggatg atagctcacc gccgctattg gatgcccgcg tggtgcgtga
     9361 gttcaagaag gtggagtcac agcaatcgct gcccgaggat gctcgttatg tacgccgggg
     9421 atcggtgaag gagcttagcg agaagttcat tcgcaaggaa tcctcctcat ccacacattc
     9481 gacccactct tcaatcgccc aatcgctggt gaggcatgag gatgagaccg aggatgatag
     9541 cgaatccaat gacgtttgca gcgtgatcga ggcaccacag atgcgtcaga atcagagtca
     9601 caccatgagc accactcgat ccagcaacac tcgttcattc cttaacagca gtgccgatca
     9661 gcgacaggtg accagtgtgg acgatgtcct cgagcgcatg cgaaatgcgg ataatgtcga
     9721 ggagcccgga gattctagcg aggatcgcga ggcccgtgct ctgctcaaca aattcctggg
     9781 agccagtgtc attatgcagg gcgtggagag catgttgcca cccaccgcca cgggtcagcg
     9841 tctaaatagt caaggcgtga agaccacgcg gataaccaac acctacagca agtccggcaa
     9901 tagcaccagc accaccaata acaccagcaa caccaccaac aaggtcagca gctcctcagc
     9961 accagttaca cgtacaactt gtgacattga ggagatttgg gacgagcaag tgctcaagca
    10021 attgctggaa caggcgtcca cctacgagga gcgtcgcaag atccgtgcac gtctacgcga
    10081 acttatggcg gaacgcgaag cagcccagca caagagctcc tcgtccagcg aacagcgaac
    10141 ggaaaccaag tccaaggatg gcggtgccac tgtgaccaca acgacaacaa aggttaccac
    10201 acgcactgtc agcggaagtg ctgcctccaa aaacatctca cccttggcta aattcaagca
    10261 gctggacaaa caggcagccg cccagcaggc gcaaaaatca tctccaacaa caagcacacc
    10321 cacgaccccc ggcggttcgg cacaaccgtt gttcaaattt accgatccgg cattaaatgc
    10381 gcgcgccgcc actgtcaagg atcaacttct gcagtggtgt aagcataaaa cgcaggaata
    10441 tgagaacgtc caaataaaca actttagctc gagctggtca gacggcctgg ccttttgtgc
    10501 gctgatacac catttcttgc cagatgcatt tgattacacc acactaacca agcagacgcg
    10561 acgacacaac tttgagctgg ccttcagcgt tgctgatgag aaggctggca ttgcaccact
    10621 gctggatgtg gaggacatgg tggagatgag tcgtcctgac tggaaatgcg tattcgtcta
    10681 cgtgcagagc atatatcgcc gattccggaa ttgtcagtga gtagaattaa agcaaccatc
    10741 gaaatcagcg gcagcggaag cagcagcagc agcagcagca gaacagtcca aatccagatc
    10801 caaatccaag tcaactcaac ggcaacagca gaaactcatt aaattttaag cacacaccat
    10861 atacacatta tatacacatc ccctttggca ttttgtgttt agacacaccc catgaatcca
    10921 gcataaacca tttgaatcca caataatcta taataatcct acgcctatgt ggaagccact
    10981 caagaatccg tcatcctgcg aatatctatc taattgtata atcggcatta tatgtattgt
    11041 atattcttca actgaagcgc aaatggctcg tcgaacgtga acagccgact ctttgtacgt
    11101 ttttttgtgt ttgttttacc ccttttttac tattatatta cgattacgca ttattatttg
    11161 ttattatcga acttgaagca gaatctatgc aaataattgt tgaattatat atattttttt
    11221 tgtcataatt ttaattgcgg agaaagaaga tgcagagcag ttactaaccc ctgtccggcg
    11281 gtcatcgaca atggatttag tatataagat acgtcccgat gccgtcgctc acattgatga
    11341 actactgtaa taatacacat atattctatg tacgtacgtg taacgagaac gaaacattta
    11401 ctaaccgatg ccaaaataaa atggccgcaa agaaataaaa aaaaaacaaa caaacaaatg
    11461 aaaccc
//