Dbfetch

LOCUS       NM_001300341            9931 bp    mRNA    linear   INV 26-DEC-2023
DEFINITION  Drosophila melanogaster uncharacterized protein, transcript variant
            E (CG42795), mRNA.
ACCESSION   NM_001300341
VERSION     NM_001300341.1
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (bases 1 to 9931)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: Impact of
            High-Throughput Data
  JOURNAL   G3 (Bethesda) 5 (8), 1721-1736 (2015)
   PUBMED   26109357
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 9931)
  AUTHORS   Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
            Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
            Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: The
            Rule-Benders
  JOURNAL   G3 (Bethesda) 5 (8), 1737-1749 (2015)
   PUBMED   26109356
  REMARK    Publication Status: Online-Only
REFERENCE   3  (bases 1 to 9931)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
            Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
            Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
            Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
            Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
            Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
  TITLE     The Release 6 reference sequence of the Drosophila melanogaster
            genome
  JOURNAL   Genome Res 25 (3), 445-458 (2015)
   PUBMED   25589440
REFERENCE   4  (bases 1 to 9931)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
            Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
            Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
  TITLE     Sequence finishing and mapping of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1625-1628 (2007)
   PUBMED   17569867
REFERENCE   5  (bases 1 to 9931)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  TITLE     The Release 5.1 annotation of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1586-1591 (2007)
   PUBMED   17569856
  REMARK    Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE   6  (bases 1 to 9931)
  AUTHORS   Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
            Ashburner,M. and Anxolabehere,D.
  TITLE     Combined evidence annotation of transposable elements in genome
            sequences
  JOURNAL   PLoS Comput Biol 1 (2), 166-175 (2005)
   PUBMED   16110336
REFERENCE   7  (bases 1 to 9931)
  AUTHORS   Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
            Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
            Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
            Celniker,S.E., Rubin,G.M. and Karpen,G.H.
  TITLE     Heterochromatic sequences in a Drosophila whole-genome shotgun
            assembly
  JOURNAL   Genome Biol 3 (12), RESEARCH0085 (2002)
   PUBMED   12537574
REFERENCE   8  (bases 1 to 9931)
  AUTHORS   Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
            Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
            Rubin,G.M., Ashburner,M. and Celniker,S.E.
  TITLE     The transposable elements of the Drosophila melanogaster
            euchromatin: a genomics perspective
  JOURNAL   Genome Biol 3 (12), RESEARCH0084 (2002)
   PUBMED   12537573
REFERENCE   9  (bases 1 to 9931)
  AUTHORS   Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
            Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
            Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
            Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
            Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
            Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
            Rubin,G.M. and Lewis,S.E.
  TITLE     Annotation of the Drosophila melanogaster euchromatic genome: a
            systematic review
  JOURNAL   Genome Biol 3 (12), RESEARCH0083 (2002)
   PUBMED   12537572
REFERENCE   10 (bases 1 to 9931)
  AUTHORS   Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
            Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
            Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
            Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
            Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
            Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
            Gibbs,R.A. and Rubin,G.M.
  TITLE     Finishing a whole-genome shotgun: release 3 of the Drosophila
            melanogaster euchromatic genome sequence
  JOURNAL   Genome Biol 3 (12), RESEARCH0079 (2002)
   PUBMED   12537568
REFERENCE   11 (bases 1 to 9931)
  AUTHORS   Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
            Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
            George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
            Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
            Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
            Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
            Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
            Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
            Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
            Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
            Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
            Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
            Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
            Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
            Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
            Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
            Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
            Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
            Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
            Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
            Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
            Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
            Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
            McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
            Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
            Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
            Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
            Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
            Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
            Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
            Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
            Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
            Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
            Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
            Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
            Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
            Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
  TITLE     The genome sequence of Drosophila melanogaster
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
   PUBMED   10731132
REFERENCE   12 (bases 1 to 9931)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
            Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
            Smith,E., Yu,C. and Rubin,G.
  CONSRTM   Berkeley Drosophila Genome Project
  TITLE     Drosophila melanogaster release 4 sequence
  JOURNAL   Unpublished
REFERENCE   13 (bases 1 to 9931)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (20-DEC-2023) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   14 (bases 1 to 9931)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   15 (bases 1 to 9931)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   16 (bases 1 to 9931)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (20-APR-2020) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   17 (bases 1 to 9931)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (22-APR-2019) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   18 (bases 1 to 9931)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   19 (bases 1 to 9931)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   20 (bases 1 to 9931)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-OCT-2015) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   21 (bases 1 to 9931)
  AUTHORS   Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
            Park,S., Svirskas,R. and Karpen,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
            Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   22 (bases 1 to 9931)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
            Svirskas,R. and Rubin,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
            64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   23 (bases 1 to 9931)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  CONSRTM   Drosophila Heterochromatin Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
            Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
            Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE   24 (bases 1 to 9931)
  AUTHORS   Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
            Rockville, MD 20850, USA
COMMENT     REVIEWED REFSEQ: This record has been curated by FlyBase. This
            record is derived from an annotated genomic sequence (NT_033777).
            
            ##Genome-Annotation-Data-START##
            Annotation Provider :: FlyBase
            Annotation Status   :: Full annotation
            Annotation Version  :: Release 6.54
            URL                 :: http://flybase.org
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..9931
                     /organism="Drosophila melanogaster"
                     /mol_type="mRNA"
                     /db_xref="taxon:7227"
                     /chromosome="3R"
                     /genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
                     bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
     gene            1..9931
                     /gene="CG42795"
                     /locus_tag="Dmel_CG42795"
                     /old_locus_tag="Dmel_CG34108"
                     /old_locus_tag="Dmel_CG3996"
                     /gene_synonym="CG12816; CG34108; CG3996; Dmel\CG42795"
                     /map="86A1-86A2"
                     /db_xref="FLYBASE:FBgn0261928"
                     /db_xref="GeneID:41252"
     CDS             65..9637
                     /gene="CG42795"
                     /locus_tag="Dmel_CG42795"
                     /old_locus_tag="Dmel_CG34108"
                     /old_locus_tag="Dmel_CG3996"
                     /gene_synonym="CG12816; CG34108; CG3996; Dmel\CG42795"
                     /note="CG42795 gene product from transcript CG42795-RE;
                     CG42795-PE"
                     /codon_start=1
                     /product="uncharacterized protein, isoform E"
                     /protein_id="NP_001287270.1"
                     /db_xref="FLYBASE:FBpp0311379"
                     /db_xref="GeneID:41252"
                     /db_xref="FLYBASE:FBgn0261928"
                     /translation="MSETKKLLDALLCDIYGRQEVLAKRIRRCCRDPVHKRSPKKKKG
                     QLAQLASGMVGDRSSLERLDVRKLIDVCAILKVEILMLGYLLERVLVARDRLQRHQEV
                     LCEFVTAVLIVESDDAQPKMRFSLSPPPQKAITSARLTSPTKNSLSSNNTITTPGSNN
                     RSPSSTKTTMLTRNGGGHGTSPTASGSGHAPSATAAADDEATSDYNQWLHAMKLVARL
                     PGGTPPEFRRKLWLSLADKYLKSKNVDWAQQREKCFCEEWREDDEELGIQIVKDLHRT
                     GSNLCTGPAGSINQAKLKRILLGYARYNPEVGYCQGFNMLGALILQVMDKEEEESMKV
                     MIYLVEGVLPTGYFYGSMGGLQADMGVFRELMQTRLPRLAKHLQRLQGPVENAFEPPL
                     TNVFTMQWFLTMFCTCLPMSCVLRVWDLVLIEGSDVLLRTALVLWSLLEERVISVRSA
                     DEFYGKMGSYSSELLNGHLVDSNGLIERVVKLGPIEDLRQLRDKHLYNIAPLRHKQGL
                     QLYYDEEDTHSDEERLAVATVFGLNWGRRGSVGPAAAGKQQVEQKDRLALDISLLKKQ
                     YDRLRERQKQAHVILTTACSTAARQGSGPASSSQPAVPVNQLLLGRPAIVTNKGKRVG
                     APLGAIPPARKPSLPAVLHTKPTSEKQLRRGETLLWRDTDPSRRRRDSLTWKEIKADR
                     AAMIREGVDVSSVRTQKLRTRFGKSDSSSYSEDSDGEQESGTGGGGGSSTDTSLCDDD
                     DPKSTEKSPKQKAKLARKLKEQKQLAGSRETSLERQRPKSWAPSSHEIPFMLMGTDSG
                     DEKEDKSTKEGPITGDQAEDSATESGHYEFDRELHLVSSKMEPLKLPFDPEFTGMTSV
                     SPIPTPREKSEAEEDLLDERKPFDVSDDGVTNQYFERVNSVERPNRLELTYSLNEEET
                     DTNAIYLEEREKVEGHSGDREYNSLPPFYPRENDDGVQGGGKVPQIRDDNIPGENKDD
                     YKELLSMTIEENTVYKPPTPTASTLSNASRKRRDPRRKTLTRSSTIEIEERYQALERR
                     ISQDQPSGDRQAKYIPSTAALEERFNTLEKQLSAEKQRKELSEMEAEYPIKSERIPST
                     ADLESRFNSLTKQMSSSESSSKTPIDLKDEDRPSGSSSKNQKDSEKTSKLHKSEEPES
                     NTKETTGETEASDSNDSKIGEKETEQPRIKKLPSTAELEDRFNALERKMSVQKSSPSK
                     NKKEPPDEEESKSTKEPEEPEESEKANEKTSGRQTPIAKKDSKDSDQKKSETKENQSP
                     TKNQDEKVKVKSPKSEEMIEKETSSNPKEDSHESEAATNKKVEGNRELSSEKGDHKIK
                     EKSEEAPGKAGKETAETKNANVKDSSKKGDSQKNEAAKTSVSQTESDLKPSSKENSTS
                     KDAEQEKTPRKSPPSTEELEKRFNALEKQMSTTNLETTKEPDQTKPATKSQSTSAEVK
                     TQKSMKSFDDKIKEVNVAIEKEQSRVEVEVNAEKKRKNVEEAPKNKEGDSQQPEESQH
                     KGKNQRRASEPPSTEDLEKRYETLKRRMSSKNQFSETVDEALERIQQEVISEAVEEKK
                     PPPSTEDLESRFEALHGDKKNVESKMDETKHVDVAIEAHIPSPPPPPPPPKERPVLAE
                     PVLHQQQALIEELQSKMRGQSPGEENLKPSEINPQRRQKKLLQRPTPMGDETSEAPAN
                     TAYYRAANHEQWQQRMVRRFSDLPSRADLENRLQFLERQLYKKFYKQRCASDSEVASR
                     VKLPPEDQPSTSRQARKQEAEGQLEQRVLALEKQLSENSLKLLEAMRERHRSADDSGS
                     PRRLSTETIDATGKELVRYTQNIGELEEVDAHKPINISINIKMMVNKDSESKQPKGES
                     KPTTEDLTRRLEQLEQQLLEERAKNGSIPPENEVLEEKPEKLEEKDSCKKQEKNCHNQ
                     HVKGDEVEKTEIPADRKIEPASAKETKTLENVEKAQTRAKVVDTEKSVKDQNAVTDEK
                     SVQDQNVVVDKKADRKILDKKDKSPAAGKSEDTKQTSGKKEKSEDIKQASEAPKAGAS
                     KETSTRGKPSETKLEKPTTKESVLKETFPKKENLESEKPKSKENEATKTETQKSKETP
                     TVAVSPKESKVSSKQMTEKKETIKDSSSKELPEKMVINSTDVGPMDPNGKTVVLLMDN
                     EHRASKVRRLTRANTEELEDLFQALEKQLNDRNLVKSEDGRLIRVDPKPSAEQVEQTQ
                     AISDLTKEIEDFTSAKPEEENPKEAAKEDKPEPEEPEDFDWGPNTVKHHLKRKTVYLP
                     STKELESRFRSLERQIKLLEDVEKIDVEQRLNEIERKIKLQYSLSHEKDLNKYLELCE
                     GKGLDDDEPVPVETPTKEAEITTARDRSRSPGRKALATKSPYTSPSRKATIKTPHTSP
                     TRKPIIKSPYTSPSRKSAKSPYTSPSRNRQRSPSPTRSPERKSKKSPYTSPARRKPHP
                     NDLPISDDLEYKYRVLDLVRSKSKENLAKRMNDPNRKPAIHPLEMILSPSPDADAIPT
                     TGELEHRIRVLDEKLKSPAKTRSKSRSRSPTIEDIKRQKMRDEKKPRTPVHNLERIVS
                     SPGRPEPPTAEELEERIRILEQEHKFDFKTQKDYKAFNQKLKDVISPSLSFDEFRAAK
                     SREQSPRRHGPTTPKSALRRDDFDEGYCGGTHTSTLYRPTSPKVIRFRDEDEDEDQFE
                     EAPRPKSRQTSDRMKTLADQPSATRSYASSTEGLDALGSRLMRETSPITRTGTHTGVP
                     LRTGENINDRLSSIKNSIKSIDTLCEEKPYQKEKCQRYIDSLFTDSLHFASKKSSLED
                     LSLSRSLSRSESRGRSIHRSGDYAPSIRVTSEHRSLGSADSRRSPLGNRDTSPLHHRS
                     HRDISRELSPRRRRLEEEDEERKDRESSRVRRDNLLPNYFADNRSELSSGSSLTGFNH
                     KVDRQLEETCAKYADDRRSACRTPLSHPYESRTTATRHSHTDPVQIPTNPAGSATATD
                     SFPRPVSPYRQPYDPYHRSPGGAGGTPLYQPGKLEIRHTTVTSTFYDRFLTEKQIERQ
                     THSRPPSRSPVVSPSVPAKSYVELCSTSGTSSTTATSTSTSSFMSSSYAGPSFSLPSA
                     SNFSYLNPGSGSGSGISSISPRASCSDLRSTTSGPTSTSTTSVTTSSYVPYNFTSSFT
                     SRLNDPIITSTTSAVSTSSLTHSTGVYNPMMSFTLREPLASSSLGGSSASPLLPFQFN
                     RTFTSNFDKEQNKQ"
ORIGIN      
        1 caatgcaagc atttaaatta aaagtaaacc accaactgtt gagtgttatc atatagtcag
       61 caaaatgagt gaaaccaaga agctactcga tgcattgcta tgcgatatct acggcaggca
      121 agaagtgttg gccaaacgaa taaggcgttg ttgccgcgat cctgtccata aacgatcccc
      181 gaagaagaag aaaggacaac tggctcaatt ggccagcggc atggtcggtg atcgatcctc
      241 cctagagcgt ttggatgtgc gcaagctcat cgatgtgtgc gccattctga aggtcgagat
      301 ccttatgctg ggctatctac tggagcgagt cctcgtggcc agggatcgtc tgcagaggca
      361 tcaggaggtc ctgtgcgagt tcgtcactgc tgtgctcatc gtggagagtg acgatgcaca
      421 gcctaaaatg cgttttagcc tctcaccacc gccgcagaaa gccattacca gtgcccgcct
      481 aacctcgccc actaaaaaca gcctcagcag caacaacacc atcaccaccc ccggcagcaa
      541 taatagaagc cccagcagca ccaaaaccac catgctgacc cgcaatggcg gaggccacgg
      601 gaccagcccc actgcctccg gatccggaca cgccccctcc gccaccgccg ctgccgacga
      661 tgaagccacc tcggattaca accagtggct gcatgccatg aaactggtgg cccgcctgcc
      721 cggaggcact ccacccgagt tccgacgcaa gttgtggctc tcgctggcgg acaagtatct
      781 caagtcgaag aacgtggact gggcgcagca gagggagaag tgcttttgcg aggagtggcg
      841 cgaggacgac gaggagctgg gcatacaaat cgtcaaggac ctgcatcgca ctggctcgaa
      901 tctgtgcacc ggtcccgcgg gctccataaa ccaggccaag ctgaagcgca tcctactggg
      961 ctatgcccgt tacaaccccg aggtgggcta ttgccaggga ttcaacatgc tgggcgccct
     1021 catattgcaa gtgatggaca aggaggagga ggagtccatg aaggtcatga tctacctggt
     1081 ggagggcgtt ctgcccacgg gctactttta cggctcgatg ggcggcctgc aggcggacat
     1141 gggcgtcttc cgggagctga tgcagacgcg actgccacga ttggcgaagc acctgcaacg
     1201 cctgcaggga cccgtcgaga atgcattcga accgccgcta accaatgtct tcacaatgca
     1261 gtggtttctc accatgttct gcacctgcct gcccatgtcg tgcgtcctgc gcgtctggga
     1321 cctggtcctt atcgagggca gtgatgtcct tcttcgcacc gccctcgtcc tttggagcct
     1381 gctagaagaa cgtgtgatca gtgtccgatc tgcagatgag ttctatggca agatgggatc
     1441 ttattccagt gaactcctca atggccatct tgtggactcc aatggtctca ttgagcgtgt
     1501 ggtaaagcta ggacccattg aggatttgcg acaactgcgg gataagcatc tctacaacat
     1561 tgccccactg cgtcacaaac aaggattgca gctctactac gacgaggagg acacccattc
     1621 ggatgaggag cgcttggcgg tggcaactgt atttggcttg aattggggta gacgtggatc
     1681 ggtgggtcct gcggcggcag gaaagcagca agtagaacag aaggatcgcc tcgctttaga
     1741 tatttccctg ctaaagaagc agtacgatag gctgcgcgag cgccagaagc aggcacatgt
     1801 catcctaacc accgcctgtt cgacggcagc gcgccaagga tcgggtccag cgagcagctc
     1861 ccagccggcg gttccagtga accagttact tctcggtcgc cctgcaatag taactaacaa
     1921 gggaaagcgg gttggagcac cattgggcgc cattccacct gctcgaaagc cctcacttcc
     1981 cgctgtcctc cataccaaac caacttcaga gaaacagttg cgaaggggag agactctgct
     2041 ctggagggac accgatccaa gccgcagaag gcgagatagt ttgacctgga aggagatcaa
     2101 agcagatcga gctgcaatga tacgagaggg cgttgatgtc agctccgtga gaacgcagaa
     2161 gctgcgcacg cgatttggaa agagcgatag ctcctcatac agcgaggata gtgatggtga
     2221 gcaggagtct gggacaggag gcggaggagg ttccagcacg gacaccagcc tgtgtgatga
     2281 cgatgatcca aagtccacgg agaagagtcc caagcagaag gctaaactag cacgcaagct
     2341 caaggagcag aagcaattgg ccggttccag ggagacgagc ttggagcggc agagacccaa
     2401 atcttgggct ccaagcagcc atgagattcc cttcatgctg atgggtacgg atagtggcga
     2461 cgagaaggag gataagtcta cgaaagaggg gccgattact ggagatcaag cggaggatag
     2521 tgccacggaa agcggtcact acgaatttga cagagagttg catttggtca gctccaagat
     2581 ggagccactc aagcttcctt tcgatcccga attcacaggc atgacttccg ttagtcccat
     2641 accgacgccc cgagaaaaga gtgaagctga agaggatttg ctggatgagc gaaagccatt
     2701 cgatgtgagc gatgatggag tgacaaatca gtatttcgag agggttaaca gcgtggagcg
     2761 gcctaatcgt ctggaattga cctactcgct taacgaagaa gaaacggata ctaatgccat
     2821 ttacctagag gagcgggaaa aggttgaggg gcatagtggt gatagggaat ataattccct
     2881 accccctttc tacccgagag agaacgatga tggtgtacag ggtggtggca aggtgccaca
     2941 gattcgtgat gataacattc caggtgagaa caaggacgat tacaaggagc tcctgagcat
     3001 gacgatagag gaaaatactg tatacaagcc accaaccccc acagccagca cattgagtaa
     3061 tgccagccga aagagacgag atcctcgccg aaaaactttg acccgctcat cgaccattga
     3121 aatcgaggaa cggtatcagg ccctggagcg caggatcagt caggatcagc caagtggaga
     3181 tcggcaagcc aagtatattc ctagcacagc tgctctagag gaacggttta acaccctaga
     3241 aaagcaattg agtgctgaaa agcaacgcaa ggagctgtcc gaaatggagg ctgaatatcc
     3301 aattaaatcc gagcgaattc catctaccgc cgacctggaa tcacgcttca attccctaac
     3361 aaaacaaatg agttccagcg aatctagttc caaaactccc attgatctca aggacgagga
     3421 ccgacccagt ggcagcagct ccaagaacca gaaggatagc gaaaaaacta gcaagctgca
     3481 caaatcagag gaacctgaat ctaatacgaa ggaaactaca ggggaaacag aagccagcga
     3541 tagcaatgat agtaaaattg gtgaaaagga aacggagcaa ccacgcatca agaaacttcc
     3601 atcaactgca gagctggaag atcgttttaa tgccttagaa cgcaaaatga gtgttcagaa
     3661 gagcagccca tccaagaaca agaaagaacc acccgatgaa gaagaatcga aatctacaaa
     3721 ggagccagaa gagccagagg agtccgaaaa agcaaatgag aaaacttcgg gcaggcaaac
     3781 acccattgct aaaaaggatt ccaaagatag tgatcagaaa aaatctgaaa ctaaggagaa
     3841 ccaatcacca actaaaaacc aagacgaaaa agtcaaggtc aaatccccaa aaagtgagga
     3901 gatgattgaa aaagaaacct cttcgaatcc gaaagaggat tcacacgaat cggaagccgc
     3961 tacaaataaa aaggtggaag gcaatcgcga gcttagctca gaaaaaggtg atcacaaaat
     4021 aaaagaaaag agtgaagaag ctccagggaa agccggaaaa gaaactgcgg aaaccaaaaa
     4081 tgcaaatgtt aaagactcgt caaaaaaagg tgattctcaa aaaaacgaag ccgctaagac
     4141 gtcggtttca caaactgagt ctgacttaaa accaagttct aaagaaaact caacttccaa
     4201 ggatgcggaa caagaaaaga cccctcgaaa atcaccaccc tccacagaag aattagaaaa
     4261 acgttttaat gccttagaaa agcagatgag taccaccaac ttggaaacaa ctaaagaacc
     4321 tgatcaaact aaaccagcaa ctaaaagtca atctacgagt gccgaagtaa agacgcagaa
     4381 atccatgaaa tcttttgacg acaagatcaa agaagttaat gtggctatcg aaaaggagca
     4441 gagcagagtt gaggtggagg ttaatgctga aaagaagcgc aaaaacgttg aagaagcccc
     4501 aaaaaacaaa gaaggggatt ctcagcaacc agaagaaagt caacacaagg gtaagaatca
     4561 gcgaagagca tctgagccac cgtcaactga ggatcttgag aagcgttatg aaaccttgaa
     4621 gcgtcgcatg agtagcaaga atcaatttag cgaaaccgtg gacgaagctc tcgagcgaat
     4681 ccagcaggag gtaatctcgg aagcagtgga ggaaaagaag ccaccgccat cgacggagga
     4741 tctggagagc cgctttgagg cactgcatgg ggataagaaa aatgtggaat ctaaaatgga
     4801 tgaaactaaa cacgtagatg tggccattga ggcgcatatt ccatccccac ctccgccgcc
     4861 accaccacca aaggaacgtc ctgtcctggc cgaaccagtt ctccaccagc aacaggctct
     4921 gatcgaggag ctgcagagca agatgcgagg ccaatcaccc ggcgaggaga acctgaagcc
     4981 cagcgagata aatccgcaga ggaggcagaa aaagctactg caacgaccca cgcccatggg
     5041 cgatgagacc tcggaagcac ctgcaaacac ggcttactac agagcggcaa accacgaaca
     5101 atggcagcag cgcatggtgc gtcggttctc tgatttaccc tcccgggccg atctggagaa
     5161 ccgattgcag ttcctggaga ggcaactcta caagaagttc tacaagcagc gctgtgcaag
     5221 tgattccgaa gttgcatcga gggtcaaact cccgcccgag gaccagccga gcacctctcg
     5281 gcaggccagg aaacaggaag ccgaaggaca gctggagcag cgtgtcctgg cgttggagaa
     5341 acagctgagc gagaacagtc tcaagttgct ggaagcgatg agggagcgcc acaggtcagc
     5401 agatgacagt ggctccccta ggcgattgag cacggagaca atcgacgcca ccggcaagga
     5461 acttgttaga tacacccaga acattgggga gctggaggaa gtcgacgccc acaagcccat
     5521 caacataagc atcaatatca agatgatggt caacaaagac agtgagtcca agcagccaaa
     5581 gggtgagtcc aagcccacaa cagaagatct gactcgccgc ctggagcaat tggagcagca
     5641 actgttggag gaacgggcca aaaatggctc gatcccacca gaaaatgaag tacttgagga
     5701 gaaaccagaa aagctcgaag aaaaggattc ctgcaaaaag caggagaaga actgccacaa
     5761 tcagcacgtc aaaggtgatg aagttgagaa aacagaaatt ccggctgata gaaaaattga
     5821 acctgcatcg gctaaggaga ctaaaaccct tgaaaatgta gagaaagctc aaacccgagc
     5881 gaaagttgtg gatactgaaa aatcggttaa agatcagaac gcagtgacgg atgaaaaatc
     5941 tgttcaagac cagaatgtag tggttgataa gaaagcagat aggaaaattt tggataaaaa
     6001 agataaatct cctgctgcgg ggaaatcaga agacacaaaa cagacgtcag ggaaaaagga
     6061 gaaatcagaa gacataaaac aggcgtcaga ggcccccaaa gctggagcct ctaaagaaac
     6121 atccacgcga gggaaacctt ccgaaactaa gttggaaaaa cccacaacta aagagtcggt
     6181 tcttaaggaa actttcccca aaaaggagaa tcttgaaagt gagaaaccca aatccaaaga
     6241 aaacgaagct acaaaaacgg aaacacaaaa atcaaaggaa acccctacag ttgcggtcag
     6301 ccctaaagaa agcaaagtat cgagcaaaca aatgacagag aaaaaagaaa caatcaaaga
     6361 ctcttcttcc aaagaactgc ccgagaaaat ggttatcaat tccacagatg tgggacccat
     6421 ggatccaaac ggaaaaacag tggtgctgtt aatggacaac gaacacagag cttcgaaggt
     6481 tagaagattg accagggcca acacggaaga actggaagac ctcttccagg ccttggagaa
     6541 acagctcaat gatcgaaatc ttgtcaaatc cgaagatggt cgcctgatac gagtagatcc
     6601 caagccaagc gctgagcaag tggagcagac tcaggctatt tctgatctca ccaaggaaat
     6661 cgaggacttt accagcgcca aaccggagga ggagaacccg aaggaggcgg ccaaagaaga
     6721 caaaccggag cccgaagagc cggaggactt cgactgggga cccaacactg tgaaacatca
     6781 cttgaaacgc aaaacagtct acctgccctc cacaaaagag ctggaatctc gcttccgctc
     6841 cttggaacgt cagataaaac tgcttgagga tgtggaaaag atcgatgtgg agcagcggct
     6901 gaatgaaatt gagcgtaaaa ttaaactgca gtactcgcta tcccacgaaa aagatttgaa
     6961 caagtacttg gaactgtgtg aaggtaaggg actagatgac gatgaacctg tgcctgtgga
     7021 aactccaacg aaggaggcgg aaattaccac agctagagat cgttctcgta gtcctgggcg
     7081 caaagcattg gctaccaaat ctccatatac ctctccatcg cgaaaggcca ccattaaaac
     7141 tccacatact tctcccaccc ggaagcctat cattaaatct ccctatacat ctccttcacg
     7201 gaagtcggcc aagtcccctt atacgtcgcc ctccagaaat cgacagagat caccttctcc
     7261 aactcgatcg ccggagagga agtcaaagaa aagtccctac acttcaccag cccgtcgcaa
     7321 gccgcatcct aacgatcttc ccatctcaga tgatttagaa tacaaatatc gagtacttga
     7381 tttggtaaga tccaaatcta aggagaactt ggccaagcga atgaacgatc ccaacagaaa
     7441 accggccatt catcccttgg aaatgattct cagtcccagt ccggatgcag atgcaatacc
     7501 tacaacagga gaactagagc atagaatacg tgtcctggac gaaaagctca aatcgccagc
     7561 caaaacccga tcaaagtcgc gctctcgatc gccaaccatc gaagacataa aacgtcaaaa
     7621 gatgagagat gagaaaaagc ccaggactcc tgttcacaat ctggaaagga ttgtcagttc
     7681 gcctggtaga ccagaaccac ccactgccga agagctcgag gagcgtatac gcatcctgga
     7741 gcaggagcat aagtttgact ttaagaccca gaaggactac aaggcattca atcaaaagct
     7801 caaggacgtc atctcaccct cgctctcctt cgacgaattc cgggcagcca agtcacgcga
     7861 gcaaagtccc cgccgccatg gacccactac gcccaagtct gccctgcgtc gcgatgattt
     7921 cgatgaaggc tactgtggcg gtacccacac atccactctc taccgcccca ccagcccgaa
     7981 ggtcattcgg ttccgggatg aggacgagga tgaggaccag tttgaggaag cacccagacc
     8041 gaagtcgcgt cagaccagcg atcgaatgaa gactcttgca gatcaaccat cggccactcg
     8101 cagctacgcc agcagcacgg agggtctgga tgccttgggt agtcgtctaa tgagggaaac
     8161 ctctccgatt acccgaaccg gaacccacac gggtgtacca ctgcgaacgg gggagaacat
     8221 caacgaccgc ctgagttcga tcaagaactc tatcaagtcg atcgacacac tgtgcgagga
     8281 gaagccgtac cagaaggaga agtgccagcg gtacattgac tcactgttca cggactcact
     8341 gcactttgcc agcaagaaga gctccctgga ggacctcagt ctcagcagga gtcttagtcg
     8401 cagcgagagc aggggaagga gtattcatag atctggagac tatgcacctt cgattagggt
     8461 cacctccgag caccgatctt tgggctcggc ggattctcga aggagtcctt tgggcaaccg
     8521 ggacaccagt cccttgcacc acagatcgca tagggatatt agccgggaac tgtcccctcg
     8581 acgaagacgc ttggaggagg aggatgagga gcgtaaggat cgggagagca gtagggtaag
     8641 acgtgataac ttgttgccaa attattttgc tgataatcgt agcgaactaa gtagcgggag
     8701 tagtttaacc gggtttaacc acaaagtaga tagacaacta gaagagacgt gcgccaaata
     8761 tgcggacgat cgacgctcgg cctgtcgcac accgttgagc catccgtacg agtcccgcac
     8821 cacagccaca cgccacagcc acacagatcc agtccagatc ccaaccaacc cagcaggatc
     8881 agctactgca acagatagtt tcccccggcc cgtgtcgcca tatcgccagc cgtacgatcc
     8941 ctaccatcgg tcccctggtg gtgccggtgg cacacccctc tatcagcccg gcaagctgga
     9001 gatccgccac accaccgtca cctcaacctt ctacgatcgg ttcctcactg agaagcagat
     9061 cgagcggcag acccactccc gtccgcccag tcgttcgcca gtggtttcac cctcggtgcc
     9121 agccaaaagc tatgtcgaat tgtgcagcac ctcaggcaca tccagcacca ccgctaccag
     9181 cacctccacc tcctcattca tgtccagcag ctatgcaggc ccctcctttt cgctgccctc
     9241 ggccagcaac ttttcctatt tgaatccggg ttcgggctcg ggttcgggca tctcgtcaat
     9301 ttcgccgcgc gccagttgct cggatcttcg ttccaccacc agcggcccta catccaccag
     9361 taccacctcc gtaaccacct cttcatatgt accctacaat ttcaccagct ctttcacatc
     9421 tcgcttaaac gatcccatta tcacttccac taccagtgca gttagcacta gttcccttac
     9481 ccattccacg ggggtctaca atcccatgat gtcgttcaca ctgagggaac cactagccag
     9541 cagttccctg ggtggttcaa gtgcctctcc attgctaccg tttcagttta acagaacttt
     9601 cacttccaac ttcgacaagg aacagaacaa acaatagaag gatgcacatc aaacattaac
     9661 caaaggttgt ccatacacat tacaaagctc ccattgaaga ttcaaaacat tgaaaagaaa
     9721 tcaagctcaa agcactgaga gaaaaatctg tctaaaacaa tttaattatt ctactcctat
     9781 tcgaattttt tcatttttgt ggttctgaaa atattaaata atcaacaatt atcttaaagt
     9841 taacaggttg ccactttccc aactgttctt ataccatgtt tgctgatttt agttaggttt
     9901 ttatctcagt cagcctaact ttcgctaagc a
//