Dbfetch

LOCUS       NM_135247              11437 bp    mRNA    linear   INV 26-DEC-2023
DEFINITION  Drosophila melanogaster uninflatable (uif), transcript variant G,
            mRNA.
ACCESSION   NM_135247
VERSION     NM_135247.6
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (bases 1 to 11437)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: Impact of
            High-Throughput Data
  JOURNAL   G3 (Bethesda) 5 (8), 1721-1736 (2015)
   PUBMED   26109357
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 11437)
  AUTHORS   Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
            Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
            Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: The
            Rule-Benders
  JOURNAL   G3 (Bethesda) 5 (8), 1737-1749 (2015)
   PUBMED   26109356
  REMARK    Publication Status: Online-Only
REFERENCE   3  (bases 1 to 11437)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
            Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
            Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
            Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
            Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
            Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
  TITLE     The Release 6 reference sequence of the Drosophila melanogaster
            genome
  JOURNAL   Genome Res 25 (3), 445-458 (2015)
   PUBMED   25589440
REFERENCE   4  (bases 1 to 11437)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
            Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
            Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
  TITLE     Sequence finishing and mapping of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1625-1628 (2007)
   PUBMED   17569867
REFERENCE   5  (bases 1 to 11437)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  TITLE     The Release 5.1 annotation of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1586-1591 (2007)
   PUBMED   17569856
  REMARK    Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE   6  (bases 1 to 11437)
  AUTHORS   Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
            Ashburner,M. and Anxolabehere,D.
  TITLE     Combined evidence annotation of transposable elements in genome
            sequences
  JOURNAL   PLoS Comput Biol 1 (2), 166-175 (2005)
   PUBMED   16110336
REFERENCE   7  (bases 1 to 11437)
  AUTHORS   Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
            Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
            Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
            Celniker,S.E., Rubin,G.M. and Karpen,G.H.
  TITLE     Heterochromatic sequences in a Drosophila whole-genome shotgun
            assembly
  JOURNAL   Genome Biol 3 (12), RESEARCH0085 (2002)
   PUBMED   12537574
REFERENCE   8  (bases 1 to 11437)
  AUTHORS   Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
            Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
            Rubin,G.M., Ashburner,M. and Celniker,S.E.
  TITLE     The transposable elements of the Drosophila melanogaster
            euchromatin: a genomics perspective
  JOURNAL   Genome Biol 3 (12), RESEARCH0084 (2002)
   PUBMED   12537573
REFERENCE   9  (bases 1 to 11437)
  AUTHORS   Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
            Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
            Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
            Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
            Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
            Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
            Rubin,G.M. and Lewis,S.E.
  TITLE     Annotation of the Drosophila melanogaster euchromatic genome: a
            systematic review
  JOURNAL   Genome Biol 3 (12), RESEARCH0083 (2002)
   PUBMED   12537572
REFERENCE   10 (bases 1 to 11437)
  AUTHORS   Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
            Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
            Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
            Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
            Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
            Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
            Gibbs,R.A. and Rubin,G.M.
  TITLE     Finishing a whole-genome shotgun: release 3 of the Drosophila
            melanogaster euchromatic genome sequence
  JOURNAL   Genome Biol 3 (12), RESEARCH0079 (2002)
   PUBMED   12537568
REFERENCE   11 (bases 1 to 11437)
  AUTHORS   Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
            Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
            George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
            Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
            Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
            Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
            Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
            Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
            Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
            Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
            Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
            Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
            Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
            Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
            Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
            Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
            Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
            Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
            Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
            Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
            Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
            Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
            Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
            McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
            Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
            Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
            Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
            Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
            Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
            Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
            Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
            Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
            Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
            Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
            Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
            Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
            Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
  TITLE     The genome sequence of Drosophila melanogaster
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
   PUBMED   10731132
REFERENCE   12 (bases 1 to 11437)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
            Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
            Smith,E., Yu,C. and Rubin,G.
  CONSRTM   Berkeley Drosophila Genome Project
  TITLE     Drosophila melanogaster release 4 sequence
  JOURNAL   Unpublished
REFERENCE   13 (bases 1 to 11437)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (20-DEC-2023) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   14 (bases 1 to 11437)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   15 (bases 1 to 11437)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (10-NOV-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   16 (bases 1 to 11437)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   17 (bases 1 to 11437)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (20-APR-2020) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   18 (bases 1 to 11437)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (22-APR-2019) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   19 (bases 1 to 11437)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   20 (bases 1 to 11437)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   21 (bases 1 to 11437)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-OCT-2015) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   22 (bases 1 to 11437)
  AUTHORS   Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
            Park,S., Svirskas,R. and Karpen,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
            Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   23 (bases 1 to 11437)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
            Svirskas,R. and Rubin,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
            64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   24 (bases 1 to 11437)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  CONSRTM   Drosophila Heterochromatin Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
            Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
            Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE   25 (bases 1 to 11437)
  AUTHORS   Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
            Rockville, MD 20850, USA
COMMENT     REVIEWED REFSEQ: This record has been curated by FlyBase. This
            record is derived from an annotated genomic sequence (NT_033779).
            
            On Jul 15, 2014 this sequence version replaced NM_135247.5.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider :: FlyBase
            Annotation Status   :: Full annotation
            Annotation Version  :: Release 6.54
            URL                 :: http://flybase.org
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..11437
                     /organism="Drosophila melanogaster"
                     /mol_type="mRNA"
                     /db_xref="taxon:7227"
                     /chromosome="2L"
                     /genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
                     bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
     gene            1..11437
                     /gene="uif"
                     /locus_tag="Dmel_CG9138"
                     /gene_synonym="CG9138; CT26172; Dmel\CG9138; poly-EGF;
                     Poly-EGF; sp1070; SP1070; Uif"
                     /note="uninflatable"
                     /map="27D1-27D3"
                     /db_xref="FLYBASE:FBgn0031879"
                     /db_xref="GeneID:33983"
     CDS             115..10788
                     /gene="uif"
                     /locus_tag="Dmel_CG9138"
                     /gene_synonym="CG9138; CT26172; Dmel\CG9138; poly-EGF;
                     Poly-EGF; sp1070; SP1070; Uif"
                     /note="CG9138 gene product from transcript CG9138-RG;
                     CG9138-PG; uif-PG; uninflatable; SP1070"
                     /codon_start=1
                     /product="uninflatable, isoform G"
                     /protein_id="NP_609091.4"
                     /db_xref="FLYBASE:FBpp0308723"
                     /db_xref="GeneID:33983"
                     /db_xref="FLYBASE:FBgn0031879"
                     /translation="MKQSTAANSHWLAALLSSLLLLNLLHLIGADEAFSCPNGWELRG
                     LNCYKYFNIKHSWDKSAELCRRYGAELVAIDSYAENNETLAIARASDPNQRASDKYWL
                     GLASLDDLRTNTLESASGALISQYSGYWSLHQPNAESGECVAAAFAGKSQSWDLGTCE
                     SLLPFMCRAQACPQGSLHCANGKCINQAFKCDGSDDCGDGTDELDCPAQCHYHMQSGG
                     DVIETPNYPHKYGALSKCKWTLEGPLGSNIILQFQDFETEKTFDTVQILVGGRTEDKS
                     VSLATLSGKQDLTTQPFVSASNFMIVKFTTDGSVERKGFRATWKTEAKNCGGTLKATL
                     QRQILTSPNYPKQYPGGLECLYVIKAQPGRIISIEVDDLDIADGRDFLMIRDGESPMS
                     RTIAKLTGKTAQNNRVIISTGNALYLYFKSSLGEAGKGFSLRYIQGCKATITARNGTV
                     TSPAFGLADYPKNQECYFTIRNNARAPLSLKFDKFTVHKSDNVQVFDGSSTSGLRLHS
                     GNGFTGPAAPKLTLTASSGEMLIKFTSDALHNAAGWSATFSADCPELQPGIGALASSR
                     DTAFGTLVSFTCPIGQEFATGKTRLVTECLRGGNWSVSYIPKCQEVYCGPVPQIDNGF
                     SIGSSNVTYRGIAMYQCYAGFAFASGAPIEKISCLPDGRWERQPHCMASQCAALPEVA
                     HANVTLLNGGGRSYGTIVQYECEPGYERNGHPVLTCMSNGTWSGDVPRCTRKRCFEFP
                     TIANGFVVDSTRAYLFGDEARVQCFKGYKLIGSNIMRCSEAQKFEQPPTCEDINECSS
                     SQCDLTTTECQNTNGSFHCQCRTGFTATTECRPVGDLGLGNGGIPDDSITTSVSEPGY
                     SKEQLRLNTNGWCGGSSEPGANWILIDLKAPTILRGFRTMSVQRPDGNVAFSSAVRLQ
                     YTNDLTDVFKDYANPDGTAVEFRILEPTLSILNLPLPIEARYIRFRIQDYVGAPCLRM
                     ELMGCTRLDCVDINECSKNNGGCDQKCINSPGGFACGCNTGYQLYTSNGTAGYHIERS
                     ESGERDGDTYQRNKTCVPLMCPELEAPENGQLLSDKNDYHFGDVVRFQCHFGYIMSGS
                     SAALCLSSGQWNASVPECNYAKCVSLPDDKLEGLTVARPDPESVLVPFRDNVTITCGS
                     PGRQLRATASSGFRQCVYDPKPGLPDYWLSGMQPSCPRVDCYSPMPTPGAEYGQFVDT
                     RYQSSFFFGCQNTFKLAGQTGRHDNVVRCGADGIWDFGDLRCEGPVCEDPGRPADGRQ
                     IARSYEQSSEVYFGCNRPGYILINPRPITCIREPECKVIKPLGLSSGRIPDSAINATS
                     ERPNYEAKNIRLNSATGWCGKQEAFTYVSVDLGQIYRVKAILVKGVVTNDIVGRPTEI
                     RFFYKQAESENYVVYFPNFNLTMRDPGNYGELAMITLPKFVQARFVILGIVSYMDNAC
                     LKFELMGCEEPKQEPLLGYDYGYSPCVDNEPPIFQNCPQQPIVVRRDENGGVLPVNFT
                     EPTAVDNSGSIARLEIKPQNFRTPSYIFKDTVVKYVAFDYDGNVAICEINITVPDVTP
                     PLLQCPQSYVIELVDRQDSYTVNFNDTRKRIKTSDDTGDVRLQFSPESANIKIGNFEN
                     VTVTATDKYNNRAACHFQVSVKASPCVDWELQPPANGAINCLPGDRGIECIATCKPGF
                     RFTDGEPLKTFSCETSRLWRPTSVVPDCVSENTEQAAYHVTASITYRANGAVAQSCLG
                     QYQEVLAQHYGGLNQLLSQRCSAVNVNMNVTFVKSVPMLLEENVVKMDFILSILPAVR
                     QPQLYDLCGSTLNLIFDLSVPYASAVIDDLLNIANIGNQCPPLRALKSQISRGFNCNV
                     GEVLNMDTSDVPRCLHCPAGTYVSEGQNSCTYCPRGYYQNRDRQGTCLRCPAGTYTKE
                     EGTKSQADCIPVCGYGTYSPTGLVPCLECPRNSFTAEPPTGGFKDCQACPAQSFTYQP
                     AASNKDLCRAKCAPGTYSATGLAPCSPCPLHHYQGAAGAQSCNECPSNMRTDSPASKG
                     REQCKPVVCGEGACQHGGLCVPMGHDIQCFCPAGFSGRRCEQDIDECASQPCYNGGQC
                     KDLPQGYRCECPAGYSGINCQEEASDCGNDTCPARAMCKNEPGYKNVTCLCRSGYTGD
                     QCDVTIDPCTANGNPCGNGASCQALEQGRYKCECVPGWEGIHCEQNINDCSENPCLLG
                     ANCTDLVNDFQCACPPGFTGKRCEQKIDLCLSEPCKHGTCVDRLFDHECVCHPGWTGS
                     ACDINIDDCENRPCANEGTCVDLVDGYSCNCEPGYTGKNCQHTIDDCASNPCQHGATC
                     VDQLDGFSCKCRPGYVGLSCEAEIDECLSDPCNPVGTERCLDLDNKFECVCRDGFKGP
                     LCATDIDDCEAQPCLNNGICRDRVGGFECGCEPGWSGMRCEQQVTTCGAQAPCQNDAS
                     CIDLFQDYFCVCPSGTDGKNCETAPERCIGDPCMHGGKCQDFGSGLNCSCPADYSGIG
                     CQYEYDACEEHVCQNGATCVDNGAGYSCQCPPGFTGRNCEQDIVDCKDNSCPPGATCV
                     DLTNGFYCQCPFNMTGDDCRKAIQVDYDLYFSDPSRSTAAQVVPFPTGEANSLTVAMW
                     VQFAQKDDRGIFFTLYGVQSARMTQQRRMLLQAHSSGVQVSLFEDQPDAFLSFGEYTS
                     VNDGQWHHVAVVWDGISGQLQLITEGLIASKMEYGAGGSLPGYLWAVLGLPQPYGLSN
                     ELAYSDSGFQGTITKAQVWARALDITSEIQKQVRDCRSEPVLYPGLILNWAGYEVTSG
                     GVERNVPSLCGQRKCPVGYTGANCQQLVVDKEPPVVEHCPGDLWVIAKNGSAVVSWDE
                     PHFSDNIGVTKIYERNGHRSGTTLLWGTYDITYIASDAAGNTASCSFKVSLLTDFCPA
                     LADPVGGSQVCKDWGAGGQFKVCEIACNAGLRFSEPVPEFYTCGAEGFWRPTREPSMP
                     LVYPSCSPSKPAQRVFRIKMLFPSDVLCNKAGQAVLRQKVTNSVNGLNRDWNFCSYAI
                     EGTRECKDIQIDVKCDHYRGTQNNRVRRQAKDGGVYVMEAELPVVNDPVVHTSTGERS
                     TVKQLLEKLILEDDQFAVQEILPNTVPDPASLELGSEYACPVGQVVMIPDCVPCAIGT
                     FYDSANKTCIACSRGTYQSEAGQLQCSKCPVIAGRPGVTAGPGARSAADCKERCPAGK
                     YFDAETGLCRSCGHGFYQPNEGSFSCELCGLGQTTRSTEATSRKECRDECSSGQQLGA
                     DGRCEPCPRGTYRLQGVQPSCAACPLGRTTPKVGASSVEECTLPVCSAGTYLNATQNM
                     CIECRKGYYQSESQQTSCLQCPPNHSTKITGATSKSECTNPCEHIAEGKPHCDVNAYC
                     IMVPETSDFKCECKPGFNGTGMACTDVCDGFCENSGACVKDLKGTPSCRCVGSFTGPH
                     CAERSEFAYIAGGIAGAVIFIIIIVLLIWMICVRSTKRRDPKKMLTPAIDQTGSQVNF
                     YYGAHTPYAESIAPSHHSTYAHYYDDEEDGWEMPNFYNETYMKDGLHGGKMSTLARSN
                     ASLYGTKEDLYDRLKRHAYTGKKEKSDSDSEVQ"
ORIGIN      
        1 gtatgccttc gactgtggcg cgttgtggac gttaagaggc tgcggcgaac ccgaactcta
       61 gacaagtatt ggcaagaaac caacgcgctt aactgggcat taaatcgggc caaaatgaaa
      121 caatcgacgg cagccaattc gcattggctg gccgctttgc tgtcgtcgtt gctgctccta
      181 aatttgctac acttaattgg agccgacgag gcgttttcgt gtcccaatgg ttgggaactg
      241 cgcggcttga attgttataa atatttcaat atcaagcact cgtgggataa aagcgccgaa
      301 ctgtgtcgaa gatacggcgc cgaactggta gccatcgaca gctatgcgga gaacaacgag
      361 accttggcca tcgcccgggc cagcgatccc aaccagaggg cttcggacaa gtactggctg
      421 ggattggcct ccctcgacga tctgcgcacc aatacgctgg agtccgcatc gggagcactg
      481 atctcgcaat actccggcta ctggtcactc catcagccga atgccgagtc cggagagtgt
      541 gtggctgctg cctttgccgg caaatcgcag agctgggatc tgggcacctg tgagtccctg
      601 ctgccgttca tgtgccgtgc ccaggcgtgt ccacagggat cactacactg tgccaacggc
      661 aagtgcatca atcaggcctt caagtgcgac ggcagtgatg attgtggcga tggcaccgat
      721 gaactggact gtccagcaca gtgccactac cacatgcagt ccggaggaga tgtgatcgag
      781 acgcccaact atccgcacaa atacggtgcg ctgagcaagt gcaagtggac gctggaggga
      841 ccgctgggca gcaacattat cctgcagttc caggacttcg aaacggagaa gacctttgac
      901 accgtgcaga ttctggttgg cggccgtacc gaggataagt ccgtgtcgct ggccacgctc
      961 agtggcaagc aggatctgac cacgcagccc ttcgtatccg cttccaactt catgatcgtc
     1021 aagttcacca cggatggcag tgtggagcgc aagggattcc gggccacgtg gaagacggag
     1081 gccaaaaact gcggcggcac cttgaaggcc acgcttcagc gacagatcct gaccagcccc
     1141 aactacccga agcaatatcc cggcggtctg gagtgtctct atgtgattaa agcacagccg
     1201 ggtcgcatca tctccatcga agtggacgac ttggacatcg ccgatggacg cgatttcctg
     1261 atgatccgcg atggcgaatc acctatgagt cgcaccatcg ccaaactgac tggaaagaca
     1321 gcccaaaaca accgggtgat catctcaacg ggcaacgctc tctacttgta tttcaagtcc
     1381 agtttgggtg aggccggcaa gggcttcagt ttgcggtaca tccagggctg caaggccacg
     1441 atcaccgcta gaaatggcac ggttacttca cccgcctttg gattggccga ctaccccaag
     1501 aaccaggagt gctacttcac cattcgcaac aatgcccgtg ctccgctgtc cctgaagttc
     1561 gacaagttca ccgttcacaa gagcgacaat gtccaggtgt tcgatggatc ctccacttcc
     1621 ggtctgcgcc tgcactccgg aaacggattc actggcccag cggcgcccaa actgaccctg
     1681 actgcttcat ccggtgagat gctcatcaag ttcacctcgg atgcactgca caatgctgct
     1741 ggatggtcgg ccacattctc ggccgattgc ccggagctgc aacccggaat tggagccttg
     1801 gcctccagtc gcgacaccgc tttcggtacg ctggtcagct ttacatgtcc cattggacag
     1861 gagtttgcca ccggcaagac gcgactggtt accgaatgtc tgcgcggtgg caactggagt
     1921 gtctcctaca tacccaagtg tcaggaggtc tactgcggtc ctgtgccaca aatcgacaac
     1981 ggtttctcca ttggctcctc gaacgtaacc tatcgcggta tagcaatgta ccagtgctac
     2041 gccggctttg ccttcgcctc gggtgctccg atcgagaaga tctcctgtct gccggatggc
     2101 cgttgggagc gacagcccca ctgcatggcc tcccagtgcg cagcgctgcc ggaagtggca
     2161 cacgccaacg tcaccctgct gaatggaggt ggtcgcagct acggcaccat tgtccagtat
     2221 gagtgtgagc cgggctacga gcgcaatggc catcccgtgc tgacctgtat gtcgaacggc
     2281 acctggagtg gtgatgtacc aagatgcacg cgcaagcggt gcttcgaatt cccgaccatt
     2341 gccaacggct ttgtggtgga ctcgacgcga gcctacctct tcggcgatga ggccagggtg
     2401 cagtgcttca agggctacaa actgatcggc agcaacatca tgcgctgcag cgaggcccag
     2461 aagttcgagc agccgccgac gtgcgaggac atcaacgagt gcagctcctc gcagtgcgac
     2521 ctaaccacca ccgagtgcca gaacacgaac ggctccttcc actgccagtg caggacggga
     2581 ttcacggcta ccaccgagtg tcggcccgtc ggtgatttgg gcttgggtaa tggaggcata
     2641 ccggatgaca gcatcaccac ctcggtcagt gagccgggct acagcaagga gcagctgcgc
     2701 ttgaacacga atggctggtg cggtggctct tcggagcctg gtgccaactg gatactcatc
     2761 gacctgaagg cacccaccat tctgcgtggc ttccgcacca tgtccgtgca gcgtcccgat
     2821 ggcaatgtgg ccttcagctc ggcggtgcgt ctgcagtaca ccaacgatct gacggatgtg
     2881 ttcaaggatt atgccaatcc cgacggcact gccgtcgaat tccgcatcct ggagcccacg
     2941 ctctccatct taaacctgcc cctgcccatc gaagctcgct atattcgctt ccgcatccag
     3001 gactacgtgg gtgcgccctg tctgcgcatg gagctgatgg gctgcacgcg cttggattgc
     3061 gtggacatca acgagtgcag caagaacaat ggcggctgtg accagaagtg catcaactca
     3121 ccgggcggat ttgcctgtgg ctgcaacact ggctaccagc tgtacacctc caacggcacg
     3181 gctggctatc acatcgaacg ctccgaatcc ggcgaacgtg atggtgacac ctatcagcgc
     3241 aacaagacct gtgttcctct catgtgtccc gaactggagg cgcccgagaa tggtcaactc
     3301 ctgagcgaca agaacgacta tcactttggc gatgtggtgc gcttccagtg ccactttggc
     3361 tacatcatga gcggcagctc ggcggccctg tgcctctcca gcggtcagtg gaacgccagc
     3421 gtaccggagt gcaattatgc caaatgcgtt tccctgcccg atgacaagtt ggagggtctg
     3481 actgtggccc gccccgatcc cgaatccgtt ctagtgccct tccgtgacaa tgtgaccatt
     3541 acgtgcggat cgccgggacg ccaactgaga gccaccgctt cctctggttt ccggcagtgc
     3601 gtgtacgatc ccaagcccgg tctgcccgat tactggctat ccggaatgca gccctcttgt
     3661 ccccgagtgg attgctactc acccatgcca acgcccggcg cagaatacgg acagtttgtg
     3721 gacactcgct atcagagcag cttcttcttt ggctgccaga acacctttaa gttggctgga
     3781 cagacgggtc gtcacgacaa tgtggttcgt tgtggagccg atggtatctg ggactttgga
     3841 gatcttcgct gtgagggacc tgtgtgcgag gatccgggaa gaccggcaga tggtcgccag
     3901 attgcacgca gctatgagca gagctcggag gtgtacttcg gctgcaatcg tcctggctac
     3961 atcctgatca atccgcgacc cattacatgc atacgcgagc cagagtgcaa ggtcatcaag
     4021 cctttgggat taagttccgg caggattccg gattcggcca tcaatgccac ctcggagcga
     4081 cccaattacg aggccaagaa catccgtctc aactcggcca ctggctggtg tggcaagcag
     4141 gaggccttca cctatgtgag cgtggatctg ggtcagatct atcgagtcaa ggcgattctg
     4201 gtgaagggtg tggttaccaa cgacattgtg ggcaggccca cggagattcg gttcttctac
     4261 aaacaagctg agagcgagaa ctacgtggtg tacttcccca atttcaatct gaccatgcga
     4321 gatccaggca actacggcga gctggccatg atcacgctgc ccaagttcgt gcaggctcgc
     4381 tttgtgatcc ttggaatagt gagctacatg gacaacgcct gtctgaagtt cgagttgatg
     4441 ggctgcgagg agccgaaaca ggaaccactc ctcggctacg actacggcta ctccccgtgc
     4501 gtggacaacg aaccacccat cttccaaaac tgcccgcagc aaccaattgt ggtgcgacgc
     4561 gatgagaatg gaggagtact acccgttaac ttcaccgaac ccacggcggt ggacaactcc
     4621 ggatcgattg cccgcctgga gatcaagcca cagaacttcc gcacacccag ctacattttc
     4681 aaggatacgg ttgtaaagta cgtggccttt gactacgatg gcaatgtggc catctgcgag
     4741 atcaacatca cggtgcccga tgtaacacca ccactgctgc agtgccccca gagctatgtg
     4801 attgagctag tggatcgcca ggacagctac actgtgaact tcaacgatac ccggaagagg
     4861 atcaagacct ccgacgacac aggagatgtg aggttgcagt tcagccccga gagtgccaac
     4921 atcaagatcg gaaacttcga gaacgtgacc gtcacggcaa cggataagta caacaaccgc
     4981 gccgcctgcc acttccaggt ctctgtgaag gcttcaccct gcgtggactg ggagctccag
     5041 ccgccggcga atggtgccat caattgcctg cctggtgatc gtggtatcga atgcattgcc
     5101 acgtgcaagc caggattccg tttcaccgac ggcgaaccac tgaagacctt ctcctgcgag
     5161 acatcacgtc tgtggcgtcc cacgtccgtg gtgcccgact gcgtgtcgga gaacacggag
     5221 caggccgcct accacgtgac cgcctccatt acctaccgcg ccaatggagc agtggcccaa
     5281 tcctgtctgg gtcagtacca ggaggtgctg gcacagcact atggcggact caaccagttg
     5341 ctctcgcagc gctgctccgc cgtgaatgtc aacatgaatg tgacctttgt gaagtctgtg
     5401 cccatgctgc tggaggagaa tgtggtcaag atggacttca tcctctccat tctgcccgct
     5461 gtgcgtcagc cgcagctgta cgacctgtgc ggctccacgc tgaacctgat ctttgatctg
     5521 agtgtaccct atgccagtgc cgtgatcgat gaccttttga acattgccaa catcggtaac
     5581 cagtgtcctc cgctacgcgc cctcaagtcg caaatctcgc gaggatttaa ctgcaatgtg
     5641 ggcgaggtac tgaacatgga caccagcgat gtgccgcgtt gcctgcactg tcccgccgga
     5701 acgtatgtgt cagagggtca gaacagctgc acctactgcc cgaggggcta ctaccagaac
     5761 cgtgaccgcc agggaacctg cctgcgctgc ccggccggaa cctacaccaa ggaggagggc
     5821 accaagtcgc aggcggactg cattcccgtc tgcggttatg gcacctactc acccaccgga
     5881 ctggtgccgt gcctggagtg tccgcgtaac tcattcactg ccgaaccacc aaccggtgga
     5941 ttcaaggatt gccaggcctg tccggcacag agcttcacct accagccggc tgcctcgaac
     6001 aaggatctgt gtcgcgccaa gtgtgcgccg ggaacgtact ccgccaccgg actggcaccc
     6061 tgctcgccct gcccactgca tcattaccag ggagccgcgg gtgcgcagag ctgcaacgag
     6121 tgtccgagta acatgagaac cgattcaccc gcctccaagg gacgcgaaca gtgcaagccg
     6181 gtggtatgtg gtgaaggtgc ttgccagcac ggcggactgt gtgtgcccat gggccatgac
     6241 atccagtgct tctgtccggc cggattctct ggacgtcgct gcgaacagga catcgacgag
     6301 tgcgcctccc agccctgcta caatggtggt cagtgcaagg atctgccgca gggctatcgc
     6361 tgtgagtgcc cggctggata ctcgggcatc aattgccagg aggaggccag tgactgtggc
     6421 aacgacacct gtccggccag ggccatgtgc aagaacgagc cgggctacaa gaacgtgacc
     6481 tgtctgtgcc gcagtggcta caccggcgat cagtgcgacg tgaccatcga tccgtgcacg
     6541 gcgaatggca atccgtgcgg aaacggagcc agctgccagg ccttggagca gggtcgctac
     6601 aagtgcgagt gtgtgcccgg atgggagggc atccactgtg agcagaatat caatgactgt
     6661 tcggagaatc cctgcctgtt gggcgccaac tgcacagatc tggtcaatga cttccagtgc
     6721 gcctgtccgc caggatttac gggcaagcga tgcgagcaaa agatcgatct ctgcctatcg
     6781 gaaccatgca agcatggcac ctgcgtggat cgtctgttcg atcacgagtg tgtttgccat
     6841 ccgggctgga cgggatccgc ctgcgacatc aacatcgacg actgcgagaa ccgaccctgc
     6901 gccaatgagg gaacctgcgt cgacctggtc gacggctata gctgcaactg tgaacccggc
     6961 tacacgggca agaattgcca gcacaccatc gacgactgcg cctcgaatcc ctgccagcac
     7021 ggcgccacct gtgtggacca gctggatggc ttcagctgca aatgccgccc tggctacgtg
     7081 ggtctctcct gcgaggccga gatcgacgag tgtctgagcg acccctgcaa tccggtgggc
     7141 acggagcgct gcctcgatct ggacaacaaa ttcgagtgcg tgtgccggga cggattcaag
     7201 ggacccctgt gcgccacgga catcgatgac tgcgaggcgc agccgtgtct gaacaacggc
     7261 atctgtcggg atcgcgtcgg tggctttgag tgcggctgcg agccaggatg gagtggcatg
     7321 cgctgcgagc agcaggtgac cacgtgcgga gctcaggcgc cgtgccagaa cgatgccagc
     7381 tgcatcgacc tgttccagga ctacttctgc gtgtgtccca gcggcaccga tggcaagaac
     7441 tgcgagaccg ctccggaacg ctgcatcggt gatccttgca tgcacggtgg caagtgccag
     7501 gactttggct ctggtcttaa ctgcagttgc cctgcggatt actcgggcat tgggtgtcag
     7561 tacgagtacg acgcatgcga ggagcatgtc tgtcagaatg gcgccacttg tgtggacaat
     7621 ggtgctggct acagctgcca gtgcccacct ggcttcaccg gtcgcaattg cgaacaggac
     7681 atcgtggact gcaaggacaa ctcttgccca ccgggcgcca cgtgcgtgga tctaaccaac
     7741 ggcttctact gtcagtgccc cttcaatatg accggagacg attgccgcaa ggccatccaa
     7801 gtggactacg atctgtactt cagcgatcca tcgcgatcca ccgccgccca ggtggtgccc
     7861 ttccccacgg gagaggcgaa cagcctgact gtcgccatgt gggtgcagtt tgcccagaag
     7921 gacgatcgcg gcatcttctt caccctctac ggcgtgcaat ccgctcgcat gacccagcag
     7981 cgccgcatgc tgctccaggc gcactccagt ggagtccagg tttcactgtt tgaggaccaa
     8041 cccgatgcct tcctgagctt tggggagtac acttccgtca acgacggcca gtggcatcat
     8101 gtagccgtgg tctgggacgg aatctccggg cagcttcaat tgatcacaga gggactgatt
     8161 gccagcaaga tggagtacgg agccggcggc tctctgcccg gttatctctg ggcagtgctg
     8221 ggactcccac agccgtatgg acttagtaat gagctggcct actcggattc cggattccag
     8281 ggcacaataa ccaaggctca agtgtgggcc agagccctag acatcacgtc agagatccag
     8341 aagcaggtgc gcgactgccg ttctgaaccg gttctctatc ccggcctcat cctcaactgg
     8401 gcgggatacg aggtgacctc aggcggagtg gagcgcaatg tgccctccct atgcggacaa
     8461 cgcaagtgcc cagtgggcta cacgggcgcc aattgccagc aactggtcgt ggacaaggaa
     8521 ccacctgtgg tggagcactg ccccggagat ctgtgggtga ttgccaagaa cggttccgcg
     8581 gtggtctcct gggatgagcc gcacttcagc gacaacattg gcgtgaccaa gatctacgag
     8641 cgaaatggac accgatctgg aactacattg ctatggggca cctacgacat cacctacatt
     8701 gcatccgatg cagctggaaa tactgcatcg tgcagcttca aggtttctct gctgaccgac
     8761 ttctgtccag cgttggctga tcccgttggt ggatcacagg tttgcaagga ttggggtgcc
     8821 ggtggtcagt tcaaggtctg cgagatcgcc tgtaatgcgg gtcttcgatt ctcggagccg
     8881 gtgcctgagt tctatacctg cggagccgaa ggcttttggc gaccaactag ggaaccctcg
     8941 atgccactcg tctacccatc ctgctcacca tcgaagcccg cccagcgggt gttccgcatc
     9001 aagatgctct tcccctcgga cgtgctgtgc aacaaggctg gtcaggcggt gctccgtcag
     9061 aaggtgacca actcggttaa tggcctgaac agggactgga acttctgctc ctatgccatc
     9121 gagggaacaa gggaatgcaa ggacattcag atcgatgtga aatgcgacca ctaccgaggt
     9181 acgcagaaca atcgtgtgcg tcgtcaggcc aaggatggcg gagtctatgt gatggaggcc
     9241 gaattgccag tggtcaatga tcccgtggtg cacacatcga cgggcgaacg aagcactgtc
     9301 aagcagctgc tggagaagct catcctcgag gacgatcagt tcgccgtgca ggagattctg
     9361 cccaacacag tgcctgatcc ggcttccctg gaactgggct cggagtacgc ctgtcccgtg
     9421 ggccaggtgg tgatgatacc cgactgtgta ccctgtgcca tcggcacctt ctacgacagc
     9481 gccaacaaga cgtgcatagc ctgctcgcgc ggaacctacc agtcggaggc gggtcagctg
     9541 cagtgcagca agtgcccggt gattgctgga agaccaggag tgactgccgg tccgggagca
     9601 cgctccgcgg cggactgcaa ggagcgctgc ccagctggca agtactttga cgcggaaacg
     9661 ggtctgtgcc gctcctgcgg ccatggattc taccagccca acgagggttc ctttagctgt
     9721 gagctatgcg gtctgggaca gacaacgcgc tccacggagg ccacgtcacg caaggagtgt
     9781 cgcgatgagt gcagctctgg ccagcaactg ggtgccgatg gacgctgcga gccctgccca
     9841 cgtggaacat accgcctgca gggcgtgcag ccatcctgcg ccgcctgtcc gctgggcagg
     9901 acgacgccca aggtgggcgc cagttcggtg gaggagtgca cactgcccgt ctgctcggcg
     9961 ggtacgtacc tgaatgccac acagaatatg tgcatcgagt gccgcaaggg atactaccaa
    10021 tcggagtcgc agcagacctc ctgtctgcag tgcccaccga accacagtac caagatcact
    10081 ggcgccacct cgaagagcga gtgcaccaat ccgtgcgagc acattgcaga gggcaagccg
    10141 cactgcgatg tcaatgccta ctgcatcatg gtgccggaga cgtcggactt taagtgcgaa
    10201 tgcaagccag gattcaatgg aacgggcatg gcctgcacgg atgtgtgcga tggcttctgc
    10261 gagaactctg gtgcgtgtgt caaggacttg aagggcacac catcttgccg ctgtgtgggc
    10321 tcctttacgg gtccccactg tgcggaacgc tcggagtttg cctacatcgc cggtggcatt
    10381 gccggagcgg tgatctttat catcatcatt gtcctgctca tctggatgat ttgcgtgcgc
    10441 tccacgaagc gcagggatcc caagaagatg ctaacacctg cgattgacca gaccggctcg
    10501 caggtgaact tctactacgg cgcccacacg ccctacgcgg agtccatcgc gccatcgcat
    10561 cacagcacat atgcgcacta ctacgacgac gaggaggatg gctgggagat gcccaacttc
    10621 tacaatgaaa cgtacatgaa ggatggtctg catggcggta agatgagcac gttggccaga
    10681 tcgaatgcct cgctctatgg aactaaagaa gacttatacg accgactgaa acgtcacgcc
    10741 tacacgggca agaaggagaa gagtgatagt gatagcgaag tgcagtagaa cgacgataaa
    10801 ctacagataa ccagctgctt taatgtgtaa aatgtggtca taaataacga atgggttgca
    10861 gcagctaact cactgtcaga acagtgacgc cgcccactgc ccgcgccgaa gaatactcac
    10921 tggagagcct tgcattgaat caacgtaata ttcggcgatc catctacgat ccatctatag
    10981 tcccccatct cggatattac gtgacttaat gcaaggcttt tggcgattaa agtcaagcgg
    11041 agatgagatg gcttttatac gagtaatagt aactcccata tgtgctcttt aggtatgcaa
    11101 actacatgaa attaagtcga acttatgcgt aatttaataa atgaaatatt gttttaatct
    11161 taagtattat ttacctatac aagaaactcg aacttaccat gcttgacgcg gtaccaaatt
    11221 gcaatacata tatgctaata tatatatata tatataaaga tagatttttc caaaccattt
    11281 agtttgtcca agcttgttga acaaatgcgc gagtgctgta aaaagcaaat caaatatagt
    11341 cgttattttg taatttaaat aaaagcaatt taattattat gtatgacaat taattttata
    11401 aattttcttg tataaaataa acgaaacaaa caaatcc
//