Dbfetch

LOCUS       NM_079895              10287 bp    mRNA    linear   INV 26-DEC-2023
DEFINITION  Drosophila melanogaster apolipophorin (apolpp), transcript variant
            A, mRNA.
ACCESSION   NM_079895
VERSION     NM_079895.3
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (bases 1 to 10287)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: Impact of
            High-Throughput Data
  JOURNAL   G3 (Bethesda) 5 (8), 1721-1736 (2015)
   PUBMED   26109357
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 10287)
  AUTHORS   Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
            Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
            Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: The
            Rule-Benders
  JOURNAL   G3 (Bethesda) 5 (8), 1737-1749 (2015)
   PUBMED   26109356
  REMARK    Publication Status: Online-Only
REFERENCE   3  (bases 1 to 10287)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
            Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
            Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
            Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
            Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
            Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
  TITLE     The Release 6 reference sequence of the Drosophila melanogaster
            genome
  JOURNAL   Genome Res 25 (3), 445-458 (2015)
   PUBMED   25589440
REFERENCE   4  (bases 1 to 10287)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
            Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
            Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
  TITLE     Sequence finishing and mapping of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1625-1628 (2007)
   PUBMED   17569867
REFERENCE   5  (bases 1 to 10287)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  TITLE     The Release 5.1 annotation of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1586-1591 (2007)
   PUBMED   17569856
  REMARK    Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE   6  (bases 1 to 10287)
  AUTHORS   Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
            Ashburner,M. and Anxolabehere,D.
  TITLE     Combined evidence annotation of transposable elements in genome
            sequences
  JOURNAL   PLoS Comput Biol 1 (2), 166-175 (2005)
   PUBMED   16110336
REFERENCE   7  (bases 1 to 10287)
  AUTHORS   Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
            Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
            Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
            Celniker,S.E., Rubin,G.M. and Karpen,G.H.
  TITLE     Heterochromatic sequences in a Drosophila whole-genome shotgun
            assembly
  JOURNAL   Genome Biol 3 (12), RESEARCH0085 (2002)
   PUBMED   12537574
REFERENCE   8  (bases 1 to 10287)
  AUTHORS   Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
            Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
            Rubin,G.M., Ashburner,M. and Celniker,S.E.
  TITLE     The transposable elements of the Drosophila melanogaster
            euchromatin: a genomics perspective
  JOURNAL   Genome Biol 3 (12), RESEARCH0084 (2002)
   PUBMED   12537573
REFERENCE   9  (bases 1 to 10287)
  AUTHORS   Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
            Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
            Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
            Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
            Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
            Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
            Rubin,G.M. and Lewis,S.E.
  TITLE     Annotation of the Drosophila melanogaster euchromatic genome: a
            systematic review
  JOURNAL   Genome Biol 3 (12), RESEARCH0083 (2002)
   PUBMED   12537572
REFERENCE   10 (bases 1 to 10287)
  AUTHORS   Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
            Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
            Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
            Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
            Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
            Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
            Gibbs,R.A. and Rubin,G.M.
  TITLE     Finishing a whole-genome shotgun: release 3 of the Drosophila
            melanogaster euchromatic genome sequence
  JOURNAL   Genome Biol 3 (12), RESEARCH0079 (2002)
   PUBMED   12537568
REFERENCE   11 (bases 1 to 10287)
  AUTHORS   Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
            Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
            George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
            Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
            Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
            Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
            Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
            Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
            Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
            Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
            Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
            Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
            Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
            Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
            Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
            Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
            Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
            Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
            Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
            Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
            Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
            Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
            Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
            McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
            Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
            Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
            Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
            Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
            Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
            Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
            Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
            Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
            Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
            Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
            Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
            Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
            Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
  TITLE     The genome sequence of Drosophila melanogaster
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
   PUBMED   10731132
REFERENCE   12 (bases 1 to 10287)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
            Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
            Smith,E., Yu,C. and Rubin,G.
  CONSRTM   Berkeley Drosophila Genome Project
  TITLE     Drosophila melanogaster release 4 sequence
  JOURNAL   Unpublished
REFERENCE   13 (bases 1 to 10287)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (20-DEC-2023) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   14 (bases 1 to 10287)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   15 (bases 1 to 10287)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   16 (bases 1 to 10287)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (20-APR-2020) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   17 (bases 1 to 10287)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (22-APR-2019) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   18 (bases 1 to 10287)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   19 (bases 1 to 10287)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   20 (bases 1 to 10287)
  AUTHORS   Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
            Park,S., Svirskas,R. and Karpen,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
            Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   21 (bases 1 to 10287)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
            Svirskas,R. and Rubin,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
            64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   22 (bases 1 to 10287)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  CONSRTM   Drosophila Heterochromatin Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
            Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
            Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE   23 (bases 1 to 10287)
  AUTHORS   Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
            Rockville, MD 20850, USA
COMMENT     REVIEWED REFSEQ: This record has been curated by FlyBase. This
            record is derived from an annotated genomic sequence (NC_004353).
            
            On Jan 16, 2013 this sequence version replaced NM_079895.2.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider :: FlyBase
            Annotation Status   :: Full annotation
            Annotation Version  :: Release 6.54
            URL                 :: http://flybase.org
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..10287
                     /organism="Drosophila melanogaster"
                     /mol_type="mRNA"
                     /db_xref="taxon:7227"
                     /chromosome="4"
                     /genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
                     bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
     gene            1..10287
                     /gene="apolpp"
                     /locus_tag="Dmel_CG11064"
                     /gene_synonym="ApoL1; ApoL2; apoLI; ApoLI; apoLII; ApoLII;
                     apoLpp; apoLPP; Apolpp; ApoLpp; CG11064;
                     chr4:1088013..1088173; Dmel\CG11064; DRBP; lipophorin;
                     Lipophorin; Lpp; LPP; rfabg; Rfabg; RFABG; Rfabp; RfaBp;
                     RfaBP; RFABP; RFBP; T3"
                     /note="apolipophorin"
                     /map="102F7-102F8"
                     /db_xref="FLYBASE:FBgn0087002"
                     /db_xref="GeneID:43827"
     CDS             142..10197
                     /gene="apolpp"
                     /locus_tag="Dmel_CG11064"
                     /gene_synonym="ApoL1; ApoL2; apoLI; ApoLI; apoLII; ApoLII;
                     apoLpp; apoLPP; Apolpp; ApoLpp; CG11064;
                     chr4:1088013..1088173; Dmel\CG11064; DRBP; lipophorin;
                     Lipophorin; Lpp; LPP; rfabg; Rfabg; RFABG; Rfabp; RfaBp;
                     RfaBP; RFABP; RFBP; T3"
                     /note="CG11064 gene product from transcript CG11064-RA;
                     CG11064-PA; apolpp-PA; retinoid-fatty acid binding
                     protein; Retinoid- and fatty acid-binding glycoprotein;
                     retinoid- and fatty acid-binding gene; Retinoid-fatty
                     acid-binding glycoprotein; retinoid and fatty acid binding
                     protein; Retinoid- and fatty acid-binding protein;
                     Retinoid- and fatty-acid binding protein; apolipophorin
                     II; lipophorin; RfaBp; retinoid and fatty acid binding
                     glycoprotein; retinoid and fatty-acid binding protein;
                     apolipophorin-2"
                     /codon_start=1
                     /product="apolipophorin, isoform A"
                     /protein_id="NP_524634.2"
                     /db_xref="FLYBASE:FBpp0088252"
                     /db_xref="GeneID:43827"
                     /db_xref="FLYBASE:FBgn0087002"
                     /translation="MARMKYNIALIGILASVLLTIAVNAENACNLGCPKSDNGLLKYI
                     PGNYYDYSFDSILTIGASSDVPNDSDDTSLKVSGSAKIFAKGNCGYTLQLSSVKVTNT
                     KESVEKKILNSIQKPVQFTLVSGILEPQICSDSSDLDYSLNIKRAVVSLLQSGIEAEH
                     EVDVFGMCPTHTSTSKVGNANIITKARNLNSCSHREQINSGLVSGKVNEKAGITSSLL
                     LQANYIKESRIVNHLIENVQLTETYKFIGNTKRNSDISAKVVTILKLKNPSGTKANSP
                     GTGSTVRSLIFQRPETYTSKNINALKTILSDLVDSTGDYVKKETAKKFVEFIRLLRQS
                     DSETLLELAAFPHPNKVLARKVYLDGLFRTSTAESARVILKQLSKFDEKEKLLAILSL
                     NIVKSVDKETLNQAASQLLPNAPKELYIAVGNLVAKYCLKNYCQGPEIDAISKKFSDG
                     LKHCKPNTKREEERIVYILKGLGNAKSLSGNTVAALSECASTGRSNRIRVAALHAFSK
                     VKCEETLQSKSLELLKNRNEDSELRIEAYLSAISCPNAEVANQISEIVNSETVNQVGG
                     FISSNLKAIRDSTDVSRDQQKYHLANIRVTKTFPVDYRRYSFNNEVSYKLESLGVGAS
                     TDYQIIYSQHGFLPRSSRINVTTEFFGTNYNVFEASVRQENVEDVLEYYLGPKGLVNK
                     DFDEIVKLIEVGNNGVAAGGRARRSIVDDVSKISKKYKMYGVKNVQDLNLDVSLKLFG
                     SELAFLSLGDNIPSSLDDIINYFSTSFEKAKQELSSFEKQFSSHHLFLDTDLAYPTSI
                     GVPLELVAQGFAATKVDLAVSLDINAILEQNWQKAKYRLKFVPSVDINANVQIGFNAQ
                     VLSTGLRVVSSAHSATGSDITVAVISDGEGFNVDLELPREKLELINFNVDTELYVAEQ
                     DKQKAIALKGNKKNKNSQPSEICFNQLELVGLNICIKSSTSLSEVQAGNGNVAERGLS
                     VSEKFHLSRPFNFAVYLTTERKFTFKGIHTQEAFSQKWKLDYSTPGSKVSHDTTVVYE
                     LGNKPKTFSRLSFDNSQCHFAVEGGINNDKNELVVYGQYEQDKEIKKSKIGFSKNGNE
                     YKPLIEIQDNNGISNSINGYHADGKIVVKKNSNNIERYNFENFQVSNSNNAHVAVNGW
                     SDVGTNSLTSELRISLDHQTFLIKENLKLENGLYEAGFFINDEHSPENIYGSSIHLTI
                     ADQSYALKTNGKAAAWSIGSDGSFNFQKLADSNSARAGSLVENVEIQYKNKQVGGIKI
                     MSNFDVNKMDVDVEISREQKIGSIIVKYESNQRHAQDYSLEASAKINKHSIDVISKCD
                     FNGNVYVVDNSLVTSWGTLLSAKGEIGQRYSAQDININIQGNVQISGKDKVTQWILKV
                     IGTPDKTNSDFRISRDTSELIKLTSESQHPQDKISFAKLNLIVKNQLTAKGEFRVAKN
                     GKGDFTASIDTLKTEPKHKLEIESKFHIQSPKYDIDASLTLDGKRKVHLKSENTIEKL
                     KFSTKNIGEANDKIIAFEANGSLKGELRGNGEIQGTFIFNAPDGRVIDGSINRKISTN
                     AKSGLSQGNIDAQLSDTPFGSNKKRSISLIGKLDRLNTKTKEFSANSNLVYTAFNGEK
                     SEISYQIKQQPNGDAKNIDFSLKAYGNPLPQPFEIAFALGDYSAQHAVVSITSKYGEI
                     FSVSANGNYNNNQALEYGLQANIEIPKSTLKSLEINSHGKVLKSLIGNENAAYNVEFF
                     LDSKTSLGQYARVNTVWNGTANDGSYDFEAQTNNMESPLKFNGKYHRKQTGNIKDGDL
                     TGKQTYVLNAQYGAQYVKMDASLGYGAEKVDIAYVIDSSFDSVKDIKVNIRTFKPLDD
                     STYVVTALFKQTDKSYGLDTTFYHSAHKKGVDIRLDLLKEKPIIISSIAELLGDRKGK
                     VLFEILNLADLDIKINSEASYVSIDEFYIIVNWSSKKLKLDGYELEARAQSKNIKIQL
                     KNENGIIFSGTATYALKKELNKTIIDGQGKVQYQGKALSGNFKLTRQHFDFGTDREVG
                     FSYTFMGNLGSKNGLGTLKITNKEFNTKFSVCEEKRQCTNLIVQSIVSIDEQKLDAVE
                     HTTLIIVDLRDFGYPYEFELKSQNTRQGLKYQYHLDSFIITGNNFKYQFTANVQPTSS
                     TIKLALPKRQILFETTQKIPADGSLFGRYEQTASFFIDKLQKPDDVARFSAIVDVTGT
                     ERVAFNANGKLKFEHPTIRPLSISGQLNGDVNQQIASAEVIFDIFRLPEQKVVGNSEL
                     RNSRSQNGFNIAYITTVKSAGLQFQYQINSNAAVDIEAHEYNIGLELNNGEIDVKAIS
                     FLNKEKFEISLSESNKHIIYIVGDFSKQNHYAKLNTKVQILDKNPIEITSEVQPNSAK
                     IILKRQDFIDGTAEVKLGKEFKVDVIGSGKQLFNGRVALDATNFLQTNYFINEDHLNG
                     FWHIVESEINKDSEYISENIKERLKKSRQVTDKIVKLAKEAGPDFSKLQGKLLDYKND
                     IVQELEADQSIAPIIDGIRTLFKKIAGIVDDINKAISEILEKAQKSIVDIYDKLQALW
                     KDSLLKAWEDFIITVQKLISTLKTEFIKICTQSFKDLLSALEKYGPALKNYGKAIGEI
                     VKPINDAAQEVIKIVVNAAEGVTHEFKQYVASLPSFESIRNEFNDKVKVLKLFEKATE
                     LTNSLFDQINILPQTPETSEFLQKLHDYLIAKLKQEHIDNEKYIEELGQLLIKAVRSI
                     WVSIRSTYPGSSDHVIDFQSWIGSLTHSFDSLAVLPSILSFRSSILNCLLNENWDVVF
                     NKKLLYSWIFFNDFELRGHVVDGKHIFTFDGLNFAYPGNCKYILAQDSVDNNFTIIGQ
                     LTNGKLKSITLIDREGSYFEVADNLALKLNGNLVEYPQHLSGLHAWRRFYTIHLYSEY
                     GVGIVCTSDLKVCHININGFYTSKTRGLLGNGNAEPYDDFLLIDGTLAENSAALGNDY
                     GVGKCTAIEFDNNQFKSSKRQEMCSELFGIESTLAFNFITLDSRPYRKACDIALAKVA
                     EKEKEATACTFALAYGSAVKQINKWVLLPPRCIKCAGPAGQHDFGDEFTVKLPNNKVD
                     VVFVVDINVTPGVLSNLIAPAINDIRESLRSRGFSDVQVGVIVFEETKRYPALLTSDG
                     GKINYKGNVADVKLAGIKSFCDNCVEQIITEKRILDIYNSLKEIVKGIAPQADEKAFQ
                     LALDYPFRAGAAKSIIGVRSDSLEYKNWWKFVRAQLTGSITKFDGALIHLIAPVKGLS
                     LEGVLSEKLIGFNSRLVATVDGKDSKKRTKLQFDNDMGIDFVLNNGGWVFATQNFEKL
                     KASDQKKMLNQITSSLADTLFKTEIVSDCRCLPIHGLHGQHKCVIKSSTFVANKKAKS
                     A"
ORIGIN      
        1 cggcttaatt cgtagacaac gttcattgta ttcgaatcgg tttaattcgc ggatggtctg
       61 tgtgtttctt aatctaagaa gtatatacaa agtgaaattg gaccagtgtt ggtaaaggct
      121 atccctaagg ggcccagcgt aatggccagg atgaaataca atatcgcttt gattggaatc
      181 ctagcttctg tgcttttaac aattgctgta aatgctgaaa acgcttgcaa tctaggatgt
      241 ccaaaatctg acaatggact tttgaagtac atacccggca actactatga ctattccttc
      301 gacagtattt taactattgg agcgagtagc gacgttccga acgactcaga tgatacaagt
      361 cttaaagtgt ccggatctgc aaaaattttt gcgaaaggaa actgtgggta cacattacag
      421 ttaagttccg ttaaggtaac taatacaaaa gagtctgtgg agaaaaagat attgaacagc
      481 attcagaagc cagttcagtt tacactggta agtggaattt tagagccaca aatttgctca
      541 gactccagcg acttggacta ctctttgaat attaagcgcg cagttgtatc attgcttcaa
      601 tcgggaatag aagcggaaca tgaggttgac gttttcggca tgtgccctac acatacatca
      661 acatcgaaag tgggtaacgc aaatataatt acgaaggcgc ggaacttaaa cagctgttcg
      721 catcgcgagc agataaacag tggcttggtg tccggcaaag ttaacgaaaa agctggcatt
      781 acgtctagcc tgctgttgca ggcaaactat ataaaggagt ccaggattgt aaaccactta
      841 attgaaaatg ttcagctgac agaaacatat aagtttattg gaaatactaa aagaaactct
      901 gatatcagtg caaaagtagt cacaatatta aaactgaaaa atccaagcgg taccaaggca
      961 aactcaccag gaactggttc tactgtcaga agcctaatat ttcagagacc agaaacctat
     1021 acctccaaaa atattaacgc cctcaaaacg attctatctg atctcgtaga ttcaactggc
     1081 gactatgtga aaaaagaaac agctaaaaag tttgttgagt ttattaggtt gctacgccaa
     1141 tctgatagtg agactttatt agaactggct gcgtttccac atccaaacaa agtcttagcc
     1201 cgcaaggtat atttggatgg attatttcgg accagtacag ctgagtcagc tagagttata
     1261 ttaaaacagc tttccaaatt tgatgagaag gagaaattac ttgcaatact gtccttaaac
     1321 atagtgaaaa gtgttgacaa agagaccctc aatcaagcgg cttctcagct cttacctaac
     1381 gcgcctaaag aactatatat agctgtgggt aacttagttg ctaaatattg tttaaaaaat
     1441 tattgtcaag gaccggaaat tgatgccata tctaaaaagt tttctgatgg ccttaagcac
     1501 tgtaagccaa atactaagcg ggaagaagag cgcattgtgt acattttaaa gggactagga
     1561 aatgcaaaga gtttaagtgg caatacggta gctgcactaa gtgagtgcgc ttccacagga
     1621 cgctccaacc gcatacgtgt cgcagccttg cacgcttttt caaaagttaa atgcgaagaa
     1681 accttgcaat ctaaatcctt ggaacttctt aagaatcgta acgaagattc agagttgcgt
     1741 attgaagctt atttgtccgc gatttcatgc cctaacgcag aggttgctaa tcaaatttcg
     1801 gaaatagtta attccgaaac tgttaaccag gttggaggat ttatatcatc taatttaaaa
     1861 gctatccgag actctacaga cgtcagcaga gaccagcaaa aataccattt ggctaacatt
     1921 agggttacaa aaacatttcc cgttgactac agacgataca gttttaataa tgaggtttcc
     1981 tataagcttg aatcccttgg tgttggcgcc agcactgatt accaaataat atattcgcaa
     2041 catggattct tgccccgatc atctcgtatc aatgtaacaa cagaattttt tggcacgaac
     2101 tacaatgtat ttgaggcaag cgtccgacaa gaaaatgttg aagacgtatt ggaatactat
     2161 ttagggccga aaggattagt aaacaaagat tttgacgaaa ttgttaaact cattgaagtt
     2221 ggcaataatg gtgttgctgc tggtggccga gcccgacgat caattgttga tgacgtttct
     2281 aaaatttcga aaaagtataa aatgtatgga gtcaaaaatg tgcaagatct caatttagat
     2341 gtgtcattaa aactgtttgg ttcggaattg gcatttttga gcctaggcga caacatacca
     2401 agttccctag atgacattat aaactacttc tcaacttctt tcgaaaaagc aaaacaggaa
     2461 ttatcttcgt ttgaaaagca attttccagt caccacttat tccttgatac agacctcgct
     2521 tatccaacta gtattggagt acctttggag cttgtagccc aaggttttgc agctaccaaa
     2581 gttgatcttg ccgttagtct tgatattaat gcaattctag aacaaaactg gcaaaaggct
     2641 aagtatagac taaagtttgt tccgagtgtc gatattaatg ctaacgttca gataggcttt
     2701 aatgcacaag tactatccac tggactccgc gtggtttcgt cagcccattc tgccaccggt
     2761 agcgatatta ctgtagctgt tattagcgat ggagagggct ttaacgttga cctcgagcta
     2821 ccacgcgaaa aacttgagct tattaatttt aatgttgaca cagaactata tgtagccgaa
     2881 caagacaaac aaaaggcaat tgccctaaag ggtaacaaaa aaaataagaa ttctcaacct
     2941 agtgagatat gctttaatca attggaactt gttggattaa atatttgcat taagagttca
     3001 acaagcttaa gtgaggttca agctggtaac ggcaatgttg cagagagagg actttctgtt
     3061 tcggagaaat ttcatttatc tagaccattt aatttcgccg tttacttaac gactgaacgc
     3121 aaatttacat tcaaaggaat tcacacacag gaagctttct cccaaaagtg gaaattagat
     3181 tattccactc ctggatccaa agtttctcac gatacaactg ttgtatatga actcggaaat
     3241 aagcctaaaa catttagtag attatccttt gataactctc aatgccactt tgcagtggaa
     3301 ggtggaataa acaacgacaa gaatgaattg gttgtatatg gtcaatacga acaagacaaa
     3361 gaaataaaaa aaagtaaaat tggctttagc aaaaacggaa atgaatacaa gccattaatt
     3421 gaaatccaag acaataatgg aatctcaaat agcataaatg gttaccacgc agacggaaaa
     3481 atagttgtaa aaaaaaacag taataatatt gaaagatata attttgaaaa cttccaagtg
     3541 tcaaattcta ataacgccca cgtagcagta aacggatggt cggatgttgg gacaaattca
     3601 ttaacctctg agcttcgaat ttctttagat caccaaacct tcttaataaa agaaaattta
     3661 aaattggaaa atggtctgta tgaggctggt ttcttcataa atgatgaaca ttctcccgaa
     3721 aatatttatg gaagttcgat acatttaaca atagctgacc agtcttatgc tcttaaaact
     3781 aacgggaaag cagctgcttg gtccattggc agtgatggta gttttaattt ccaaaaactt
     3841 gccgattcca actctgctcg tgcaggctca cttgtggaaa atgtagaaat tcagtataaa
     3901 aataagcagg tcggaggaat aaagattatg tcgaattttg atgttaataa aatggatgtc
     3961 gatgtggaaa tttcgcgtga gcaaaaaatt gggtccataa tcgttaaata cgaaagtaat
     4021 cagcgacacg cacaggatta ttccctggag gcaagtgcaa agattaacaa gcactccata
     4081 gatgttatat ctaagtgtga ttttaacgga aacgtctatg ttgtcgataa ctctttagta
     4141 actagttggg gaactttact atctgcaaaa ggagaaatag gacaacgtta ctcggcccaa
     4201 gacattaata ttaatattca aggcaatgtt caaattagcg gaaaggacaa agttactcaa
     4261 tggatactta aagtaatcgg cacacctgac aaaaccaaca gcgactttag gattagccga
     4321 gacacctcag agcttattaa actaactagc gagtctcaac atccgcaaga taaaatatcc
     4381 tttgccaaac ttaacttaat tgtcaaaaat caattaactg ccaaaggcga gtttcgtgta
     4441 gccaaaaatg gaaagggtga ctttacagcc agtattgaca ctctgaagac tgaaccgaag
     4501 cacaaactag aaatagaatc taaatttcac atacagtctc caaagtacga tattgatgct
     4561 tccctaacac ttgatggaaa aaggaaagtg cacttgaagt cagaaaacac tatagaaaag
     4621 ttaaaattct caactaaaaa cattggtgag gcaaacgaca aaataattgc tttcgaagct
     4681 aatggaagtt taaaaggcga attgcgagga aatggggaaa tacaaggtac ctttatattt
     4741 aacgctcccg atggtcgagt catcgacggc agtatcaatc gtaagatttc cacaaatgca
     4801 aaaagtggct tatctcaagg caacattgac gcgcagctta gtgatacacc ttttggcagt
     4861 aataagaaac gctcaatttc actaatcgga aagcttgatc ggctcaacac aaaaaccaaa
     4921 gaattttctg caaacagtaa tttagtctac acggcattta atggagaaaa gtcggagata
     4981 agttatcaaa ttaagcagca accaaacggg gatgctaaaa acatcgattt tagccttaag
     5041 gcctatggaa atccactacc ccaacctttt gagatcgctt ttgccttagg agattacagt
     5101 gcacagcatg ccgtagttag tatcacaagt aagtacggtg agattttttc tgtaagtgca
     5161 aatggaaact acaacaataa tcaagcactt gaatatgggc ttcaggctaa tattgaaatt
     5221 ccgaaatcca ccttgaagtc cttagaaata aacagtcatg gaaaggtcct aaaatcttta
     5281 atcggaaatg aaaacgctgc atacaatgta gaattcttcc tggattctaa aaccagtcta
     5341 gggcaatatg ctcgtgtcaa tacagtatgg aacggcacgg caaatgatgg cagctatgat
     5401 tttgaagctc aaactaacaa tatggagtct ccattaaagt tcaatggtaa atatcatcgg
     5461 aaacaaacag gtaatataaa agatggtgac cttaccggta aacaaacgta cgtacttaac
     5521 gcacagtacg gagcacaata cgtaaaaatg gatgcttcat taggttatgg agccgagaaa
     5581 gtagatatag catatgttat agattccagc tttgattctg taaaagatat caaagttaat
     5641 attcgcactt ttaaaccttt ggatgattcg acatatgtag tcactgcact attcaaacag
     5701 actgacaagt catacggatt agacacaaca ttttatcatt cggcccacaa aaaaggtgtt
     5761 gatattcgtt tagatctgtt gaaagaaaag ccaattatca tatcgagcat tgctgaactc
     5821 ttaggagacc gaaaaggaaa ggtactcttt gaaattctaa acttggccga tttggacatt
     5881 aaaataaata gtgaagcttc atacgtcagc attgatgagt tttatataat tgtcaactgg
     5941 agctcgaaga aattaaagct cgatggctac gaacttgaag cacgagctca aagtaagaac
     6001 attaaaattc aacttaagaa tgaaaatggc ataattttct caggtacagc cacttatgct
     6061 cttaaaaaag aactaaacaa aactattatc gatggacaag gcaaagtgca gtatcaagga
     6121 aaggcgctca gtggcaattt taagcttacc cggcaacact ttgattttgg tactgatagg
     6181 gaagttggtt tctcttacac ttttatgggt aatttaggat cgaaaaacgg attaggcact
     6241 ttaaaaatca caaacaagga attcaacacc aaattttccg tttgtgaaga aaagagacag
     6301 tgtacaaatt taatagtaca atcaattgtg agcattgatg aacaaaaact ggacgctgtg
     6361 gaacatacca cacttattat tgttgacctg agagattttg gatatccata cgagtttgaa
     6421 ttaaagtctc aaaatacacg tcaaggtctt aagtatcagt accatttaga tagctttatt
     6481 ataacaggaa ataatttcaa gtaccaattt acagccaacg ttcagcccac gtcgtccaca
     6541 attaaattag cactcccgaa acgtcaaatt ttgttcgaaa ctactcagaa aataccagca
     6601 gacggaagcc tttttggccg ctacgagcaa acagcttcat tctttattga caagttgcaa
     6661 aagcctgacg acgttgcccg cttttctgct attgtagatg taacgggcac cgaacgcgtt
     6721 gcattcaacg ccaacgggaa acttaagttt gaacatccaa ctattcgtcc attaagtatt
     6781 tctggacaat tgaatggaga cgtgaatcaa cagatcgcaa gcgccgaagt aatatttgat
     6841 attttccgac tgccggaaca aaaagtggtt ggaaacagtg aattgcgcaa ctcccgttct
     6901 caaaatggtt tcaatattgc atatattaca actgttaagt ctgctggcct tcagtttcag
     6961 taccaaataa atagtaacgc tgcagtggat attgaagccc atgagtataa cattggcttg
     7021 gaattaaata acggtgaaat cgatgttaag gcgatttcgt ttttaaataa ggaaaaattt
     7081 gaaatctctt taagtgaatc aaataaacat atcatttata tagtcggaga cttttctaag
     7141 caaaaccatt atgcaaagct taatacaaag gttcaaattc ttgataaaaa tccaattgaa
     7201 attacatcgg aggttcaacc taattctgca aagattatac tgaagcgtca agatttcatc
     7261 gatggcactg cagaagttaa gctgggcaag gaatttaaag tagatgtcat tggaagtggg
     7321 aaacaattat ttaatggccg agtggcatta gatgcaacaa actttttaca aacaaactac
     7381 tttataaatg aagatcatct aaatggtttc tggcatattg ttgaatcgga aataaataaa
     7441 gatagtgaat atatttctga aaatatcaag gaacgcttaa aaaaatctcg tcaagttacg
     7501 gataaaattg ttaaattggc taaagaagct ggaccagatt tttccaaact gcagggtaaa
     7561 ttacttgatt ataaaaatga tattgttcaa gagcttgagg ctgatcaatc aattgcaccg
     7621 attattgatg gcatacgcac cctatttaaa aaaatagccg gtattgttga tgacataaac
     7681 aaggcaatat cagaaatatt ggaaaaggct cagaagtcca ttgttgatat ttacgataaa
     7741 ttgcaagccc tttggaagga ttcccttctt aaagcctggg aagatttcat tataactgta
     7801 cagaaactaa taagtacact caaaactgag tttatcaaga tttgtacaca gtcatttaaa
     7861 gatctccttt ctgcgcttga gaaatatgga ccagctctga aaaactatgg aaaagctatt
     7921 ggagaaattg taaaacctat aaacgacgct gctcaggaag tcattaaaat tgtggttaat
     7981 gccgctgaag gtgtaaccca tgaattcaag caatatgtcg caagccttcc gtcatttgaa
     8041 agtattcgta atgagtttaa tgacaaagtt aaagttctga aactatttga aaaggcaaca
     8101 gagttaacaa atagtctatt tgatcagatt aacattttac cacagacccc agagacttcc
     8161 gagtttctac aaaaactcca cgattactta atcgctaaat taaagcaaga acatattgat
     8221 aatgaaaaat acatcgaaga acttggacag cttttaataa aggctgttcg ttccatttgg
     8281 gtctcgatta gaagcactta tccaggctca tcggatcatg taattgattt tcaatcttgg
     8341 attggttctc taacgcattc ttttgactca ttggctgttc ttccgagcat actttcattc
     8401 cgttcaagta tcttaaactg tttattaaat gagaattggg atgtagtatt taacaaaaag
     8461 ctattatact catggatatt ctttaacgac ttcgaactac gaggccatgt cgtcgatggg
     8521 aagcatatat tcactttcga cggcctgaat ttcgcgtatc cgggcaactg caaatatatt
     8581 ctcgctcagg acagtgttga caataacttc acaataattg gtcagcttac aaatggaaaa
     8641 cttaagagca ttacgcttat agatcgggag ggtagctatt tcgaagtagc agacaacctt
     8701 gccctgaagc taaatggaaa cctagttgaa tacccacaac acttgtctgg tcttcacgcg
     8761 tggcgccgat tttacactat tcacttatat tctgagtacg gagtaggcat agtatgtaca
     8821 tcagacctga aggtttgcca catcaacata aacggatttt acacaagcaa aaccagagga
     8881 cttctcggca acggcaacgc agaaccatat gatgacttct tgctaatcga tggtacctta
     8941 gccgaaaatt cggccgccct aggcaacgac tatggagtag gaaaatgcac agcaattgaa
     9001 tttgacaata accagtttaa aagctccaaa agacaggaaa tgtgtagcga actgtttggt
     9061 attgaatcga ctctggcttt caatttcatt accttggatt cgagacctta ccgcaaggca
     9121 tgtgatattg cacttgcaaa agtggctgaa aaggaaaagg aggccactgc gtgcacgttt
     9181 gctttagctt atggttcggc agttaagcag attaataagt gggtgctatt accaccgcgt
     9241 tgcattaagt gcgcaggacc tgctggacaa cacgactttg gagatgagtt tactgttaag
     9301 ttaccaaaca acaaggttga cgttgtattt gtcgttgata tcaatgtaac acccggagtg
     9361 ctgtctaatt taatagcgcc ggccatcaat gatattcgtg aatcattaag aagccgggga
     9421 ttttcagatg tacaagtcgg agtgattgtt ttcgaggaaa ccaaacgata tcccgctcta
     9481 ctcacaagtg atggtggcaa aattaactac aagggaaacg ttgctgatgt taagctggct
     9541 ggaataaaga gcttctgtga caattgtgtg gagcaaatta taactgaaaa acgaatttta
     9601 gatatttaca attctttaaa agaaatagta aagggcattg caccccaagc agatgaaaag
     9661 gcgttccaat tggctttaga ttatcccttc cgcgctgggg cggcaaaaag cattatcggt
     9721 gtgcgaagtg attctttaga atataagaat tggtggaaat ttgttcgagc tcagttaact
     9781 ggatccatca ctaagtttga tggtgcactt attcatctta ttgcaccagt aaaggggctc
     9841 tcattagaag gagtgttaag cgaaaaacta attggcttca attctcgatt ggtggctact
     9901 gtagatggga aagacagcaa aaaacggact aagctgcaat tcgataacga catgggcatt
     9961 gactttgtcc ttaataacgg tggctgggtg tttgccacac aaaactttga aaaattaaaa
    10021 gcttcggacc aaaagaaaat gttgaatcag attacatcat cgctggcaga tactctcttt
    10081 aagacggaaa tcgtcagtga ctgtcgctgt cttccgattc atggattgca cggccaacat
    10141 aagtgcgtca ttaagtcatc aacatttgtt gcgaacaaaa aggcaaaatc tgcctaaact
    10201 gggcttacat gttttatcga atcccttcgt gaggaaaaat ctttttttaa actaagtaaa
    10261 acgaaaaatt aataaaaact taatatc
//