Dbfetch
LOCUS NM_135247 11437 bp mRNA linear INV 26-DEC-2023
DEFINITION Drosophila melanogaster uninflatable (uif), transcript variant G,
mRNA.
ACCESSION NM_135247
VERSION NM_135247.6
DBLINK BioProject: PRJNA164
BioSample: SAMN02803731
KEYWORDS RefSeq.
SOURCE Drosophila melanogaster (fruit fly)
ORGANISM Drosophila melanogaster
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE 1 (bases 1 to 11437)
AUTHORS Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
Strelets,V., Russo,S.M. and Gelbart,W.M.
CONSRTM FlyBase Consortium
TITLE Gene Model Annotations for Drosophila melanogaster: Impact of
High-Throughput Data
JOURNAL G3 (Bethesda) 5 (8), 1721-1736 (2015)
PUBMED 26109357
REMARK Publication Status: Online-Only
REFERENCE 2 (bases 1 to 11437)
AUTHORS Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
Russo,S.M. and Gelbart,W.M.
CONSRTM FlyBase Consortium
TITLE Gene Model Annotations for Drosophila melanogaster: The
Rule-Benders
JOURNAL G3 (Bethesda) 5 (8), 1737-1749 (2015)
PUBMED 26109356
REMARK Publication Status: Online-Only
REFERENCE 3 (bases 1 to 11437)
AUTHORS Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
TITLE The Release 6 reference sequence of the Drosophila melanogaster
genome
JOURNAL Genome Res 25 (3), 445-458 (2015)
PUBMED 25589440
REFERENCE 4 (bases 1 to 11437)
AUTHORS Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
TITLE Sequence finishing and mapping of Drosophila melanogaster
heterochromatin
JOURNAL Science 316 (5831), 1625-1628 (2007)
PUBMED 17569867
REFERENCE 5 (bases 1 to 11437)
AUTHORS Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
TITLE The Release 5.1 annotation of Drosophila melanogaster
heterochromatin
JOURNAL Science 316 (5831), 1586-1591 (2007)
PUBMED 17569856
REMARK Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE 6 (bases 1 to 11437)
AUTHORS Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
Ashburner,M. and Anxolabehere,D.
TITLE Combined evidence annotation of transposable elements in genome
sequences
JOURNAL PLoS Comput Biol 1 (2), 166-175 (2005)
PUBMED 16110336
REFERENCE 7 (bases 1 to 11437)
AUTHORS Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
Celniker,S.E., Rubin,G.M. and Karpen,G.H.
TITLE Heterochromatic sequences in a Drosophila whole-genome shotgun
assembly
JOURNAL Genome Biol 3 (12), RESEARCH0085 (2002)
PUBMED 12537574
REFERENCE 8 (bases 1 to 11437)
AUTHORS Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
Rubin,G.M., Ashburner,M. and Celniker,S.E.
TITLE The transposable elements of the Drosophila melanogaster
euchromatin: a genomics perspective
JOURNAL Genome Biol 3 (12), RESEARCH0084 (2002)
PUBMED 12537573
REFERENCE 9 (bases 1 to 11437)
AUTHORS Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
Rubin,G.M. and Lewis,S.E.
TITLE Annotation of the Drosophila melanogaster euchromatic genome: a
systematic review
JOURNAL Genome Biol 3 (12), RESEARCH0083 (2002)
PUBMED 12537572
REFERENCE 10 (bases 1 to 11437)
AUTHORS Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
Gibbs,R.A. and Rubin,G.M.
TITLE Finishing a whole-genome shotgun: release 3 of the Drosophila
melanogaster euchromatic genome sequence
JOURNAL Genome Biol 3 (12), RESEARCH0079 (2002)
PUBMED 12537568
REFERENCE 11 (bases 1 to 11437)
AUTHORS Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
TITLE The genome sequence of Drosophila melanogaster
JOURNAL Science 287 (5461), 2185-2195 (2000)
PUBMED 10731132
REFERENCE 12 (bases 1 to 11437)
AUTHORS Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
Smith,E., Yu,C. and Rubin,G.
CONSRTM Berkeley Drosophila Genome Project
TITLE Drosophila melanogaster release 4 sequence
JOURNAL Unpublished
REFERENCE 13 (bases 1 to 11437)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (20-DEC-2023) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 14 (bases 1 to 11437)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 15 (bases 1 to 11437)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (10-NOV-2022) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 16 (bases 1 to 11437)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 17 (bases 1 to 11437)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (20-APR-2020) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 18 (bases 1 to 11437)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (22-APR-2019) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 19 (bases 1 to 11437)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 20 (bases 1 to 11437)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 21 (bases 1 to 11437)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (07-OCT-2015) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 22 (bases 1 to 11437)
AUTHORS Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
Park,S., Svirskas,R. and Karpen,G.
TITLE Direct Submission
JOURNAL Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
REMARK Direct Submission
REFERENCE 23 (bases 1 to 11437)
AUTHORS Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
Svirskas,R. and Rubin,G.
TITLE Direct Submission
JOURNAL Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
64-121, Berkeley, CA 94720, USA
REMARK Direct Submission
REFERENCE 24 (bases 1 to 11437)
AUTHORS Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
CONSRTM Drosophila Heterochromatin Genome Project
TITLE Direct Submission
JOURNAL Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE 25 (bases 1 to 11437)
AUTHORS Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
TITLE Direct Submission
JOURNAL Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
Rockville, MD 20850, USA
COMMENT REVIEWED REFSEQ: This record has been curated by FlyBase. This
record is derived from an annotated genomic sequence (NT_033779).
On Jul 15, 2014 this sequence version replaced NM_135247.5.
##Genome-Annotation-Data-START##
Annotation Provider :: FlyBase
Annotation Status :: Full annotation
Annotation Version :: Release 6.54
URL :: http://flybase.org
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..11437
/organism="Drosophila melanogaster"
/mol_type="mRNA"
/db_xref="taxon:7227"
/chromosome="2L"
/genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
gene 1..11437
/gene="uif"
/locus_tag="Dmel_CG9138"
/gene_synonym="CG9138; CT26172; Dmel\CG9138; poly-EGF;
Poly-EGF; sp1070; SP1070; Uif"
/note="uninflatable"
/map="27D1-27D3"
/db_xref="FLYBASE:FBgn0031879"
/db_xref="GeneID:33983"
CDS 115..10788
/gene="uif"
/locus_tag="Dmel_CG9138"
/gene_synonym="CG9138; CT26172; Dmel\CG9138; poly-EGF;
Poly-EGF; sp1070; SP1070; Uif"
/note="CG9138 gene product from transcript CG9138-RG;
CG9138-PG; uif-PG; uninflatable; SP1070"
/codon_start=1
/product="uninflatable, isoform G"
/protein_id="NP_609091.4"
/db_xref="FLYBASE:FBpp0308723"
/db_xref="GeneID:33983"
/db_xref="FLYBASE:FBgn0031879"
/translation="MKQSTAANSHWLAALLSSLLLLNLLHLIGADEAFSCPNGWELRG
LNCYKYFNIKHSWDKSAELCRRYGAELVAIDSYAENNETLAIARASDPNQRASDKYWL
GLASLDDLRTNTLESASGALISQYSGYWSLHQPNAESGECVAAAFAGKSQSWDLGTCE
SLLPFMCRAQACPQGSLHCANGKCINQAFKCDGSDDCGDGTDELDCPAQCHYHMQSGG
DVIETPNYPHKYGALSKCKWTLEGPLGSNIILQFQDFETEKTFDTVQILVGGRTEDKS
VSLATLSGKQDLTTQPFVSASNFMIVKFTTDGSVERKGFRATWKTEAKNCGGTLKATL
QRQILTSPNYPKQYPGGLECLYVIKAQPGRIISIEVDDLDIADGRDFLMIRDGESPMS
RTIAKLTGKTAQNNRVIISTGNALYLYFKSSLGEAGKGFSLRYIQGCKATITARNGTV
TSPAFGLADYPKNQECYFTIRNNARAPLSLKFDKFTVHKSDNVQVFDGSSTSGLRLHS
GNGFTGPAAPKLTLTASSGEMLIKFTSDALHNAAGWSATFSADCPELQPGIGALASSR
DTAFGTLVSFTCPIGQEFATGKTRLVTECLRGGNWSVSYIPKCQEVYCGPVPQIDNGF
SIGSSNVTYRGIAMYQCYAGFAFASGAPIEKISCLPDGRWERQPHCMASQCAALPEVA
HANVTLLNGGGRSYGTIVQYECEPGYERNGHPVLTCMSNGTWSGDVPRCTRKRCFEFP
TIANGFVVDSTRAYLFGDEARVQCFKGYKLIGSNIMRCSEAQKFEQPPTCEDINECSS
SQCDLTTTECQNTNGSFHCQCRTGFTATTECRPVGDLGLGNGGIPDDSITTSVSEPGY
SKEQLRLNTNGWCGGSSEPGANWILIDLKAPTILRGFRTMSVQRPDGNVAFSSAVRLQ
YTNDLTDVFKDYANPDGTAVEFRILEPTLSILNLPLPIEARYIRFRIQDYVGAPCLRM
ELMGCTRLDCVDINECSKNNGGCDQKCINSPGGFACGCNTGYQLYTSNGTAGYHIERS
ESGERDGDTYQRNKTCVPLMCPELEAPENGQLLSDKNDYHFGDVVRFQCHFGYIMSGS
SAALCLSSGQWNASVPECNYAKCVSLPDDKLEGLTVARPDPESVLVPFRDNVTITCGS
PGRQLRATASSGFRQCVYDPKPGLPDYWLSGMQPSCPRVDCYSPMPTPGAEYGQFVDT
RYQSSFFFGCQNTFKLAGQTGRHDNVVRCGADGIWDFGDLRCEGPVCEDPGRPADGRQ
IARSYEQSSEVYFGCNRPGYILINPRPITCIREPECKVIKPLGLSSGRIPDSAINATS
ERPNYEAKNIRLNSATGWCGKQEAFTYVSVDLGQIYRVKAILVKGVVTNDIVGRPTEI
RFFYKQAESENYVVYFPNFNLTMRDPGNYGELAMITLPKFVQARFVILGIVSYMDNAC
LKFELMGCEEPKQEPLLGYDYGYSPCVDNEPPIFQNCPQQPIVVRRDENGGVLPVNFT
EPTAVDNSGSIARLEIKPQNFRTPSYIFKDTVVKYVAFDYDGNVAICEINITVPDVTP
PLLQCPQSYVIELVDRQDSYTVNFNDTRKRIKTSDDTGDVRLQFSPESANIKIGNFEN
VTVTATDKYNNRAACHFQVSVKASPCVDWELQPPANGAINCLPGDRGIECIATCKPGF
RFTDGEPLKTFSCETSRLWRPTSVVPDCVSENTEQAAYHVTASITYRANGAVAQSCLG
QYQEVLAQHYGGLNQLLSQRCSAVNVNMNVTFVKSVPMLLEENVVKMDFILSILPAVR
QPQLYDLCGSTLNLIFDLSVPYASAVIDDLLNIANIGNQCPPLRALKSQISRGFNCNV
GEVLNMDTSDVPRCLHCPAGTYVSEGQNSCTYCPRGYYQNRDRQGTCLRCPAGTYTKE
EGTKSQADCIPVCGYGTYSPTGLVPCLECPRNSFTAEPPTGGFKDCQACPAQSFTYQP
AASNKDLCRAKCAPGTYSATGLAPCSPCPLHHYQGAAGAQSCNECPSNMRTDSPASKG
REQCKPVVCGEGACQHGGLCVPMGHDIQCFCPAGFSGRRCEQDIDECASQPCYNGGQC
KDLPQGYRCECPAGYSGINCQEEASDCGNDTCPARAMCKNEPGYKNVTCLCRSGYTGD
QCDVTIDPCTANGNPCGNGASCQALEQGRYKCECVPGWEGIHCEQNINDCSENPCLLG
ANCTDLVNDFQCACPPGFTGKRCEQKIDLCLSEPCKHGTCVDRLFDHECVCHPGWTGS
ACDINIDDCENRPCANEGTCVDLVDGYSCNCEPGYTGKNCQHTIDDCASNPCQHGATC
VDQLDGFSCKCRPGYVGLSCEAEIDECLSDPCNPVGTERCLDLDNKFECVCRDGFKGP
LCATDIDDCEAQPCLNNGICRDRVGGFECGCEPGWSGMRCEQQVTTCGAQAPCQNDAS
CIDLFQDYFCVCPSGTDGKNCETAPERCIGDPCMHGGKCQDFGSGLNCSCPADYSGIG
CQYEYDACEEHVCQNGATCVDNGAGYSCQCPPGFTGRNCEQDIVDCKDNSCPPGATCV
DLTNGFYCQCPFNMTGDDCRKAIQVDYDLYFSDPSRSTAAQVVPFPTGEANSLTVAMW
VQFAQKDDRGIFFTLYGVQSARMTQQRRMLLQAHSSGVQVSLFEDQPDAFLSFGEYTS
VNDGQWHHVAVVWDGISGQLQLITEGLIASKMEYGAGGSLPGYLWAVLGLPQPYGLSN
ELAYSDSGFQGTITKAQVWARALDITSEIQKQVRDCRSEPVLYPGLILNWAGYEVTSG
GVERNVPSLCGQRKCPVGYTGANCQQLVVDKEPPVVEHCPGDLWVIAKNGSAVVSWDE
PHFSDNIGVTKIYERNGHRSGTTLLWGTYDITYIASDAAGNTASCSFKVSLLTDFCPA
LADPVGGSQVCKDWGAGGQFKVCEIACNAGLRFSEPVPEFYTCGAEGFWRPTREPSMP
LVYPSCSPSKPAQRVFRIKMLFPSDVLCNKAGQAVLRQKVTNSVNGLNRDWNFCSYAI
EGTRECKDIQIDVKCDHYRGTQNNRVRRQAKDGGVYVMEAELPVVNDPVVHTSTGERS
TVKQLLEKLILEDDQFAVQEILPNTVPDPASLELGSEYACPVGQVVMIPDCVPCAIGT
FYDSANKTCIACSRGTYQSEAGQLQCSKCPVIAGRPGVTAGPGARSAADCKERCPAGK
YFDAETGLCRSCGHGFYQPNEGSFSCELCGLGQTTRSTEATSRKECRDECSSGQQLGA
DGRCEPCPRGTYRLQGVQPSCAACPLGRTTPKVGASSVEECTLPVCSAGTYLNATQNM
CIECRKGYYQSESQQTSCLQCPPNHSTKITGATSKSECTNPCEHIAEGKPHCDVNAYC
IMVPETSDFKCECKPGFNGTGMACTDVCDGFCENSGACVKDLKGTPSCRCVGSFTGPH
CAERSEFAYIAGGIAGAVIFIIIIVLLIWMICVRSTKRRDPKKMLTPAIDQTGSQVNF
YYGAHTPYAESIAPSHHSTYAHYYDDEEDGWEMPNFYNETYMKDGLHGGKMSTLARSN
ASLYGTKEDLYDRLKRHAYTGKKEKSDSDSEVQ"
ORIGIN
1 gtatgccttc gactgtggcg cgttgtggac gttaagaggc tgcggcgaac ccgaactcta
61 gacaagtatt ggcaagaaac caacgcgctt aactgggcat taaatcgggc caaaatgaaa
121 caatcgacgg cagccaattc gcattggctg gccgctttgc tgtcgtcgtt gctgctccta
181 aatttgctac acttaattgg agccgacgag gcgttttcgt gtcccaatgg ttgggaactg
241 cgcggcttga attgttataa atatttcaat atcaagcact cgtgggataa aagcgccgaa
301 ctgtgtcgaa gatacggcgc cgaactggta gccatcgaca gctatgcgga gaacaacgag
361 accttggcca tcgcccgggc cagcgatccc aaccagaggg cttcggacaa gtactggctg
421 ggattggcct ccctcgacga tctgcgcacc aatacgctgg agtccgcatc gggagcactg
481 atctcgcaat actccggcta ctggtcactc catcagccga atgccgagtc cggagagtgt
541 gtggctgctg cctttgccgg caaatcgcag agctgggatc tgggcacctg tgagtccctg
601 ctgccgttca tgtgccgtgc ccaggcgtgt ccacagggat cactacactg tgccaacggc
661 aagtgcatca atcaggcctt caagtgcgac ggcagtgatg attgtggcga tggcaccgat
721 gaactggact gtccagcaca gtgccactac cacatgcagt ccggaggaga tgtgatcgag
781 acgcccaact atccgcacaa atacggtgcg ctgagcaagt gcaagtggac gctggaggga
841 ccgctgggca gcaacattat cctgcagttc caggacttcg aaacggagaa gacctttgac
901 accgtgcaga ttctggttgg cggccgtacc gaggataagt ccgtgtcgct ggccacgctc
961 agtggcaagc aggatctgac cacgcagccc ttcgtatccg cttccaactt catgatcgtc
1021 aagttcacca cggatggcag tgtggagcgc aagggattcc gggccacgtg gaagacggag
1081 gccaaaaact gcggcggcac cttgaaggcc acgcttcagc gacagatcct gaccagcccc
1141 aactacccga agcaatatcc cggcggtctg gagtgtctct atgtgattaa agcacagccg
1201 ggtcgcatca tctccatcga agtggacgac ttggacatcg ccgatggacg cgatttcctg
1261 atgatccgcg atggcgaatc acctatgagt cgcaccatcg ccaaactgac tggaaagaca
1321 gcccaaaaca accgggtgat catctcaacg ggcaacgctc tctacttgta tttcaagtcc
1381 agtttgggtg aggccggcaa gggcttcagt ttgcggtaca tccagggctg caaggccacg
1441 atcaccgcta gaaatggcac ggttacttca cccgcctttg gattggccga ctaccccaag
1501 aaccaggagt gctacttcac cattcgcaac aatgcccgtg ctccgctgtc cctgaagttc
1561 gacaagttca ccgttcacaa gagcgacaat gtccaggtgt tcgatggatc ctccacttcc
1621 ggtctgcgcc tgcactccgg aaacggattc actggcccag cggcgcccaa actgaccctg
1681 actgcttcat ccggtgagat gctcatcaag ttcacctcgg atgcactgca caatgctgct
1741 ggatggtcgg ccacattctc ggccgattgc ccggagctgc aacccggaat tggagccttg
1801 gcctccagtc gcgacaccgc tttcggtacg ctggtcagct ttacatgtcc cattggacag
1861 gagtttgcca ccggcaagac gcgactggtt accgaatgtc tgcgcggtgg caactggagt
1921 gtctcctaca tacccaagtg tcaggaggtc tactgcggtc ctgtgccaca aatcgacaac
1981 ggtttctcca ttggctcctc gaacgtaacc tatcgcggta tagcaatgta ccagtgctac
2041 gccggctttg ccttcgcctc gggtgctccg atcgagaaga tctcctgtct gccggatggc
2101 cgttgggagc gacagcccca ctgcatggcc tcccagtgcg cagcgctgcc ggaagtggca
2161 cacgccaacg tcaccctgct gaatggaggt ggtcgcagct acggcaccat tgtccagtat
2221 gagtgtgagc cgggctacga gcgcaatggc catcccgtgc tgacctgtat gtcgaacggc
2281 acctggagtg gtgatgtacc aagatgcacg cgcaagcggt gcttcgaatt cccgaccatt
2341 gccaacggct ttgtggtgga ctcgacgcga gcctacctct tcggcgatga ggccagggtg
2401 cagtgcttca agggctacaa actgatcggc agcaacatca tgcgctgcag cgaggcccag
2461 aagttcgagc agccgccgac gtgcgaggac atcaacgagt gcagctcctc gcagtgcgac
2521 ctaaccacca ccgagtgcca gaacacgaac ggctccttcc actgccagtg caggacggga
2581 ttcacggcta ccaccgagtg tcggcccgtc ggtgatttgg gcttgggtaa tggaggcata
2641 ccggatgaca gcatcaccac ctcggtcagt gagccgggct acagcaagga gcagctgcgc
2701 ttgaacacga atggctggtg cggtggctct tcggagcctg gtgccaactg gatactcatc
2761 gacctgaagg cacccaccat tctgcgtggc ttccgcacca tgtccgtgca gcgtcccgat
2821 ggcaatgtgg ccttcagctc ggcggtgcgt ctgcagtaca ccaacgatct gacggatgtg
2881 ttcaaggatt atgccaatcc cgacggcact gccgtcgaat tccgcatcct ggagcccacg
2941 ctctccatct taaacctgcc cctgcccatc gaagctcgct atattcgctt ccgcatccag
3001 gactacgtgg gtgcgccctg tctgcgcatg gagctgatgg gctgcacgcg cttggattgc
3061 gtggacatca acgagtgcag caagaacaat ggcggctgtg accagaagtg catcaactca
3121 ccgggcggat ttgcctgtgg ctgcaacact ggctaccagc tgtacacctc caacggcacg
3181 gctggctatc acatcgaacg ctccgaatcc ggcgaacgtg atggtgacac ctatcagcgc
3241 aacaagacct gtgttcctct catgtgtccc gaactggagg cgcccgagaa tggtcaactc
3301 ctgagcgaca agaacgacta tcactttggc gatgtggtgc gcttccagtg ccactttggc
3361 tacatcatga gcggcagctc ggcggccctg tgcctctcca gcggtcagtg gaacgccagc
3421 gtaccggagt gcaattatgc caaatgcgtt tccctgcccg atgacaagtt ggagggtctg
3481 actgtggccc gccccgatcc cgaatccgtt ctagtgccct tccgtgacaa tgtgaccatt
3541 acgtgcggat cgccgggacg ccaactgaga gccaccgctt cctctggttt ccggcagtgc
3601 gtgtacgatc ccaagcccgg tctgcccgat tactggctat ccggaatgca gccctcttgt
3661 ccccgagtgg attgctactc acccatgcca acgcccggcg cagaatacgg acagtttgtg
3721 gacactcgct atcagagcag cttcttcttt ggctgccaga acacctttaa gttggctgga
3781 cagacgggtc gtcacgacaa tgtggttcgt tgtggagccg atggtatctg ggactttgga
3841 gatcttcgct gtgagggacc tgtgtgcgag gatccgggaa gaccggcaga tggtcgccag
3901 attgcacgca gctatgagca gagctcggag gtgtacttcg gctgcaatcg tcctggctac
3961 atcctgatca atccgcgacc cattacatgc atacgcgagc cagagtgcaa ggtcatcaag
4021 cctttgggat taagttccgg caggattccg gattcggcca tcaatgccac ctcggagcga
4081 cccaattacg aggccaagaa catccgtctc aactcggcca ctggctggtg tggcaagcag
4141 gaggccttca cctatgtgag cgtggatctg ggtcagatct atcgagtcaa ggcgattctg
4201 gtgaagggtg tggttaccaa cgacattgtg ggcaggccca cggagattcg gttcttctac
4261 aaacaagctg agagcgagaa ctacgtggtg tacttcccca atttcaatct gaccatgcga
4321 gatccaggca actacggcga gctggccatg atcacgctgc ccaagttcgt gcaggctcgc
4381 tttgtgatcc ttggaatagt gagctacatg gacaacgcct gtctgaagtt cgagttgatg
4441 ggctgcgagg agccgaaaca ggaaccactc ctcggctacg actacggcta ctccccgtgc
4501 gtggacaacg aaccacccat cttccaaaac tgcccgcagc aaccaattgt ggtgcgacgc
4561 gatgagaatg gaggagtact acccgttaac ttcaccgaac ccacggcggt ggacaactcc
4621 ggatcgattg cccgcctgga gatcaagcca cagaacttcc gcacacccag ctacattttc
4681 aaggatacgg ttgtaaagta cgtggccttt gactacgatg gcaatgtggc catctgcgag
4741 atcaacatca cggtgcccga tgtaacacca ccactgctgc agtgccccca gagctatgtg
4801 attgagctag tggatcgcca ggacagctac actgtgaact tcaacgatac ccggaagagg
4861 atcaagacct ccgacgacac aggagatgtg aggttgcagt tcagccccga gagtgccaac
4921 atcaagatcg gaaacttcga gaacgtgacc gtcacggcaa cggataagta caacaaccgc
4981 gccgcctgcc acttccaggt ctctgtgaag gcttcaccct gcgtggactg ggagctccag
5041 ccgccggcga atggtgccat caattgcctg cctggtgatc gtggtatcga atgcattgcc
5101 acgtgcaagc caggattccg tttcaccgac ggcgaaccac tgaagacctt ctcctgcgag
5161 acatcacgtc tgtggcgtcc cacgtccgtg gtgcccgact gcgtgtcgga gaacacggag
5221 caggccgcct accacgtgac cgcctccatt acctaccgcg ccaatggagc agtggcccaa
5281 tcctgtctgg gtcagtacca ggaggtgctg gcacagcact atggcggact caaccagttg
5341 ctctcgcagc gctgctccgc cgtgaatgtc aacatgaatg tgacctttgt gaagtctgtg
5401 cccatgctgc tggaggagaa tgtggtcaag atggacttca tcctctccat tctgcccgct
5461 gtgcgtcagc cgcagctgta cgacctgtgc ggctccacgc tgaacctgat ctttgatctg
5521 agtgtaccct atgccagtgc cgtgatcgat gaccttttga acattgccaa catcggtaac
5581 cagtgtcctc cgctacgcgc cctcaagtcg caaatctcgc gaggatttaa ctgcaatgtg
5641 ggcgaggtac tgaacatgga caccagcgat gtgccgcgtt gcctgcactg tcccgccgga
5701 acgtatgtgt cagagggtca gaacagctgc acctactgcc cgaggggcta ctaccagaac
5761 cgtgaccgcc agggaacctg cctgcgctgc ccggccggaa cctacaccaa ggaggagggc
5821 accaagtcgc aggcggactg cattcccgtc tgcggttatg gcacctactc acccaccgga
5881 ctggtgccgt gcctggagtg tccgcgtaac tcattcactg ccgaaccacc aaccggtgga
5941 ttcaaggatt gccaggcctg tccggcacag agcttcacct accagccggc tgcctcgaac
6001 aaggatctgt gtcgcgccaa gtgtgcgccg ggaacgtact ccgccaccgg actggcaccc
6061 tgctcgccct gcccactgca tcattaccag ggagccgcgg gtgcgcagag ctgcaacgag
6121 tgtccgagta acatgagaac cgattcaccc gcctccaagg gacgcgaaca gtgcaagccg
6181 gtggtatgtg gtgaaggtgc ttgccagcac ggcggactgt gtgtgcccat gggccatgac
6241 atccagtgct tctgtccggc cggattctct ggacgtcgct gcgaacagga catcgacgag
6301 tgcgcctccc agccctgcta caatggtggt cagtgcaagg atctgccgca gggctatcgc
6361 tgtgagtgcc cggctggata ctcgggcatc aattgccagg aggaggccag tgactgtggc
6421 aacgacacct gtccggccag ggccatgtgc aagaacgagc cgggctacaa gaacgtgacc
6481 tgtctgtgcc gcagtggcta caccggcgat cagtgcgacg tgaccatcga tccgtgcacg
6541 gcgaatggca atccgtgcgg aaacggagcc agctgccagg ccttggagca gggtcgctac
6601 aagtgcgagt gtgtgcccgg atgggagggc atccactgtg agcagaatat caatgactgt
6661 tcggagaatc cctgcctgtt gggcgccaac tgcacagatc tggtcaatga cttccagtgc
6721 gcctgtccgc caggatttac gggcaagcga tgcgagcaaa agatcgatct ctgcctatcg
6781 gaaccatgca agcatggcac ctgcgtggat cgtctgttcg atcacgagtg tgtttgccat
6841 ccgggctgga cgggatccgc ctgcgacatc aacatcgacg actgcgagaa ccgaccctgc
6901 gccaatgagg gaacctgcgt cgacctggtc gacggctata gctgcaactg tgaacccggc
6961 tacacgggca agaattgcca gcacaccatc gacgactgcg cctcgaatcc ctgccagcac
7021 ggcgccacct gtgtggacca gctggatggc ttcagctgca aatgccgccc tggctacgtg
7081 ggtctctcct gcgaggccga gatcgacgag tgtctgagcg acccctgcaa tccggtgggc
7141 acggagcgct gcctcgatct ggacaacaaa ttcgagtgcg tgtgccggga cggattcaag
7201 ggacccctgt gcgccacgga catcgatgac tgcgaggcgc agccgtgtct gaacaacggc
7261 atctgtcggg atcgcgtcgg tggctttgag tgcggctgcg agccaggatg gagtggcatg
7321 cgctgcgagc agcaggtgac cacgtgcgga gctcaggcgc cgtgccagaa cgatgccagc
7381 tgcatcgacc tgttccagga ctacttctgc gtgtgtccca gcggcaccga tggcaagaac
7441 tgcgagaccg ctccggaacg ctgcatcggt gatccttgca tgcacggtgg caagtgccag
7501 gactttggct ctggtcttaa ctgcagttgc cctgcggatt actcgggcat tgggtgtcag
7561 tacgagtacg acgcatgcga ggagcatgtc tgtcagaatg gcgccacttg tgtggacaat
7621 ggtgctggct acagctgcca gtgcccacct ggcttcaccg gtcgcaattg cgaacaggac
7681 atcgtggact gcaaggacaa ctcttgccca ccgggcgcca cgtgcgtgga tctaaccaac
7741 ggcttctact gtcagtgccc cttcaatatg accggagacg attgccgcaa ggccatccaa
7801 gtggactacg atctgtactt cagcgatcca tcgcgatcca ccgccgccca ggtggtgccc
7861 ttccccacgg gagaggcgaa cagcctgact gtcgccatgt gggtgcagtt tgcccagaag
7921 gacgatcgcg gcatcttctt caccctctac ggcgtgcaat ccgctcgcat gacccagcag
7981 cgccgcatgc tgctccaggc gcactccagt ggagtccagg tttcactgtt tgaggaccaa
8041 cccgatgcct tcctgagctt tggggagtac acttccgtca acgacggcca gtggcatcat
8101 gtagccgtgg tctgggacgg aatctccggg cagcttcaat tgatcacaga gggactgatt
8161 gccagcaaga tggagtacgg agccggcggc tctctgcccg gttatctctg ggcagtgctg
8221 ggactcccac agccgtatgg acttagtaat gagctggcct actcggattc cggattccag
8281 ggcacaataa ccaaggctca agtgtgggcc agagccctag acatcacgtc agagatccag
8341 aagcaggtgc gcgactgccg ttctgaaccg gttctctatc ccggcctcat cctcaactgg
8401 gcgggatacg aggtgacctc aggcggagtg gagcgcaatg tgccctccct atgcggacaa
8461 cgcaagtgcc cagtgggcta cacgggcgcc aattgccagc aactggtcgt ggacaaggaa
8521 ccacctgtgg tggagcactg ccccggagat ctgtgggtga ttgccaagaa cggttccgcg
8581 gtggtctcct gggatgagcc gcacttcagc gacaacattg gcgtgaccaa gatctacgag
8641 cgaaatggac accgatctgg aactacattg ctatggggca cctacgacat cacctacatt
8701 gcatccgatg cagctggaaa tactgcatcg tgcagcttca aggtttctct gctgaccgac
8761 ttctgtccag cgttggctga tcccgttggt ggatcacagg tttgcaagga ttggggtgcc
8821 ggtggtcagt tcaaggtctg cgagatcgcc tgtaatgcgg gtcttcgatt ctcggagccg
8881 gtgcctgagt tctatacctg cggagccgaa ggcttttggc gaccaactag ggaaccctcg
8941 atgccactcg tctacccatc ctgctcacca tcgaagcccg cccagcgggt gttccgcatc
9001 aagatgctct tcccctcgga cgtgctgtgc aacaaggctg gtcaggcggt gctccgtcag
9061 aaggtgacca actcggttaa tggcctgaac agggactgga acttctgctc ctatgccatc
9121 gagggaacaa gggaatgcaa ggacattcag atcgatgtga aatgcgacca ctaccgaggt
9181 acgcagaaca atcgtgtgcg tcgtcaggcc aaggatggcg gagtctatgt gatggaggcc
9241 gaattgccag tggtcaatga tcccgtggtg cacacatcga cgggcgaacg aagcactgtc
9301 aagcagctgc tggagaagct catcctcgag gacgatcagt tcgccgtgca ggagattctg
9361 cccaacacag tgcctgatcc ggcttccctg gaactgggct cggagtacgc ctgtcccgtg
9421 ggccaggtgg tgatgatacc cgactgtgta ccctgtgcca tcggcacctt ctacgacagc
9481 gccaacaaga cgtgcatagc ctgctcgcgc ggaacctacc agtcggaggc gggtcagctg
9541 cagtgcagca agtgcccggt gattgctgga agaccaggag tgactgccgg tccgggagca
9601 cgctccgcgg cggactgcaa ggagcgctgc ccagctggca agtactttga cgcggaaacg
9661 ggtctgtgcc gctcctgcgg ccatggattc taccagccca acgagggttc ctttagctgt
9721 gagctatgcg gtctgggaca gacaacgcgc tccacggagg ccacgtcacg caaggagtgt
9781 cgcgatgagt gcagctctgg ccagcaactg ggtgccgatg gacgctgcga gccctgccca
9841 cgtggaacat accgcctgca gggcgtgcag ccatcctgcg ccgcctgtcc gctgggcagg
9901 acgacgccca aggtgggcgc cagttcggtg gaggagtgca cactgcccgt ctgctcggcg
9961 ggtacgtacc tgaatgccac acagaatatg tgcatcgagt gccgcaaggg atactaccaa
10021 tcggagtcgc agcagacctc ctgtctgcag tgcccaccga accacagtac caagatcact
10081 ggcgccacct cgaagagcga gtgcaccaat ccgtgcgagc acattgcaga gggcaagccg
10141 cactgcgatg tcaatgccta ctgcatcatg gtgccggaga cgtcggactt taagtgcgaa
10201 tgcaagccag gattcaatgg aacgggcatg gcctgcacgg atgtgtgcga tggcttctgc
10261 gagaactctg gtgcgtgtgt caaggacttg aagggcacac catcttgccg ctgtgtgggc
10321 tcctttacgg gtccccactg tgcggaacgc tcggagtttg cctacatcgc cggtggcatt
10381 gccggagcgg tgatctttat catcatcatt gtcctgctca tctggatgat ttgcgtgcgc
10441 tccacgaagc gcagggatcc caagaagatg ctaacacctg cgattgacca gaccggctcg
10501 caggtgaact tctactacgg cgcccacacg ccctacgcgg agtccatcgc gccatcgcat
10561 cacagcacat atgcgcacta ctacgacgac gaggaggatg gctgggagat gcccaacttc
10621 tacaatgaaa cgtacatgaa ggatggtctg catggcggta agatgagcac gttggccaga
10681 tcgaatgcct cgctctatgg aactaaagaa gacttatacg accgactgaa acgtcacgcc
10741 tacacgggca agaaggagaa gagtgatagt gatagcgaag tgcagtagaa cgacgataaa
10801 ctacagataa ccagctgctt taatgtgtaa aatgtggtca taaataacga atgggttgca
10861 gcagctaact cactgtcaga acagtgacgc cgcccactgc ccgcgccgaa gaatactcac
10921 tggagagcct tgcattgaat caacgtaata ttcggcgatc catctacgat ccatctatag
10981 tcccccatct cggatattac gtgacttaat gcaaggcttt tggcgattaa agtcaagcgg
11041 agatgagatg gcttttatac gagtaatagt aactcccata tgtgctcttt aggtatgcaa
11101 actacatgaa attaagtcga acttatgcgt aatttaataa atgaaatatt gttttaatct
11161 taagtattat ttacctatac aagaaactcg aacttaccat gcttgacgcg gtaccaaatt
11221 gcaatacata tatgctaata tatatatata tatataaaga tagatttttc caaaccattt
11281 agtttgtcca agcttgttga acaaatgcgc gagtgctgta aaaagcaaat caaatatagt
11341 cgttattttg taatttaaat aaaagcaatt taattattat gtatgacaat taattttata
11401 aattttcttg tataaaataa acgaaacaaa caaatcc
//