Dbfetch
LOCUS NM_143496 11469 bp mRNA linear INV 26-DEC-2023
DEFINITION Drosophila melanogaster vacuolar protein sorting 13B, transcript
variant A (Vps13B), mRNA.
ACCESSION NM_143496
VERSION NM_143496.4
DBLINK BioProject: PRJNA164
BioSample: SAMN02803731
KEYWORDS RefSeq.
SOURCE Drosophila melanogaster (fruit fly)
ORGANISM Drosophila melanogaster
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE 1 (bases 1 to 11469)
AUTHORS Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
Strelets,V., Russo,S.M. and Gelbart,W.M.
CONSRTM FlyBase Consortium
TITLE Gene Model Annotations for Drosophila melanogaster: Impact of
High-Throughput Data
JOURNAL G3 (Bethesda) 5 (8), 1721-1736 (2015)
PUBMED 26109357
REMARK Publication Status: Online-Only
REFERENCE 2 (bases 1 to 11469)
AUTHORS Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
Russo,S.M. and Gelbart,W.M.
CONSRTM FlyBase Consortium
TITLE Gene Model Annotations for Drosophila melanogaster: The
Rule-Benders
JOURNAL G3 (Bethesda) 5 (8), 1737-1749 (2015)
PUBMED 26109356
REMARK Publication Status: Online-Only
REFERENCE 3 (bases 1 to 11469)
AUTHORS Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
TITLE The Release 6 reference sequence of the Drosophila melanogaster
genome
JOURNAL Genome Res 25 (3), 445-458 (2015)
PUBMED 25589440
REFERENCE 4 (bases 1 to 11469)
AUTHORS Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
TITLE Sequence finishing and mapping of Drosophila melanogaster
heterochromatin
JOURNAL Science 316 (5831), 1625-1628 (2007)
PUBMED 17569867
REFERENCE 5 (bases 1 to 11469)
AUTHORS Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
TITLE The Release 5.1 annotation of Drosophila melanogaster
heterochromatin
JOURNAL Science 316 (5831), 1586-1591 (2007)
PUBMED 17569856
REMARK Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE 6 (bases 1 to 11469)
AUTHORS Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
Ashburner,M. and Anxolabehere,D.
TITLE Combined evidence annotation of transposable elements in genome
sequences
JOURNAL PLoS Comput Biol 1 (2), 166-175 (2005)
PUBMED 16110336
REFERENCE 7 (bases 1 to 11469)
AUTHORS Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
Celniker,S.E., Rubin,G.M. and Karpen,G.H.
TITLE Heterochromatic sequences in a Drosophila whole-genome shotgun
assembly
JOURNAL Genome Biol 3 (12), RESEARCH0085 (2002)
PUBMED 12537574
REFERENCE 8 (bases 1 to 11469)
AUTHORS Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
Rubin,G.M., Ashburner,M. and Celniker,S.E.
TITLE The transposable elements of the Drosophila melanogaster
euchromatin: a genomics perspective
JOURNAL Genome Biol 3 (12), RESEARCH0084 (2002)
PUBMED 12537573
REFERENCE 9 (bases 1 to 11469)
AUTHORS Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
Rubin,G.M. and Lewis,S.E.
TITLE Annotation of the Drosophila melanogaster euchromatic genome: a
systematic review
JOURNAL Genome Biol 3 (12), RESEARCH0083 (2002)
PUBMED 12537572
REFERENCE 10 (bases 1 to 11469)
AUTHORS Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
Gibbs,R.A. and Rubin,G.M.
TITLE Finishing a whole-genome shotgun: release 3 of the Drosophila
melanogaster euchromatic genome sequence
JOURNAL Genome Biol 3 (12), RESEARCH0079 (2002)
PUBMED 12537568
REFERENCE 11 (bases 1 to 11469)
AUTHORS Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
TITLE The genome sequence of Drosophila melanogaster
JOURNAL Science 287 (5461), 2185-2195 (2000)
PUBMED 10731132
REFERENCE 12 (bases 1 to 11469)
AUTHORS Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
Smith,E., Yu,C. and Rubin,G.
CONSRTM Berkeley Drosophila Genome Project
TITLE Drosophila melanogaster release 4 sequence
JOURNAL Unpublished
REFERENCE 13 (bases 1 to 11469)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (20-DEC-2023) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 14 (bases 1 to 11469)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 15 (bases 1 to 11469)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 16 (bases 1 to 11469)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (20-APR-2020) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 17 (bases 1 to 11469)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (22-APR-2019) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 18 (bases 1 to 11469)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 19 (bases 1 to 11469)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 20 (bases 1 to 11469)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (07-OCT-2015) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 21 (bases 1 to 11469)
AUTHORS Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
Park,S., Svirskas,R. and Karpen,G.
TITLE Direct Submission
JOURNAL Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
REMARK Direct Submission
REFERENCE 22 (bases 1 to 11469)
AUTHORS Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
Svirskas,R. and Rubin,G.
TITLE Direct Submission
JOURNAL Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
64-121, Berkeley, CA 94720, USA
REMARK Direct Submission
REFERENCE 23 (bases 1 to 11469)
AUTHORS Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
CONSRTM Drosophila Heterochromatin Genome Project
TITLE Direct Submission
JOURNAL Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE 24 (bases 1 to 11469)
AUTHORS Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
TITLE Direct Submission
JOURNAL Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
Rockville, MD 20850, USA
COMMENT REVIEWED REFSEQ: This record has been curated by FlyBase. This
record is derived from an annotated genomic sequence (NT_033777).
On Dec 17, 2009 this sequence version replaced NM_143496.3.
##Genome-Annotation-Data-START##
Annotation Provider :: FlyBase
Annotation Status :: Full annotation
Annotation Version :: Release 6.54
URL :: http://flybase.org
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..11469
/organism="Drosophila melanogaster"
/mol_type="mRNA"
/db_xref="taxon:7227"
/chromosome="3R"
/genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
gene 1..11469
/gene="Vps13B"
/locus_tag="Dmel_CG15523"
/gene_synonym="CG15523; Dmel\CG15523"
/note="Vacuolar protein sorting 13B"
/map="99C7-99C7"
/db_xref="FLYBASE:FBgn0039727"
/db_xref="GeneID:43550"
CDS 197..11392
/gene="Vps13B"
/locus_tag="Dmel_CG15523"
/gene_synonym="CG15523; Dmel\CG15523"
/note="CG15523 gene product from transcript CG15523-RA;
CG15523-PA; Vps13B-PA"
/codon_start=1
/product="vacuolar protein sorting 13B, isoform A"
/protein_id="NP_651753.2"
/db_xref="FLYBASE:FBpp0084871"
/db_xref="GeneID:43550"
/db_xref="FLYBASE:FBgn0039727"
/translation="MFKLESYITPILLSYVAKYVKNFRDEDAQVSLWEGEVTFQNLDL
RLEVLEEELNLPVELVSGHIHELSILVPWTKLMSEPVKIVINTIEFVAKLPDSESKQR
RASFQREQRRKSKRESVEQPDQTKSPGPASSSSVVNKIINNINLQCHNIILKYVEDDI
VVSMNVQTLNFSSAGEDWKPTMVDIHPVSVVMRKLLQVSDLTICLDKRNTAGRIEVCQ
EPVLYRCTLECRVLRKYNANTVSTTSTTRIGVFTKSLDINVSSLQFPMVMRLVKMLLE
LKPAEFEDDPQNPEDQEAVAEGSESQQGTESAGRSVFWWAWSLLPSFDTEAPSSSCDT
PTGHAFDLGVYAEELNFQLKNSEYFTDQSMGGIKRIRYTPILRISLGGLYYERTILKE
CDWANVKAGLSSMCMEPLGAYRSDDLVDRNLVNTQEFQQERSFIDKSLFDENYMFAER
SWCSYNYADYVARNTDEYMLFRSPVLAFDVIEHRVPKPSSHPVAESQLRDLGVRIQYR
LLSAGITFHFSQSFVQVKKVISDLIRPYDYPGYRSESMKDEDRGVPEPENKNSEMTIP
DLEYLMGLVPTCNYKIELRNIVVQLYPRQQQDESSAINQHQLTTTVRQSLLPYLQLKI
SLVEGTMCGPVNPVRLVQLITHLEEKPRDVVNACYNCFHFNVKNLALMIMNTTPDNGR
AKLLNIPRVQVNWNRLLAPHLWRQNEAPLETSEIKSEIITLEFSKRELIIAKRLVPLI
SSFSGQDLCDLAHIVANVNVNSDVIKLQSVGTKLNLVYHKYHTHLAAVGSLQGVHTDA
VHTKMNIRNVVLSTSKNADNKWLEMQCQFPLEEVHSEQEKIPGTVVCLWLEPFRITTD
IYLLQFLNFSDNSGRQITKDKEVETDLESVNQSEPPITQSVSNYSTISASAIAFNDFQ
PRRISRNNSRKISVPEETVHLSSERDERHTQAEPLTIPVANKPQPSNFDTTDFVKRLT
KMVVLVEIAQAKIDVCEVMMRKTPGHTDVEYTSIRLPHMKVKSGNCEAIQRGNIRGVI
PVASNTELLNWVFDLNEFNVCYRRANVTEMIIAPVRTTITLALSFKTSDKVDGSKGKE
DKCANTCSINVHVDMSAINIFAQRVKLLHEHCLIISKMYGSLCPDEPASKWQTGKMKP
RLEVYTGSAQNYPVIKEFLELDPNFEAPKVKSAPKSSVILFCQWTIPRISFEMENSND
FKHSKIVMSLEDLLLNIDKHSDFNKITTKIENFGLNYYEEKHEEWEKIDDLHIKMLGD
STNLPLISVVITTVSLEDLYAKIGARNPHNKQHTISEVLIEVQPIEMVLNLDQITEFM
VPICEVLEIIGNSASAQSANKPSAPVPSKITTVQDLPMVHLNSKGIYVYLPLSTGKKS
CSVLLLRIESIQLTPSLENPLVRQLVRPDIYQKAAELNMLSTPGALVEDRQYELSVRR
VSLSTGNWKQTQEYRIEKTLASKHDNPAFEWNNQSQSTELDHMEIFKDFDFIMIYAPA
ISYDRFLVCGQSVEYNCATDFVASLNTHQISLISCIIVRAQRLGCIIDQVCVKNAELP
KARSHKTLGNISKYNSVDNSIYFPADPKVFKKDTWIPKSSKKLVSERLSYTNASTSFT
DAKGSSDYFYGSCQSDSGIHLGTRMKGYSRSLAAEEINRSGRQPFVLSYPSSVSFVAG
IFVLKIYDVLELEELERPVMKPLFLMTVSQPSFMSTQNFKAAVTQASIFNVNISVAKA
DVDGKVDSERFNESVFATLPGQLGSSGIPPPLLTIKTQYDRLHQKEVDVELRKPILLR
ICENTIKQLASDVFRIYTVLHETPCFNVRSPRPVPEGSQLRQLKMNCYNADRFHLSCD
RITAKFYDEERSYKCSFVWLDVSTRIKFSSRPQKAVIRSNIGSFYVQSGEKMLLHPLL
MRLSVDLVSEPWSEELLVNATLKLNYLHIDAGVFSILQLQKAQQGVNRIREYADREWQ
QFLRSRPVLGTPKDVSPCNLIKCTPSHESIERIKRPLKSNAEFYQDDLRAGAFQFVYL
NSESVLPMPYQVQIIKKNYGIICWRYPQPRQMTSIHIYPVPMPVSRPIHIRCRMEYFS
ETHETFLHYCDFMLSETSTKQLTPPDRPICATIWRVVILQSLISVDGTCFEGSEDEDL
SIEYLQSDFKVDENNDFILHPKVLVGCMRIDTVFRAEKVPKLQLLLCCQDIELNFVNQ
TDNTGELPKQLNKYRLRQTPNITQTFLTVHVEDLQMHSRIYSRNDFSLQTEMRARVKC
LDYGFLNMLDVLEPMKFSSYVRLTDYPRSLVQANFLVDKLRLNCGPHVIHTLLSSKYH
WQEVLQQKDVRHVLMPKLVIVNRIQTPISFGQTNTVEKIPVNPGEVQLYHFRSCSHVQ
ELTFFITNPKTLLVDSSDSVHIALKFEDEEKIHNLRVGEYCITLKLAKLSTTQIYILV
KGQIELVSMVPFNLVTEFREEEINYDESTPPEHLLLSKERSSYYQNVKRNADINMRLK
LTAGLGRGRTGDIPLKTNNNLPWLVKVPTQSAQQFISIWVRILREDIKMDKPAEDFQP
QKILICIWPIFEICNMLSCAVKATESTTGESSTIEAMGGRWALNTATTHATEHNIGFT
FSGILRGSSKDEYSLMLRTMDWHKFFYYDPTVWTIENALHKLGKSSKRKWPLNDPEEL
RVSRVLDFENAFDVKYQTKATREFSCCQGLDVTAWGMFINGTGLYVAIFMIQEQARLD
LKANSLQMMPKLKSAFTIDVRFDNGWKSSTPICLEDFTQKSSSNAGRSLYLRYNSFVD
IVIVERDEVLRLILEYRQEDDGRRLFKLRSKFVFTNFTDVVLNALTLAMDHKENSTRE
EVEAYSICKKARSLVSSESTKNCSGNAMDVFYDLNAHKAKHNCDTAFVYFVCFSVGGS
REISIPIPLAVPFTRRCFSLQAGPESIPLMITLIEKDNVHYLNVFRDTAPAIVISNNT
NVKFIVAQTSASENSNVSCTNSEFVGRNFEWNQLVLPHSNCYYTPPQMYANFPDVEFT
MCNLSLAVYKDKLCAKNKIAWSKPIRTDKSWQKFLHVPHHGDVKVVVCDKHRAIRLNI
YYIAQQLEFSVKDLRSRLTKPENVLMPSQEQAAPRPDESSEQENDTKRVTLPNMVTCA
HFNEECESKTQTRIKIFIKSFVFSLQTDNRENDFVKTEVCNIYADDSMLIYDDDDDQR
ELRLQLPNLQVDNQLYSSGKYDFPVLLCAEKLYRRNCSLPQVYDLDHVYKQQAQRTPV
SLFTFIFYQDEMQLQNVRCQIPPFRVYIEDAYLNQLLEALVECEPSNCVYSPPVEQDQ
ILLPAGMTLLPDQVVAQSLYISEPLRLNSFTVEPLNLLLSVHTSSRLYIALDHSPLSF
SRYERQQILTVPLRFGQSLGLHYLSGAIFGAGWVVGSLEILGSPSGLARSFSTGLRDF
VSMPVQGLFRGPWGFLVGVTQGSASLLRNVTAGTVNSVTKLAGSVARNLDRLTLDAEH
IELTEARRRARPQGFADGLTQGLTGLGISLLGAVGGLAHHTLEARSSVGVIAGLTKGI
VGALTKPISGAAEMLALTGQGVLHTVGFNAMPQQVEPSFIRNVALHGSSNRIWSYLPE
ELSRDQMLFFSEIMLQISTQLRPALLFLTSSVLAIMQSDSEDLAFASPISKVDVLADR
EDPSKLYLSLKSERRDDFEGQLNYTNERIMKFLNASRLQSIANDSLNDLLQLHELDVD
DRIQRQSQCIFYIKPNIGEHLIHYLKVISRIQH"
ORIGIN
1 cggggctgaa acgtaaacaa tccaaaattc tatggctatg tttgttttgt taaacatatt
61 cggcccagaa ttgtagcaaa attcacagca tgtaaattca agttaaccgc cggcaacgca
121 ttaaatgaag agtgtcatat aagtgagcag tcaacagcgc tgggatcgag gcctgagcct
181 ccggcgtcgc cacaccatgt tcaaactgga gtcctacatc actccgattc tgctgagcta
241 cgtggccaag tacgtgaaga acttccgcga cgaggatgcc caagtgtcgc tgtgggaagg
301 cgaggtcacc ttccagaatc tggacctgcg cctggaggtg ctcgaggaag agctgaacct
361 cccagtagaa ctagtttccg gccacattca tgaactcagc atcctggtgc cgtggaccaa
421 gcttatgtcc gagcccgtga agatcgtcat caataccata gaattcgtgg ccaagttgcc
481 ggatagtgag agcaagcagc gcagggcctc cttccagcgg gagcagcgaa ggaaatcgaa
541 gcgggagtcc gtggagcagc cggatcagac gaagagtcct ggaccggcaa gctcttccag
601 cgtggtcaac aaaatcatca acaacatcaa cctgcagtgc cacaacatca tcctgaagta
661 tgtggaggac gacatcgtgg tctctatgaa tgtgcagacc cttaacttta gctccgccgg
721 cgaggactgg aagcccacta tggtggacat acacccggtg tccgtcgtca tgcgcaagtt
781 gctccaggtc tcagatttga ccatctgctt ggacaagcgc aacacagctg gtaggataga
841 ggtgtgccag gagccggttc tctaccgatg tactctggag tgccgggttc tccgcaagta
901 caacgccaac acggtgtcca caacaagcac aacacgcatt ggagttttca cgaagtcgct
961 ggatatcaac gtgtcatctc tgcagttccc catggtgatg aggctggtca aaatgctgct
1021 cgagcttaag ccagccgagt ttgaggatga cccccagaat ccagaagatc aggaggcggt
1081 agctgagggc agcgagtcgc aacaggggac cgaatccgca ggacgaagcg tcttttggtg
1141 ggcctggagc ctactgccca gtttcgatac tgaggcgcct tcctcgtcct gcgacactcc
1201 cactggtcac gccttcgatc tgggagtcta cgccgaggag ctgaacttcc agcttaaaaa
1261 ctcggagtac tttaccgacc agagcatggg tggcatcaag cgaatacgat acaccccaat
1321 attgcgaatt agtttgggcg gactttatta cgagcggaca attctaaagg agtgcgactg
1381 ggccaacgtc aaggcgggcc tttctagcat gtgcatggag ccattaggtg cttaccgtag
1441 cgacgatctc gtggatcgca acttggtcaa tacccaggaa tttcagcaag agcgatcctt
1501 tattgacaag agtctgtttg acgagaacta catgttcgcc gaacggtcct ggtgcagcta
1561 caactacgcg gattacgtgg ccaggaacac ggatgaatac atgctgtttc gcagtcctgt
1621 tttggccttc gacgtcatcg agcaccgggt gcctaagccc agcagccatc ccgtggcgga
1681 gagccagctc agagatctgg gagttcgcat ccagtatcgt ctattgtccg ccggaatcac
1741 gttccacttt tcgcagtcgt tcgtccaggt taaaaaggta attagtgatt taattcgtcc
1801 ttacgattac ccgggatatc gttcagaatc gatgaaggat gaggatcgcg gtgtccccga
1861 gccggaaaac aagaactctg aaatgactat acctgatctg gagtatctga tgggattggt
1921 gcccacttgt aactacaaaa ttgagttgcg taacattgtg gttcagttgt atccccgcca
1981 gcagcaagat gagtcatccg cgatcaatca acaccaactg acgaccacag tcagacaatc
2041 tttgctacca tatttgcagc ttaagatctc cctagtcgag ggcacaatgt gtggaccagt
2101 gaatcctgtt cgcctggtcc aactgattac tcatctagaa gagaagcctc gggacgtggt
2161 taatgcctgc tacaactgtt ttcacttcaa tgtgaaaaac ttggcgctta tgattatgaa
2221 tacaactccg gataatggac gtgccaagct tttgaatatt ccacgcgttc aagtgaactg
2281 gaaccgcctt ttggcgcctc atctttggcg tcaaaacgag gcgcctttag agacttcgga
2341 aataaaatca gagatcataa ctttagagtt ctcaaagcgc gagctaatca tcgcaaagcg
2401 cttggtgccg ttaatttcga gtttcagtgg ccaagatctt tgtgatctgg cccacattgt
2461 agccaatgtc aacgtcaatt ctgatgtgat caagctgcag agtgtgggca ccaaactaaa
2521 cctggtctat cacaagtacc atacacattt ggctgcagtg gggtcacttc aaggtgtcca
2581 cactgatgcc gtccatacca aaatgaatat acgcaatgtg gtgttgtcca cttcaaagaa
2641 tgcagacaat aagtggcttg agatgcagtg tcagtttccc ctggaggagg tccattcgga
2701 gcaggaaaaa attcccggca cagtggtgtg cctttggttg gaaccatttc gcatcactac
2761 ggacatatat ttgctgcagt tcctaaattt ttcggacaac agtggaaggc aaattacaaa
2821 agacaaggaa gtagaaactg atttggagtc tgtgaaccag tcggagccac caattacaca
2881 atcggtttcg aattattcca cgatttctgc atctgcgatc gccttcaatg attttcaacc
2941 gcgtcgaatc agcagaaaca atagtcgaaa gatctcggtc cccgaggaga cagttcatct
3001 aagttccgag cgagatgaga ggcacaccca ggcagaacct ttgaccatac cagtggccaa
3061 taagccacaa ccaagtaatt tcgatacaac cgattttgtg aagcggctga caaaaatggt
3121 agtgctggta gagatcgctc aggctaagat tgacgtatgt gaagtaatga tgcgaaaaac
3181 tcctggacat accgatgtgg agtatacttc cattcgcctg ccgcacatga aagtaaaatc
3241 gggcaattgc gaggccatcc aacgaggaaa catcagagga gttattccag ttgcgagtaa
3301 cactgagcta ttgaattggg tcttcgacct aaacgagttt aacgtttgct atcgcagagc
3361 taatgtgacc gagatgatta tagctccagt gaggaccacg ataaccttag ctctgtcttt
3421 caagacatct gataaagttg acggaagtaa aggaaaagag gataagtgtg ctaacacgtg
3481 ctctataaat gtgcatgtgg atatgtctgc cattaacatt tttgcccagc gggtgaaatt
3541 gctacatgag cactgcttga ttatttccaa gatgtacggt tcgctgtgcc cggacgaacc
3601 tgcatccaag tggcagacag gaaaaatgaa acccagactg gaagtgtaca ctgggagtgc
3661 gcaaaactat ccggttatta aggagttcct ggagctagat ccaaacttcg aagcaccgaa
3721 agtaaaaagt gctcccaaaa gctcagttat actattttgc caatggacaa ttccgcgtat
3781 cagttttgaa atggaaaact ccaatgattt caaacacagc aaaatcgtta tgagtcttga
3841 ggacttactc ctgaatatcg acaagcatag tgacttcaac aaaattacca ccaaaataga
3901 aaacttcggc cttaactact atgaggaaaa gcatgaggaa tgggaaaaga tcgacgactt
3961 gcatatcaaa atgctgggag atagcacaaa cttgccattg attagtgtgg tgatcaccac
4021 tgtgagcctt gaggatcttt acgcaaagat tggggcaaga aaccctcaca acaagcagca
4081 cactatctct gaggtcctca tcgaggtgca accgatagaa atggtgctga acctggacca
4141 aattaccgag tttatggtcc caatatgcga ggtccttgaa attattggaa attctgcctc
4201 tgcacagtca gcgaacaaac cgagtgctcc agttccaagc aagatcacaa cagttcaaga
4261 tctaccaatg gtgcatttaa acagcaaggg aatctatgtt tatcttcctc tgtctacagg
4321 caaaaagagt tgcagtgtcc tgctgctaag gattgaaagt attcaactta ctccgagtct
4381 cgaaaatcct ctagttcgtc agcttgtccg accagatatc taccaaaaag cagcagagct
4441 gaacatgctc agcactccgg gagcactggt ggaggaccgc cagtatgagc ttagtgttcg
4501 gagggtatct ctgtccaccg gaaactggaa gcaaactcaa gagtatcgca tagagaagac
4561 tctggcaagc aaacatgaca atcccgcatt cgagtggaac aatcaaagtc aatccaccga
4621 gctggatcat atggaaatat tcaaagactt tgattttatc atgatttacg caccggctat
4681 aagctacgat agatttttgg tttgtggaca gagtgtggag tataattgcg cgaccgattt
4741 tgtggcttcc ttgaacactc accaaatcag tctgattagc tgcatcatag tgcgcgcgca
4801 gcgattgggt tgtataatcg accaggtctg tgttaaaaat gctgagttac caaaagcgag
4861 atcacataaa acgctgggaa acatttccaa atacaacagc gtggacaact cgatatattt
4921 tccagcagat ccaaaagttt ttaagaagga tacttggatc ccaaaatcat ctaaaaagct
4981 ggtcagcgag aggttaagtt acacgaatgc aagcacaagt tttactgatg ccaaaggcag
5041 tagtgactac ttctacggtt cctgccaaag cgattctgga attcatttgg gcactcggat
5101 gaaaggatat tcaagatcgc ttgcagcgga agagattaat cggagtggac gccaaccgtt
5161 tgtcctgagt taccccagta gcgtatcgtt tgtggccggt atattcgttc ttaagatcta
5221 cgacgttttg gagttggaag aactggaaag acctgtcatg aaaccacttt ttttaatgac
5281 tgtatctcag cccagtttca tgtctaccca aaatttcaaa gcagccgtaa cacaagcctc
5341 gatttttaac gtaaacataa gcgtagccaa ggcagatgtt gatggaaaag tagattcgga
5401 acgatttaat gagtcggtat ttgcaacatt acccggacaa ctgggaagct cgggtattcc
5461 gcctccgttg cttactatta agactcaata tgaccgcctc catcagaagg aagtagatgt
5521 ggagcttcgg aaacccatcc ttttaaggat ttgcgaaaac actatcaaac aattggcaag
5581 tgacgttttc cgtatctaca ctgttcttca cgagactccc tgctttaatg tccgtagtcc
5641 gcgtcctgtg cccgaaggat cccaactccg ccagttgaag atgaattgct acaacgcgga
5701 taggtttcac ttgtcatgtg atcgtataac tgccaagttc tacgatgagg aaaggagcta
5761 taaatgcagc ttcgtctggc tggatgtcag cacccgcatc aagtttagta gtcgccctca
5821 gaaggcggtt ataaggtcca acataggatc attctatgtg cagtccggag aaaagatgct
5881 gctccacccg ttgcttatgc gtttgtccgt agatttggtc agtgagccat ggtctgagga
5941 gctattggtc aatgccacac tcaagctaaa ttaccttcac atcgacgcgg gtgttttcag
6001 catattgcaa ctgcaaaagg cacagcaagg agtgaaccgg atacgcgaat atgccgatcg
6061 ggaatggcag cagttcctgc gcagtcgtcc tgttctgggc actccgaaag atgtctcacc
6121 ctgcaatctg attaagtgca ctccctcaca cgaatcgata gagcggatta aacgaccgtt
6181 gaaatcgaac gcggaatttt atcaggatga cctgagagct ggcgccttcc agttcgtgta
6241 tctgaactcc gagtctgtac tgccaatgcc ctaccaggtg cagatcatta agaaaaacta
6301 tggcatcatc tgctggcgat atccgcagcc gcgccagatg acaagcatac acatatatcc
6361 ggttcctatg ccagtaagca gacccataca tatcagatgt cgcatggagt actttagtga
6421 gacgcatgaa actttcctgc attactgcga ttttatgctc tcggagacct ctactaagca
6481 attgactcca ccagatcgtc cgatctgtgc aaccatttgg agggttgtaa tactgcaatc
6541 gttgatatct gtagacggta catgttttga aggaagtgag gatgaagacc tctcgatcga
6601 gtacctccag tcggacttca aagtggacga aaacaatgat tttattttgc atccaaaagt
6661 tcttgtgggt tgcatgcgaa tcgataccgt ttttagggca gaaaaagtgc caaaactgca
6721 gttgttgctc tgctgccagg acatcgaact gaattttgtg aaccagacgg ataacactgg
6781 tgaactgcct aaacaattga acaagtaccg ccttaggcag acccctaata ttacgcaaac
6841 ctttcttact gtgcacgtgg aggatctaca gatgcattcg aggatatatt ccagaaacga
6901 ttttagtttg cagaccgaaa tgcgcgctcg agttaagtgc ttggactatg gattcctaaa
6961 catgctagat gttctagaac caatgaagtt ctcaagctat gtaaggctga ccgactatcc
7021 acgctctcta gtgcaagcta atttcctagt ggataagctt cggttgaact gcggacccca
7081 tgtgatccac actttgctga gttccaaata ccactggcag gaagttttgc agcagaagga
7141 tgtgcggcac gttttgatgc caaagttggt tatagttaac cggatacaga cgcccataag
7201 ctttgggcaa acgaacactg tcgaaaaaat ccccgtcaat cctggtgaag tgcagctcta
7261 tcacttccgc agctgctccc acgtccagga actaaccttc ttcattacaa atcccaaaac
7321 tcttctggtg gattcaagtg attcagtaca tatagcactt aagttcgagg acgaagaaaa
7381 aatccataat ttgcgggtcg gagagtattg tattactctc aagttagcaa aactctccac
7441 cacccaaatt tatattctgg tcaagggaca aatagagttg gttagcatgg ttcccttcaa
7501 tctggtcact gagtttcgcg aggaggaaat aaattacgac gaaagcaccc caccggagca
7561 tctcctcctg tccaaagaaa gatcatccta ctaccaaaat gtcaagcgga acgccgatat
7621 caacatgcgg cttaagctga ccgccggact tggaagagga cgcactggcg acatcccact
7681 gaaaactaac aacaacttgc cttggctggt caaggttcct acgcaatctg ctcagcagtt
7741 tatcagcatt tgggtgcgaa tcttgcgtga ggacattaaa atggataagc ccgccgagga
7801 ctttcagccg cagaagattc tgatctgcat atggccgatc tttgagatat gcaatatgct
7861 cagctgtgcg gtcaaggcta ccgaaagtac taccggggaa agttccacaa tcgaagccat
7921 gggaggcaga tgggcactga acactgctac aacccatgca acggagcata acattggctt
7981 tacattctcg ggtattttgc gaggatcttc aaaagatgaa tattcgctta tgttaaggac
8041 aatggactgg cacaagttct tttattatga tcccacagtg tggaccatcg aaaatgctct
8101 gcataagctt ggcaagtcat cgaagcgcaa atggccactg aacgatcctg aagaattgcg
8161 ggttagccgc gtgctagatt ttgaaaatgc ctttgatgta aagtaccaga ctaaggcgac
8221 gagggagttc tcctgctgcc aaggactgga cgtgactgcc tggggcatgt tcatcaatgg
8281 aacgggctta tatgtagcca tttttatgat ccaagagcag gcgcgccttg atcttaaagc
8341 aaacagtttg caaatgatgc caaaactcaa aagtgctttc acaattgatg tccgatttga
8401 caacgggtgg aaatcctcaa ctccgatctg tttggaagac tttacccaaa aatcgtccag
8461 caacgccggg cgatcattgt atctgcggta caattcattc gtggatatag ttatagtaga
8521 aagggatgaa gtattgcgcc taatactgga atacaggcag gaagacgacg gacgtcgttt
8581 gttcaaattg cgatcaaaat tcgtattcac taattttacg gatgtggtcc ttaatgccct
8641 gacccttgct atggaccaca aggagaactc tacgcgggaa gaagtggagg catattcaat
8701 ttgcaaaaag gcccgcagtc ttgtttcatc cgaatcaacc aagaattgct cgggtaatgc
8761 aatggacgtc ttctacgatc ttaacgccca taaggctaag cataactgtg atacagcatt
8821 cgtctatttc gtttgcttta gtgttggtgg tagtcgggag atctccatac ccattccctt
8881 ggccgtaccc ttcaccagac gctgcttcag cttgcaagcg ggacccgagt ccatcccatt
8941 gatgatcacc ctaatagaga aggataatgt ccactactta aatgtgttcc gggacacagc
9001 acctgccatt gttattagca acaacacgaa tgtcaagttt attgtggccc aaacgagtgc
9061 ttcggagaac agcaatgtga gctgcacaaa ctcggaattt gtgggaagga acttcgagtg
9121 gaatcagctg gtcctgcctc acagcaattg ctattatacc cctccgcaga tgtacgctaa
9181 ctttccagat gtggaattca caatgtgtaa tttatccttg gcagtttaca aagataagct
9241 ttgtgccaag aacaagatcg cttggtcaaa acccattcgc accgacaagt cttggcagaa
9301 attcctacat gtgcctcacc acggcgacgt gaaagtggtt gtctgtgaca agcatcgagc
9361 tatacgactg aacatctact atatagctca gcagctggag ttttcggtta aagacttgcg
9421 atcacgtttg acaaaacccg agaatgtttt gatgccatcg caggagcagg ctgctccgag
9481 accagatgag agctcggagc aagagaacga cactaaaagg gtaactttac cgaatatggt
9541 gacgtgtgcc catttcaacg aagagtgtga aagcaaaacg cagacaagga tcaaaatttt
9601 catcaagagc tttgtgttca gcctgcagac agacaaccgg gaaaatgact tcgtgaagac
9661 ggaggtgtgc aatatctatg ctgatgactc catgcttatc tacgacgacg acgacgatca
9721 gcgagagctg cgattgcagt tgccaaacct tcaggtagat aaccagttgt actccagtgg
9781 aaagtatgat tttcccgtcc tgctttgtgc ggaaaaactg tacagaagga attgtagtct
9841 gccacaggtc tacgatttgg accatgtata taagcaacag gcacaacgaa ctcccgtctc
9901 tctgttcacg ttcattttct accaagatga aatgcaactg cagaatgtgc gatgtcagat
9961 tccacccttc agggtgtaca tcgaagatgc ctacctgaat caactgctgg aagcattggt
10021 agagtgtgaa ccgtccaatt gtgtctatag tccccctgtg gagcaggatc aaattcttct
10081 gccagccggg atgacccttt tgcccgatca agttgtggcc caatcattgt acatatcgga
10141 acccttgcgt cttaatagtt ttacggtaga gcccctgaat ctattgctat ccgtccatac
10201 atcttcgagg ctctatatag cactagatca ctctccccta tccttttcac gctacgagcg
10261 tcagcaaatc cttacagttc ccctccgatt cggacaatct ctaggactgc actacctcag
10321 cggagcgatt ttcggagccg gctgggtcgt tggttcccta gagatactgg gaagtcccag
10381 tgggctggcg cgctcgttca gcaccggtct gcgagacttt gtgtccatgc ccgttcaggg
10441 tctgttccgc ggaccttggg gcttcctcgt gggagtgacg caaggttctg cctctctgct
10501 gcgaaatgtg acagcaggaa ccgtgaattc cgtgaccaaa ctggctggtt cagtggctcg
10561 gaatctcgac cggctgactt tggacgctga gcacatcgaa ctgaccgagg cgaggcgtag
10621 agctcggcca cagggcttcg ccgatggact gacccaaggt ctgactggat tgggcattag
10681 tttacttggc gctgttggcg gattggccca tcacaccctg gaggcacgaa gttccgttgg
10741 tgttatcgct ggtctcacca agggcattgt tggagcacta acgaaaccaa ttagcggagc
10801 agctgaaatg ctagccctca cgggccaggg tgttctgcat acggtgggct ttaatgctat
10861 gccccaacag gttgagccca gtttcattcg aaacgttgcc ctgcatggaa gctccaatcg
10921 catatggagc tatttgcccg aggagttgag ccgcgatcag atgctgttct ttagtgagat
10981 catgctgcag atcagcacgc agttgcgccc agccttgctg tttctaacct cgtcagtatt
11041 ggccatcatg cagtcggata gcgaagacct ggccttcgcc tctcccattt ccaaggtgga
11101 tgtgttggcg gaccgcgaag atccctccaa actgtacttg agtctgaaat ccgagaggag
11161 ggatgacttt gaagggcaac tcaactacac caatgagcgc atcatgaagt tcctgaatgc
11221 ctctcgactt caaagcatcg ccaacgattc gctgaatgat ctgctccagc tacacgaact
11281 ggatgtggac gaccgaatcc agcggcaatc gcagtgcatc ttctacatta agcccaatat
11341 tggtgaacac ctgatccact acctaaaggt aattagtcgg attcaacatt gatagtaaat
11401 atgcatctgt gcgtacgttt cggattagtc caaagtattc ctagaatcct gtgtatttta
11461 tgccaaaaa
//