Dbfetch
LOCUS NM_078670 12442 bp mRNA linear INV 26-DEC-2023
DEFINITION Drosophila melanogaster dynein heavy chain at 16F (Dhc16F), mRNA.
ACCESSION NM_078670
VERSION NM_078670.3
DBLINK BioProject: PRJNA164
BioSample: SAMN02803731
KEYWORDS RefSeq.
SOURCE Drosophila melanogaster (fruit fly)
ORGANISM Drosophila melanogaster
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE 1 (bases 1 to 12442)
AUTHORS Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
Strelets,V., Russo,S.M. and Gelbart,W.M.
CONSRTM FlyBase Consortium
TITLE Gene Model Annotations for Drosophila melanogaster: Impact of
High-Throughput Data
JOURNAL G3 (Bethesda) 5 (8), 1721-1736 (2015)
PUBMED 26109357
REMARK Publication Status: Online-Only
REFERENCE 2 (bases 1 to 12442)
AUTHORS Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
Russo,S.M. and Gelbart,W.M.
CONSRTM FlyBase Consortium
TITLE Gene Model Annotations for Drosophila melanogaster: The
Rule-Benders
JOURNAL G3 (Bethesda) 5 (8), 1737-1749 (2015)
PUBMED 26109356
REMARK Publication Status: Online-Only
REFERENCE 3 (bases 1 to 12442)
AUTHORS Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
TITLE The Release 6 reference sequence of the Drosophila melanogaster
genome
JOURNAL Genome Res 25 (3), 445-458 (2015)
PUBMED 25589440
REFERENCE 4 (bases 1 to 12442)
AUTHORS Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
TITLE Sequence finishing and mapping of Drosophila melanogaster
heterochromatin
JOURNAL Science 316 (5831), 1625-1628 (2007)
PUBMED 17569867
REFERENCE 5 (bases 1 to 12442)
AUTHORS Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
TITLE The Release 5.1 annotation of Drosophila melanogaster
heterochromatin
JOURNAL Science 316 (5831), 1586-1591 (2007)
PUBMED 17569856
REMARK Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE 6 (bases 1 to 12442)
AUTHORS Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
Ashburner,M. and Anxolabehere,D.
TITLE Combined evidence annotation of transposable elements in genome
sequences
JOURNAL PLoS Comput Biol 1 (2), 166-175 (2005)
PUBMED 16110336
REFERENCE 7 (bases 1 to 12442)
AUTHORS Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
Celniker,S.E., Rubin,G.M. and Karpen,G.H.
TITLE Heterochromatic sequences in a Drosophila whole-genome shotgun
assembly
JOURNAL Genome Biol 3 (12), RESEARCH0085 (2002)
PUBMED 12537574
REFERENCE 8 (bases 1 to 12442)
AUTHORS Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
Rubin,G.M., Ashburner,M. and Celniker,S.E.
TITLE The transposable elements of the Drosophila melanogaster
euchromatin: a genomics perspective
JOURNAL Genome Biol 3 (12), RESEARCH0084 (2002)
PUBMED 12537573
REFERENCE 9 (bases 1 to 12442)
AUTHORS Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
Rubin,G.M. and Lewis,S.E.
TITLE Annotation of the Drosophila melanogaster euchromatic genome: a
systematic review
JOURNAL Genome Biol 3 (12), RESEARCH0083 (2002)
PUBMED 12537572
REFERENCE 10 (bases 1 to 12442)
AUTHORS Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
Gibbs,R.A. and Rubin,G.M.
TITLE Finishing a whole-genome shotgun: release 3 of the Drosophila
melanogaster euchromatic genome sequence
JOURNAL Genome Biol 3 (12), RESEARCH0079 (2002)
PUBMED 12537568
REFERENCE 11 (bases 1 to 12442)
AUTHORS Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
TITLE The genome sequence of Drosophila melanogaster
JOURNAL Science 287 (5461), 2185-2195 (2000)
PUBMED 10731132
REFERENCE 12 (bases 1 to 12442)
AUTHORS Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
Smith,E., Yu,C. and Rubin,G.
CONSRTM Berkeley Drosophila Genome Project
TITLE Drosophila melanogaster release 4 sequence
JOURNAL Unpublished
REFERENCE 13 (bases 1 to 12442)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (20-DEC-2023) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 14 (bases 1 to 12442)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 15 (bases 1 to 12442)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 16 (bases 1 to 12442)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (20-APR-2020) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 17 (bases 1 to 12442)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (22-APR-2019) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 18 (bases 1 to 12442)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 19 (bases 1 to 12442)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 20 (bases 1 to 12442)
AUTHORS Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
Park,S., Svirskas,R. and Karpen,G.
TITLE Direct Submission
JOURNAL Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
REMARK Direct Submission
REFERENCE 21 (bases 1 to 12442)
AUTHORS Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
Svirskas,R. and Rubin,G.
TITLE Direct Submission
JOURNAL Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
64-121, Berkeley, CA 94720, USA
REMARK Direct Submission
REFERENCE 22 (bases 1 to 12442)
AUTHORS Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
CONSRTM Drosophila Heterochromatin Genome Project
TITLE Direct Submission
JOURNAL Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE 23 (bases 1 to 12442)
AUTHORS Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
TITLE Direct Submission
JOURNAL Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
Rockville, MD 20850, USA
COMMENT REVIEWED REFSEQ: This record has been curated by FlyBase. This
record is derived from an annotated genomic sequence (NC_004354).
On Jul 15, 2014 this sequence version replaced NM_078670.2.
##Genome-Annotation-Data-START##
Annotation Provider :: FlyBase
Annotation Status :: Full annotation
Annotation Version :: Release 6.54
URL :: http://flybase.org
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..12442
/organism="Drosophila melanogaster"
/mol_type="mRNA"
/db_xref="taxon:7227"
/chromosome="X"
/genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
gene 1..12442
/gene="Dhc16F"
/locus_tag="Dmel_CG7092"
/gene_synonym="atoll; CG7092; cvl-6; dhc; Dhc; DHC; DHC9;
Dmel\CG7092"
/note="Dynein heavy chain at 16F"
/map="16F6-16F6"
/db_xref="FLYBASE:FBgn0283476"
/db_xref="GeneID:32785"
CDS 120..12365
/gene="Dhc16F"
/locus_tag="Dmel_CG7092"
/gene_synonym="atoll; CG7092; cvl-6; dhc; Dhc; DHC; DHC9;
Dmel\CG7092"
/note="CG7092 gene product from transcript CG7092-RA;
CG7092-PA; Dhc16F-PA; dynein-related heavy chain
polypeptide; crossveinless-like 6"
/codon_start=1
/product="dynein heavy chain at 16F"
/protein_id="NP_523394.1"
/db_xref="FLYBASE:FBpp0074283"
/db_xref="GeneID:32785"
/db_xref="FLYBASE:FBgn0283476"
/translation="MSGMNRHLISKGKDRSKIKIKGKEKEKERKKDEDDDGSRKARER
GRSRRQSAGGGDHSVDMGFKRVTGEDRPSSMSMDRLDSDFAAFTDRASDANDGSITGP
ADGNGKKAQTGGKQKRASKHRLNFSTSKSAGSGTGNKSRSASTTKITTGASLIPDLNL
IINRMRNMPDDSFVYMDYMLPHDSEYFTPYSLREVEYKDLNTYEPFYTVTRHGVTFWH
CSENFFTPLDQWQQEFKQFLSIIQIRSFSIFRLWKGFKVWEKTIKWRKLNEARDYLQN
NLFIVIPQLAKAILRMRSDIVQLQRLNFVNVSNIENWHPFYFLETHMRIYEQLHKTFT
DFREFIAKTIFRACTDAIQARGFYPDDEVNYYPSLKKMREAHSFMDRARKRAFCKTLT
NFLTYCDMMVYQMLYRITKKSFEDLATSFEVHDEVGPSEADINKHDRVDKRIEKQRPQ
DKPQSPFFLAMLRLLPDRIDIEPSEDIIRIIFQRITGLILETVLEIHPFTTDPFFTQY
TQPSIMGRQEEVLYEGAPDLHYLLRADIRFQYNRKNMFVLIRKAYERARLYTQRFHQI
RENFEIDNSTDPTVLNTERDLQILRAYCDRYCNNVRALDGILEYVFLGLLKLTQTNFK
DTVTPVCSRLQNVLATYLPKLAEEETTRLYEEAQDFHGRICYEPHETLEIVAHIRFLD
KCSTELDGIFDGIDYVHDLLLIIKDFGIPIDDDSKEDYMDTEDYLNRTRETLEEIREK
RQDFINRLEDAMQDDIAALKEDIHEVAIEALQPWLLDANSNRLSVTNKLDSMLERLNK
CRETADEFLGYQKEFQIDLTMYDEMASGFYDIRMRQNLYRTWSDWEESLAEWIVSDFN
TLNVVDMVELNSKTIKNCMQFQKYLPENNIVPVLQKSAEAFKEKLPVIGYLRNPNLRA
RHWAEIEDLLNRKFFQEKDILIQTYEDVHAFDDVAIGEALMQISSQATGEVQLENMLK
GIETTWKETELSIVPHHDAKDVFILAGTEELQAVLDDSNVNINTIAASKFVGPIKSKV
DEWINAMDQFAKTFESWMDCQGAWIYLEAIFASADIQRQLPHEAKMFFTVDKSFKETV
RQAKKVALALPTMSSVDVHKVLVENNRLLDLISRGLEAYLEVKRVVFPRFYFLSNDEL
LEILAQTRIPQAVQPHLRKCFDAIYRLEFGSKEGGDGKMVATNDIVAFLSPEGEKLQF
GKGLKARGAVEEWLSKVEEAMFVSCKRYMRFGYQCYPAKEREDWFQDHPNQVVLTVSQ
VQWAADIHRIYEGKERNPLNILEKMAKFEIKCLKDLGALAALTRKNISSLLRKILCAL
ITIDVHAKDSVRMLIEKEVCKASDFNWLKMLRFYWADETETVYSRMAAANIPYYYEYL
GAGGVLVLTPLTDRCYLCLMGAFQMDLGGAPAGPAGTGKTETTKDLAKALAKQCVVFN
CSDGLDYKMMGRFFSGLAQCGAWCCFDEFNRIDIEVLSVIAQQLITIRTAKAMRVKRF
IFEGREIKINRSCCVFITMNPGYAGRTELPDNLKALFRPISMMVPDYALISEVILYSE
GFEDPKILARKMVQMYQLCSQQLSQQNHYDFGMRAVKSVLVMAGALKRASPNQREDIT
LIAALRDSNIPKFLADDAVLFRGILSDLFPGVELPDSQHPHLEASLRLGLRQKNLQAV
PTTIRKCLQLYETMCVRWGVMLVGPTGGGKSVVLHALEFALSHLFENEVQDPNFRPVV
IQTMNPKAVTMNELYGYVDLKTLEWQDGLLGLAVRTATTVEDEIHQWIMCDGPVDAVW
IENLNTVLDDNKMLCLANSERIKLTAWIHMLFEVQDLLQASPATVSRCGMVYVDPGDL
GWIPLIDTWREVDMKHKLPAPLAEFCYQLFVGYFDKALKIERKRAVYTIHQVLGSKVR
LCCELNSAQFEAVKWSAMSEEQGKELVTKIFAWAVLWAIASNLKDAEKVSFEEQWSKA
IAQHPNMTLPNFTLWNYRIDLEKMDWGSWIDIMAKFVFDPETSYYDMQVPTVDTTKYG
YVSDLLFKRGMPVMVTGDTGVGKTVLAISCMKRLSQGNVIPVILNFSAQTSSNRTQEM
IEGPLEKRKKTQLGAPVGKTVIVFIDDVNMPKLDTYGASPAIELLRQFLDFKGFYDRE
KLYWKEILDVVLGCACAPPGGGRNPLTPRFVRHFALFSLPKPNEETLTQIFNGILRGF
LQTFSSAVRALSEPMVNACVDVYMRVATVMLPTPDKSHYIFNLRDLSKCIQGILQASN
LHYNQENQILRLFYHETTRVFHDRLINIEDKNIFKALMKEVCMDHFNRPVINDNEPPI
LFGDFMVFGKPKNERIYDEIRDHTKLESVLNDYIADYNSVAVGKQMKLILFQDAMEHT
VRLARLLRSDRGNGLLVGVAGMGKQSLTRLASHVNEYNCWQIEMRRNYDLNAFHEDLR
VLYRIAGIDNQPVTFLLIDSQIVEEEFLEDINNILNSGEVPNLFEGDEFEKIILDARD
GCNENRKDDPCTRDDIYKFFINRVRNNLHVVMSMSPVGDAFRRRCRMFPSLVNCTTID
WFTSWPTEALYSVALGLLTKIAPKMEDRISLASTTVFMHKTVEDASVKFYKEMKRHYY
TTPSSYLELLKLYQNLLKIKNMEIIAKRKRIANGLNKLLETNEVIAVMGKELEVMVPQ
LDEKSAMMKSLVDNLTKETKQADAVKQSVLEDEMNAKEKAAVAQAISEDAGKDLEIAM
PALREAEEALKGLTKADINELKSFTTPPALVQFCMEAVCILLGVKPTWASAKAIMADI
NFIKRLFEYDKEHMKEDTLKKVKKYIDHKDFVPAKFEKVSKVAKSMSMWVISMDKFSK
VYKVVEPKIKRKEAAEAELKEVMTVLRQKQKELAAVEAKIQGLRDSLEEKQREFQVIQ
DNVDLTYGRINRAGRLTSALSDEQVRWRETVKSLTGDLACVPGDVLVAAACVAYLGAF
SHEYRRDMSALWVSKCREHKIPSSPEFNLLKVLGDPYEMRQWNVDGLPKDNISIENGI
YATRALRWALMIDPQEQANRWIRNMERANNLQVIKMTDSTMMRVLENAVRQGYPVLLE
EINETIDPSLRPILQRETYRFEGRTYLKLGDMVIDYDDNFKLYMTTKLPNPHYLPEVC
INVTLVNFLVTESGLEDQLLADIVAIELPAMEIQRNDLVVKINSDKQQLLALEDKVLK
LLFNSEGNILDDEELVETLNDAKETSLIIAARLIDTEETEKVITASRERYRILASRGA
ILYFVVAGLAEIDPMYQYSLKYFTQVFCNVLRLDHPPQSVEVRISTLMTDELRAIFDN
ISRGLFENHKIIFSFLLALSVERQEGRVTEEEFLFLSRGPVGNIRTKIQPAKIKMSQI
EWDSCIFLEDNFSSFFSGLTDELDKPFFIQMQENKEVFDFAQTNQPPTDKWNKRLRVF
HKLMFISAFRKPRFLLNVVCYLQSTVGKYFTEASGGTQLSSVYLDTSAVTPLIFVLST
GSDPMSGFLKFTTQMQFTDKYYSISLGQGQGPLAENLIEKSLRLGHWVFLQNCHLATS
FMQTLETIVRNLTLGITKAHVDFRLYLSSMPIQTFPISVLQNSVKITNEPPKGIKANV
FGALTDLKQDFFEQHIQNGNWRAIVFGLCMFHAVLLERRKFGPLGWNITYEFSESDRE
CGLKTLDFFIDREVLDEIPWEAILYINGDITWGGRVTDYWDLRCLRTILTIFSSKRII
QPDYKYCRGDSYYRDPRKKTLTEYSAYVQGFPVLEDPEIFGMNQNANIVFQTKETAFF
INTLLLGQPRSAADEGQAMENEIAQQTIARIQKALATKIKREPIHDTLSVLDAKGQVP
SLTIVLVQEIDRFNIALGIIHDSLVNLSKAIKGLVVMSEELENVFKALLSNQVPASWA
KRSFLSIKPLPSYISDFQRRIDFIQQWAENGAPRSYWISGFFFPQSFLTGVLQTYARR
RVLPIDSLKIDFDVFERELVQQDFFEMHTNNMSDQKLYGNLPECTDAIINVHGIFIEA
ARWDLSKGGLCDANFGELFSRMPVVRFKPCLEISPTVRYEAPLYKTQQRSGVLSTTGH
STNFILAVLLRSHNDPEFWIMRGTALVSAVLENIC"
ORIGIN
1 cggatgtctg acaacagtag acatggcgac ggaataacaa tccggagcaa ccaacagctg
61 tcagtcggtg tgaagccgat caaaacctag ccacatatag ccggattctc tatagcccga
121 tgtccggaat gaaccgccac ctgatctcca agggcaagga taggagcaaa atcaagatca
181 agggcaagga gaaggagaag gaaaggaaga aggatgagga cgacgatggg tcgcggaagg
241 ctcgggaacg gggaagaagt aggcggcagt cggcaggagg cggagatcat tccgtagata
301 tgggttttaa gcgcgtaacc ggcgaagacc ggccgagtag catgtccatg gatcgacttg
361 acagcgattt cgccgccttc acggatcgcg ccagtgatgc caacgacggc agtatcacgg
421 gtccggcgga tggcaatggg aagaaggcac aaaccggcgg gaagcaaaag agggcctcca
481 agcaccgtct gaacttttcc acttcgaaat cagccggctc gggtaccggt aacaaatcgc
541 gcagtgccag caccacaaag atcaccactg gagcatcgct cattccggat ctgaatttga
601 tcatcaaccg gatgcgcaac atgccggatg attcgttcgt ctatatggac tacatgctgc
661 cccacgacag cgagtacttc acaccgtaca gcctgcggga ggtggagtac aaggatctga
721 acacatacga acccttctat accgtcaccc gacacggcgt caccttctgg cactgctccg
781 agaatttctt cacgccactg gaccagtggc agcaggagtt caagcagttc ctcagcatca
841 tccagatacg atccttctct atcttccggc tgtggaaggg tttcaaggta tgggaaaaga
901 cgatcaagtg gcggaaactg aacgaggcac gggactatct gcagaacaac ctctttatcg
961 tgattcccca gctggccaaa gcgatcctac gcatgcgcag cgatattgtg cagttacagc
1021 gattgaattt cgtcaatgtg tccaacattg agaactggca tccattctac ttcctggaga
1081 cgcacatgcg catctatgag cagctacaca agaccttcac cgacttccgg gagtttattg
1141 ccaagacgat atttcgagcc tgcacggatg ccatccaggc acgtggcttc tatccggatg
1201 acgaggtcaa ctactatcca tcgctgaaaa agatgcgtga agcgcatagc tttatggacc
1261 gagcacgaaa gagagccttc tgcaagaccc tgaccaattt cctcacctat tgcgacatga
1321 tggtgtacca aatgctgtac cgcatcacga agaaaagttt tgaggatttg gccacttcgt
1381 tcgaggtgca cgacgaagtt ggtccttcgg aggcggatat caataaacat gatcgggtgg
1441 acaagcggat cgagaagcag cgaccacagg acaagccgca aagtccattc tttttggcca
1501 tgttgcgtct actgcccgat cgcattgata tcgagccgtc ggaggatatt atcaggatta
1561 tatttcagcg catcacagga ctcattctgg agacggtcct cgaaatccac ccgttcacca
1621 cggatccctt cttcacgcag tacactcaac cctcgattat gggacgccag gaggaggtgc
1681 tatacgaagg tgcccccgat ttgcactatc tactgcgcgc cgatatacga tttcagtata
1741 accgcaaaaa tatgttcgta ctaatacgaa aagcgtatga aagggcgcga ctttatacgc
1801 agaggttcca ccaaattcgc gaaaactttg agatcgacaa tagtacagat cccacggttc
1861 tcaatacgga aagagatctt cagatactgc gagcatactg cgatagatac tgcaacaatg
1921 tgagggcttt ggatggcatc ttggaatatg tattcctggg actactaaag ctcacccaga
1981 cgaacttcaa ggacacggtg acgccggtct gttcccgact ccagaacgtc ctggccacat
2041 acctgcccaa attggccgag gaggagacga caaggctgta cgaggaggca caggatttcc
2101 atggcaggat ctgctacgaa ccgcacgaga ccctggagat tgtggcccac attcgatttc
2161 tggacaagtg ctccaccgaa ctggacggga tattcgatgg catcgactat gtgcacgatc
2221 tgttgctcat aatcaaggac tttggcatac ccatcgatga cgattccaag gaggattaca
2281 tggacaccga agactatttg aataggacgc gggagacgct ggaggagatc cgagagaagc
2341 ggcaggactt tatcaatcga ctcgaggatg ccatgcagga tgacatagcg gcgctcaagg
2401 aggatatcca tgaggtggcc atcgaagccc tgcagccctg gctgctggat gccaattcca
2461 accgattgtc ggtcaccaac aaattggatt ccatgctgga acgcctgaac aaatgccgcg
2521 agacagccga cgagttcctt ggctatcaga aggagttcca gattgacttg accatgtacg
2581 atgagatggc cagcggcttc tatgacatac ggatgcgaca gaatctctat cgcacgtggt
2641 ccgactggga ggaatcgctg gccgagtgga tagtttccga tttcaacacc ctaaatgtgg
2701 tggatatggt tgaactgaac tccaagacga tcaagaactg catgcagttc caaaagtatc
2761 tgcccgaaaa caacatagtc ccagttctac agaaatccgc agaggcattt aaggagaaac
2821 taccagtgat cggttacctg cgtaatccca atttgagagc ccgtcattgg gcggagatcg
2881 aagatctgct gaatcgcaag ttcttccaag agaaggacat cctaatccag acatacgagg
2941 atgtgcacgc attcgacgat gtcgccatcg gtgaggctct gatgcagata tcctcgcagg
3001 ccaccggcga ggtgcagctg gagaatatgc taaagggcat cgaaaccacc tggaaggaaa
3061 cggaactgtc cattgtaccg caccatgacg ccaaggacgt cttcatactg gcgggcacag
3121 aggagttaca agccgttctc gatgattcca atgtgaatat taacacgata gccgcatcca
3181 agtttgtggg tcccatcaag agcaaggtgg acgagtggat caatgccatg gaccagtttg
3241 ccaagacctt cgaaagctgg atggactgcc agggagcttg gatctacctg gaggccatct
3301 ttgcctcggc ggatattcag cgacagctgc cccacgaggc caagatgttc ttcacggtgg
3361 acaagagttt caaggagacg gtccgccagg caaagaaggt ggctctggcc ctgcccacca
3421 tgtctagcgt ggatgtgcac aaggtgctgg tggagaacaa ccgtttgctg gatctcatat
3481 cacgcggatt ggaagcatat ttggaggtga aacgggtggt cttccccaga ttctactttc
3541 tgtccaacga cgaactgctg gagatcctgg cacaaactcg aattccccag gcagtgcaac
3601 cgcatctgcg caagtgcttc gatgccatct accgattgga atttggatcg aaggaaggcg
3661 gcgacggcaa gatggtggcc accaacgaca tcgttgcctt cctctcaccc gaaggcgaaa
3721 agctgcagtt tggcaagggt ctgaaggctc gtggagccgt ggaggaatgg ctcagcaagg
3781 tggaggaggc catgtttgtg tcctgcaagc ggtacatgcg attcggttac cagtgctatc
3841 cggccaagga gcgcgaggac tggttccagg atcatccaaa ccaggtggta ctcaccgtct
3901 ctcaagtgca gtgggccgcc gacattcatc gcatctacga gggcaaggag cgcaatcccc
3961 ttaacatttt agagaagatg gccaagtttg agatcaaatg cctgaaggat ctgggcgccc
4021 tcgctgcgtt gaccaggaag aacattagtt cgctgctgcg caagatcctg tgcgccctga
4081 tcaccatcga tgtgcacgcc aaagattcgg tgcgcatgct catcgaaaag gaggtctgca
4141 aagcatcgga cttcaactgg ctaaagatgc tgcgattcta ctgggccgat gagacggaaa
4201 cggtctactc ccgcatggcc gccgcaaata ttccatacta ttatgagtac ttgggtgccg
4261 gcggtgtcct ggtgctcacc ccgctgacgg atcgctgcta cctctgtctg atgggcgcct
4321 ttcagatgga tctgggcgga gctccggccg gaccagccgg aactggcaag accgagacca
4381 caaaggatct ggccaaggcg ttggccaaac agtgcgtcgt ctttaattgc tcagatggcc
4441 tggactacaa gatgatggga cgtttcttct cgggattggc ccaatgtggc gcctggtgct
4501 gcttcgacga gttcaaccga attgacatcg aagtgctgtc cgtgattgcc cagcagttga
4561 tcaccatccg aacggccaag gccatgaggg tgaaacgctt tatcttcgag ggtcgggaga
4621 tcaagatcaa tcgttcctgc tgcgtcttca ttaccatgaa tcccggctac gccggtcgta
4681 ctgagcttcc cgataatctg aaggccctgt tccgacccat ctcgatgatg gtgcccgact
4741 acgcactcat ttcggaggtg attctgtact cggagggctt cgaggatcct aagattctgg
4801 cccgcaagat ggtgcagatg taccagctgt gcagtcagca gctcagccag cagaatcact
4861 atgatttcgg tatgcgagcc gtgaagtctg tcctggtgat ggctggagct ctcaagagag
4921 cctcacccaa tcaacgggag gatatcaccc tgattgcggc attaagggac tccaatatac
4981 ccaagtttct ggctgacgat gcggtcctgt ttcgtggcat ccttagcgat ctgtttccgg
5041 gcgtcgaact gccggactca cagcatccgc atctggaggc tagcttgcga ttgggtttgc
5101 ggcagaagaa tctccaggcg gtgccgacca ccatccgaaa gtgtctgcaa ttgtacgaga
5161 cgatgtgcgt ccgctggggt gtcatgttgg tgggtcccac cggtggcggt aaatccgtgg
5221 tgctgcatgc cctcgaattc gctctctcac atctcttcga gaacgaagtg caggacccga
5281 actttcggcc tgtggtcatc cagacaatga atcccaaggc ggtgaccatg aacgagctgt
5341 acggctatgt ggatctcaag acgctggagt ggcaggatgg cttactgggc ctggcggtgc
5401 gaactgccac aacagtcgaa gatgagattc accaatggat catgtgcgat ggtcctgtag
5461 atgcagtatg gatcgagaac ttgaacacgg tgctggatga taacaagatg ctgtgtctgg
5521 ccaactcgga gcgcatcaag ctgaccgctt ggattcacat gctcttcgag gtgcaggatc
5581 ttctgcaggc ttcgcctgcc accgtttccc gttgcggcat ggtctatgtg gatcccggcg
5641 atctcggttg gattccactg atcgacacgt ggcgagaggt ggacatgaag cacaagctgc
5701 ccgcaccgct ggccgagttc tgctaccagc tctttgtggg ctacttcgac aaggctctga
5761 agatcgaacg gaagcgggcg gtgtacacca tccaccaggt gctcggatcc aaggtacgac
5821 tgtgctgtga gctgaactct gcccagttcg aggcggtcaa gtggtcggca atgagtgaag
5881 agcagggcaa ggaactggtg accaagatct tcgcctgggc ggtgctctgg gccattgcct
5941 ctaatctcaa ggatgccgag aaggtctcct ttgaggagca gtggagcaag gccattgccc
6001 agcatccgaa tatgaccctg cccaacttca ccttgtggaa ctatcgcatc gacctcgaga
6061 aaatggactg gggcagctgg atagatatca tggcaaagtt tgtcttcgat ccggaaacct
6121 cttactacga catgcaggtg ccaacggtgg acaccaccaa atacggctat gtgtccgatc
6181 tcctgttcaa acggggtatg cccgtgatgg tgactggaga tacgggcgtg ggcaaaacag
6241 tcctggccat cagttgcatg aaacggctgt cgcagggcaa tgtcatccca gtgatcttga
6301 acttctccgc ccagacgagc agcaatcgca cccaggagat gatcgagggt ccgctggaga
6361 agcgcaagaa gacccagctg ggtgctccgg tgggcaagac cgtaatcgtt ttcattgacg
6421 acgtaaatat gcccaagctg gatacctacg gagcctcgcc ggccattgag ttgctgcgtc
6481 aattccttga ctttaagggc ttctacgatc gggagaaact gtactggaag gagattttgg
6541 acgtggtcct gggctgcgcc tgtgctccgc ctggaggagg tcgaaatccg ctcactccgc
6601 gctttgtccg tcactttgcc ttgttctcgt tgccgaaacc gaacgaggag accctaaccc
6661 agatcttcaa tggcatcctg cggggtttcc tgcaaacgtt ctcctccgca gtaagagctc
6721 tctccgagcc catggtgaat gcctgtgtgg atgtctatat gcgagtggct actgtaatgt
6781 tgcccacacc ggataaatcg cattatatct tcaatctgcg cgatctatcc aagtgtatac
6841 agggtatcct ccaggcgagc aatctgcact acaaccagga gaaccagata ctccgactgt
6901 tctaccacga gaccactcgt gtgtttcacg acagattgat taacatagag gacaagaaca
6961 ttttcaaggc cctgatgaag gaagtatgca tggaccactt caatcggccg gtgatcaatg
7021 acaatgagcc accgatcctc ttcggtgact ttatggtctt tggcaagccg aagaatgagc
7081 gtatctacga tgagattagg gatcatacga agctggaaag cgttctcaac gattacattg
7141 cggactacaa ctcggtggcg gtgggtaagc aaatgaagct aatcctcttc caagacgcca
7201 tggagcacac ggtccgattg gcgcgacttt tgcgcagcga tcgcggcaat ggactgctgg
7261 tgggcgtggc cggaatggga aaacaatccc tcacccggct ggcatcgcat gtcaacgagt
7321 ataactgctg gcagatcgag atgcgacgca actacgacct aaatgctttc catgaagatc
7381 tgagagtcct ctaccgcatc gctggaatcg ataatcaacc ggtgaccttc ctattgatag
7441 acagtcagat cgtggaggaa gagttcctcg aggatatcaa caacatcctc aattccggcg
7501 aagtgcccaa tctattcgag ggtgatgagt tcgagaagat catcttggac gcccgtgacg
7561 gatgcaatga gaacagaaaa gatgatccct gcacacgaga tgacatctac aagttcttca
7621 tcaaccgggt aaggaacaat ctgcatgtgg tcatgtcgat gagtccggtg ggcgatgcct
7681 ttcggcgcag atgccgcatg ttcccctcgc tagtcaactg caccacgatc gattggttca
7741 ccagctggcc aacggaggcc ctatattcgg tggccctcgg gttgctcaca aagattgcac
7801 ccaaaatgga ggatcgcatc tcgctggcga gcacaaccgt ctttatgcat aagactgttg
7861 aggatgcctc agtgaagttc tacaaggaaa tgaagaggca ctactatacc acacccagta
7921 gctatctgga gttgttaaag ctgtaccaga atctgctgaa gatcaaaaac atggagatca
7981 tagccaagag aaagcgaatc gccaatggcc taaacaagct tctggagacc aatgaagtga
8041 tcgctgtgat gggaaaagag ctagaggtaa tggtgcccca gttggatgag aaatccgcta
8101 tgatgaagtc cctggtggac aatctgacca aggagaccaa gcaggcggat gccgtcaagc
8161 agagcgtgtt agaggacgag atgaatgcca aggagaaggc ggcggtggcc caggcgatat
8221 ccgaggatgc cggcaaggat ctggagattg ccatgccagc gctgcgcgag gcggaggaag
8281 cacttaaggg tctgaccaag gcggacatca atgaactgaa gtccttcacc acgccgccgg
8341 ctctggtgca gttctgcatg gaggccgtct gcattctgct cggtgtcaaa cccacttggg
8401 catctgccaa ggccatcatg gcggacataa acttcatcaa gcggctgttt gagtatgaca
8461 aggagcacat gaaggaggat actctgaaga aggtcaagaa gtacatcgac cacaaggact
8521 tcgtgccggc taaattcgag aaggtctcca aggtggccaa gtcgatgagc atgtgggtaa
8581 tatccatgga caagttctcc aaggtgtaca aagtggtgga gccgaagatc aagcgaaaag
8641 aggccgccga agcggagctc aaggaggtga tgaccgtgct gaggcagaag caaaaggagc
8701 tggccgccgt ggaggccaag atccaaggac ttcgcgacag tctcgaagag aagcagcgcg
8761 agttccaagt gatccaggac aacgtggacc taacctacgg taggatcaat cgggcgggtc
8821 gcttgacctc cgccctttcc gacgagcagg tgcgctggcg cgaaacggtc aagtccctaa
8881 ccggagatct agcctgtgtg cccggtgatg tcctggtggc tgccgcctgt gtcgcctatt
8941 tgggagcctt ctcccatgaa tatcgtcggg atatgagtgc tctgtgggtg tccaagtgcc
9001 gtgagcacaa gattccctcc agtcccgagt tcaacttgct caaggtgctt ggagatccgt
9061 acgagatgcg acaatggaat gtggacggcc taccgaagga taacatatcc attgagaatg
9121 gcatatatgc caccagagct ctgcgttggg ccctcatgat cgatccgcaa gagcaggcca
9181 atcgctggat ccgtaacatg gagcgggcca acaatctgca ggtgatcaag atgaccgact
9241 cgaccatgat gcgggtgctg gagaacgctg tacgccaggg atatccggtg ctgctcgagg
9301 agatcaacga gaccatcgat cccagtttgc ggccgatcct tcagcgcgaa acgtaccgat
9361 tcgagggtcg tacctatttg aaactgggcg acatggtgat tgactacgat gacaatttta
9421 agctttacat gaccacaaag ttgcccaatc ctcactacct gcccgaggtg tgcatcaacg
9481 tgacattggt caactttctg gtcacagaga gcggtctgga ggatcaactg ttggctgata
9541 tcgtggccat cgagctgccc gcgatggaga tccagcgtaa cgatctggtg gtcaagatca
9601 actcggacaa acagcaattg ctagcgctgg aggacaaggt gttgaagttg ctattcaact
9661 ccgaaggaaa cattctcgac gacgaagagc tggtggagac actgaacgat gccaaggaaa
9721 catcgctgat cattgccgcc cgactgattg acaccgagga gacggagaag gtcatcactg
9781 cttcgcggga acgctaccgc atcctggcct cccgaggagc catactctac tttgtggtcg
9841 caggcctagc cgagatcgat cccatgtacc agtacagcct gaagtacttc acccaggtat
9901 tttgcaatgt actgcgtttg gatcatccac ctcagtccgt ggaggtgcgc atttccactc
9961 tgatgacgga cgaactgagg gcaattttcg ataacatatc ccgcggcttg ttcgaaaacc
10021 ataagatcat ctttagcttc ctgctggctt tgtcggtgga gcggcaagag ggtcgagtta
10081 ccgaagagga gttccttttc ctttctcgcg gccctgtggg taacattcgg acgaagattc
10141 agccggctaa gatcaagatg agccagatcg aatgggattc gtgcatcttt ttggaggata
10201 acttcagtag cttcttcagc ggacttaccg acgaactgga caagcccttc ttcatccaga
10261 tgcaggagaa caaggaggtc tttgacttcg cccagacgaa tcagccacca acggataagt
10321 ggaacaaacg actgagggtt ttccacaagc tgatgttcat ctccgctttc cgaaaaccga
10381 gattcctgct caacgtcgtc tgctatctgc agtccactgt gggcaagtac ttcaccgagg
10441 catcaggtgg tacccagttg agctctgtct atttggacac ctcagctgtg acgccgttga
10501 tctttgtgct gtccacggga tcggatccaa tgtctggctt ccttaagttc accacccaga
10561 tgcagttcac cgacaagtac tactccattt cgctgggtca gggccaggga ccactggcgg
10621 agaacctgat cgaaaagagc ttgcgactgg gtcactgggt gttcctgcag aattgccatt
10681 tggccacctc gtttatgcag acgctggaga cgattgtgcg aaacctgacg ctgggcatca
10741 ccaaggcgca tgtcgatttc cggctctatc tctcctccat gcccatacaa acgttcccga
10801 tcagtgtact ccagaattcg gtgaagatca ccaatgaacc gccgaagggc attaaggcga
10861 atgtgtttgg tgcactaacg gatctaaagc aggacttctt cgagcagcac atccagaatg
10921 gcaattggcg ggccattgtc tttggtctgt gcatgttcca tgcggttttg ctggagcgca
10981 ggaaattcgg accccttgga tggaacatca cctacgagtt cagcgaaagc gatcgggaat
11041 gcggccttaa gaccctcgac ttcttcatcg atcgggaggt tctggatgaa ataccctggg
11101 aggccatact ctatatcaac ggtgacatca cgtggggcgg ccgggtgact gactactggg
11161 atctgcggtg tctgcgcacc attctgacca tcttttcctc gaagcgaatc atccagcccg
11221 actacaagta ctgtcgtggt gatagctact atcgggatcc gcgcaaaaaa acgctcaccg
11281 aatactcagc ctatgtgcag ggctttccgg tcctcgaaga tcccgagatc ttcggcatga
11341 accagaatgc caacatcgtg ttccagacca aggagacggc cttcttcatc aacaccctgc
11401 tcctgggcca gccgagatcc gccgccgacg agggccaggc tatggagaat gaaatcgccc
11461 agcagacaat cgcacgcatc cagaaagcgc tggccaccaa gattaagagg gaacccatcc
11521 acgacacgct ctctgtgctt gacgccaagg gtcaagtgcc atcgctgacc attgtgctgg
11581 tccaagagat cgatcgattc aacatcgcat tgggcatcat tcacgacagt ttggtcaatt
11641 tgtccaaggc catcaagggc ctggtggtaa tgtccgagga gctggagaac gtgttcaagg
11701 cgctgctatc caaccaggtg cccgcgtcct gggccaaacg gagcttcctc tcgatcaagc
11761 cgctgcccag ctatatatcc gatttccagc ggcgcatcga tttcatccaa cagtgggcgg
11821 agaatggagc acctcgctcc tactggatca gtggcttctt cttcccgcaa tcctttctca
11881 ccggcgtcct gcaaacgtac gcccgtcgcc gtgtcctacc cattgattcg ctgaagatcg
11941 acttcgatgt gttcgaacgg gaactggtgc agcaggactt ctttgagatg cacaccaaca
12001 acatgagtga ccaaaagcta tatggcaacc tgccggaatg caccgatgcc atcatcaatg
12061 tgcacggaat cttcatcgaa gcagctcgct gggatttgag caaaggcggc ctgtgcgatg
12121 ccaactttgg cgagctcttc tcccgaatgc cggtggtgag gttcaagccg tgcctggaga
12181 tttctccgac ggtgcgatat gaggcgccgc tgtacaagac ccagcagaga tccggtgtcc
12241 tctccaccac gggccattcg acgaacttca tcttggccgt ccttttgcgg agccacaacg
12301 atccggaatt ttggattatg cggggtacag ctctggtttc cgctgtgctc gaaaatattt
12361 gctaggtcta gatacaaatt ggaacttatt gttttacttt ttatgctgca tacttaaagc
12421 cgtctttatt tcgtttcatg tt
//