Dbfetch

LOCUS       NM_001259599           11632 bp    mRNA    linear   INV 26-DEC-2023
DEFINITION  Drosophila melanogaster rhinoceros (rno), transcript variant D,
            mRNA.
ACCESSION   NM_001259599
VERSION     NM_001259599.2
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (bases 1 to 11632)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: Impact of
            High-Throughput Data
  JOURNAL   G3 (Bethesda) 5 (8), 1721-1736 (2015)
   PUBMED   26109357
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 11632)
  AUTHORS   Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
            Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
            Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: The
            Rule-Benders
  JOURNAL   G3 (Bethesda) 5 (8), 1737-1749 (2015)
   PUBMED   26109356
  REMARK    Publication Status: Online-Only
REFERENCE   3  (bases 1 to 11632)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
            Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
            Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
            Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
            Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
            Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
  TITLE     The Release 6 reference sequence of the Drosophila melanogaster
            genome
  JOURNAL   Genome Res 25 (3), 445-458 (2015)
   PUBMED   25589440
REFERENCE   4  (bases 1 to 11632)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
            Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
            Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
  TITLE     Sequence finishing and mapping of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1625-1628 (2007)
   PUBMED   17569867
REFERENCE   5  (bases 1 to 11632)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  TITLE     The Release 5.1 annotation of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1586-1591 (2007)
   PUBMED   17569856
  REMARK    Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE   6  (bases 1 to 11632)
  AUTHORS   Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
            Ashburner,M. and Anxolabehere,D.
  TITLE     Combined evidence annotation of transposable elements in genome
            sequences
  JOURNAL   PLoS Comput Biol 1 (2), 166-175 (2005)
   PUBMED   16110336
REFERENCE   7  (bases 1 to 11632)
  AUTHORS   Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
            Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
            Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
            Celniker,S.E., Rubin,G.M. and Karpen,G.H.
  TITLE     Heterochromatic sequences in a Drosophila whole-genome shotgun
            assembly
  JOURNAL   Genome Biol 3 (12), RESEARCH0085 (2002)
   PUBMED   12537574
REFERENCE   8  (bases 1 to 11632)
  AUTHORS   Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
            Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
            Rubin,G.M., Ashburner,M. and Celniker,S.E.
  TITLE     The transposable elements of the Drosophila melanogaster
            euchromatin: a genomics perspective
  JOURNAL   Genome Biol 3 (12), RESEARCH0084 (2002)
   PUBMED   12537573
REFERENCE   9  (bases 1 to 11632)
  AUTHORS   Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
            Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
            Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
            Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
            Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
            Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
            Rubin,G.M. and Lewis,S.E.
  TITLE     Annotation of the Drosophila melanogaster euchromatic genome: a
            systematic review
  JOURNAL   Genome Biol 3 (12), RESEARCH0083 (2002)
   PUBMED   12537572
REFERENCE   10 (bases 1 to 11632)
  AUTHORS   Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
            Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
            Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
            Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
            Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
            Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
            Gibbs,R.A. and Rubin,G.M.
  TITLE     Finishing a whole-genome shotgun: release 3 of the Drosophila
            melanogaster euchromatic genome sequence
  JOURNAL   Genome Biol 3 (12), RESEARCH0079 (2002)
   PUBMED   12537568
REFERENCE   11 (bases 1 to 11632)
  AUTHORS   Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
            Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
            George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
            Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
            Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
            Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
            Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
            Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
            Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
            Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
            Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
            Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
            Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
            Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
            Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
            Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
            Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
            Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
            Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
            Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
            Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
            Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
            Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
            McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
            Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
            Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
            Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
            Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
            Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
            Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
            Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
            Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
            Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
            Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
            Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
            Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
            Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
  TITLE     The genome sequence of Drosophila melanogaster
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
   PUBMED   10731132
REFERENCE   12 (bases 1 to 11632)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
            Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
            Smith,E., Yu,C. and Rubin,G.
  CONSRTM   Berkeley Drosophila Genome Project
  TITLE     Drosophila melanogaster release 4 sequence
  JOURNAL   Unpublished
REFERENCE   13 (bases 1 to 11632)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (20-DEC-2023) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   14 (bases 1 to 11632)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   15 (bases 1 to 11632)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (10-NOV-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   16 (bases 1 to 11632)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   17 (bases 1 to 11632)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (20-APR-2020) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   18 (bases 1 to 11632)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (22-APR-2019) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   19 (bases 1 to 11632)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   20 (bases 1 to 11632)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   21 (bases 1 to 11632)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-OCT-2015) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   22 (bases 1 to 11632)
  AUTHORS   Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
            Park,S., Svirskas,R. and Karpen,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
            Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   23 (bases 1 to 11632)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
            Svirskas,R. and Rubin,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
            64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   24 (bases 1 to 11632)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  CONSRTM   Drosophila Heterochromatin Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
            Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
            Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE   25 (bases 1 to 11632)
  AUTHORS   Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
            Rockville, MD 20850, USA
COMMENT     REVIEWED REFSEQ: This record has been curated by FlyBase. This
            record is derived from an annotated genomic sequence (NT_037436).
            
            On Jan 16, 2013 this sequence version replaced NM_001259599.1.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider :: FlyBase
            Annotation Status   :: Full annotation
            Annotation Version  :: Release 6.54
            URL                 :: http://flybase.org
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..11632
                     /organism="Drosophila melanogaster"
                     /mol_type="mRNA"
                     /db_xref="taxon:7227"
                     /chromosome="3L"
                     /genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
                     bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
     gene            1..11632
                     /gene="rno"
                     /locus_tag="Dmel_CG7036"
                     /gene_synonym="CG7036; Dmel\CG7036"
                     /note="rhinoceros"
                     /map="61B2-61B2"
                     /db_xref="FLYBASE:FBgn0035106"
                     /db_xref="GeneID:38027"
     CDS             565..10290
                     /gene="rno"
                     /locus_tag="Dmel_CG7036"
                     /gene_synonym="CG7036; Dmel\CG7036"
                     /note="CG7036 gene product from transcript CG7036-RD;
                     CG7036-PD; rno-PD; rhinoceros"
                     /codon_start=1
                     /product="rhinoceros, isoform D"
                     /protein_id="NP_001246528.1"
                     /db_xref="FLYBASE:FBpp0294000"
                     /db_xref="GeneID:38027"
                     /db_xref="FLYBASE:FBgn0035106"
                     /translation="MSQRGKRGNQQHHQSHHPPPQQHQRKDVEPQPPPTKRRKGRPPN
                     GATTAAVAEVTGSGPATGSERVPVLPLCKSKHEEPGAEAGGGGQGRAAAGATSTSKSK
                     STKLAKSASKCKSQGASSSSSWQARSVADIKMSSIYNRSSTEAPAELYRKDLISAMKL
                     PDSEPLANYEYLIVTDPWKQEWEKGVQVPVNPDSLPEPCVYVLPEPVVSPAHDFKLPK
                     NRYLRITKDEHYSPDLHYLTNVVALAENTCAYDIDPIDEAWLRLYNSDRAQCGAFPIN
                     ATQFERVIEELEVRCWEQIQVILKLEEGLGIEFDENVICDVCRSPDSEEANEMVFCDN
                     CNICVHQACYGITAIPSGQWLCRTCSMGIKPDCVLCPNKGGAMKSNKSGKHWAHVSCA
                     LWIPEVSIGCVDRMEPITKISSIPQSRWSLICVLCRKRVGSCIQCSVKPCKTAYHVTC
                     AFQHGLEMRAIIEEGNAEDGVKLRSYCQKHSMSKGKKENAGSHGGGSASVASAMQKAN
                     RYGSGAGGGADDGNNACGTTGEDPRRRKNHRKTELTSEERNQARAQRLQEVEAEFDKH
                     VNFNDISCHLFDVDDDAIVAIYNYWKLKRKSRHNRELIPPKSEDVEMIARKQEQQDME
                     NHKLVVHLRQDLERVRNLCYMVSRREKLSRSLFKLREQVFYKQLGVLDEMRLEKQQTK
                     QEQQQPVMDLNAVIYANDGPTLYDRFYSSVGGQTVPAQYQDLKYILEQLMGKLQSGKQ
                     GRGRASQSPNKRKQPAKASPNKKLNNGILSSRTSSPEKTVAGSKVGTTTSKVRSPPGK
                     NPTGRRASKSSAAAATSTHNKSQFHSNIRSSTTSHSSSGTISSGNSSSANGTSSSDSS
                     SGSDSGSESGSSSAGSGVSKRKSSSGSPLKKQSYARSVEQRQKQRQRRQNEAVAGASA
                     TYPDSRSASSSSDGEDERCRNRQEPERGARRGPIQSKSVPNRSQASRSKPTTEADVGE
                     GTGASARRKLSTTTRGLAQMDKDADESVSSDESEELLPLRGERQRESTTTSGLATTGS
                     AIGRNLGQHIYSDSESSSSEQEKDQEEQATVESNVSDSQNQQTIRTKAAMKEFVPGTA
                     ATTSSTSQAASSTSKAKNTREGKEGAASIGNSTKTKPNPNAKLYPADLLVVPQRQAAK
                     KASENMRSTNLATTLQPDVSDRVREPDINSISGTAKSKVKDSSSRVSNEADKSSLEKV
                     RPKEHLQKTVGKTSESAPAERGKRGRPPKVPKDARPPSITENDKPALPTHTQSKPPSV
                     VATPVSAKSNFAVSLVPQRQAAKKAAEQLKSSKPVLESFSTGNDISDKETVTSATISG
                     SGSSVPAASTPVKPTRRSSIKEAPITPKEPLSGRRKSKEDLLATPIKTTPLVKRRVVV
                     PNLSSSSSGDSESSSSSSSSGSSSSSGGSDSDSESQASNSENPSSREPPVAPAKVPSD
                     SSLVPKRSPRKSMDKPSALTIAPASVNVLNIPSTRSRQNSTTKSTKVALQKAVQSVED
                     DVKCTPKTNRLQGSMDECGKQVQLEQATKRATRGSKSRPPSPTAKSSPEKTVSRCKSR
                     AEESPKKVANLEQEISQRKVASGKGTSSLDKLLNKKQQQMNHSAQATPPPISPTPPAS
                     ETRIVKDQCDLKPDEVSIQQINLGADAQPEPDLDPESAAEAGELPMDIDEELTTAPTR
                     TQLSASASKLADIIDDERPPAAPLPASPTPTPTSNDEMSDAGSDLSERRRMRWRSRRR
                     RRRRSHEPDEEHTHHTQHLLNEMEMARELEEERKNELLANASKYSASTSSPAVTVIPP
                     DPPEIIELDSNSAEQQQQHLHDQPLPPPLVVQSPAADVVPTVMQQQLLPSQRPLIEQL
                     PVEHLPIVETILEMEDSKFANNFASNLASVLNPPNQMSLIGSSIDRSKQISEEDSIQA
                     TRNLLEKLRKTKRKAQDDCSSKEAVDLLPPTPAIPSVFPFHNAADPEDIIHAQKEQQH
                     QQQQQLQSSQTCIYGNSSGPNSVASLTIKDSPMTANSGSYANSLTNTPNATPTNATMN
                     NLGYQVNFPNSQPPPTLGLFLEKSPHQKGACPLSSNGGANVGQPAPTPDFVDLAAAAV
                     KNTLGSFRGAATVPTQSGTGVNAKINDYDESTRMQSPFGGMPWNESDLIAERRSSSPS
                     SVSESNDPPQPPPVVTATATTARSLAQLESCKNFFNSYPSGNAGPGTAANATAPFNHP
                     PMVNGIDSIPMFNNTNTTQHQPTTPAHQQQQQRTPNNQYNGTIYPQLAGIMHPQTTPT
                     EPPSSLYGNGGVGGAVQSTTLPPPAQVNQYPGTPYSATTLGMISVQQPALSTVPVQTA
                     TTPNNPFTLTSPIDGKMPTYPAQLLSSCAEAVVASMMPPTPPVTATAKDSPSKRTSVS
                     GSNLSKKQTHKSPQLPQGKSPGKSPRQPLQPPTPPAPVPVVALPPTKYDPQTHTLQGK
                     PRQRAPRGSGGSGAPGRGRGRGRGRGRGGGVTSGMAMVLPPPMSDYGSNTHIVNNLVG
                     TPFEFNNEFDDMAGPGVENLQSLRDRRRSFELRAPRVQNKPTTTPTTATTTNPLLHPV
                     LPGPVDMRTYNLGFEAPHSTASQEAYQNNLLGAFDSGTADQTLSEFNEEDERQFQSAL
                     RATGTGTSPSKQHSGPTALVAPPTGPNPTPAPNLLLHCTEANQMAPNVAATGAATHLV
                     EGSLVEASLEATSEEVSIDSDSTIPHSKTSTSDARSQIKLKIKSPMAYPEHYNAMTNS
                     SSLTLTSTLVQSSNVVQTTVSTSTVVSASSAVSGNSRRMRKKELLSLYVVQKDNHNDD
                     SSCGLPAASDTLPLENLRKSEEEDELSGGNGTKRFKKNSSSRELRALDANLALVEEQL
                     LSSGAGACGGGSSGDGRRRSACSSGSNNDNNGKTGAASSAGKRRGRSKTLESSEDDHQ
                     APKLKIKIRGLTANETPSGVSSVDEGQNYSYEMTRRACPPKKRLTSNFSTLTLEEIKR
                     DSMNYRKKVMQDFVKGEDSNKRGVVVKDGESLIMPQPPTKRPKSSKPKKEKKEKKRQK
                     QQQLILSSSTTTMTTTLIENTASASPGDKPKLILRFGKRKAETTTRTASLEQPPTLEA
                     PAPLRFKIARNSSGGGYIIGTKAEKKDESTADNTSPITELPLISPLREASPQGRLLNS
                     FTPHSQNANTSPALLGKDTGTPSPPCLVIDSSKSADVHDSTSLPESGEAAMGVQSSLV
                     NATTPLCVNVGNYENSNNSLPSASGTGSASSNSCNSNSINNNGSGGGRASGEGGLLPL
                     KKDCEVR"
ORIGIN      
        1 gagaaaaata atttttgctc aaaaataaaa taaataatca aaagaattgt aattcctgcg
       61 aacgcaaatg cgattttggg aaacgtgagt ttgatacgga cgcttcacga aatgcgaaga
      121 ctgtgagcgg tcttcgacta gcgaatgctc gtcgagaatc caaaccagtg agccaaacgc
      181 caaagaaaaa cttgcaaaag tctgtggata aacaacgaac aataacaaaa ggcagcaagc
      241 ggaaggtggc agtggaagtg gactaactcg cccgtggaga agtgcggagc acacaggact
      301 gcaaagtggc gtcgctactg gatattaggg ctggggcagc agcggcgtcc atgtccacgg
      361 ggcacaaagg gtggaccacc aagaatcacg accgcaccac atccaggaga gcgcagcagc
      421 agcagccggt gtggtgtcgc caactgtggc cttaagcatg tagtcttgca gcaaagaacc
      481 cgttctctaa caagcacaag acacaaggga caaggtacca atcgaagcaa aaacacacaa
      541 atatccaatc aacaccaatg aaaaatgtca caaagaggta agcgcggtaa tcagcagcat
      601 caccagtcgc accatccgcc gccgcaacag catcagcgca aggatgttga gccgcagccg
      661 ccgcccacca agcggcgcaa aggtaggcca cctaatggag ccactacggc agcagtagct
      721 gaagtgactg gatcgggtcc agctacagga tcggagcgtg tgccagttct gcctttgtgc
      781 aaaagcaaac atgaagagcc aggggcggaa gcggggggag gaggacaagg aagagccgcg
      841 gcaggcgcta ccagcactag caaatcgaag tcaacgaaac tggcaaagtc ggcaagcaag
      901 tgcaagtcgc agggggcgag ttctagtagc tcctggcagg cgcggtctgt agcggacatc
      961 aagatgtcga gcatctacaa tcgaagctca acggaagctc cagctgaact ttaccgcaag
     1021 gatctcatta gcgcaatgaa attgccagac tctgagccgt tggccaacta cgagtattta
     1081 atagtaacag acccatggaa gcaggaatgg gaaaagggtg tgcaagtgcc cgttaaccca
     1141 gactcccttc ctgagccatg cgtctatgtt ctgcccgagc ctgttgtgtc tccagcgcac
     1201 gactttaagc tcccgaaaaa tcgctatctg cgcattacca aagacgagca ctactcgccg
     1261 gatctgcatt acctgacgaa tgttgttgcc ttggcggaga acacatgtgc ctatgatata
     1321 gatcctattg acgaggcctg gttgcgtctc tacaacagcg atcgtgccca atgtggtgct
     1381 tttcccataa acgcgacgca gtttgagcgc gtcattgagg agctagaggt tcgttgctgg
     1441 gaacagattc aagtcatact taaactggaa gaaggcttgg gcatcgaatt cgatgagaat
     1501 gtcatctgcg atgtttgccg atcgcccgac tccgaggagg cgaacgaaat ggtattctgt
     1561 gataattgca atatctgtgt gcaccaggcg tgttatggca ttacagcaat tccatcagga
     1621 caatggctat gccgcacttg ttcgatggga attaagccgg actgtgtcct ctgtccaaat
     1681 aagggcggcg ctatgaaatc caacaagtcg ggtaagcact gggcacacgt ctcttgcgct
     1741 ctgtggatac ccgaggtcag cattgggtgt gtggatcgca tggaacccat cacgaaaatc
     1801 tctagcattc ctcagtcgcg ttggtccctg atctgtgtac tctgccgaaa gcgagttggc
     1861 agctgcatcc agtgctcagt aaagccctgc aaaacagcat accatgtgac ttgcgcgttt
     1921 cagcatggcc tggaaatgcg tgcaatcata gaggaaggta atgctgagga cggcgtgaag
     1981 ctgcgctctt attgtcagaa acacagtatg agcaagggta agaaggaaaa tgctggtagt
     2041 catgggggag gcagtgcctc agtcgcaagc gccatgcaga aagcaaacag atacggcagt
     2101 ggggctggtg gaggagccga cgacggcaac aacgcgtgcg ggacaactgg agaggatcca
     2161 cgcaggcgga aaaatcaccg taaaaccgaa ctgacctccg aggaacgtaa ccaagccaga
     2221 gctcagcgcc tccaggaggt ggaagctgag ttcgataagc atgttaactt taatgacatc
     2281 agttgccatc tatttgatgt cgacgacgat gctattgttg ccatatacaa ttattggaag
     2341 cttaagagaa agtctcgaca caatcgcgag cttatcccgc ccaagtccga agatgtggag
     2401 atgatagccc gcaagcagga gcaacaggac atggaaaatc ataagttggt ggtgcatttg
     2461 cgacaggatt tggagcgagt tcgtaatctc tgctatatgg ttagtcgaag agaaaagctt
     2521 tcgcgctctc tctttaagct acgtgagcag gtattctaca agcagctggg agttctggat
     2581 gagatgcgct tggagaagca gcagaccaaa caagagcagc aacaacctgt gatggatctg
     2641 aacgctgtaa tctacgccaa cgacggacct actttgtatg accgtttcta cagctctgtg
     2701 ggaggacaaa ctgtgccggc gcagtaccag gatttgaaat acatccttga acagcttatg
     2761 ggtaagctgc agagtggtaa acagggccgt ggccgtgcct cgcagtctcc taacaaacgc
     2821 aagcagcctg ctaaagcttc gccaaacaag aagcttaata atggcattct tagttcacgc
     2881 acgtcatcgc ctgagaagac tgtggcaggg agtaaggtgg gaacgactac atccaaggta
     2941 cggtctccgc cagggaagaa tcctacaggg aggcgtgcct cgaagagcag cgcggcggcg
     3001 gcgacgtcga cccacaacaa aagtcaattc cactcaaata tccgcagtag cacgacctcc
     3061 cattcgtcaa gcggcactat atcatcaggt aattccagct cggcgaatgg taccagcagc
     3121 tccgatagtt cctcaggaag tgattcgggt agtgaaagcg gtagctctag cgcgggtagc
     3181 ggagtctcca agcgaaagtc ttcctcagga agtcccctta aaaagcaaag ctacgctcgg
     3241 tccgttgagc agcggcaaaa acagcgacag cgacgtcaaa atgaagcagt tgcgggtgcg
     3301 tctgcaacgt atccggattc caggtctgca agcagctcca gcgacggtga agacgagcga
     3361 tgtagaaacc gccaagaacc agagcgcggt gcgagacggg ggccaattca aagcaaatca
     3421 gtacctaata ggagccaggc aagtaggtca aaacctacaa cggaagcaga cgttggagag
     3481 ggaactggag cctctgcaag acgaaagttg tctacgacaa cacgaggact tgcccaaatg
     3541 gataaagatg ccgatgagag cgtatcgagc gacgaaagtg aggaattgtt gcctctaaga
     3601 ggggaacgac agcgtgagag cacaacgact tccggactag ccacgactgg ttcggccatt
     3661 ggtagaaact tgggacagca tatctactcc gactctgaga gcagttcctc tgaacaagaa
     3721 aaagaccagg aggagcaggc caccgtggaa agtaatgtta gtgactcaca aaaccaacag
     3781 acgattagaa caaaggcagc tatgaaagag tttgtgccag gaacagcagc cacaacatcc
     3841 tctacttccc aagcagcgtc ctcaactagt aaagcgaaga acacgcggga aggaaaggaa
     3901 ggggctgcca gtatcggcaa cagtacaaaa accaaaccga acccgaatgc caagttgtat
     3961 ccggcagatc ttttggtggt gccacagcgc caggcagcca aaaaggcttc tgagaacatg
     4021 cgatccacaa atctcgccac gacgctacag ccagatgtgt ccgacagagt cagagagcca
     4081 gacataaact ccatttcagg aactgcaaag agcaaggtga aggattcaag ctctcgtgtg
     4141 tcgaatgaag cggataaatc aagtcttgaa aaagtacgac caaaagaaca cttacagaag
     4201 actgttggaa aaacgtccga aagcgcacct gctgaacgtg gaaagcgcgg aaggccacca
     4261 aaggtcccaa aggatgcacg tccgccatct attacggaaa atgacaagcc tgccctccca
     4321 acgcatactc aaagcaagcc cccatctgtc gtcgctaccc cagtctcagc caagtcgaac
     4381 tttgcagtct cattggttcc tcaacggcaa gcggctaaga aggcagcgga gcaattaaag
     4441 agtagcaaac ccgttctaga atctttctca acggggaacg acatttcgga caaagaaaca
     4501 gtgacgtcgg caacgatatc cggatcagga tcatctgttc cagcggctag cacgccagtt
     4561 aagcctacca gacggtcctc aattaaagaa gcaccaatta ctccaaagga acccttaagt
     4621 ggcagaagaa aatcaaaaga agatttgctg gcgacaccta taaaaacaac tcccctggtt
     4681 aagcgccgcg tagttgtccc aaatctttca agttctagtt ctggcgacag tgaaagttcc
     4741 agctcctcta gcagctcagg aagtagttcc agcagtggcg gcagcgactc ggacagcgag
     4801 tctcaggcca gcaattcgga aaacccatct agtagggagc ctcctgtagc ccctgcgaaa
     4861 gtgccatctg attcctctct tgtgccaaaa cgttcacccc gcaagtctat ggataagcct
     4921 tcagcattaa ctattgcgcc agcatcggtc aatgtattga acataccgtc tacgcggtct
     4981 cgtcagaact ctacaacaaa atcgacaaag gtagcactgc agaaggcagt gcaatcagtt
     5041 gaggatgacg tcaaatgtac gccgaaaaca aatcgtctgc agggctctat ggacgaatgt
     5101 ggaaagcagg tccagttgga acaggctacc aaaagagcga ctcgcggttc caagtcgcga
     5161 ccgccctcgc ctactgctaa atcgtctcca gaaaagacag tatccagatg taaatctcga
     5221 gcggaagaat ctcctaagaa ggttgcaaat ttggaacaag aaataagcca gagaaaagta
     5281 gctagcggaa aagggactag ttcgttggac aagctgttaa ataagaaaca gcagcagatg
     5341 aaccattctg ctcaagctac acccccacca atttcaccaa ctccacctgc ttctgaaaca
     5401 cgtatcgtaa aagatcaatg tgatcttaaa cccgatgagg tgtccatcca acagattaac
     5461 ttgggagcag atgcgcaacc cgaacccgat ctggacccag agtctgcggc agaagctggt
     5521 gaactaccaa tggatattga tgaggagctt accactgctc caacgcgaac ccaactatct
     5581 gcgagcgcga gtaaactggc agacataatc gatgatgagc gtccaccggc ggctccgctt
     5641 cccgcctcgc ccactcctac ccctacctct aatgacgaga tgtctgacgc tggaagtgat
     5701 ctcagcgaac ggcggcgtat gcgctggcga tcccgacgga gaagaaggag acgtagtcac
     5761 gagccagacg aggaacacac tcatcacacc caacatctcc tcaacgagat ggaaatggct
     5821 agggagctgg aagaggaacg taaaaacgag ctgcttgcaa atgcgagtaa gtactcggcg
     5881 tctacatcct ctccagcagt cactgtgatt ccgcccgatc cgccggaaat aattgaactg
     5941 gactcgaatt ctgcagagca gcaacagcag catctacatg accaacccct gccgcctccg
     6001 cttgttgtac aatcccctgc tgcagatgtt gtaccaacag ttatgcagca acagctgctg
     6061 ccctcacaac ggcctctaat cgagcaactg cctgtcgagc acttacccat cgttgagaca
     6121 atacttgaga tggaggacag taagtttgct aacaacttcg cgtcaaactt agccagtgtg
     6181 ttaaatcctc ccaatcagat gagtctaatc gggtctagta tagacaggag taagcaaata
     6241 agtgaggagg atagcatcca agcaacccga aatctcttag agaaactgcg caaaacaaag
     6301 cgcaaggcgc aggatgactg cagtagcaag gaggcggtcg atctattacc tccaactcca
     6361 gccatcccat cggtttttcc ttttcacaat gcggcggacc cagaggatat cattcatgcg
     6421 caaaaagaac agcagcacca acagcaacaa cagctgcaat catctcaaac atgtatttat
     6481 ggaaactcat caggacctaa ctctgttgca tcattgacta tcaaggattc tcctatgaca
     6541 gccaacagtg gaagctatgc aaacagtctc accaacactc caaatgctac acccaccaac
     6601 gcaacaatga acaatcttgg gtatcaagta aattttccga actctcagcc acctccaacg
     6661 ttaggtctct tcctggaaaa atcaccccac caaaaagggg cttgtcccct atccagcaac
     6721 ggaggagcta atgtagggca acctgcaccc actcctgact ttgtagactt ggcagcagca
     6781 gcggtaaaga acactctcgg aagctttcgc ggagcagcaa ccgttcccac gcaatccgga
     6841 acgggagtca atgcgaagat caacgactac gatgagagca cacgaatgca gtcgccgttc
     6901 gggggaatgc cgtggaacga aagcgatctt atcgctgaga gaagaagcag ctcgcctagt
     6961 tcagtgtccg aatccaatga tcctcctcag ccacctccag tcgttacggc aacagcaaca
     7021 acggctcgat ccctcgctca gcttgagagc tgtaaaaact ttttcaacag ctatccaagc
     7081 ggtaatgccg ggcctggcac tgcagcaaac gctacagcgc cctttaacca tcctccaatg
     7141 gtgaacggta ttgattccat accaatgttc aacaatacta atacgactca gcatcaaccg
     7201 acgacgccag ctcatcaaca gcaacagcag agaactccaa acaatcagta caacggaact
     7261 atctaccccc aactagcggg tatcatgcat ccacagacaa ctcctacaga accgccttca
     7321 agcttatacg gcaatggggg agttggaggc gctgtgcagt ccactacact acctccaccg
     7381 gctcaggtca accagtaccc cggaacacct tattccgcga ctactttggg tatgatttct
     7441 gtacaacagc ctgcgctttc aacagttccc gttcaaactg caactacacc caataatccg
     7501 tttactctga cttcgccgat tgatggaaaa atgccgacat atccggctca gcttctaagc
     7561 agctgtgcag aagcagtcgt tgcgtcgatg atgccaccaa ccccaccggt tacagctaca
     7621 gcaaaggact cgccaagcaa gagaacaagc gtcagcggta gtaacttatc taaaaaacag
     7681 acgcacaaat cgccccaact tccgcaagga aaatcgccag gaaagtcgcc cagacagcct
     7741 ctgcagccac caacgcctcc tgcaccagtt cccgtggtgg cattgccgcc gactaaatat
     7801 gacccccaga cgcacacatt acagggaaag ccgcgccagc gtgcaccgcg cggtagcggc
     7861 ggctctggag caccaggcag gggcagggga cgaggaaggg gtagaggacg tggaggagga
     7921 gttaccagtg ggatggctat ggtactgcca ccgccaatgt cagattacgg gagcaatact
     7981 catatagtca ataatttggt cggaactccc tttgagttca acaacgagtt tgacgacatg
     8041 gcaggacctg gtgtggagaa tctgcaatcg ctaagggatc ggcgaagaag ttttgagctt
     8101 cgagctccac gagttcaaaa caagccaacc actacaccga caactgcaac aaccacaaat
     8161 cctctcctcc atccagtgct gccgggacct gtggacatga gaacatacaa tctcggattt
     8221 gaggcaccgc acagtacagc atctcaagag gcctaccaga ataatcttct gggtgcgttt
     8281 gactcgggaa ccgccgatca gacactcagc gagttcaacg aggaggacga acgccagttc
     8341 cagtcggcac tgcgagcaac tggcactgga acctcgccca gcaaacagca ttcaggacca
     8401 acggcactag ttgcacctcc aactggtccc aatcccacac ctgcaccaaa tcttcttcta
     8461 cattgcacgg aagcaaacca aatggcaccc aatgtggctg ctacaggtgc tgctacgcat
     8521 ttggtcgaag gctcgctggt tgaagcatct ctagaggcca cttcagagga ggtatcgatt
     8581 gactcagaca gcacgatacc acactccaag acctcgactt cagatgcccg aagtcagatt
     8641 aaacttaaga tcaagagccc gatggcctat ccagaacact ataacgccat gacgaacagc
     8701 agcagtctga ctcttaccag cactctggtg cagtcgtcaa atgtagtgca aaccaccgta
     8761 tccacgtcga ctgtagtcag tgcgtcatca gccgttagcg gcaactctcg tcgcatgcga
     8821 aagaaggaac ttcttagtct ttatgtggta cagaaagata atcacaatga cgatagctca
     8881 tgcggactgc cagccgcatc cgacactttg ccccttgaga atttgcgaaa gtctgaggag
     8941 gaagacgaac tttcaggtgg caacggtacg aaaaggttta agaagaactc tagtagcagg
     9001 gaattgcgtg ctcttgatgc gaacttagcc cttgtggagg aacaattgct ttcaagcggt
     9061 gcaggagcct gtggaggagg atcatcgggt gacggcagac ggcgtagcgc ttgtagctca
     9121 ggtagcaata acgacaacaa cgggaagact ggagcagcta gtagtgcagg taagaggcga
     9181 ggacgcagta agactctcga aagcagcgag gatgaccacc aggcccctaa gctgaagatc
     9241 aaaatcaggg gtttgacggc caacgagaca ccttcaggtg tttcaagcgt tgacgagggc
     9301 caaaactaca gttatgaaat gacccgaagg gcttgtccac ctaaaaagcg tttgacaagt
     9361 aattttagca ctcttacact agaggaaatc aagagggact cgatgaatta ccgcaagaaa
     9421 gtcatgcagg actttgtcaa gggcgaggac agcaacaaga gaggggtagt tgtcaaagat
     9481 ggcgagtcgc tcataatgcc ccagcctccc acaaaacgac ctaagtcttc caagccgaaa
     9541 aaggagaaga aggaaaagaa acgccaaaaa cagcaacagc ttatcttaag cagcagtaca
     9601 accaccatga ccactacgtt aattgagaac acggccagtg cgtcgccagg tgataagcct
     9661 aagctgatcc tacgttttgg caaacgaaag gcggagacca cgacaagaac cgccagtctg
     9721 gagcaacctc ctaccttgga ggctccagca cccctgcgat ttaaaatagc ccggaactca
     9781 tctggcggcg gatacataat cggcacgaag gcggaaaaaa aggatgagtc tacggcagac
     9841 aacacgtctc cgattacgga gctgccactt atatcacctc tcagagaggc gtcaccacag
     9901 ggcagactgc tcaacagttt tacaccgcac tcccagaacg ctaatacgtc accagccctg
     9961 cttggtaagg acacgggtac cccatccccg ccatgcttgg tgatagactc tagcaaaagt
    10021 gcagacgtgc acgactccac atccctgccc gagagcgggg aggctgcgat gggcgtgcag
    10081 tcgtcccttg tgaatgcgac cacacctttg tgcgtcaacg tgggaaacta cgagaatagc
    10141 aacaactcat tgccatcagc cagtggcacc ggatcggctt ccagcaactc ctgcaatagc
    10201 aactcgatca acaacaacgg cagtggagga ggacgcgcca gcggggaagg cggcttactt
    10261 ccattgaaaa aagactgtga ggttagatga taatcggaga acgcgaaaac gtcgcatgta
    10321 agcgtatcct gtaacttcgt tcattcaagc tttcgataat cccctccctt tgtgatttgg
    10381 ttccttgccc tcgcccctat taaagtgtta gcttttatag tacatgagcg aactgcaggt
    10441 ccttaatcga actttaaagt gtacagttag cttgtaagcc ccttaaccat tcttctatct
    10501 gtctgtaact ggtttgttgt aatcgcatca gcgtaaagaa gatattatct ctacacacct
    10561 atataatatt aatatgcata tagcacatta tgcgagaaac aaaaaatata aattgtacat
    10621 ttttagaagc agattgtaca tttaaggctt tgtaaaatat tgtattaaag atataaaaga
    10681 aacaaaaaag cagacacagt tgtgttgtct agcctaaaaa aaatccaaaa atgctctcgg
    10741 tttaggttta taaagtttcc attcacctaa aaatccttcc acgcctccta ctcactaagc
    10801 atgattgata aaccccagca tatataataa acactattta ttaataatta tttaataaaa
    10861 tgtgtaaaat aatcgtaaaa tattagcaaa aagtgttaaa tgcgtgcaaa aagtaacaac
    10921 aatttcaaca attgtagcta tagtcattaa ttgtagttct tagcgaaaat caaagtcggt
    10981 tgaccctgac cgcaatctaa gtgcatataa tagtccctgt tcccaaacat gtgcgtagag
    11041 agaaaaacaa aaattgaata actaaattcc ctatcatttc cgaaagggaa ttgtttgtat
    11101 tttagcgttt aagaatgcaa cgtatcgaag aactccagag gaaagttaat gattgttttg
    11161 cgttgtatga tgttaaagca atagtaacgt aggcaaaaga acactggttt tctataagca
    11221 aaggtgcctg cagcctcgag cgaaatcaca caaaacaggc cgaagcaaga tgccatctaa
    11281 aacagtactg tttggtaggt gtgattgcat tccagattcc gatcgcggaa cgatacacaa
    11341 cgccccggtc cacctagact ttctgtattc ttttgtcgcc aatagttcaa gaaattactt
    11401 cccatttagt tttcaattag cattgggcgt atctcgatta aaggggcaat acaaaattaa
    11461 ataaaaaatc atttaaggaa cccagcaaca aatatttatt ctaaaataaa tccgcgaaaa
    11521 ggcaaaaata gagaataact aatataattg tgtaaagaac caagcaagaa ataaatttag
    11581 aagttaacga cacttctgta attgtaaata tttgtgcgca aaaaatgttt tt
//