Dbfetch
LOCUS NM_001259599 11632 bp mRNA linear INV 26-DEC-2023
DEFINITION Drosophila melanogaster rhinoceros (rno), transcript variant D,
mRNA.
ACCESSION NM_001259599
VERSION NM_001259599.2
DBLINK BioProject: PRJNA164
BioSample: SAMN02803731
KEYWORDS RefSeq.
SOURCE Drosophila melanogaster (fruit fly)
ORGANISM Drosophila melanogaster
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE 1 (bases 1 to 11632)
AUTHORS Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
Strelets,V., Russo,S.M. and Gelbart,W.M.
CONSRTM FlyBase Consortium
TITLE Gene Model Annotations for Drosophila melanogaster: Impact of
High-Throughput Data
JOURNAL G3 (Bethesda) 5 (8), 1721-1736 (2015)
PUBMED 26109357
REMARK Publication Status: Online-Only
REFERENCE 2 (bases 1 to 11632)
AUTHORS Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
Russo,S.M. and Gelbart,W.M.
CONSRTM FlyBase Consortium
TITLE Gene Model Annotations for Drosophila melanogaster: The
Rule-Benders
JOURNAL G3 (Bethesda) 5 (8), 1737-1749 (2015)
PUBMED 26109356
REMARK Publication Status: Online-Only
REFERENCE 3 (bases 1 to 11632)
AUTHORS Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
TITLE The Release 6 reference sequence of the Drosophila melanogaster
genome
JOURNAL Genome Res 25 (3), 445-458 (2015)
PUBMED 25589440
REFERENCE 4 (bases 1 to 11632)
AUTHORS Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
TITLE Sequence finishing and mapping of Drosophila melanogaster
heterochromatin
JOURNAL Science 316 (5831), 1625-1628 (2007)
PUBMED 17569867
REFERENCE 5 (bases 1 to 11632)
AUTHORS Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
TITLE The Release 5.1 annotation of Drosophila melanogaster
heterochromatin
JOURNAL Science 316 (5831), 1586-1591 (2007)
PUBMED 17569856
REMARK Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE 6 (bases 1 to 11632)
AUTHORS Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
Ashburner,M. and Anxolabehere,D.
TITLE Combined evidence annotation of transposable elements in genome
sequences
JOURNAL PLoS Comput Biol 1 (2), 166-175 (2005)
PUBMED 16110336
REFERENCE 7 (bases 1 to 11632)
AUTHORS Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
Celniker,S.E., Rubin,G.M. and Karpen,G.H.
TITLE Heterochromatic sequences in a Drosophila whole-genome shotgun
assembly
JOURNAL Genome Biol 3 (12), RESEARCH0085 (2002)
PUBMED 12537574
REFERENCE 8 (bases 1 to 11632)
AUTHORS Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
Rubin,G.M., Ashburner,M. and Celniker,S.E.
TITLE The transposable elements of the Drosophila melanogaster
euchromatin: a genomics perspective
JOURNAL Genome Biol 3 (12), RESEARCH0084 (2002)
PUBMED 12537573
REFERENCE 9 (bases 1 to 11632)
AUTHORS Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
Rubin,G.M. and Lewis,S.E.
TITLE Annotation of the Drosophila melanogaster euchromatic genome: a
systematic review
JOURNAL Genome Biol 3 (12), RESEARCH0083 (2002)
PUBMED 12537572
REFERENCE 10 (bases 1 to 11632)
AUTHORS Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
Gibbs,R.A. and Rubin,G.M.
TITLE Finishing a whole-genome shotgun: release 3 of the Drosophila
melanogaster euchromatic genome sequence
JOURNAL Genome Biol 3 (12), RESEARCH0079 (2002)
PUBMED 12537568
REFERENCE 11 (bases 1 to 11632)
AUTHORS Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
TITLE The genome sequence of Drosophila melanogaster
JOURNAL Science 287 (5461), 2185-2195 (2000)
PUBMED 10731132
REFERENCE 12 (bases 1 to 11632)
AUTHORS Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
Smith,E., Yu,C. and Rubin,G.
CONSRTM Berkeley Drosophila Genome Project
TITLE Drosophila melanogaster release 4 sequence
JOURNAL Unpublished
REFERENCE 13 (bases 1 to 11632)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (20-DEC-2023) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 14 (bases 1 to 11632)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 15 (bases 1 to 11632)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (10-NOV-2022) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 16 (bases 1 to 11632)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 17 (bases 1 to 11632)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (20-APR-2020) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 18 (bases 1 to 11632)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (22-APR-2019) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 19 (bases 1 to 11632)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 20 (bases 1 to 11632)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 21 (bases 1 to 11632)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (07-OCT-2015) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 22 (bases 1 to 11632)
AUTHORS Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
Park,S., Svirskas,R. and Karpen,G.
TITLE Direct Submission
JOURNAL Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
REMARK Direct Submission
REFERENCE 23 (bases 1 to 11632)
AUTHORS Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
Svirskas,R. and Rubin,G.
TITLE Direct Submission
JOURNAL Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
64-121, Berkeley, CA 94720, USA
REMARK Direct Submission
REFERENCE 24 (bases 1 to 11632)
AUTHORS Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
CONSRTM Drosophila Heterochromatin Genome Project
TITLE Direct Submission
JOURNAL Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE 25 (bases 1 to 11632)
AUTHORS Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
TITLE Direct Submission
JOURNAL Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
Rockville, MD 20850, USA
COMMENT REVIEWED REFSEQ: This record has been curated by FlyBase. This
record is derived from an annotated genomic sequence (NT_037436).
On Jan 16, 2013 this sequence version replaced NM_001259599.1.
##Genome-Annotation-Data-START##
Annotation Provider :: FlyBase
Annotation Status :: Full annotation
Annotation Version :: Release 6.54
URL :: http://flybase.org
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..11632
/organism="Drosophila melanogaster"
/mol_type="mRNA"
/db_xref="taxon:7227"
/chromosome="3L"
/genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
gene 1..11632
/gene="rno"
/locus_tag="Dmel_CG7036"
/gene_synonym="CG7036; Dmel\CG7036"
/note="rhinoceros"
/map="61B2-61B2"
/db_xref="FLYBASE:FBgn0035106"
/db_xref="GeneID:38027"
CDS 565..10290
/gene="rno"
/locus_tag="Dmel_CG7036"
/gene_synonym="CG7036; Dmel\CG7036"
/note="CG7036 gene product from transcript CG7036-RD;
CG7036-PD; rno-PD; rhinoceros"
/codon_start=1
/product="rhinoceros, isoform D"
/protein_id="NP_001246528.1"
/db_xref="FLYBASE:FBpp0294000"
/db_xref="GeneID:38027"
/db_xref="FLYBASE:FBgn0035106"
/translation="MSQRGKRGNQQHHQSHHPPPQQHQRKDVEPQPPPTKRRKGRPPN
GATTAAVAEVTGSGPATGSERVPVLPLCKSKHEEPGAEAGGGGQGRAAAGATSTSKSK
STKLAKSASKCKSQGASSSSSWQARSVADIKMSSIYNRSSTEAPAELYRKDLISAMKL
PDSEPLANYEYLIVTDPWKQEWEKGVQVPVNPDSLPEPCVYVLPEPVVSPAHDFKLPK
NRYLRITKDEHYSPDLHYLTNVVALAENTCAYDIDPIDEAWLRLYNSDRAQCGAFPIN
ATQFERVIEELEVRCWEQIQVILKLEEGLGIEFDENVICDVCRSPDSEEANEMVFCDN
CNICVHQACYGITAIPSGQWLCRTCSMGIKPDCVLCPNKGGAMKSNKSGKHWAHVSCA
LWIPEVSIGCVDRMEPITKISSIPQSRWSLICVLCRKRVGSCIQCSVKPCKTAYHVTC
AFQHGLEMRAIIEEGNAEDGVKLRSYCQKHSMSKGKKENAGSHGGGSASVASAMQKAN
RYGSGAGGGADDGNNACGTTGEDPRRRKNHRKTELTSEERNQARAQRLQEVEAEFDKH
VNFNDISCHLFDVDDDAIVAIYNYWKLKRKSRHNRELIPPKSEDVEMIARKQEQQDME
NHKLVVHLRQDLERVRNLCYMVSRREKLSRSLFKLREQVFYKQLGVLDEMRLEKQQTK
QEQQQPVMDLNAVIYANDGPTLYDRFYSSVGGQTVPAQYQDLKYILEQLMGKLQSGKQ
GRGRASQSPNKRKQPAKASPNKKLNNGILSSRTSSPEKTVAGSKVGTTTSKVRSPPGK
NPTGRRASKSSAAAATSTHNKSQFHSNIRSSTTSHSSSGTISSGNSSSANGTSSSDSS
SGSDSGSESGSSSAGSGVSKRKSSSGSPLKKQSYARSVEQRQKQRQRRQNEAVAGASA
TYPDSRSASSSSDGEDERCRNRQEPERGARRGPIQSKSVPNRSQASRSKPTTEADVGE
GTGASARRKLSTTTRGLAQMDKDADESVSSDESEELLPLRGERQRESTTTSGLATTGS
AIGRNLGQHIYSDSESSSSEQEKDQEEQATVESNVSDSQNQQTIRTKAAMKEFVPGTA
ATTSSTSQAASSTSKAKNTREGKEGAASIGNSTKTKPNPNAKLYPADLLVVPQRQAAK
KASENMRSTNLATTLQPDVSDRVREPDINSISGTAKSKVKDSSSRVSNEADKSSLEKV
RPKEHLQKTVGKTSESAPAERGKRGRPPKVPKDARPPSITENDKPALPTHTQSKPPSV
VATPVSAKSNFAVSLVPQRQAAKKAAEQLKSSKPVLESFSTGNDISDKETVTSATISG
SGSSVPAASTPVKPTRRSSIKEAPITPKEPLSGRRKSKEDLLATPIKTTPLVKRRVVV
PNLSSSSSGDSESSSSSSSSGSSSSSGGSDSDSESQASNSENPSSREPPVAPAKVPSD
SSLVPKRSPRKSMDKPSALTIAPASVNVLNIPSTRSRQNSTTKSTKVALQKAVQSVED
DVKCTPKTNRLQGSMDECGKQVQLEQATKRATRGSKSRPPSPTAKSSPEKTVSRCKSR
AEESPKKVANLEQEISQRKVASGKGTSSLDKLLNKKQQQMNHSAQATPPPISPTPPAS
ETRIVKDQCDLKPDEVSIQQINLGADAQPEPDLDPESAAEAGELPMDIDEELTTAPTR
TQLSASASKLADIIDDERPPAAPLPASPTPTPTSNDEMSDAGSDLSERRRMRWRSRRR
RRRRSHEPDEEHTHHTQHLLNEMEMARELEEERKNELLANASKYSASTSSPAVTVIPP
DPPEIIELDSNSAEQQQQHLHDQPLPPPLVVQSPAADVVPTVMQQQLLPSQRPLIEQL
PVEHLPIVETILEMEDSKFANNFASNLASVLNPPNQMSLIGSSIDRSKQISEEDSIQA
TRNLLEKLRKTKRKAQDDCSSKEAVDLLPPTPAIPSVFPFHNAADPEDIIHAQKEQQH
QQQQQLQSSQTCIYGNSSGPNSVASLTIKDSPMTANSGSYANSLTNTPNATPTNATMN
NLGYQVNFPNSQPPPTLGLFLEKSPHQKGACPLSSNGGANVGQPAPTPDFVDLAAAAV
KNTLGSFRGAATVPTQSGTGVNAKINDYDESTRMQSPFGGMPWNESDLIAERRSSSPS
SVSESNDPPQPPPVVTATATTARSLAQLESCKNFFNSYPSGNAGPGTAANATAPFNHP
PMVNGIDSIPMFNNTNTTQHQPTTPAHQQQQQRTPNNQYNGTIYPQLAGIMHPQTTPT
EPPSSLYGNGGVGGAVQSTTLPPPAQVNQYPGTPYSATTLGMISVQQPALSTVPVQTA
TTPNNPFTLTSPIDGKMPTYPAQLLSSCAEAVVASMMPPTPPVTATAKDSPSKRTSVS
GSNLSKKQTHKSPQLPQGKSPGKSPRQPLQPPTPPAPVPVVALPPTKYDPQTHTLQGK
PRQRAPRGSGGSGAPGRGRGRGRGRGRGGGVTSGMAMVLPPPMSDYGSNTHIVNNLVG
TPFEFNNEFDDMAGPGVENLQSLRDRRRSFELRAPRVQNKPTTTPTTATTTNPLLHPV
LPGPVDMRTYNLGFEAPHSTASQEAYQNNLLGAFDSGTADQTLSEFNEEDERQFQSAL
RATGTGTSPSKQHSGPTALVAPPTGPNPTPAPNLLLHCTEANQMAPNVAATGAATHLV
EGSLVEASLEATSEEVSIDSDSTIPHSKTSTSDARSQIKLKIKSPMAYPEHYNAMTNS
SSLTLTSTLVQSSNVVQTTVSTSTVVSASSAVSGNSRRMRKKELLSLYVVQKDNHNDD
SSCGLPAASDTLPLENLRKSEEEDELSGGNGTKRFKKNSSSRELRALDANLALVEEQL
LSSGAGACGGGSSGDGRRRSACSSGSNNDNNGKTGAASSAGKRRGRSKTLESSEDDHQ
APKLKIKIRGLTANETPSGVSSVDEGQNYSYEMTRRACPPKKRLTSNFSTLTLEEIKR
DSMNYRKKVMQDFVKGEDSNKRGVVVKDGESLIMPQPPTKRPKSSKPKKEKKEKKRQK
QQQLILSSSTTTMTTTLIENTASASPGDKPKLILRFGKRKAETTTRTASLEQPPTLEA
PAPLRFKIARNSSGGGYIIGTKAEKKDESTADNTSPITELPLISPLREASPQGRLLNS
FTPHSQNANTSPALLGKDTGTPSPPCLVIDSSKSADVHDSTSLPESGEAAMGVQSSLV
NATTPLCVNVGNYENSNNSLPSASGTGSASSNSCNSNSINNNGSGGGRASGEGGLLPL
KKDCEVR"
ORIGIN
1 gagaaaaata atttttgctc aaaaataaaa taaataatca aaagaattgt aattcctgcg
61 aacgcaaatg cgattttggg aaacgtgagt ttgatacgga cgcttcacga aatgcgaaga
121 ctgtgagcgg tcttcgacta gcgaatgctc gtcgagaatc caaaccagtg agccaaacgc
181 caaagaaaaa cttgcaaaag tctgtggata aacaacgaac aataacaaaa ggcagcaagc
241 ggaaggtggc agtggaagtg gactaactcg cccgtggaga agtgcggagc acacaggact
301 gcaaagtggc gtcgctactg gatattaggg ctggggcagc agcggcgtcc atgtccacgg
361 ggcacaaagg gtggaccacc aagaatcacg accgcaccac atccaggaga gcgcagcagc
421 agcagccggt gtggtgtcgc caactgtggc cttaagcatg tagtcttgca gcaaagaacc
481 cgttctctaa caagcacaag acacaaggga caaggtacca atcgaagcaa aaacacacaa
541 atatccaatc aacaccaatg aaaaatgtca caaagaggta agcgcggtaa tcagcagcat
601 caccagtcgc accatccgcc gccgcaacag catcagcgca aggatgttga gccgcagccg
661 ccgcccacca agcggcgcaa aggtaggcca cctaatggag ccactacggc agcagtagct
721 gaagtgactg gatcgggtcc agctacagga tcggagcgtg tgccagttct gcctttgtgc
781 aaaagcaaac atgaagagcc aggggcggaa gcggggggag gaggacaagg aagagccgcg
841 gcaggcgcta ccagcactag caaatcgaag tcaacgaaac tggcaaagtc ggcaagcaag
901 tgcaagtcgc agggggcgag ttctagtagc tcctggcagg cgcggtctgt agcggacatc
961 aagatgtcga gcatctacaa tcgaagctca acggaagctc cagctgaact ttaccgcaag
1021 gatctcatta gcgcaatgaa attgccagac tctgagccgt tggccaacta cgagtattta
1081 atagtaacag acccatggaa gcaggaatgg gaaaagggtg tgcaagtgcc cgttaaccca
1141 gactcccttc ctgagccatg cgtctatgtt ctgcccgagc ctgttgtgtc tccagcgcac
1201 gactttaagc tcccgaaaaa tcgctatctg cgcattacca aagacgagca ctactcgccg
1261 gatctgcatt acctgacgaa tgttgttgcc ttggcggaga acacatgtgc ctatgatata
1321 gatcctattg acgaggcctg gttgcgtctc tacaacagcg atcgtgccca atgtggtgct
1381 tttcccataa acgcgacgca gtttgagcgc gtcattgagg agctagaggt tcgttgctgg
1441 gaacagattc aagtcatact taaactggaa gaaggcttgg gcatcgaatt cgatgagaat
1501 gtcatctgcg atgtttgccg atcgcccgac tccgaggagg cgaacgaaat ggtattctgt
1561 gataattgca atatctgtgt gcaccaggcg tgttatggca ttacagcaat tccatcagga
1621 caatggctat gccgcacttg ttcgatggga attaagccgg actgtgtcct ctgtccaaat
1681 aagggcggcg ctatgaaatc caacaagtcg ggtaagcact gggcacacgt ctcttgcgct
1741 ctgtggatac ccgaggtcag cattgggtgt gtggatcgca tggaacccat cacgaaaatc
1801 tctagcattc ctcagtcgcg ttggtccctg atctgtgtac tctgccgaaa gcgagttggc
1861 agctgcatcc agtgctcagt aaagccctgc aaaacagcat accatgtgac ttgcgcgttt
1921 cagcatggcc tggaaatgcg tgcaatcata gaggaaggta atgctgagga cggcgtgaag
1981 ctgcgctctt attgtcagaa acacagtatg agcaagggta agaaggaaaa tgctggtagt
2041 catgggggag gcagtgcctc agtcgcaagc gccatgcaga aagcaaacag atacggcagt
2101 ggggctggtg gaggagccga cgacggcaac aacgcgtgcg ggacaactgg agaggatcca
2161 cgcaggcgga aaaatcaccg taaaaccgaa ctgacctccg aggaacgtaa ccaagccaga
2221 gctcagcgcc tccaggaggt ggaagctgag ttcgataagc atgttaactt taatgacatc
2281 agttgccatc tatttgatgt cgacgacgat gctattgttg ccatatacaa ttattggaag
2341 cttaagagaa agtctcgaca caatcgcgag cttatcccgc ccaagtccga agatgtggag
2401 atgatagccc gcaagcagga gcaacaggac atggaaaatc ataagttggt ggtgcatttg
2461 cgacaggatt tggagcgagt tcgtaatctc tgctatatgg ttagtcgaag agaaaagctt
2521 tcgcgctctc tctttaagct acgtgagcag gtattctaca agcagctggg agttctggat
2581 gagatgcgct tggagaagca gcagaccaaa caagagcagc aacaacctgt gatggatctg
2641 aacgctgtaa tctacgccaa cgacggacct actttgtatg accgtttcta cagctctgtg
2701 ggaggacaaa ctgtgccggc gcagtaccag gatttgaaat acatccttga acagcttatg
2761 ggtaagctgc agagtggtaa acagggccgt ggccgtgcct cgcagtctcc taacaaacgc
2821 aagcagcctg ctaaagcttc gccaaacaag aagcttaata atggcattct tagttcacgc
2881 acgtcatcgc ctgagaagac tgtggcaggg agtaaggtgg gaacgactac atccaaggta
2941 cggtctccgc cagggaagaa tcctacaggg aggcgtgcct cgaagagcag cgcggcggcg
3001 gcgacgtcga cccacaacaa aagtcaattc cactcaaata tccgcagtag cacgacctcc
3061 cattcgtcaa gcggcactat atcatcaggt aattccagct cggcgaatgg taccagcagc
3121 tccgatagtt cctcaggaag tgattcgggt agtgaaagcg gtagctctag cgcgggtagc
3181 ggagtctcca agcgaaagtc ttcctcagga agtcccctta aaaagcaaag ctacgctcgg
3241 tccgttgagc agcggcaaaa acagcgacag cgacgtcaaa atgaagcagt tgcgggtgcg
3301 tctgcaacgt atccggattc caggtctgca agcagctcca gcgacggtga agacgagcga
3361 tgtagaaacc gccaagaacc agagcgcggt gcgagacggg ggccaattca aagcaaatca
3421 gtacctaata ggagccaggc aagtaggtca aaacctacaa cggaagcaga cgttggagag
3481 ggaactggag cctctgcaag acgaaagttg tctacgacaa cacgaggact tgcccaaatg
3541 gataaagatg ccgatgagag cgtatcgagc gacgaaagtg aggaattgtt gcctctaaga
3601 ggggaacgac agcgtgagag cacaacgact tccggactag ccacgactgg ttcggccatt
3661 ggtagaaact tgggacagca tatctactcc gactctgaga gcagttcctc tgaacaagaa
3721 aaagaccagg aggagcaggc caccgtggaa agtaatgtta gtgactcaca aaaccaacag
3781 acgattagaa caaaggcagc tatgaaagag tttgtgccag gaacagcagc cacaacatcc
3841 tctacttccc aagcagcgtc ctcaactagt aaagcgaaga acacgcggga aggaaaggaa
3901 ggggctgcca gtatcggcaa cagtacaaaa accaaaccga acccgaatgc caagttgtat
3961 ccggcagatc ttttggtggt gccacagcgc caggcagcca aaaaggcttc tgagaacatg
4021 cgatccacaa atctcgccac gacgctacag ccagatgtgt ccgacagagt cagagagcca
4081 gacataaact ccatttcagg aactgcaaag agcaaggtga aggattcaag ctctcgtgtg
4141 tcgaatgaag cggataaatc aagtcttgaa aaagtacgac caaaagaaca cttacagaag
4201 actgttggaa aaacgtccga aagcgcacct gctgaacgtg gaaagcgcgg aaggccacca
4261 aaggtcccaa aggatgcacg tccgccatct attacggaaa atgacaagcc tgccctccca
4321 acgcatactc aaagcaagcc cccatctgtc gtcgctaccc cagtctcagc caagtcgaac
4381 tttgcagtct cattggttcc tcaacggcaa gcggctaaga aggcagcgga gcaattaaag
4441 agtagcaaac ccgttctaga atctttctca acggggaacg acatttcgga caaagaaaca
4501 gtgacgtcgg caacgatatc cggatcagga tcatctgttc cagcggctag cacgccagtt
4561 aagcctacca gacggtcctc aattaaagaa gcaccaatta ctccaaagga acccttaagt
4621 ggcagaagaa aatcaaaaga agatttgctg gcgacaccta taaaaacaac tcccctggtt
4681 aagcgccgcg tagttgtccc aaatctttca agttctagtt ctggcgacag tgaaagttcc
4741 agctcctcta gcagctcagg aagtagttcc agcagtggcg gcagcgactc ggacagcgag
4801 tctcaggcca gcaattcgga aaacccatct agtagggagc ctcctgtagc ccctgcgaaa
4861 gtgccatctg attcctctct tgtgccaaaa cgttcacccc gcaagtctat ggataagcct
4921 tcagcattaa ctattgcgcc agcatcggtc aatgtattga acataccgtc tacgcggtct
4981 cgtcagaact ctacaacaaa atcgacaaag gtagcactgc agaaggcagt gcaatcagtt
5041 gaggatgacg tcaaatgtac gccgaaaaca aatcgtctgc agggctctat ggacgaatgt
5101 ggaaagcagg tccagttgga acaggctacc aaaagagcga ctcgcggttc caagtcgcga
5161 ccgccctcgc ctactgctaa atcgtctcca gaaaagacag tatccagatg taaatctcga
5221 gcggaagaat ctcctaagaa ggttgcaaat ttggaacaag aaataagcca gagaaaagta
5281 gctagcggaa aagggactag ttcgttggac aagctgttaa ataagaaaca gcagcagatg
5341 aaccattctg ctcaagctac acccccacca atttcaccaa ctccacctgc ttctgaaaca
5401 cgtatcgtaa aagatcaatg tgatcttaaa cccgatgagg tgtccatcca acagattaac
5461 ttgggagcag atgcgcaacc cgaacccgat ctggacccag agtctgcggc agaagctggt
5521 gaactaccaa tggatattga tgaggagctt accactgctc caacgcgaac ccaactatct
5581 gcgagcgcga gtaaactggc agacataatc gatgatgagc gtccaccggc ggctccgctt
5641 cccgcctcgc ccactcctac ccctacctct aatgacgaga tgtctgacgc tggaagtgat
5701 ctcagcgaac ggcggcgtat gcgctggcga tcccgacgga gaagaaggag acgtagtcac
5761 gagccagacg aggaacacac tcatcacacc caacatctcc tcaacgagat ggaaatggct
5821 agggagctgg aagaggaacg taaaaacgag ctgcttgcaa atgcgagtaa gtactcggcg
5881 tctacatcct ctccagcagt cactgtgatt ccgcccgatc cgccggaaat aattgaactg
5941 gactcgaatt ctgcagagca gcaacagcag catctacatg accaacccct gccgcctccg
6001 cttgttgtac aatcccctgc tgcagatgtt gtaccaacag ttatgcagca acagctgctg
6061 ccctcacaac ggcctctaat cgagcaactg cctgtcgagc acttacccat cgttgagaca
6121 atacttgaga tggaggacag taagtttgct aacaacttcg cgtcaaactt agccagtgtg
6181 ttaaatcctc ccaatcagat gagtctaatc gggtctagta tagacaggag taagcaaata
6241 agtgaggagg atagcatcca agcaacccga aatctcttag agaaactgcg caaaacaaag
6301 cgcaaggcgc aggatgactg cagtagcaag gaggcggtcg atctattacc tccaactcca
6361 gccatcccat cggtttttcc ttttcacaat gcggcggacc cagaggatat cattcatgcg
6421 caaaaagaac agcagcacca acagcaacaa cagctgcaat catctcaaac atgtatttat
6481 ggaaactcat caggacctaa ctctgttgca tcattgacta tcaaggattc tcctatgaca
6541 gccaacagtg gaagctatgc aaacagtctc accaacactc caaatgctac acccaccaac
6601 gcaacaatga acaatcttgg gtatcaagta aattttccga actctcagcc acctccaacg
6661 ttaggtctct tcctggaaaa atcaccccac caaaaagggg cttgtcccct atccagcaac
6721 ggaggagcta atgtagggca acctgcaccc actcctgact ttgtagactt ggcagcagca
6781 gcggtaaaga acactctcgg aagctttcgc ggagcagcaa ccgttcccac gcaatccgga
6841 acgggagtca atgcgaagat caacgactac gatgagagca cacgaatgca gtcgccgttc
6901 gggggaatgc cgtggaacga aagcgatctt atcgctgaga gaagaagcag ctcgcctagt
6961 tcagtgtccg aatccaatga tcctcctcag ccacctccag tcgttacggc aacagcaaca
7021 acggctcgat ccctcgctca gcttgagagc tgtaaaaact ttttcaacag ctatccaagc
7081 ggtaatgccg ggcctggcac tgcagcaaac gctacagcgc cctttaacca tcctccaatg
7141 gtgaacggta ttgattccat accaatgttc aacaatacta atacgactca gcatcaaccg
7201 acgacgccag ctcatcaaca gcaacagcag agaactccaa acaatcagta caacggaact
7261 atctaccccc aactagcggg tatcatgcat ccacagacaa ctcctacaga accgccttca
7321 agcttatacg gcaatggggg agttggaggc gctgtgcagt ccactacact acctccaccg
7381 gctcaggtca accagtaccc cggaacacct tattccgcga ctactttggg tatgatttct
7441 gtacaacagc ctgcgctttc aacagttccc gttcaaactg caactacacc caataatccg
7501 tttactctga cttcgccgat tgatggaaaa atgccgacat atccggctca gcttctaagc
7561 agctgtgcag aagcagtcgt tgcgtcgatg atgccaccaa ccccaccggt tacagctaca
7621 gcaaaggact cgccaagcaa gagaacaagc gtcagcggta gtaacttatc taaaaaacag
7681 acgcacaaat cgccccaact tccgcaagga aaatcgccag gaaagtcgcc cagacagcct
7741 ctgcagccac caacgcctcc tgcaccagtt cccgtggtgg cattgccgcc gactaaatat
7801 gacccccaga cgcacacatt acagggaaag ccgcgccagc gtgcaccgcg cggtagcggc
7861 ggctctggag caccaggcag gggcagggga cgaggaaggg gtagaggacg tggaggagga
7921 gttaccagtg ggatggctat ggtactgcca ccgccaatgt cagattacgg gagcaatact
7981 catatagtca ataatttggt cggaactccc tttgagttca acaacgagtt tgacgacatg
8041 gcaggacctg gtgtggagaa tctgcaatcg ctaagggatc ggcgaagaag ttttgagctt
8101 cgagctccac gagttcaaaa caagccaacc actacaccga caactgcaac aaccacaaat
8161 cctctcctcc atccagtgct gccgggacct gtggacatga gaacatacaa tctcggattt
8221 gaggcaccgc acagtacagc atctcaagag gcctaccaga ataatcttct gggtgcgttt
8281 gactcgggaa ccgccgatca gacactcagc gagttcaacg aggaggacga acgccagttc
8341 cagtcggcac tgcgagcaac tggcactgga acctcgccca gcaaacagca ttcaggacca
8401 acggcactag ttgcacctcc aactggtccc aatcccacac ctgcaccaaa tcttcttcta
8461 cattgcacgg aagcaaacca aatggcaccc aatgtggctg ctacaggtgc tgctacgcat
8521 ttggtcgaag gctcgctggt tgaagcatct ctagaggcca cttcagagga ggtatcgatt
8581 gactcagaca gcacgatacc acactccaag acctcgactt cagatgcccg aagtcagatt
8641 aaacttaaga tcaagagccc gatggcctat ccagaacact ataacgccat gacgaacagc
8701 agcagtctga ctcttaccag cactctggtg cagtcgtcaa atgtagtgca aaccaccgta
8761 tccacgtcga ctgtagtcag tgcgtcatca gccgttagcg gcaactctcg tcgcatgcga
8821 aagaaggaac ttcttagtct ttatgtggta cagaaagata atcacaatga cgatagctca
8881 tgcggactgc cagccgcatc cgacactttg ccccttgaga atttgcgaaa gtctgaggag
8941 gaagacgaac tttcaggtgg caacggtacg aaaaggttta agaagaactc tagtagcagg
9001 gaattgcgtg ctcttgatgc gaacttagcc cttgtggagg aacaattgct ttcaagcggt
9061 gcaggagcct gtggaggagg atcatcgggt gacggcagac ggcgtagcgc ttgtagctca
9121 ggtagcaata acgacaacaa cgggaagact ggagcagcta gtagtgcagg taagaggcga
9181 ggacgcagta agactctcga aagcagcgag gatgaccacc aggcccctaa gctgaagatc
9241 aaaatcaggg gtttgacggc caacgagaca ccttcaggtg tttcaagcgt tgacgagggc
9301 caaaactaca gttatgaaat gacccgaagg gcttgtccac ctaaaaagcg tttgacaagt
9361 aattttagca ctcttacact agaggaaatc aagagggact cgatgaatta ccgcaagaaa
9421 gtcatgcagg actttgtcaa gggcgaggac agcaacaaga gaggggtagt tgtcaaagat
9481 ggcgagtcgc tcataatgcc ccagcctccc acaaaacgac ctaagtcttc caagccgaaa
9541 aaggagaaga aggaaaagaa acgccaaaaa cagcaacagc ttatcttaag cagcagtaca
9601 accaccatga ccactacgtt aattgagaac acggccagtg cgtcgccagg tgataagcct
9661 aagctgatcc tacgttttgg caaacgaaag gcggagacca cgacaagaac cgccagtctg
9721 gagcaacctc ctaccttgga ggctccagca cccctgcgat ttaaaatagc ccggaactca
9781 tctggcggcg gatacataat cggcacgaag gcggaaaaaa aggatgagtc tacggcagac
9841 aacacgtctc cgattacgga gctgccactt atatcacctc tcagagaggc gtcaccacag
9901 ggcagactgc tcaacagttt tacaccgcac tcccagaacg ctaatacgtc accagccctg
9961 cttggtaagg acacgggtac cccatccccg ccatgcttgg tgatagactc tagcaaaagt
10021 gcagacgtgc acgactccac atccctgccc gagagcgggg aggctgcgat gggcgtgcag
10081 tcgtcccttg tgaatgcgac cacacctttg tgcgtcaacg tgggaaacta cgagaatagc
10141 aacaactcat tgccatcagc cagtggcacc ggatcggctt ccagcaactc ctgcaatagc
10201 aactcgatca acaacaacgg cagtggagga ggacgcgcca gcggggaagg cggcttactt
10261 ccattgaaaa aagactgtga ggttagatga taatcggaga acgcgaaaac gtcgcatgta
10321 agcgtatcct gtaacttcgt tcattcaagc tttcgataat cccctccctt tgtgatttgg
10381 ttccttgccc tcgcccctat taaagtgtta gcttttatag tacatgagcg aactgcaggt
10441 ccttaatcga actttaaagt gtacagttag cttgtaagcc ccttaaccat tcttctatct
10501 gtctgtaact ggtttgttgt aatcgcatca gcgtaaagaa gatattatct ctacacacct
10561 atataatatt aatatgcata tagcacatta tgcgagaaac aaaaaatata aattgtacat
10621 ttttagaagc agattgtaca tttaaggctt tgtaaaatat tgtattaaag atataaaaga
10681 aacaaaaaag cagacacagt tgtgttgtct agcctaaaaa aaatccaaaa atgctctcgg
10741 tttaggttta taaagtttcc attcacctaa aaatccttcc acgcctccta ctcactaagc
10801 atgattgata aaccccagca tatataataa acactattta ttaataatta tttaataaaa
10861 tgtgtaaaat aatcgtaaaa tattagcaaa aagtgttaaa tgcgtgcaaa aagtaacaac
10921 aatttcaaca attgtagcta tagtcattaa ttgtagttct tagcgaaaat caaagtcggt
10981 tgaccctgac cgcaatctaa gtgcatataa tagtccctgt tcccaaacat gtgcgtagag
11041 agaaaaacaa aaattgaata actaaattcc ctatcatttc cgaaagggaa ttgtttgtat
11101 tttagcgttt aagaatgcaa cgtatcgaag aactccagag gaaagttaat gattgttttg
11161 cgttgtatga tgttaaagca atagtaacgt aggcaaaaga acactggttt tctataagca
11221 aaggtgcctg cagcctcgag cgaaatcaca caaaacaggc cgaagcaaga tgccatctaa
11281 aacagtactg tttggtaggt gtgattgcat tccagattcc gatcgcggaa cgatacacaa
11341 cgccccggtc cacctagact ttctgtattc ttttgtcgcc aatagttcaa gaaattactt
11401 cccatttagt tttcaattag cattgggcgt atctcgatta aaggggcaat acaaaattaa
11461 ataaaaaatc atttaaggaa cccagcaaca aatatttatt ctaaaataaa tccgcgaaaa
11521 ggcaaaaata gagaataact aatataattg tgtaaagaac caagcaagaa ataaatttag
11581 aagttaacga cacttctgta attgtaaata tttgtgcgca aaaaatgttt tt
//