Dbfetch

LOCUS       NM_078977              10470 bp    mRNA    linear   INV 26-DEC-2023
DEFINITION  Drosophila melanogaster toutatis (tou), transcript variant A, mRNA.
ACCESSION   NM_078977
VERSION     NM_078977.5
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (bases 1 to 10470)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: Impact of
            High-Throughput Data
  JOURNAL   G3 (Bethesda) 5 (8), 1721-1736 (2015)
   PUBMED   26109357
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 10470)
  AUTHORS   Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
            Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
            Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: The
            Rule-Benders
  JOURNAL   G3 (Bethesda) 5 (8), 1737-1749 (2015)
   PUBMED   26109356
  REMARK    Publication Status: Online-Only
REFERENCE   3  (bases 1 to 10470)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
            Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
            Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
            Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
            Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
            Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
  TITLE     The Release 6 reference sequence of the Drosophila melanogaster
            genome
  JOURNAL   Genome Res 25 (3), 445-458 (2015)
   PUBMED   25589440
REFERENCE   4  (bases 1 to 10470)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
            Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
            Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
  TITLE     Sequence finishing and mapping of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1625-1628 (2007)
   PUBMED   17569867
REFERENCE   5  (bases 1 to 10470)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  TITLE     The Release 5.1 annotation of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1586-1591 (2007)
   PUBMED   17569856
  REMARK    Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE   6  (bases 1 to 10470)
  AUTHORS   Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
            Ashburner,M. and Anxolabehere,D.
  TITLE     Combined evidence annotation of transposable elements in genome
            sequences
  JOURNAL   PLoS Comput Biol 1 (2), 166-175 (2005)
   PUBMED   16110336
REFERENCE   7  (bases 1 to 10470)
  AUTHORS   Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
            Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
            Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
            Celniker,S.E., Rubin,G.M. and Karpen,G.H.
  TITLE     Heterochromatic sequences in a Drosophila whole-genome shotgun
            assembly
  JOURNAL   Genome Biol 3 (12), RESEARCH0085 (2002)
   PUBMED   12537574
REFERENCE   8  (bases 1 to 10470)
  AUTHORS   Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
            Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
            Rubin,G.M., Ashburner,M. and Celniker,S.E.
  TITLE     The transposable elements of the Drosophila melanogaster
            euchromatin: a genomics perspective
  JOURNAL   Genome Biol 3 (12), RESEARCH0084 (2002)
   PUBMED   12537573
REFERENCE   9  (bases 1 to 10470)
  AUTHORS   Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
            Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
            Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
            Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
            Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
            Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
            Rubin,G.M. and Lewis,S.E.
  TITLE     Annotation of the Drosophila melanogaster euchromatic genome: a
            systematic review
  JOURNAL   Genome Biol 3 (12), RESEARCH0083 (2002)
   PUBMED   12537572
REFERENCE   10 (bases 1 to 10470)
  AUTHORS   Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
            Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
            Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
            Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
            Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
            Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
            Gibbs,R.A. and Rubin,G.M.
  TITLE     Finishing a whole-genome shotgun: release 3 of the Drosophila
            melanogaster euchromatic genome sequence
  JOURNAL   Genome Biol 3 (12), RESEARCH0079 (2002)
   PUBMED   12537568
REFERENCE   11 (bases 1 to 10470)
  AUTHORS   Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
            Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
            George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
            Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
            Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
            Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
            Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
            Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
            Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
            Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
            Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
            Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
            Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
            Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
            Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
            Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
            Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
            Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
            Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
            Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
            Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
            Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
            Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
            McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
            Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
            Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
            Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
            Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
            Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
            Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
            Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
            Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
            Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
            Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
            Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
            Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
            Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
  TITLE     The genome sequence of Drosophila melanogaster
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
   PUBMED   10731132
REFERENCE   12 (bases 1 to 10470)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
            Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
            Smith,E., Yu,C. and Rubin,G.
  CONSRTM   Berkeley Drosophila Genome Project
  TITLE     Drosophila melanogaster release 4 sequence
  JOURNAL   Unpublished
REFERENCE   13 (bases 1 to 10470)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (20-DEC-2023) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   14 (bases 1 to 10470)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   15 (bases 1 to 10470)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   16 (bases 1 to 10470)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (20-APR-2020) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   17 (bases 1 to 10470)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (22-APR-2019) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   18 (bases 1 to 10470)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   19 (bases 1 to 10470)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   20 (bases 1 to 10470)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-OCT-2015) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   21 (bases 1 to 10470)
  AUTHORS   Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
            Park,S., Svirskas,R. and Karpen,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
            Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   22 (bases 1 to 10470)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
            Svirskas,R. and Rubin,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
            64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   23 (bases 1 to 10470)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  CONSRTM   Drosophila Heterochromatin Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
            Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
            Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE   24 (bases 1 to 10470)
  AUTHORS   Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
            Rockville, MD 20850, USA
COMMENT     REVIEWED REFSEQ: This record has been curated by FlyBase. This
            record is derived from an annotated genomic sequence (NT_033778).
            
            On Jan 16, 2013 this sequence version replaced NM_078977.4.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider :: FlyBase
            Annotation Status   :: Full annotation
            Annotation Version  :: Release 6.54
            URL                 :: http://flybase.org
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..10470
                     /organism="Drosophila melanogaster"
                     /mol_type="mRNA"
                     /db_xref="taxon:7227"
                     /chromosome="2R"
                     /genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
                     bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
     gene            1..10470
                     /gene="tou"
                     /locus_tag="Dmel_CG10897"
                     /gene_synonym="anon-48Ad; CG10897; Dm Tou; Dmel\CG10897;
                     EP(2)0622; EP0622; EP622; gene VI; TIP5; Tou; TOU; VI"
                     /note="toutatis"
                     /map="48A1-48A3"
                     /db_xref="FLYBASE:FBgn0033636"
                     /db_xref="GeneID:36241"
     CDS             524..9523
                     /gene="tou"
                     /locus_tag="Dmel_CG10897"
                     /gene_synonym="anon-48Ad; CG10897; Dm Tou; Dmel\CG10897;
                     EP(2)0622; EP0622; EP622; gene VI; TIP5; Tou; TOU; VI"
                     /note="CG10897 gene product from transcript CG10897-RA;
                     CG10897-PA; tou-PA; transcript group VI; toutatis"
                     /codon_start=1
                     /product="toutatis, isoform A"
                     /protein_id="NP_523701.3"
                     /db_xref="FLYBASE:FBpp0087193"
                     /db_xref="GeneID:36241"
                     /db_xref="FLYBASE:FBgn0033636"
                     /translation="MNKNAGDGSDGKNSNKNSNAGGGGGAGGPHDPTGLLDAASLFAY
                     WGRDPTGAAAAAASNPLFNSQFNAAAAAGLGLLPQAGGASANDRYSMAAAAAAAAGAH
                     HHQNTMAVAASQAASLAGLHPASWWSMAQLAAQDYFSRLQASGLSPFPHPDLAAAFGP
                     AGMGMGGGAGGAGAGGGGAGVLGQGGGGNGGSGNGGGGSSSGSSGSKSNKSRKEKRAA
                     QQQQQQQQNLANSLNAAAAAAAAAAANNPAAALASMHGFGVGVPGTSIPSVSTGSGSI
                     STNSGGHGSYKSTASAYGKPSTMTSSSMANNPGSAYDPVTLHKELLAMQAVAAAASGS
                     GSSGSSGKKSGGGGSSSSSMHSLGMGIGGSGGMMSSGKASTSVSASNSSLGGMHPSLT
                     SSNPLMSPHAGMGSSSSSSSKDRDKNNPSLNALNSLSQFGALGMTPQQSMQAAMNAFA
                     ASTGVSPSATVTSSPHHSSQQQQQMGGNSSTSGSGKSSSKDYMMGTGSEHPSLLGVRL
                     PPDTEIIKYTSSIVGPKIPGTTSRGRKKTISETEQQTTQQQQKQQHQAEQHLLQQQQQ
                     AQKELDSTKNAISSLLAFPGLSPAKRARLEMEYAAMAAAAQQQHQQAQQQHQAQQQLH
                     GMLGAAGIPGMAGLVGLPGMSGNPLDQLSVSKASSSTAPTTSTSSSSAGSNLLNQSNS
                     DRVEVIKLPPTITSNGAYNLSSKGKEVHDLTTDMATNSGGVNLSLKSNAGSSALTPSG
                     AVGSASNPITIDDFDAPLNLSMKPSDKSNSSSSNAAAGGSSSSSAALANLASDYQAAS
                     SGQSGNSLQSLSSITAALGGTGGMPGGSISGSGGTSPAPAGAGSGATGGGSGSGGSGG
                     GSSSYKEGRPRNLGRGVSKPKKNTVASLLAQSRAVGLKPMLATQQLLQQGADIEKIRL
                     ALSEANAHMETSTDSESVAAESGLSESESEDANILNVAELRVPLELGWKRETVIRGLT
                     KQGQIRGEVTYYAPGSTTPLKSNGQVFAILEQQPSNLSRENFSFSARAIVGSFLQPAP
                     PPYANDGEYIRMTDEDVAKRLEDLKVFTRQTLNVEQRIEIAKQQQAMRDAKKLQKEEL
                     ARNKEKARQEKNSKLEQQRKDKELKNQQAVEERKKRQEELDRLKQEELLKKQQEKEKR
                     RQEAILAKEQELQKQKEMLLAAEMERERRRQHMSLIRMLELRRKFEDREKKKHQLVLD
                     RLLLRERRMAERKRDAEILQLIRRPNEDSEMPQELVIPELDRIAGNRLPGQAMADLLM
                     VFEFLHNFGETLGFDMESLPSLQNLHDALMSDSNADAEEELLSVMTHLLVCAIEDPGV
                     PNPGRHTTLLGQSLRNADITNSNVSEILRIYLYATATGEVRQMHGITVDRERERRVPD
                     HHQLDSDTTTHSHSVKNQEYYKLLHENDTWKLSQSLKDRPFVALNPTRKAQMLAHLCN
                     DLLMNKAVLRQIDGSLETCAQMRKEKYMTDMKVRKYKALHMRKARIEAYERAQAEREA
                     AMQALMAQQKLDAERLKAEEEAKAAAAEEAAAAAGTDGEATKGGSPNGEKPEDGDQNE
                     EGAAKEPQQQQQQPMEVDGVVDEASLVSPAKTIIQTDNSLTPSKQDMPTPTYQINGSS
                     TPTTSGVTGGDMNVLLQAKKSGARNSINDEHHHDVSIIDDDLSDLDSEITNVEEDEDN
                     RLSADELQKKLDKIVRASLNCKEALEKSTNQLRAACFGQDRFWRRYWKLPKAGGIFIE
                     ALESAQNDICDYHEALEAMDDKKDANDEKENSENEKDVAAESSEQPMEVDESITKLED
                     GVPASDVGMPESNQQNAHQDEEDDDDDVTEINKVEPEIVDLGDDDDDAAPPLPKIEPP
                     RPEIKVKSEMELMGPPPTTMISTKTDFEAEIKIPSMPGILMPPTLNNNNTNNNNNNNG
                     SDNCDKLETGLGLGQQQQNFSQSVIKTEDVKKEDDCIIVSTSSVDDTPKWFSIVRREV
                     PLISELPAEEGGVVGQELQISYANQNCSAQLQLQGHPWDLINNMQYYSIPMDECKVDT
                     SKLGNECIFSLSGLDEKQMLAKVEEYKAHKVESKNGLGSPHRHHETKDDEEQAKLKLD
                     KEIDTEMETDADDLAGKEKFFRLRSDVPPDTGGGVSEGTDVKPKIELRLDEALSQAYY
                     HNIANMSLSSVQTYIPIDIPLPLSMTPDEHRLLEQVKLAGFPERVHGVYVPRRQRYGW
                     WQLDDEQKLRQLLKTLNPSGLRERELQENLQRFLGLEQPLGVNYKLKSDIDFPEEFLM
                     PDKKGDWNPKVAKRVELALIEQLESLEDKVASASMQLKNWQLPNRVESELTLDSQEDV
                     TEEDFVSIIPMIRERIIDLEANIERRYLKPPLGSQTGDAHLAVIAQNQHTTTQTQNSA
                     SAAAYLLQMQQQQQQQQLAQQQQQQQQGSGAGNSLNPSSFNERTMALAAAAAASGPGN
                     ATGVANSAVVAGATPCESGSGEPNSGNASPASNCDSDRDEKVEQIPKGLVQWRDAVSR
                     SHTTAQLAMALYVLESCVAWDKSIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFKP
                     KMDNIPDGDWYCYECVNKATNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPL
                     LKVPRGKWYCHGCISRAPPPKKRSAGGTSGSSSKSRRDRDRESGGSAKRRSDNSKTPA
                     MEHMQQQQMPLAGGDSHHHHHQQPPSLNSSHDESMNSLPAAPLSPAHSVVSATNYDDQ
                     HHANNSVDGSSRFHAHLIPPSNNGTAALLEDVPGGANVMPGVYPVYTPVAAGNFSAGL
                     INQAPVQPAMPFANVVAMSPRAVTPTRTRTPTPTPAPTPPPPPPTPLLMQASPTATAL
                     HVNACQSPPQQQHAQLMTMPSPPAIGVGTATNQMSPPPINIHAIQEAKEKLKQEKKEK
                     HATKKLMKELAVCKTLLGEMELHEDSWPFLLPVNTKQFPTYRKIIKTPMDLSTIKKKL
                     QDLSYKTREDFCVDVRQIFDNCEMFNEDDSPVGKAGHGMRKFFESRWGELTDKHS"
ORIGIN      
        1 atcggtttct ctatcgaagc gttcggacgt cgaaaagata aaccccaatt taaatctaat
       61 ccatctgcga gcaaattaaa caacaataat gatagattct taagcggcgc gcacatacgt
      121 gtcttattaa tcgattttga tgtttaaaaa ataatataga aagttgaagt gaacttgcaa
      181 acctcgcgaa tagctggaaa tttgcgcgtg tgcgggtgga acagtgacac tcgtgtgaaa
      241 atcatcgcta aggaaaaggc ggataatata aaggaaaaac gcccgcggcg aaaattattc
      301 cattggtgtg agtgtgcaaa aaaaaggaag aaatatcaag agcaaagaag ttcgagaaat
      361 tgctaaaaat taagtgtgtt tgtgcttgtg tgagtgggtg acaaatgaat gggcagcaaa
      421 aagacgagaa gtcaatcagc gataagcagt ttaaacaaac aaaaaagact cccaacgcat
      481 gttacgcaca atagttggtc ctaagaacta ggggattttg aacatgaaca aaaatgccgg
      541 cgatggcagc gatggcaaaa actcaaacaa aaactcgaat gcgggagggg gtggtggagc
      601 cggggggcca cacgacccca ccggcctttt agatgcagct tccctgttcg cttactgggg
      661 acgtgatcca actggagcgg cggctgcagc ggcttccaat ccgctcttca actcgcagtt
      721 caatgccgcc gctgctgccg gattgggttt attgccacaa gctggaggcg cctctgccaa
      781 tgaccgttat tccatggcag cagcagcggc ggctgcggcg ggagcccatc accaccagaa
      841 cacgatggca gtggccgctt cccaggccgc cagtttggcc ggtttgcatc cagcaagctg
      901 gtggtcaatg gcccagttag ctgcccagga ttacttcagt cgcctgcaag cctcgggtct
      961 ttcccccttt ccacatcccg atctggcggc tgcctttgga ccagctggga tgggaatggg
     1021 cggcggtgct ggtggagcgg gtgctggagg tggtggagca ggtgtactcg gccagggcgg
     1081 cggtggaaat ggcggaagtg gcaacggtgg tggtggatcc tcatcgggtt cttccggcag
     1141 caagtcaaac aagtcgcgca aggagaagcg agccgcccag cagcaacaac agcaacagca
     1201 aaatctggcc aacagtctaa atgcggcagc tgcggcggca gcagcagcag cagccaataa
     1261 tccagcagcc gcgctggcca gcatgcatgg ttttggtgta ggagttccgg gcacgagcat
     1321 tccgtcggtg agcacaggta gcggatcaat atccacaaat agtggaggtc atggaagtta
     1381 caagagcacg gccagcgctt atggtaagcc ctcgactatg acgagcagca gcatggctaa
     1441 taatccgggc tctgcctacg atccggtgac cctgcacaag gagctgctgg ccatgcaggc
     1501 cgtggcagcc gctgcttccg gctcaggatc aagcggttcc tcaggcaaga agtccggtgg
     1561 gggtggctcc tcatcctcct cgatgcacag ccttggaatg ggaattggtg gcagcggtgg
     1621 catgatgtcc agtggcaaag cttcgaccag tgtgagcgca agcaattcat cattgggagg
     1681 aatgcacccc agtttgacta gcagcaatcc gctgatgtcg ccccatgcgg gcatgggctc
     1741 ctcgtcgtca tcatcgagca aggatcgtga caaaaacaat ccctcgctca acgctcttaa
     1801 ctcgctctca cagtttggag ccctgggaat gacgccacag cagagcatgc aggcggcaat
     1861 gaatgctttt gcagccagca caggagtttc gccatccgcc acggtaacgt cctcgccgca
     1921 ccactcttcc cagcagcaac agcaaatggg agggaacagt tccacgtcgg gatcgggcaa
     1981 gtccagttcc aaggattaca tgatgggtac tggaagtgag cacccatctc tgcttggagt
     2041 tcgtttacca ccggacacgg agataatcaa gtacacgtcc tcgattgtgg gacccaagat
     2101 tcctggtact acctcgcgcg gtcgcaaaaa gactatttcc gaaacagaac agcaaacaac
     2161 acaacaacaa cagaaacaac aacaccaagc agaacaacac ctactccaac aacagcaaca
     2221 agcacaaaaa gagctcgaca gcaccaaaaa tgccatttcc tctctgcttg catttccagg
     2281 tctcagtccc gcgaagcggg ctagactcga aatggagtac gcggcgatgg ctgcggccgc
     2341 tcaacagcag catcagcagg cacagcagca acaccaggcg cagcagcaac tccacgggat
     2401 gctcggagca gccggtatcc cgggaatggc tggactggtc ggtctgcctg gcatgagtgg
     2461 caatcccctc gatcagttga gtgtaagcaa ggccagttct tcaacggcgc ccacaaccag
     2521 taccagttcc tcctcagcag ggagcaacct gctcaaccag agcaatagtg atcgcgtgga
     2581 ggtaatcaaa ctgccaccaa caattacttc aaacggagcc tataatctct caagtaaggg
     2641 aaaagaggtg cacgacctca ccaccgacat ggccaccaac tctggcggtg tgaatctcag
     2701 cctaaagtcc aatgctggct ccagcgcctt gacgcccagc ggtgccgtcg gatcggccag
     2761 caatcccatt accatcgatg actttgacgc accacttaat ctttccatga agccctcgga
     2821 taagagcaat agtagctcgt caaatgcagc agcagggggc tcctcctctt ccagcgcagc
     2881 gttggctaac ttggcgagcg attatcaggc agcgtccagt gggcaaagcg gtaatagtct
     2941 gcagagtttg agctctatta cagctgcttt gggtggaaca ggaggtatgc ctggtggttc
     3001 catttccggc agcggaggaa cctctccagc tccagctggt gccggatcag gagccacggg
     3061 tggtggttcg ggatctggtg gatccggcgg tggatcgtcc tcttacaaag agggaagacc
     3121 tcgtaacctg ggacgcggcg tatccaagcc caagaagaac acggtagctt cgctgttggc
     3181 tcaatcccga gctgtgggcc ttaaacctat gctggccacg caacaacttc ttcaacaggg
     3241 tgctgatatt gagaaaatcc gcctggcact gagcgaagcg aatgcccata tggaaacttc
     3301 tacagattcc gaaagcgtgg cagccgaaag tggtctttcg gagtctgaga gtgaggatgc
     3361 caacattctg aatgtagcgg agctaagggt gccacttgaa ttgggctgga aacgggagac
     3421 ggtgattcgt ggactgacta agcaagggca gatccgtgga gaggtcacat attatgcacc
     3481 aggaagcacg acgccgctaa agagcaacgg acaggttttt gctatcctgg agcagcagcc
     3541 atcgaatcta agtcgtgaaa actttagttt ttccgcgcga gccattgttg gttcattcct
     3601 gcagcctgca ccaccgccgt acgccaatga tggcgagtac atacggatga cggacgagga
     3661 cgtggccaag cggttagagg atctgaaggt cttcacccgc cagactctta acgtggaaca
     3721 gcgaattgag atcgccaagc aacagcaggc gatgcgtgat gctaagaagc tgcaaaagga
     3781 agaattggcc agaaataagg agaaggcgcg acaggagaag aactccaagt tggagcagca
     3841 gcgcaaggac aaggagctca agaatcagca ggctgttgag gagcgcaaaa agcgtcagga
     3901 ggagctagat cgccttaagc aagaggaatt gcttaagaag caacaggaga aagaaaagcg
     3961 tcgtcaggag gccattttag ctaaggaaca ggaactacag aaacaaaaag agatgctgtt
     4021 ggctgccgaa atggaacgag aacgtcgccg ccaacacatg agcctcattc gaatgttgga
     4081 gcttcgtcgg aaattcgagg atcgtgaaaa gaagaaacac cagctggttc tagaccgttt
     4141 gcttctgcgt gagcgtcgta tggcggagcg aaaacgagat gcggagattc tgcagttgat
     4201 aaggcgtccc aatgaggact ccgaaatgcc acaagaattg gtaatccctg aactggatag
     4261 gattgccggc aaccgacttc ccggccaggc aatggccgat ttgctcatgg tctttgaatt
     4321 cctacataac tttggggaga ctctaggctt tgacatggaa tcactgccgt cgcttcagaa
     4381 cctgcacgat gctttaatga gcgatagcaa cgcggatgcg gaagaggagt tgttgtcggt
     4441 gatgacgcat ctattggttt gtgccattga ggatccaggt gttcccaatc ccggcagaca
     4501 caccacttta ctgggacaat cgctacgcaa tgcggacatc accaactcga atgtatccga
     4561 aatcttgagg atttatctgt atgctacggc cacaggtgaa gtgcgtcaaa tgcacggaat
     4621 cactgtggat cgtgagcgag agagaagggt gccggatcat catcaactag atagcgatac
     4681 cacaacccac tcacattcgg taaagaacca ggagtactat aagcttctcc atgaaaacga
     4741 cacctggaag ttgtcgcaat cgctgaaaga tcgccccttc gttgcgttaa atcccacccg
     4801 taaagcccaa atgttggccc atctatgcaa cgacctgctc atgaacaagg ccgtgctccg
     4861 ccagattgat ggaagtctag aaacgtgcgc ccagatgcga aaggagaaat acatgacgga
     4921 catgaaggtg cgcaagtaca aggctctcca catgcgcaag gctcgaattg aggcctatga
     4981 acgagcacag gcggagcgtg aggcggctat gcaggcgttg atggcacagc agaagctgga
     5041 tgcagaacgt cttaaggcgg aggaggaagc caaggccgcg gcggcagagg aagcagcagc
     5101 tgcagcgggc acagatggag aggcaaccaa aggtggctca cccaatggcg agaagccgga
     5161 agacggcgat cagaatgaag agggagccgc caaggaaccc cagcagcagc aacagcagcc
     5221 aatggaagtg gatggcgtgg tcgatgaggc atctcttgta agtccggcca aaaccatcat
     5281 tcaaacggac aatagtctga cgcccagtaa acaggacatg cccacaccca cctaccaaat
     5341 taatggatcc agcacaccta ccacaagtgg tgtcactgga ggcgacatga atgttctgct
     5401 gcaggccaag aaaagcgggg cccgaaactc catcaacgat gaacaccacc atgatgttag
     5461 tattatcgac gatgatcttt ccgacttgga ttcggagatc acgaatgtcg aggaggacga
     5521 ggacaatcgc ttgagtgccg atgagttgca aaagaagctc gacaagattg tcagagcttc
     5581 cctaaactgc aaggaggcac tggagaagag cacaaatcag ttgagagctg cgtgctttgg
     5641 gcaggatcgc ttctggcgtc gctactggaa gttgcccaag gctggaggta ttttcattga
     5701 agcacttgag tcggcccaaa atgatatttg cgattatcat gaggcattgg aggccatgga
     5761 tgacaaaaag gatgcaaacg acgagaaaga gaactccgaa aatgagaagg atgttgcagc
     5821 ggagtccagt gagcagccaa tggaagtgga tgagtccatt acaaaactgg aagatggtgt
     5881 accagcttcg gacgtcggga tgcctgaaag taatcaacag aacgcacatc aagatgagga
     5941 agatgacgat gatgatgtga cagaaatcaa caaggtggaa ccggaaattg ttgatttggg
     6001 tgatgatgac gatgatgcag ctcccccgct gcccaagata gaacctccca ggccagagat
     6061 taaagttaaa tcagagatgg agctgatggg tccaccaccc acgactatga tatccactaa
     6121 gacagacttt gaagctgaaa ttaaaatacc ctctatgcct ggcatcttga tgccgcctac
     6181 cctcaacaac aacaacacca ataataacaa taacaacaat ggaagcgaca actgtgataa
     6241 gttggaaact ggtttgggac taggacagca gcagcagaac tttagccagt ctgtgatcaa
     6301 gacggaggat gtgaaaaaag aggatgattg cataatagtt tctacatcca gtgttgacga
     6361 tacaccaaag tggttctcca ttgtccggcg ggaggttcca ttgatcagcg aactgccagc
     6421 ggaggaggga ggcgtggtgg gccaggaact tcagatcagc tatgcaaatc agaactgctc
     6481 cgctcagctg cagctgcaag gtcatccgtg ggatctgatc aacaacatgc agtactactc
     6541 gattccgatg gacgaatgca aggtggacac cagcaagctg ggcaacgagt gcatcttctc
     6601 gctgagtggc ctcgatgaaa agcaaatgct agccaaggtg gaggagtaca aggcccacaa
     6661 ggtggagtca aaaaacggtc tgggctctcc tcatcgccac cacgagacta aagatgatga
     6721 ggagcaggcc aagctgaagt tggacaagga gattgacacc gaaatggaaa cagatgcaga
     6781 tgacctagct ggcaaggaaa agttttttcg cttgcgctct gatgtacccc cggatacggg
     6841 aggaggggtc agcgagggaa ccgatgtaaa gcccaagatt gagcttcgtt tggatgaggc
     6901 gttgtcacag gcatattacc acaacatagc caacatgtca ttgagcagtg ttcaaacgta
     6961 tatacccatc gatatccctc tgccgctttc catgaccccc gatgagcatc gtctgctgga
     7021 gcaggtcaag ctggcaggtt ttccggaacg agttcacgga gtttacgtgc cccgcagaca
     7081 gcgctacggc tggtggcagt tggacgatga acagaagctg cgccagctgc tcaaaactct
     7141 caatccgtcg ggattgaggg agcgtgaact gcaggagaat cttcaacgat tcctcggact
     7201 ggaacagcca ttgggtgtga attataagct gaaatcggac atcgattttc cagaggaatt
     7261 cctaatgcct gacaagaagg gcgattggaa tcccaaggtg gccaaacgtg tggagcttgc
     7321 ccttatcgaa caactagaat cactggagga caaggtggcc agcgcttcta tgcaattaaa
     7381 gaactggcaa ttgcccaacc gcgtggagag tgaactcacc ttggactcgc aggaggatgt
     7441 caccgaagag gacttcgtca gcatcatacc catgatccga gagagaatca tcgacttgga
     7501 ggcaaatatc gaaagacgtt acttgaagcc gcctttgggc tcgcagacag gcgatgccca
     7561 tttggcggtt attgctcaga accagcatac caccacccag acgcagaact ctgcatcggc
     7621 agcagcatat ctcctgcaga tgcaacagca gcagcagcaa cagcagttgg cccagcagca
     7681 gcagcaacag caacagggtt cgggtgcggg aaatagcctg aatccctcat ccttcaacga
     7741 gcgtactatg gcattggcgg cggcagcagc tgcttcgggg ccaggaaacg caaccggagt
     7801 agccaactcg gcagtcgtgg caggagccac gccctgcgaa tcgggcagcg gagaaccgaa
     7861 ttccggcaac gcgtcaccgg ccagcaactg tgacagcgat cgggacgaga aggtggagca
     7921 gatacccaag ggattggtgc agtggcggga cgcagtatcc cgatcgcaca ccaccgccca
     7981 gctggcgatg gccctgtacg ttctggaatc ctgcgtggct tgggacaaga gcattatgaa
     8041 agcgaattgc cagttctgca cgtccggcga aaacgaggac aagttgctac tgtgcgacgg
     8101 ctgtgacaag ggctatcaca cctattgctt caagcccaag atggacaaca ttcccgatgg
     8161 cgattggtac tgctacgagt gcgtgaacaa ggccaccaat gagcgcaagt gcatcgtttg
     8221 cggcggtcat cgtccgtcgc ccgtggggaa gatgatctac tgtgacttgt gtcctcgtgc
     8281 ctaccacgcc gattgctata taccgccgct gctaaaggta ccgcgtggca agtggtactg
     8341 ccacggttgc atctcccggg cgcctccgcc gaagaaaaga agtgcaggtg gaacatccgg
     8401 cagcagcagc aaatctagga gggacaggga tagggagtcg ggcggatcgg ccaagcggcg
     8461 tagcgacaac agcaagacac cagccatgga gcacatgcag cagcagcaga tgccactggc
     8521 aggcggggat tcccaccatc atcatcatca gcagccgccg tcgctgaact cctcgcacga
     8581 tgagtcgatg aattctctgc cggcggctcc tctgagtccg gcgcactcgg tggtctcggc
     8641 cacgaactac gatgaccagc accatgccaa caactcggtc gatggcagca gtcgcttcca
     8701 cgcgcatctc attcccccgt caaataatgg cactgcggct ctgctggaag atgtgccggg
     8761 tggcgccaat gtgatgcccg gcgtctatcc agtttatacg ccagttgcgg ctggcaactt
     8821 ctcagccgga ttgattaacc aagcgccggt gcagccagcg atgccgtttg ccaacgtggt
     8881 cgccatgtcg ccacgggctg tcacgcccac ccgcacccga acgcccacac cgacgccagc
     8941 acccactccg ccaccaccac cgccgacacc gctgctaatg caggcctcgc ccacagccac
     9001 cgccctccat gtaaacgcct gccagtcacc gccccaacaa cagcatgcgc agctgatgac
     9061 catgccctcg ccaccagcca ttggagtcgg aacggccact aaccaaatgt cgccaccgcc
     9121 catcaacata catgccattc aggaagccaa ggagaagctg aagcaggaga agaaggagaa
     9181 gcacgccacc aagaagctca tgaaggagct ggctgtctgc aagacgctat tgggggaaat
     9241 ggagcttcat gaggactcgt ggccatttct tttgccagta aacaccaagc agtttcccac
     9301 atatcgaaaa ataatcaaaa ctcccatgga cttgtccaca atcaaaaaga aattgcaaga
     9361 tttgagctac aaaacccgtg aggacttctg cgtggacgtg cgtcagatct tcgataattg
     9421 cgagatgttt aacgaggacg attcgcccgt tggcaaggca ggccacggca tgcgcaagtt
     9481 cttcgagtcc cgctggggtg aactgactga caagcactcc tgatcctgca ccactagcca
     9541 gccacagatc cagatggcta ccagcaccag cagcacgatc accattggca ccaccgtcac
     9601 accgaacaac atggccgtca cagcaacgca tcccattgaa atggtcaccc tggtcgcggc
     9661 aacgcaggag gatgaggagg cagccgtgga gacgacaaca gcattaactg aagcagcaca
     9721 attttggtca agagattagt ttagattaag cagagattgt acataaatgg gaggaatgag
     9781 gagagggtcg agaagcacag catacacatt aatatatata tatatatatt tatatatagt
     9841 agatatataa acacacacgc gagagtattc aacgtcctac gtcggtaaag accttggaaa
     9901 ggggtagctt tgattaaatt aaatcaccag ccaaggcgaa caacagcgca agtatgcaag
     9961 tgaaaaagtg tattcagatt ttaagtatcc aaggcggcag cggcagcagc agtatataaa
    10021 caataactat tagccgacat aaactagcag ttagaaatcg tatcgcataa gaagccatct
    10081 acagtagacg cccggactgg atgaacgctg acaaatgaca aatgaacacg tacaaaaact
    10141 gtctaccaat ttccgatccg atcgatcgat ccatcgtttg aacagtattt tgtgcacaaa
    10201 gccgcgtcgc tttattttct acacttacaa ttcgaaccct aattttaaca actatttttt
    10261 aagcaccaac gaagcgctca tttaagtttt aaagtgtatt ttgtacttat agcaactccg
    10321 aaatctgtat ttaaacgtat tataacaagt gctaacagtt gcccaggcaa aaatgcgaac
    10381 aattatactc aaaaaggcgg tgcgaggagg acttgtaaat gtaaatgaaa gcaaattaaa
    10441 attaaatgta ttaaattatt tattctgtgt
//