Dbfetch

LOCUS       NM_080338               5946 bp    mRNA    linear   INV 26-DEC-2023
DEFINITION  Drosophila melanogaster ovo, transcript variant A (ovo), mRNA.
ACCESSION   NM_080338
VERSION     NM_080338.5
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (bases 1 to 5946)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: Impact of
            High-Throughput Data
  JOURNAL   G3 (Bethesda) 5 (8), 1721-1736 (2015)
   PUBMED   26109357
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 5946)
  AUTHORS   Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
            Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
            Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: The
            Rule-Benders
  JOURNAL   G3 (Bethesda) 5 (8), 1737-1749 (2015)
   PUBMED   26109356
  REMARK    Publication Status: Online-Only
REFERENCE   3  (bases 1 to 5946)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
            Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
            Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
            Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
            Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
            Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
  TITLE     The Release 6 reference sequence of the Drosophila melanogaster
            genome
  JOURNAL   Genome Res 25 (3), 445-458 (2015)
   PUBMED   25589440
REFERENCE   4  (bases 1 to 5946)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
            Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
            Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
  TITLE     Sequence finishing and mapping of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1625-1628 (2007)
   PUBMED   17569867
REFERENCE   5  (bases 1 to 5946)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  TITLE     The Release 5.1 annotation of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1586-1591 (2007)
   PUBMED   17569856
  REMARK    Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE   6  (bases 1 to 5946)
  AUTHORS   Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
            Ashburner,M. and Anxolabehere,D.
  TITLE     Combined evidence annotation of transposable elements in genome
            sequences
  JOURNAL   PLoS Comput Biol 1 (2), 166-175 (2005)
   PUBMED   16110336
REFERENCE   7  (bases 1 to 5946)
  AUTHORS   Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
            Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
            Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
            Celniker,S.E., Rubin,G.M. and Karpen,G.H.
  TITLE     Heterochromatic sequences in a Drosophila whole-genome shotgun
            assembly
  JOURNAL   Genome Biol 3 (12), RESEARCH0085 (2002)
   PUBMED   12537574
REFERENCE   8  (bases 1 to 5946)
  AUTHORS   Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
            Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
            Rubin,G.M., Ashburner,M. and Celniker,S.E.
  TITLE     The transposable elements of the Drosophila melanogaster
            euchromatin: a genomics perspective
  JOURNAL   Genome Biol 3 (12), RESEARCH0084 (2002)
   PUBMED   12537573
REFERENCE   9  (bases 1 to 5946)
  AUTHORS   Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
            Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
            Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
            Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
            Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
            Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
            Rubin,G.M. and Lewis,S.E.
  TITLE     Annotation of the Drosophila melanogaster euchromatic genome: a
            systematic review
  JOURNAL   Genome Biol 3 (12), RESEARCH0083 (2002)
   PUBMED   12537572
REFERENCE   10 (bases 1 to 5946)
  AUTHORS   Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
            Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
            Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
            Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
            Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
            Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
            Gibbs,R.A. and Rubin,G.M.
  TITLE     Finishing a whole-genome shotgun: release 3 of the Drosophila
            melanogaster euchromatic genome sequence
  JOURNAL   Genome Biol 3 (12), RESEARCH0079 (2002)
   PUBMED   12537568
REFERENCE   11 (bases 1 to 5946)
  AUTHORS   Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
            Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
            George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
            Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
            Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
            Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
            Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
            Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
            Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
            Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
            Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
            Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
            Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
            Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
            Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
            Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
            Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
            Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
            Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
            Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
            Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
            Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
            Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
            McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
            Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
            Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
            Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
            Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
            Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
            Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
            Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
            Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
            Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
            Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
            Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
            Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
            Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
  TITLE     The genome sequence of Drosophila melanogaster
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
   PUBMED   10731132
REFERENCE   12 (bases 1 to 5946)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
            Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
            Smith,E., Yu,C. and Rubin,G.
  CONSRTM   Berkeley Drosophila Genome Project
  TITLE     Drosophila melanogaster release 4 sequence
  JOURNAL   Unpublished
REFERENCE   13 (bases 1 to 5946)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (20-DEC-2023) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   14 (bases 1 to 5946)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   15 (bases 1 to 5946)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   16 (bases 1 to 5946)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (20-APR-2020) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   17 (bases 1 to 5946)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (22-APR-2019) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   18 (bases 1 to 5946)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   19 (bases 1 to 5946)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   20 (bases 1 to 5946)
  AUTHORS   Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
            Park,S., Svirskas,R. and Karpen,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
            Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   21 (bases 1 to 5946)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
            Svirskas,R. and Rubin,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
            64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   22 (bases 1 to 5946)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  CONSRTM   Drosophila Heterochromatin Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
            Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
            Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE   23 (bases 1 to 5946)
  AUTHORS   Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
            Rockville, MD 20850, USA
COMMENT     REVIEWED REFSEQ: This record has been curated by FlyBase. This
            record is derived from an annotated genomic sequence (NC_004354).
            
            On Jul 15, 2014 this sequence version replaced NM_080338.4.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider :: FlyBase
            Annotation Status   :: Full annotation
            Annotation Version  :: Release 6.54
            URL                 :: http://flybase.org
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..5946
                     /organism="Drosophila melanogaster"
                     /mol_type="mRNA"
                     /db_xref="taxon:7227"
                     /chromosome="X"
                     /genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
                     bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
     gene            1..5946
                     /gene="ovo"
                     /locus_tag="Dmel_CG6824"
                     /gene_synonym="CG15467; CG6824; Dmel\CG6824; Fs(1)K1103;
                     fs(1)K1237; Fs(1)K1237; Fs(1)K155; fs(1)M1; fs(1)M38; Ovo;
                     OVO; Ovo-D; ovo/shavenbaby; ovo/svb; ovoD; Shv; svb; Svb;
                     svb/ovo; Svb/Ovo"
                     /map="4E2-4E2"
                     /db_xref="FLYBASE:FBgn0003028"
                     /db_xref="GeneID:31429"
     CDS             1170..4097
                     /gene="ovo"
                     /locus_tag="Dmel_CG6824"
                     /gene_synonym="CG15467; CG6824; Dmel\CG6824; Fs(1)K1103;
                     fs(1)K1237; Fs(1)K1237; Fs(1)K155; fs(1)M1; fs(1)M38; Ovo;
                     OVO; Ovo-D; ovo/shavenbaby; ovo/svb; ovoD; Shv; svb; Svb;
                     svb/ovo; Svb/Ovo"
                     /note="CG6824 gene product from transcript CG6824-RA;
                     CG6824-PA; ovo-PA; ovo/shaven-baby; shaven baby;
                     shavenbaby"
                     /codon_start=1
                     /product="ovo, isoform A"
                     /protein_id="NP_525077.2"
                     /db_xref="FLYBASE:FBpp0070708"
                     /db_xref="GeneID:31429"
                     /db_xref="FLYBASE:FBgn0003028"
                     /translation="MGGGRDGRGNYGPNSPPTGALPPFYESLKSGQQSTASNNTGQSP
                     GANHSHFNANPANFLQNAAAAAYIMSAGSGGGGCTGNGGGGASGPGGGPSANSGGGGG
                     GGGGNGYINCGGVGGPNNSLDAYGIILKDEPDIEYDEAKIDIGTFAQNIIQATMGSSG
                     QFNASAYEDAIMSDLASSGQCPNGAVDPLQFTATLMLSSQTDHLLEQLSDAVDLSSFL
                     QRSCVDDEESTSPRQDFELVSTPSLTPDSVTPVEQHNTNTTQLDVLHENLLTQLTHNI
                     VRGGSNQQQQHHQQHGVQQQQQQQHSVQQQQQHNVQQQHGVQQQHVQQQPPPSYQHAT
                     RGLMMQQQPQHGGYQQQAAIMSQQQQQLLSQQQQQSHHQQQQQQQHAAAYQQHNIYAQ
                     QQQQQQQQHHQQQQQQQHHHFHHQQQQQPQPQSHHSHHHGHGHDNSNMSLPSPTAAAA
                     AAAAAAAAAAAAAAHLQRPMSSSSSSGGTNSSNSSGGSSNSPLLDANAAAAAAAALLD
                     TKPLIQSVSNPIGQPLNTQSQQQKQGQQITLMKTTRYTEFVEMVSMDVTVKPELFSEL
                     KPEMTEITAEELTLEAETTAAAAAAAAAAAAATTTSATEGTQVLAAAPAPLSSGRKLR
                     GRAKAVAYGSTMITLISTLKSSPEVPATKTVHRTTLRSLATAAAATAAGLLAPSPTVS
                     VLNESKVLQRRLGLPPDLQLEFVNGGHGIKNPLAVENAHGGHHRIRNIDCIDDLSKHG
                     HHSQHQQQQGSPQQQNMQQSVQQQSVQQQQSLQQQQQQQHHQHHSNSSASSNASSHGS
                     AEALCMGSSGGANEDSSSGNNKFVCRVCMKTFSLQRLLNRHMKCHSDIKRYLCTFCGK
                     GFNDTFDLKRHTRTHTGVRPYKCNLCEKSFTQRCSLESHCQKVHSVQHQYAYKERRAK
                     MYVCEECGHTTCEPEVHYLHLKNNHPFSPALLKFYDKRHFKFTNSQFANNLLGQLPMP
                     VHN"
ORIGIN      
        1 acagttacat agcaatcgtc cgagcgaacg gacagacaaa tttctgagaa tcgcacttct
       61 ttgcttctct cattttcgaa aactttgccg ccgagttgct gcagcgtttg acaccaaaca
      121 ctgccaccac tgcacaaaat aatattgtta acaatttggt taataatagc agagccgcca
      181 cctccgtttt agccactaaa gactgtacaa ttgaaaattc cccaattagc ataccaaaaa
      241 atcagcgagc agaggacgag gaggagcaag aagaccagga aaaagagaag cccgcagagc
      301 gggagagaga gaagtccgat gagaggacag aacaggtaga aaaggaggag cgggtggagc
      361 gcgaagagga ggaggacgac gaggtggatg tcggcgtaga ggcgccacgc ccacgattct
      421 ataataccgg cgtggtctta acccaggccc aacgcaagga gtatccccaa gagcccaagg
      481 acttatccct aaccatcgcc aaatcctcgc cagcttcacc gcatatccac agtgactccg
      541 aatccgattc ggactctgat gggggctgca agctaattgt ggacgagaag ccaccattgc
      601 cggtgatcaa accattgtcc ctgcgcttgc gcagcacacc gccaccggcg gatcaacgac
      661 caagtccgcc gcctccgcga gatcccgcgc ccgctgtccg ctgctcggtg atccaacggg
      721 cgccgcaatc ccaattgccc accagccggg ctggcttcct attgcccccc ttggatcagc
      781 taggacccga gcagcaggag cccatcgatt atcacgtgcc gaagcgaagg agcccctctt
      841 atgactccga tgaggagctg aacgccaggc gcctggaacg agcgcggcag gtgcgcgagg
      901 caagacgacg cagcaccatt ttggccgcac gagttctgct cgcccagtcc caaaggctta
      961 atccgcgctt ggtgcgatcc ttgcccggta ttttggctgc ggcagccgga cacggacgga
     1021 atagcagcag ttccagcggt gccgccgggc agggctttca atcgtccggt tttggcagcc
     1081 aaaacagcgg cagcggctcg tccagtggca atcagaacgc cggcagcggc gctggttcac
     1141 ctggatctgg tgccggtggc ggtggtggta tgggcggtgg tcgcgacggc cgagggaact
     1201 acggacccaa ttcaccgcca acgggagcat tgcctccgtt ttatgagagc ctcaagagcg
     1261 gccaacagag cacggccagc aacaatacgg gccaaagtcc cggcgctaat cattcgcatt
     1321 tcaacgcaaa tccagcgaat tttctgcaaa acgcagcagc ggctgcgtac atcatgtcgg
     1381 caggttccgg tggaggaggc tgcaccggaa acggaggtgg tggagcatcg gggccaggag
     1441 gtggcccatc ggcaaatagt ggtggtggtg gtggtggtgg cggcggcaat gggtacatca
     1501 actgtggtgg tgttggtggt ccaaacaata gtctcgacgc ctacggcata atactcaagg
     1561 atgaaccgga cattgagtac gacgaggcca agatcgatat tggcaccttt gcgcagaaca
     1621 ttatccaggc aacgatgggc agctccggtc agttcaatgc cagcgcctat gaggatgcta
     1681 taatgtcgga cctggccagt tcgggtcagt gtcccaatgg agccgtcgat ccgcttcagt
     1741 tcacagccac tctgatgctg agttcgcaga ccgatcattt actggagcag ctgtccgatg
     1801 ccgtggactt gagttcattc ctgcaaagga gctgcgtgga cgacgaggag tccaccagtc
     1861 cgcggcagga tttcgagctg gtgtccacgc cctcgctaac gccggattca gtgacgccgg
     1921 tggagcagca caataccaat acaacccagc tggatgtcct gcacgagaat ctgttgacgc
     1981 agctgaccca caatatagtc cggggcggca gcaatcagca gcagcagcac catcaacagc
     2041 atggtgtgca gcagcaacag cagcagcagc atagtgtgca gcagcaacag cagcacaatg
     2101 tgcaacagca gcatggcgtg cagcagcaac atgtccagca gcaaccaccg ccctcgtatc
     2161 agcatgccac gcgtggcctg atgatgcaac agcagccgca gcacggcggc tatcagcagc
     2221 aagctgccat catgtcgcaa cagcagcagc aattgctcag ccagcagcag cagcagtcgc
     2281 accaccagca gcagcagcag cagcaacatg ccgctgccta ccagcagcac aacatctatg
     2341 cacagcagca acagcagcag cagcagcagc atcatcaaca gcagcaacag cagcaacatc
     2401 atcacttcca ccaccagcaa caacagcagc cacagccaca gtcgcatcat tcgcaccatc
     2461 atggccatgg tcatgacaac agcaacatgt cgcttccctc gccaactgcg gctgcagcgg
     2521 cggcggccgc tgctgctgct gctgcggcag ccgcagccgc tcatctgcag cgtccgatga
     2581 gcagcagcag cagctccggc ggcaccaata gcagcaacag tagcggtggc agcagcaaca
     2641 gtccgctcct agatgcaaac gctgctgcgg cagctgcagc tgctttgttg gacacaaagc
     2701 cactcatcca aagcgtaagt aatccaattg gtcagccact aaatacgcaa tcacagcagc
     2761 aaaagcaggg acaacaaatc acattgatga agaccactag gtacacggaa ttcgtcgaga
     2821 tggtgtccat ggacgtgacg gtcaagccag agctctttag cgagctcaag ccagaaatga
     2881 ccgagatcac cgccgaggag ttgactttgg aagcagagac aacagcagca gcggcggcag
     2941 cagcagcagc agcagcagct gcaacaacaa catcagcaac agagggaact caggttctgg
     3001 cagcagcgcc ggcaccattg tcatccggca gaaagctgcg tggacgcgcc aaggctgtag
     3061 cctatggctc cacaatgatt acgctgatat ccacgctgaa atcgtcgcca gaagtgccgg
     3121 ccaccaagac ggtgcaccgc accactttgc gatctttggc tacggcggca gcggccactg
     3181 cagccggttt gctggcgcca tcgcccaccg tttccgtttt gaatgagagc aaagtcttgc
     3241 agcggcgctt gggcttgccg ccggatctgc agcttgagtt tgtgaacggc ggccatggca
     3301 ttaagaaccc gctggccgtg gagaatgccc acggtggcca tcaccgaatt cgcaacatcg
     3361 attgcattga tgatctcagc aagcatggcc accactcgca gcaccaacag cagcagggct
     3421 cgccgcagca gcaaaatatg caacaatcgg tgcaacagca gtcggtgcaa cagcagcaat
     3481 ccctgcagca gcaacagcag cagcagcacc atcagcacca cagtaacagc agtgcaagca
     3541 gcaatgccag cagtcatgga tccgccgaag ccctttgcat gggctcatcc ggcggagcca
     3601 atgaggattc gtcgtcgggc aacaataagt tcgtctgccg cgtctgcatg aagacattct
     3661 cgctgcagcg cctgctcaat cggcacatga agtgtcactc ggacatcaag cggtatctgt
     3721 gcacgttctg cggcaaggga ttcaacgaca ccttcgacct caagcgccac acgcgcaccc
     3781 acacgggcgt ccggccgtac aagtgcaatc tctgcgagaa gagcttcacg cagcgctgtt
     3841 ccctcgagtc gcactgccag aaggtgcata gcgttcagca ccagtatgcc tacaaggagc
     3901 gcagagccaa gatgtacgtg tgcgaggagt gcggacacac cacctgcgag ccggaggtcc
     3961 actatctgca cctgaagaac aaccatccgt tctcgccggc gctgcttaag ttctacgaca
     4021 agcggcactt taagttcacc aactcgcagt tcgccaacaa tctgctcggc cagctgccca
     4081 tgccagtcca caattagttg atggatttta gttatataga ataggaacca ttcgatcgat
     4141 tcgaatcatt cgaataactc gaatcactcg aatcactcga atcacttaag caaaccaaca
     4201 ttgtgacttt gaagtttttt gaaaactgcc cccccctcct tttgcggagg atacggaatt
     4261 attggagatt atgcttatta tgtatatttt tgcgtagtcc taagaaaatt agttactagt
     4321 tactagctga attagtaccc actgtatcat gctaccccac ttcactttgc tgtgcaaatc
     4381 tctctcactt gagaaaactc gagttccctt ttctttattg ttaatcttga attttcgttt
     4441 atttcaagta tggatctttc gaaaaaaaaa actcgttgtt agtcaaaaag aaaggtatcg
     4501 ttaggacgta agaaatgttt gaaaatgctt tgagcaacac tcacacaaat tatgaatgat
     4561 cgcgtataca tatattagaa gaaaatatat attaaatatt tagctttaag atgttaagag
     4621 ctaccaaccg aacaagctgc aaacgttggc atttatttgt cttcaacaaa acagaacaca
     4681 aatcaaaaaa caaaaacttc gaaaaaaaag aaaaaaaaaa acagaacgcg tagagcaatg
     4741 agcgcggttc ggataacata tgttatatac atacacacca ccaaccatac gcattaatat
     4801 atttattact taatcacaaa accacattgc aaagaggcaa atcattcgaa taaactaggc
     4861 taggatcctt aaaataaact aaagctgttc ataaatgaag tcaaagccca acaaaaaatc
     4921 caacgtgttt tggaaaatcg aatcgcgaga atatgatcta ctgaatatat agtttgcaat
     4981 aatcttagct tttaggcaat accaccaatt gtactgatct ttaatgaaca aacaaaatac
     5041 taaacacaac ataaacataa acataacata aataacatta tgtatctgat ggtataattt
     5101 ctaagaccta ggtgtaatga actttgaatt tggattcgaa tttgagtttg aatttgttta
     5161 cgattgcagg gaaactacga tttccaattc gtaacattaa actgcattat tatttctgtt
     5221 ttagcatagc gctttttttt ttcttttaat ttgttgtaag caacaacggg ttttttgtgc
     5281 aatcgcttgg tgttggttaa atggaaaata aggatgaaat gacatgacaa atctaattta
     5341 agcatcgtac tttaaggttt atgcgggggc tgtccattgt ggcgccacgc ccaccgcccg
     5401 ctgtgcagcc cccgtggaaa atggaggaga cgtcgaatga acgagggagc aaatcgaagc
     5461 ttggcaatga acgagtcctt gtttccatat cctccctcaa ctaacttagt tcttaactag
     5521 aaaactttca ctcacacaca taactacaca catatataaa tatcagttct gcacacaaaa
     5581 acacacacac acacacacaa cgcacacctt aatatatgta cttaaattaa gttagttagt
     5641 ttaggcaata tcttgaactt gaaaacggat ggcagcagaa aatttgatca atgaaaatct
     5701 gagacttcca tgaaaaattc caaatttttt ttcattatca tatttatcga attttgtgcg
     5761 gaaatgaacg aggaattaca aattattaca aatatattaa atatataacg aaaattgaac
     5821 ggaaggcaaa aaagcttagt ttataaaatg atacagtttt ttttttttaa tctatccatt
     5881 acgtatacta tatataaaga ttatgagaag cgaattaatt aataaataat acatatgaca
     5941 aaaatc
//