Dbfetch

LOCUS       NM_001276122           12064 bp    mRNA    linear   INV 26-DEC-2023
DEFINITION  Drosophila melanogaster BRD4 interacting chromatin remodeling
            complex associated protein (Bicra), transcript variant B, mRNA.
ACCESSION   NM_001276122
VERSION     NM_001276122.1
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (bases 1 to 12064)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: Impact of
            High-Throughput Data
  JOURNAL   G3 (Bethesda) 5 (8), 1721-1736 (2015)
   PUBMED   26109357
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 12064)
  AUTHORS   Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
            Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
            Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: The
            Rule-Benders
  JOURNAL   G3 (Bethesda) 5 (8), 1737-1749 (2015)
   PUBMED   26109356
  REMARK    Publication Status: Online-Only
REFERENCE   3  (bases 1 to 12064)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
            Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
            Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
            Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
            Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
            Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
  TITLE     The Release 6 reference sequence of the Drosophila melanogaster
            genome
  JOURNAL   Genome Res 25 (3), 445-458 (2015)
   PUBMED   25589440
REFERENCE   4  (bases 1 to 12064)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
            Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
            Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
  TITLE     Sequence finishing and mapping of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1625-1628 (2007)
   PUBMED   17569867
REFERENCE   5  (bases 1 to 12064)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  TITLE     The Release 5.1 annotation of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1586-1591 (2007)
   PUBMED   17569856
  REMARK    Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE   6  (bases 1 to 12064)
  AUTHORS   Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
            Ashburner,M. and Anxolabehere,D.
  TITLE     Combined evidence annotation of transposable elements in genome
            sequences
  JOURNAL   PLoS Comput Biol 1 (2), 166-175 (2005)
   PUBMED   16110336
REFERENCE   7  (bases 1 to 12064)
  AUTHORS   Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
            Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
            Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
            Celniker,S.E., Rubin,G.M. and Karpen,G.H.
  TITLE     Heterochromatic sequences in a Drosophila whole-genome shotgun
            assembly
  JOURNAL   Genome Biol 3 (12), RESEARCH0085 (2002)
   PUBMED   12537574
REFERENCE   8  (bases 1 to 12064)
  AUTHORS   Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
            Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
            Rubin,G.M., Ashburner,M. and Celniker,S.E.
  TITLE     The transposable elements of the Drosophila melanogaster
            euchromatin: a genomics perspective
  JOURNAL   Genome Biol 3 (12), RESEARCH0084 (2002)
   PUBMED   12537573
REFERENCE   9  (bases 1 to 12064)
  AUTHORS   Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
            Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
            Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
            Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
            Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
            Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
            Rubin,G.M. and Lewis,S.E.
  TITLE     Annotation of the Drosophila melanogaster euchromatic genome: a
            systematic review
  JOURNAL   Genome Biol 3 (12), RESEARCH0083 (2002)
   PUBMED   12537572
REFERENCE   10 (bases 1 to 12064)
  AUTHORS   Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
            Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
            Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
            Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
            Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
            Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
            Gibbs,R.A. and Rubin,G.M.
  TITLE     Finishing a whole-genome shotgun: release 3 of the Drosophila
            melanogaster euchromatic genome sequence
  JOURNAL   Genome Biol 3 (12), RESEARCH0079 (2002)
   PUBMED   12537568
REFERENCE   11 (bases 1 to 12064)
  AUTHORS   Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
            Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
            George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
            Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
            Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
            Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
            Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
            Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
            Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
            Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
            Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
            Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
            Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
            Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
            Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
            Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
            Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
            Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
            Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
            Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
            Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
            Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
            Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
            McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
            Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
            Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
            Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
            Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
            Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
            Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
            Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
            Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
            Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
            Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
            Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
            Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
            Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
  TITLE     The genome sequence of Drosophila melanogaster
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
   PUBMED   10731132
REFERENCE   12 (bases 1 to 12064)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
            Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
            Smith,E., Yu,C. and Rubin,G.
  CONSRTM   Berkeley Drosophila Genome Project
  TITLE     Drosophila melanogaster release 4 sequence
  JOURNAL   Unpublished
REFERENCE   13 (bases 1 to 12064)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (20-DEC-2023) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   14 (bases 1 to 12064)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   15 (bases 1 to 12064)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   16 (bases 1 to 12064)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (20-APR-2020) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   17 (bases 1 to 12064)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (22-APR-2019) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   18 (bases 1 to 12064)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   19 (bases 1 to 12064)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   20 (bases 1 to 12064)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-OCT-2015) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   21 (bases 1 to 12064)
  AUTHORS   Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
            Park,S., Svirskas,R. and Karpen,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
            Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   22 (bases 1 to 12064)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
            Svirskas,R. and Rubin,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
            64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   23 (bases 1 to 12064)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  CONSRTM   Drosophila Heterochromatin Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
            Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
            Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE   24 (bases 1 to 12064)
  AUTHORS   Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
            Rockville, MD 20850, USA
COMMENT     REVIEWED REFSEQ: This record has been curated by FlyBase. This
            record is derived from an annotated genomic sequence (NT_033777).
            
            ##Genome-Annotation-Data-START##
            Annotation Provider :: FlyBase
            Annotation Status   :: Full annotation
            Annotation Version  :: Release 6.54
            URL                 :: http://flybase.org
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..12064
                     /organism="Drosophila melanogaster"
                     /mol_type="mRNA"
                     /db_xref="taxon:7227"
                     /chromosome="3R"
                     /genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
                     bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
     gene            1..12064
                     /gene="Bicra"
                     /locus_tag="Dmel_CG11873"
                     /gene_synonym="anon-WO0140519.162; CG11873; Dmel\CG11873"
                     /note="BRD4 interacting chromatin remodeling complex
                     associated protein"
                     /map="98F6-98F10"
                     /db_xref="FLYBASE:FBgn0039633"
                     /db_xref="GeneID:43435"
     CDS             888..9899
                     /gene="Bicra"
                     /locus_tag="Dmel_CG11873"
                     /gene_synonym="anon-WO0140519.162; CG11873; Dmel\CG11873"
                     /note="CG11873 gene product from transcript CG11873-RB;
                     CG11873-PB; Bicra-PB"
                     /codon_start=1
                     /transl_except=(pos:9816..9818,aa:OTHER)
                     /product="BRD4 interacting chromatin remodeling complex
                     associated protein, isoform B"
                     /protein_id="NP_001263051.1"
                     /db_xref="FLYBASE:FBpp0303331"
                     /db_xref="GeneID:43435"
                     /db_xref="FLYBASE:FBgn0039633"
                     /translation="MSRRQSLDRPGILSPPGPFLTPSPSPSISASPRLQPSPSLSPFP
                     QDVVLVAGGSQVNANSNPQEHERKAATPGRRQSIHQFTNGSPNAGTVVFQPSPSQSPA
                     AATTVIGVTSGGLIATAAGGTGAGLSTGVGAGSSSGVGIGVALHGNAIGNELNATLVG
                     SNNIQLNKKGTKRATQQQQQQQQQLGHQIFSSTVAPTSISSATAVAGGGGGYGQFNAT
                     AISTPTSFATAGSVVSSSAKGPTKGQKGQQQQQQHQQFQHYQSSSPLIADSPIPSPSG
                     AIAAAAIKPPTPQPQQMQMLPQQFTISQQPQQQQQAQQVFQFIQGPQGQLIATTPQQQ
                     QPIQQQQQQQPQPVQQQQQHRFVSNAGVTGMSTGKTGKAPQQILPKPQQELQQQKKGT
                     NGGAQISQSAQQTGQANNPTQQQQQQQQQQILLPATTNQPQQLLLNQMPVLVQQNPQG
                     VQLILRPPTPQLTTTPSLVIQQNARGQPQLQTQPAQQQLLRIVGTNGATMQLAAAPTF
                     IVSSQANLIHQTAAGQFQSAIKSGTQLTGLHAALAQAQAQAQAPRSQPFATTATLNTQ
                     LLGQSVAAQLQNLQLAAAQIQMPNGLTAISQLPAHLQQSLGGATINLNQLNGAHIQQI
                     ANAFQAPQAPGTNSGGANSTTELFTQSSPAHVPLAVTPEPLRQPTPVPMMQQQQQAAM
                     ATIQLQQQQQQQQQAQIQQLQHQLQQTQSQVQQAQQSVQATTIPEPKKKPRPRKKKQP
                     PAPTPPAASPTVAPAEVRPPTPQKIAIAATPAPQPTVRAKSPSPPPTTIQRSSNGKLD
                     LGDVMKFCGITGDDDDDEDYGMGLETGLASESEPAQAHAPATGSAAATTGSSGDIMIS
                     IPNQNGSDGLPFTLTIPAPQPQSSTGSQAAGNESATEIPNILIKIDPSAEAAVPPFGL
                     STPRLPTAEEIQKQAQQHLAQQAAAAAAAAKATTATPTINISLPSMTLQPQVAPAPVV
                     TPTPAPSSNQVSTTVVVKPRRKPAVRRNAKKPDVTSSASNMISSSSGTTTTISSSTST
                     ILNNSQPTTVNTITLTTPTPVTSSSNSTLMLTHGTPTAVPSQIGNIQISQVHNHHPSA
                     LPVSSAVENKIQIMPILPPGATVMGGTPGTGGHAPTTQSTQLVFQPAAPSVAPTITSD
                     STGFKLSSDGRQMQLVAIPQATPPSPAASSAQPPVPTPTPALTAGGATILPPQLTGMP
                     ANVSVSVGSPASAAALVPQLTGSLTLTVSEQCERLILRHDPNNPQDHQSQLILQALLK
                     GALPNVTIINEPTRVDPNKQTTPMLQPQQQVIQIPQPLTQPQQILQQQPKVSTAPKIE
                     VKVANNRKLSGQVTHPASSMLGMTSTTTTMSTMKPPANLPAKQNEVVSFPQLPLNSQV
                     LIQQQPMISAQLQPQLQPVTVPVSLPTQPVPIPSPAPAPQLANNGQQRYIALPPIDPT
                     TQQLFCLNSVTNQITAMSAGQTAASIGPTERLLIAPAGINAQQLAQCLQLGQLHFNDV
                     NPLPAQQQQQQVATTSSSMTLSMPLRPQPPQQQIVHPIQPITNGLAITTTASSTLTTT
                     TSTTSLTTVTKQVANVAPIIDQSKAKLEPAKAATSGAGGGAVVKKKPVRKPKATPTNA
                     SELANLKNASVVKPMPKLDPLSQKPNNNPVQIVQPNVASGKQMIVVGSSSCTTTTTTT
                     MSASIQPFQTTSGTKLVGVTVPMQQQQPTGTAAILQQQQQQRTSFSLTATTVATPAPM
                     PSPTVASGVSQLQMQQLPSQQAQQQLQPQQQQQLHAPKQQQPQPNTVSRVQTIQLTPQ
                     MQQIFKQVQMQIQFLTIKLQNKSTFLPVSPEIDSATIAAYNKPMTDAEINLALQRLFA
                     EQHRILASGKEVPTPEGMLPTANGTGGFSLMPQPVQQQQQQQQQQLQSTVPEPLKTIV
                     PANVVQPNNHSSTPAGAATSGATTAANIQQNITQRIHIYPMQHQQQQQQQQQQQQQQP
                     QHQQQQPQHQLQQTGAKPKQAQPKPPPQQEQQSQLQIKPKEPANRSKVTSQQLQQNPQ
                     PPNGTINMLPQKVSMMPVVQQQQQQQPQQHQQQPMLNKSGPPPLIFASSTNMLSNSNS
                     NNLHNSNSAMQVNNMLPLPPLQPQQQPQPQLQQPPTPILPPVAPTMAMGVAPGPMGNL
                     VPSLPTIPSLPLYMPSKMDEMPTQKPKIARLSLFVRQLEVDQESCLKPDYVKPFRTKD
                     EAVKRLIRYHCMHENDVELPSDEDEEFESTALEFQDKFRQLNGKFQEILMQESTLPHR
                     TSELLQMQQLMIDDLKGEINEIRTAEKELEQQLKEEQTSEKSTAESDVLSEAKVKEEI
                     KQEPIDKSAQPDGVVDKFDLLKNSADVNKAFSKSQPQQTTTVKQEESNESGEPSVNGS
                     VKSEGHDKESSKYIKKDYDNFDIESELTTSFIMKKVENKAALAKSAENRYQERNMMDQ
                     QQQQQHIKANGGDPVDQDDGWYCLQKELNLMNNDHMQQSQQQLNGNQQHLPINRQPAP
                     QQSHFLDNSNSNNMMGNSNHMSDLFPNGCIDKSNQSTTSSSNSSCVSESLPGSNPVKT
                     VSSCSGQREQPVVMDADQQNEHPAISEFFSSDSDVQKSVETRLEAMFGESPVHLDVKS
                     SNDANDIESNLDEIFGDSKSPAANHKSKMSMWVPDASFMQQAPQQQTSQQQQQYAGPQ
                     QPSQQQHHIELNCNNNPRWMQSMEAHYSDFLSTGTSNGELVNGESRKRSWDSQLMGSG
                     SDMEDDNSSKRLCTASSSSSSSPMSSQQQHVQVAQHGLLDSDMFPQMLMDQQPHQQQQ
                     QVQQQHNFAYANIMDQQQQQQHQQQHFQHNLQQQIQQQQQMHVSHMGGAAVVAGGEYD
                     DDISRHVASAIDSILNLQNSGDSLQFSLGSILGDSMLDDQRQAGANLPSCQQEQAIQR
                     RRHLGEEMNDCLISGGGSAVSGVADNSSNILLEHHHQQHQLMQSHQQQPHLQHHHHQN
                     MSQAQAQHMGQLNDFSCVAGGLDDPVKSIMTSXLGAPSSTSAAENASNSLILEFNATT
                     L"
ORIGIN      
        1 atcgatttcg ggcagtcgat aaaacaacag ttcggagtgc agcaaaagtg cgggaaatct
       61 gcggaaatgt tattaatcga acattaaggg atacctcgcc cagccgcgct ctagcattca
      121 ccatttccct ttgtttgcgg cggcggcgct cgttgtacgg ccgtgcgaat cgagagaaaa
      181 acgcataaaa tcggggaaag tgcactagcc ccgtgctgca atgtaaataa acaaggctct
      241 gatgagcagt tgccgatcaa gtgcttcgat tagccagcaa aatgtgcgta caaggcgaga
      301 atcagtgacg aaaacaacaa ctacgtgctg ctgcgttcat tgacgtgacg cgcgagagtg
      361 aatgtgtgcg tgcgtgtgtg tttgtgtgtg tgtgctggct gctggcgagt gagtgtgtgc
      421 gtgctcccca tatgtgtgtt gtcgaaaaag cgtttttcaa accacaagcc agcaacaaac
      481 aaatcacaac agcacgtcac aaattgcaat agaagcagaa gcagcaacac ctcaaatcaa
      541 aaaaaggccc cacaacaaca acatctgtcg agttctcaga gagtgagaaa aagaaagagc
      601 gcgttagagt gagagagggg caaaaagaaa gtcaacagtc gcgtgtgaaa tgaaattcga
      661 aagatgtccg cgtgtgtgtg tgttgagtgt atagaattga gtaagaagca gcgcggcaaa
      721 aggaaataag caaaagcaat agccaaaagg aaccaacaac cagcagcaac aaattaccca
      781 atgaatcaat caaaattggc ataatttgca ttcgccgacg ttattgtcta taaaaccccc
      841 ctgaaaaaaa caccacacaa cagaagagca gcacaaatct tttgccaatg agtcgcagac
      901 agtcattgga tcgaccagga atattgtctc ctccgggtcc gtttctgacg cccagtccat
      961 cgccatcgat ctcagctagc cctcgtctgc aaccatcgcc atcgctctcg ccgttcccac
     1021 aggatgtggt gctcgtagcc ggtggcagcc aagtaaacgc caacagcaat ccccaggaac
     1081 acgagcgcaa ggcggcgacg cctggcagga gacagtccat ccaccagttc accaatggca
     1141 gtcccaatgc tggaactgtc gtcttccagc cgagcccctc acaatcgccg gccgctgcca
     1201 cgacggtcat cggcgtgacc agtggtggcc tcatagccac cgccgccgga ggaactggag
     1261 ccggtctgag taccggagta ggagccggga gcagttccgg agtaggaatt ggcgtcgcgc
     1321 tgcacggaaa tgccatcggc aacgagctga atgccactct ggtgggatcg aataatatcc
     1381 agctcaataa gaagggcaca aagcgggcaa cccagcagca gcagcaacaa cagcaacagc
     1441 tgggacacca gatcttcagc agcacagttg cacccacctc gatcagcagt gcgactgcgg
     1501 tggcaggcgg cggcgggggc tatggacagt tcaatgccac cgccatcagc acgcccacct
     1561 cgttcgccac ggccggcagt gtggtcagca gcagcgccaa aggtccgacg aagggtcaga
     1621 aggggcagca gcagcagcag cagcatcagc agttccagca ctaccagagc agcagtcccc
     1681 tgatagcgga cagtcccatc ccgagtccca gcggcgccat tgcagcggcg gcgattaagc
     1741 ctccaacccc gcagccccaa caaatgcaga tgctgcccca gcagttcacc atctcccagc
     1801 agccgcagca gcagcagcag gcgcagcagg tcttccagtt tatccaggga ccgcagggcc
     1861 agctcatagc caccactccc cagcaacagc agcctatcca gcagcagcaa cagcagcagc
     1921 ctcagccagt ccagcagcag cagcagcatc gattcgttag caatgccgga gtcacgggaa
     1981 tgtccaccgg caagaccggc aaggcaccgc agcagatact gcccaagccg caacaggagc
     2041 tccagcagca gaagaagggc acgaacggcg gagcacagat ctcgcaatcg gcgcagcaaa
     2101 ctggccaagc caacaatccc acccaacagc agcaacaaca acaacagcaa cagatcctgc
     2161 tacctgccac caccaaccag ccgcaacagt tgctccttaa ccaaatgccc gtgctggtgc
     2221 aacagaatcc gcagggcgtg cagctaattc tgcgaccacc tacgcctcag ctgacgacca
     2281 caccgtccct cgtcatccag cagaatgccc gcgggcaacc ccaactgcag acgcagccgg
     2341 cacagcaaca actgctgcgg atagtgggca ccaatggagc gacaatgcag ctggctgccg
     2401 cgcccacctt cattgtgtcc tcgcaggcga acctcatcca tcagacggcg gcaggtcagt
     2461 tccagagcgc catcaagtct ggcacccagc tcacgggtct gcatgcggct ctagcacagg
     2521 cacaggccca ggcgcaggcg ccacgatctc agccgtttgc cacgacggcc acgttgaaca
     2581 cccagctcct tggccagagt gtagccgccc agctgcagaa cttacagctg gctgcggccc
     2641 aaattcagat gcccaacggc ctcaccgcca tatcccaact gccagctcat ttgcagcaga
     2701 gcctaggtgg agccaccatc aatctgaatc agctgaatgg ggcgcatatc cagcaaatag
     2761 ctaacgcttt tcaggcgcca caggctccag gaacgaatag cggaggagct aacagcacga
     2821 cggagctctt tacacaatcc tcgccggcac atgtgccgct ggccgtcacc ccggagccat
     2881 tgcgtcagcc cacgcccgtg cccatgatgc agcaacaaca acaggctgcg atggccacca
     2941 ttcagttgca gcagcagcag caacaacagc agcaggcgca gatacagcag ctgcagcatc
     3001 aactgcagca aacacagtcc caggtgcagc aagcccagca atctgtccag gcgacaacta
     3061 taccggagcc caagaagaaa ccgcgacccc gcaaaaagaa gcaacctcca gctccaactc
     3121 caccagcagc ctctccgaca gtagcacccg ctgaagtgag acccccgacg ccacaaaaga
     3181 tagccatcgc cgccacgccc gcgccacagc caactgtacg cgccaagtct ccatcgccac
     3241 ccccaacgac gatccagcga agtagcaatg gaaagctgga tctcggcgat gtgatgaagt
     3301 tttgcggtat caccggagac gatgacgacg acgaggacta tggaatgggc ctagaaacag
     3361 ggctagcatc agaatcagag ccagctcaag cccacgcgcc tgccacaggg agtgctgcag
     3421 cgacgacggg cagcagcggc gacatcatga tctccatacc gaaccaaaat ggcagcgatg
     3481 gcctgccttt taccttgacc atacccgctc cgcagcccca gtcatcgacg ggatctcagg
     3541 cggcgggaaa cgaatcagcc accgaaatac caaatatcct catcaaaatc gatccaagtg
     3601 ccgaggcagc ggtgccgcca tttggtctat ccacgccacg gttgcccaca gcagaggaaa
     3661 ttcagaagca ggcgcagcaa catttggcac aacaggcggc cgctgcagca gcggcggcaa
     3721 aggcaaccac agccacgccc accataaaca ttagcctgcc tagcatgaca ttgcagccgc
     3781 aggtagcgcc ggcaccagtt gtgacgccaa ctcctgcccc atcctcgaac caggtgtcca
     3841 cgacagttgt tgtcaaacca agaaggaagc ccgcggtgcg aaggaatgcc aagaaacctg
     3901 acgtgaccag cagcgcaagc aatatgatct cctcgagcag tggcactacc actacaatca
     3961 gcagcagcac tagtaccatc ctaaacaaca gtcaacccac cactgtaaac accatcactt
     4021 taaccacgcc cacgccagtg acaagcagca gcaattccac cctgatgctg acccacggaa
     4081 ctccgacagc agtgcccagt cagattggca acattcagat ctcacaggtg cataatcatc
     4141 acccatccgc attgccggtg agcagtgcgg tggagaacaa gatccagata atgccgatcc
     4201 tgccaccggg agccacagtg atgggcggaa caccaggaac cggaggtcat gcccccacca
     4261 cccaatcaac ccagttggtc tttcagccag cggcaccatc ggtggcgcct acgattactt
     4321 cagactccac tgggtttaag ctctccagcg atggtcggca aatgcagcta gtggccatac
     4381 cacaggcgac tcctccatcg ccagcagcca gttctgccca gccaccggtt cccacgccca
     4441 ctcctgcgtt aactgccgga ggagctacaa ttctgccacc tcagttgaca ggaatgcccg
     4501 ccaatgtatc ggtttctgtg ggatcccctg cctcggctgc ggctctggtg ccacaactca
     4561 ctggaagtct cacactcacc gtgtcggagc agtgcgagcg attgattctg cgacatgatc
     4621 cgaacaatcc ccaggaccat caatcgcagc tcatcctgca ggctctgctc aagggcgctc
     4681 tgcccaatgt taccataatc aatgagccga cgcgcgtgga tccaaataaa cagacaactc
     4741 cgatgttgca gccgcaacag caagttatcc aaatacccca gccactaact caaccccagc
     4801 aaattctgca gcagcaacca aaggtttcaa cagctcccaa aatagaagtt aaagtggcta
     4861 acaacagaaa gttgtcaggt caggtaacac atcctgccag cagtatgtta ggaatgacca
     4921 gcactacaac taccatgtca acgatgaaac caccggccaa tctacccgcc aagcagaacg
     4981 aagtcgtatc tttcccgcag ctaccactga actcgcaggt tctcattcag cagcagccta
     5041 tgatttctgc gcagctacag cctcagttgc aacccgtcac tgtgcccgtt agcttgccca
     5101 ctcagccggt accgattcca tctccggcgc ctgctccaca actggccaac aatggccagc
     5161 agcggtacat tgcactgccg cccattgatc ccaccaccca gcagttgttc tgcctgaaca
     5221 gtgtcacgaa tcagataact gcgatgagtg cgggccagac ggctgcttcg attggtccca
     5281 ccgagcggtt gcttattgct ccggctggta tcaatgctca gcagctggct caatgtctgc
     5341 agttggggca acttcatttc aatgatgtga atccgctgcc ggctcagcag caacagcagc
     5401 aggtggctac cacctcgtct tcgatgacac tttcaatgcc actgcgtcct cagccaccgc
     5461 agcaacaaat agtgcatccc atccagccaa tcacaaatgg actggccatc accacaacgg
     5521 caagtagcac gctgaccacc accactagca ccacttcgtt aacaacggtg accaagcagg
     5581 tggccaatgt ggcgccgatt atagatcaaa gcaaagcgaa gctggagccc gctaaggcag
     5641 ccaccagtgg agccggtggc ggagcggtgg tcaaaaagaa gccggtgagg aagcccaagg
     5701 caacgccgac caatgcctcc gaactggcca atctaaagaa cgccagcgta gtcaagccaa
     5761 tgccaaagct cgatccccta tcgcaaaagc ccaacaataa cccggtgcag atcgtgcagc
     5821 ctaatgttgc cagtggaaag cagatgattg tggtgggcag ctccagttgc acaacaacta
     5881 ccacgacaac gatgagcgcc agcatccagc ccttccagac aacgagtggc accaaattgg
     5941 tgggcgttac ggtgcccatg cagcagcaac aacccactgg cacggcagct atattgcaac
     6001 agcagcagca acagaggacg agcttcagct tgacggcaac cacagtggcg acacctgctc
     6061 cgatgccatc tccaacggta gcgtctgggg tcagccaact ccaaatgcag caacttccat
     6121 cccagcaggc gcagcaacag ttgcagccgc agcagcagca acagttgcac gctcctaagc
     6181 aacaacaacc gcaaccaaat actgtgtccc gggtgcaaac catccaactt acgcctcaga
     6241 tgcaacagat cttcaagcag gtccaaatgc aaattcagtt cctcactatt aagttacaaa
     6301 ataaatcaac cttcttgccc gtctcgccgg aaattgattc ggcaactatt gccgcctaca
     6361 acaaaccgat gacagatgcg gagataaatc tggctctgca gcgtctcttc gcagagcagc
     6421 accggattct ggcctccggc aaagaggtac ccactccaga gggtatgttg ccaacagcaa
     6481 atggaacggg cggcttcagt ctcatgccac aaccagttca acaacaacaa cagcagcagc
     6541 agcaacagtt gcaatcaact gttcctgagc cactgaaaac aatcgttccg gccaatgtgg
     6601 ttcagccaaa caaccactca tccactcctg caggagcggc cactagtggt gccacaactg
     6661 ctgccaatat ccagcagaat ataacacaga ggatacacat ctaccctatg cagcaccaac
     6721 aacaacaaca gcagcaacaa cagcagcagc aacaacagcc gcagcatcag caacaacagc
     6781 cccagcatca gctacaacag accggagcca aaccaaagca ggcacaacca aagccgccgc
     6841 cacagcagga gcaacaatcg caactccaaa taaaacccaa ggaaccggcc aacaggagca
     6901 aagtgactag ccagcagctg cagcaaaatc ctcagccacc caatggaacc atcaatatgt
     6961 tgccacaaaa agttagcatg atgccggttg tccagcagca acagcagcag cagccgcaac
     7021 aacatcagca gcaaccaatg ctcaacaaaa gcgggccacc tccactgatc ttcgccagct
     7081 ctacaaatat gctgagcaac agcaatagca acaacctcca caacagcaat tctgcaatgc
     7141 aagtaaacaa catgttgccc ctgccaccat tgcaacccca acagcagcct cagccacagt
     7201 tacagcagcc accaactcca atcctgccgc cagtagcacc aacaatggcc atgggagtgg
     7261 cgccaggacc gatgggaaat ctagtgccat ccctcccgac cataccgagc ctgcccctgt
     7321 atatgccgag caaaatggac gaaatgccaa cacagaaacc aaaaattgca cgcctctcct
     7381 tgttcgttcg ccaactggag gtcgatcagg agagctgtct gaagccggat tacgtgaagc
     7441 ctttccgcac aaaggatgag gcggttaaaa ggctaatcag gtatcattgc atgcatgaga
     7501 acgacgttga gttgccttct gacgaagacg aggaatttga gtccactgct ctggaattcc
     7561 aggacaaatt ccgccaactg aacggcaagt tccaggaaat tctgatgcag gagtcaacgc
     7621 tgccacaccg aacttctgag ttgctacaaa tgcagcagct gatgatcgat gatctcaaag
     7681 gcgagatcaa tgagattcga accgctgaaa aggagttaga gcagcagcta aaggaggagc
     7741 agactagcga aaaatcgacg gcggaaagcg atgtactttc ggaggcgaaa gttaaggagg
     7801 agattaaaca ggaaccaata gataagtctg ctcagccaga cggcgttgtt gacaaatttg
     7861 atttgctgaa aaacagcgct gatgtcaata aagcattcag caagtcacag ccacagcaga
     7921 ctactacagt aaagcaagag gagagcaacg aatctgggga accttctgta aatggatctg
     7981 taaagagcga aggccacgat aaggagtcgt cgaaatacat taagaaggac tacgacaact
     8041 tcgacattga gtctgagctc accactagct tcatcatgaa gaaggtggag aacaaagcgg
     8101 ctttggcaaa gagcgctgag aatcgctacc aggaaagaaa tatgatggac caacaacagc
     8161 agcagcaaca tatcaaagcc aatggaggag atcccgtaga ccaggatgat ggctggtact
     8221 gcctgcaaaa ggagcttaac ctgatgaata acgatcatat gcagcaatcg cagcaacaac
     8281 taaatggaaa ccaacagcat ctgcccatca atcgtcaacc ggcgccacag caatcacact
     8341 tcctggacaa ttcaaattcg aacaacatga tgggcaatag caaccacatg tcggatctgt
     8401 tccccaatgg ttgcattgac aagagcaatc aaagcaccac cagcagcagc aatagcagtt
     8461 gcgttagcga aagccttcct ggttctaatc ccgtgaagac tgtctcctca tgcagcggcc
     8521 agcgcgaaca gccggtggtc atggatgctg atcagcaaaa cgagcatccc gccatcagtg
     8581 agttctttag cagcgattcg gacgtccaga aatccgtgga gacacgactc gaggcaatgt
     8641 tcggggagtc gcccgtccat ttggacgtta agagcagcaa cgatgccaac gacattgagt
     8701 ccaacctgga cgagattttt ggcgactcga aatcgccagc tgccaaccac aaaagtaaaa
     8761 tgagcatgtg ggtgccagac gcatcattta tgcagcaggc gccgcagcaa cagacgtcgc
     8821 aacagcagca acaatacgca gggccacaac aaccatcaca gcagcaacat cacatcgaac
     8881 tgaactgcaa caacaatccg cgctggatgc aaagcatgga ggcgcactac agcgactttc
     8941 tttccaccgg gactagcaat ggagagctgg tgaacggaga atcgcgaaaa agaagctggg
     9001 atagccagct catgggctct ggcagtgata tggaagatga caacagcagc aagcgtctgt
     9061 gcacggcttc atcgtcctcc tcttcgtcgc ccatgtcgtc gcagcagcag cacgttcagg
     9121 tggcacagca tggtctcttg gactcggaca tgttccccca gatgttgatg gatcagcagc
     9181 cgcatcagca acaacagcag gtacagcagc aacacaactt tgcctacgcc aacataatgg
     9241 atcaacagca gcagcaacag caccagcagc aacatttcca gcataacctg cagcaacaga
     9301 tacaacagca gcagcagatg catgttagcc acatgggcgg agcagcagta gtggccggcg
     9361 gagagtacga cgatgacatc agtcggcatg tggccagcgc catcgacagc atcctgaact
     9421 tgcagaacag cggcgactcg ctgcagttct ccctgggttc gatcctcggt gacagcatgc
     9481 tggacgatca acggcaagca ggagccaatt tgcccagctg ccagcaggag caggcgatcc
     9541 aacgacgccg ccacctgggc gaggagatga acgactgcct gattagcggc ggtggttccg
     9601 cggttagcgg agtggcggac aacagtagta acatactgct agagcaccac caccagcagc
     9661 atcagttgat gcagagccac caacagcagc cgcatctgca gcatcaccac catcagaaca
     9721 tgtcgcaggc ccaggcgcag cacatgggac agctgaacga ctttagctgt gtagccggtg
     9781 gcttggacga tcctgtcaag tcgataatga cgtcctgact tggagcccca agcagcacat
     9841 cagcggcgga aaatgcttcg aacagtctga tactagagtt taatgccacc accttgtgaa
     9901 aaacggatga taataatgac ttcctgattg tatagatacc cgtcttagtt gtccttggtt
     9961 attgttttac ccccgagtac aacctttaat tagatctaag ttgaatccgt gtagcataaa
    10021 ttgtttttac ttattgtttg tattgtttag ttcaatgatt tgtaccatat ttgaacagcc
    10081 ctcagataac caattactta atgcaagggg acagaacgaa gagagcgagg tgaaagtaat
    10141 ggcatctaca tacaatacgg ctgagcccca ataaagctac gaaagcaatt gacgtggtga
    10201 attgaaaact aaacgacgaa tagcatcaaa tgcttctcac agaggtacac taaaatggaa
    10261 cataatataa aaaattatat agctacagta actaagagaa agcccctaga caatttatac
    10321 aaatgaaacc aagaaagtac gtaagataaa gggaagaagg taacgatggg aataaacacg
    10381 cgcaaagtga gagaaatgaa aggaaaacga attggttgta taacttataa ttaagttact
    10441 tgaagcaatt ctgagaaaca attaaaatca cgttgccaat tttgttgtcg tttgaatatc
    10501 aggcagagca atttatccgc atccaatcga tttttaaaag cccagataaa atatatttat
    10561 ttacgcacta cgttgcgaag gagttttgta aatagttaga acatgcagca gtttcagtat
    10621 atataaatat atagctaagc atttttaaaa ggatatcatt taaagcgctt caagtacgac
    10681 tgcttctgat attacagaca aaccaacaaa ccaaccaacc aaccaatcaa cctattgacg
    10741 tgctcaatat gagagtaaag aatatttcgg catagctacg ggtgtcattg gaaagcaaat
    10801 gcaaagatct tagggattaa cgagtaccaa ctatggcaaa ctacgaagca aaccaaaagc
    10861 agttcaaaaa tcacaatttt agccaattta gtttaaacat ttataaatta taaattattg
    10921 aatctaatgt aatatcttat acacttaaac cgatcgattg aaacgagaag ctagcagttc
    10981 tgagatgagg aaccccagga aaacaaataa caacaaatta acgaatcgta tcaattgtat
    11041 ctcgatcttg cgacaaatct gacttgtgat ttgtgccacg ttttaaatat agatgtctga
    11101 ttagcttgta aacatacata ataaatacga ttaaaaagtc tatacaaaag cccacactca
    11161 gtaagccact acgaagttta tccaaaatga atcgcgactc gcggtagcaa ttgtaaatgt
    11221 ttgacgtggg gagggcgggc gccacgcatc aattacatac caattacaat actaactact
    11281 actttaatac caccactaac acacctacta cgatacgtac taccgtggac ggattgcttg
    11341 tattacgtgt agtctgtaag ggaaattcga gaggtaccat attttaagat gtgcttcgtt
    11401 tgttcaagtc cacccatcca tgacccatga cccatgaacc ctgaccctga actcctgtac
    11461 caaaacccga atcataacca taaccatcca ctcccacaaa agcaaagtct cccacctgcc
    11521 tagcgtcccg tgtgagttgc atgatcttgt atttctattt ggaaactttc tagttattat
    11581 tcaaagacaa ccactttcgt gtaaaaatgt atatgtaata gcataattta agttggcaac
    11641 ccaaaactga ataaacctaa ttctactgat agataactaa tgcgaattgt tttaagttgt
    11701 atttagaacg aaatcgaaat cctcagaact caaatcggtc gcccctcccg caaccaactt
    11761 tccacaacaa caaagcgcag tgtcgtcact tgaattgttc agctaaatgt gtctaaatgt
    11821 aattaaacat aaagagaaaa ctaaagattt ttaaacaaac attgtgagtt ttattaaaca
    11881 tcatcagcat agggagagga gaggacagca gcaaagcaaa ggaaaccaac atctttaata
    11941 tttgatagtt tattaagccc atccattcat agtcgtcata aacattaagt acataaaaga
    12001 aaaacccaac aaaagattca atataaatct cactgaaacg caacaaacaa ccgaaaaccc
    12061 aact
//