Dbfetch
LOCUS NM_001103523 5683 bp mRNA linear INV 26-DEC-2023
DEFINITION Drosophila melanogaster clathrin heavy chain, transcript variant E
(Chc), mRNA.
ACCESSION NM_001103523
VERSION NM_001103523.2
DBLINK BioProject: PRJNA164
BioSample: SAMN02803731
KEYWORDS RefSeq.
SOURCE Drosophila melanogaster (fruit fly)
ORGANISM Drosophila melanogaster
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE 1 (bases 1 to 5683)
AUTHORS Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
Strelets,V., Russo,S.M. and Gelbart,W.M.
CONSRTM FlyBase Consortium
TITLE Gene Model Annotations for Drosophila melanogaster: Impact of
High-Throughput Data
JOURNAL G3 (Bethesda) 5 (8), 1721-1736 (2015)
PUBMED 26109357
REMARK Publication Status: Online-Only
REFERENCE 2 (bases 1 to 5683)
AUTHORS Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
Russo,S.M. and Gelbart,W.M.
CONSRTM FlyBase Consortium
TITLE Gene Model Annotations for Drosophila melanogaster: The
Rule-Benders
JOURNAL G3 (Bethesda) 5 (8), 1737-1749 (2015)
PUBMED 26109356
REMARK Publication Status: Online-Only
REFERENCE 3 (bases 1 to 5683)
AUTHORS Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
TITLE The Release 6 reference sequence of the Drosophila melanogaster
genome
JOURNAL Genome Res 25 (3), 445-458 (2015)
PUBMED 25589440
REFERENCE 4 (bases 1 to 5683)
AUTHORS Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
TITLE Sequence finishing and mapping of Drosophila melanogaster
heterochromatin
JOURNAL Science 316 (5831), 1625-1628 (2007)
PUBMED 17569867
REFERENCE 5 (bases 1 to 5683)
AUTHORS Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
TITLE The Release 5.1 annotation of Drosophila melanogaster
heterochromatin
JOURNAL Science 316 (5831), 1586-1591 (2007)
PUBMED 17569856
REMARK Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE 6 (bases 1 to 5683)
AUTHORS Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
Ashburner,M. and Anxolabehere,D.
TITLE Combined evidence annotation of transposable elements in genome
sequences
JOURNAL PLoS Comput Biol 1 (2), 166-175 (2005)
PUBMED 16110336
REFERENCE 7 (bases 1 to 5683)
AUTHORS Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
Celniker,S.E., Rubin,G.M. and Karpen,G.H.
TITLE Heterochromatic sequences in a Drosophila whole-genome shotgun
assembly
JOURNAL Genome Biol 3 (12), RESEARCH0085 (2002)
PUBMED 12537574
REFERENCE 8 (bases 1 to 5683)
AUTHORS Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
Rubin,G.M., Ashburner,M. and Celniker,S.E.
TITLE The transposable elements of the Drosophila melanogaster
euchromatin: a genomics perspective
JOURNAL Genome Biol 3 (12), RESEARCH0084 (2002)
PUBMED 12537573
REFERENCE 9 (bases 1 to 5683)
AUTHORS Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
Rubin,G.M. and Lewis,S.E.
TITLE Annotation of the Drosophila melanogaster euchromatic genome: a
systematic review
JOURNAL Genome Biol 3 (12), RESEARCH0083 (2002)
PUBMED 12537572
REFERENCE 10 (bases 1 to 5683)
AUTHORS Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
Gibbs,R.A. and Rubin,G.M.
TITLE Finishing a whole-genome shotgun: release 3 of the Drosophila
melanogaster euchromatic genome sequence
JOURNAL Genome Biol 3 (12), RESEARCH0079 (2002)
PUBMED 12537568
REFERENCE 11 (bases 1 to 5683)
AUTHORS Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
TITLE The genome sequence of Drosophila melanogaster
JOURNAL Science 287 (5461), 2185-2195 (2000)
PUBMED 10731132
REFERENCE 12 (bases 1 to 5683)
AUTHORS Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
Smith,E., Yu,C. and Rubin,G.
CONSRTM Berkeley Drosophila Genome Project
TITLE Drosophila melanogaster release 4 sequence
JOURNAL Unpublished
REFERENCE 13 (bases 1 to 5683)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (20-DEC-2023) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 14 (bases 1 to 5683)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 15 (bases 1 to 5683)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 16 (bases 1 to 5683)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (20-APR-2020) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 17 (bases 1 to 5683)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (22-APR-2019) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 18 (bases 1 to 5683)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 19 (bases 1 to 5683)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 20 (bases 1 to 5683)
AUTHORS Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
Park,S., Svirskas,R. and Karpen,G.
TITLE Direct Submission
JOURNAL Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
REMARK Direct Submission
REFERENCE 21 (bases 1 to 5683)
AUTHORS Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
Svirskas,R. and Rubin,G.
TITLE Direct Submission
JOURNAL Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
64-121, Berkeley, CA 94720, USA
REMARK Direct Submission
REFERENCE 22 (bases 1 to 5683)
AUTHORS Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
CONSRTM Drosophila Heterochromatin Genome Project
TITLE Direct Submission
JOURNAL Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE 23 (bases 1 to 5683)
AUTHORS Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
TITLE Direct Submission
JOURNAL Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
Rockville, MD 20850, USA
COMMENT REVIEWED REFSEQ: This record has been curated by FlyBase. This
record is derived from an annotated genomic sequence (NC_004354).
On Jul 15, 2014 this sequence version replaced NM_001103523.1.
##Genome-Annotation-Data-START##
Annotation Provider :: FlyBase
Annotation Status :: Full annotation
Annotation Version :: Release 6.54
URL :: http://flybase.org
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..5683
/organism="Drosophila melanogaster"
/mol_type="mRNA"
/db_xref="taxon:7227"
/chromosome="X"
/genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
gene 1..5683
/gene="Chc"
/locus_tag="Dmel_CG9012"
/gene_synonym="CG9012; chc; CHC; chc1; Cla; Clh; CLH;
D-Chc; dCHC; Dmel\CG9012; l(1)13Fb; l(1)G0438; l(1)VI"
/note="Clathrin heavy chain"
/map="13F5-13F7"
/db_xref="FLYBASE:FBgn0000319"
/db_xref="GeneID:32537"
CDS 361..5397
/gene="Chc"
/locus_tag="Dmel_CG9012"
/gene_synonym="CG9012; chc; CHC; chc1; Cla; Clh; CLH;
D-Chc; dCHC; Dmel\CG9012; l(1)13Fb; l(1)G0438; l(1)VI"
/note="CG9012 gene product from transcript CG9012-RE;
CG9012-PE; Chc-PE; clathrin heavy-chain; clathrin heavy
chain; clathrin hc"
/codon_start=1
/product="clathrin heavy chain, isoform E"
/protein_id="NP_001096993.1"
/db_xref="FLYBASE:FBpp0111709"
/db_xref="GeneID:32537"
/db_xref="FLYBASE:FBgn0000319"
/translation="MTQPLPIRFQEHLQLTNVGINANSFSFSTLTMESDKFICVREKV
NDTAQVVIIDMNDATNPTRRPISADSAIMNPASKVIALKAQKTLQIFNIEMKSKMKAH
TMNEDVVFWKWISLNTLALVTETSVFHWSMEGDSMPQKMFDRHSSLNGCQIINYRCNA
SQQWLLLVGISALPSRVAGAMQLYSVERKVSQAIEGHAASFATFKIDANKEPTTLFCF
AVRTATGGKLHIIEVGAPPNGNQPFAKKAVDVFFPPEAQNDFPVAMQVSAKYDTIYLI
TKYGYIHLYDMETATCIYMNRISADTIFVTAPHEASGGIIGVNRKGQVLSVTVDEEQI
IPYINTVLQNPDLALRMAVRNNLAGAEDLFVRKFNKLFTAGQYAEAAKVAALAPKAIL
RTPQTIQRFQQVQTPAGSTTPPLLQYFGILLDQGKLNKFESLELCRPVLLQGKKQLCE
KWLKEEKLECSEELGDLVKASDLTLALSIYLRANVPNKVIQCFAETGQFQKIVLYAKK
VNYTPDYVFLLRSVMRSNPEQGAGFASMLVAEEEPLADINQIVDIFMEHSMVQQCTAF
LLDALKHNRPAEGALQTRLLEMNLMSAPQVADAILGNAMFTHYDRAHIAQLCEKAGLL
QRALEHYTDLYDIKRAVVHTHMLNAEWLVSFFGTLSVEDSLECLKAMLTANLRQNLQI
CVQIATKYHEQLTNKALIDLFEGFKSYDGLFYFLSSIVNFSQDPEVHFKYIQAACKTN
QIKEVERICRESNCYNPERVKNFLKEAKLTDQLPLIIVCDRFDFVHDLVLYLYRNNLQ
KYIEIYVQKVNPSRLPVVVGGLLDVDCSEDIIKNLILVVKGQFSTDELVEEVEKRNRL
KLLLPWLESRVHEGCVEPATHNALAKIYIDSNNNPERYLKENQYYDSRVVGRYCEKRD
PHLACVAYERGLCDRELIAVCNENSLFKSEARYLVGRRDAELWAEVLSESNPYKRQLI
DQVVQTALSETQDPDDISVTVKAFMTADLPNELIELLEKIILDSSVFSDHRNLQNLLI
LTAIKADRTRVMDYINRLENYDAPDIANIAISNQLYEEAFAIFKKFDVNTSAIQVLID
QVNNLERANEFAERCNEPAVWSQLAKAQLQQGLVKEAIDSYIKADDPSAYVDVVDVAS
KVESWDDLVRYLQMARKKARESYIESELIYAYARTGRLADLEEFISGPNHADIQKIGN
RCFSDGMYDAAKLLYNNVSNFARLAITLVYLKEFQGAVDSARKANSTRTWKEVCFACV
DAEEFRLAQMCGLHIVVHADELEDLINYYQNRGYFDELIALLESALGLERAHMGMFTE
LAILYSKFKPSKMREHLELFWSRVNIPKVLRAAESAHLWSELVFLYDKYEEYDNAVLA
MMAHPTEAWREGHFKDIITKVANIELYYKAIEFYLDFKPLLLNDMLLVLAPRMDHTRA
VSYFSKTGYLPLVKPYLRSVQSLNNKAINEALNGLLIDEEDYQGLRNSIDGFDNFDNI
ALAQKLEKHELTEFRRIAAYLYKGNNRWKQSVELCKKDKLYKDAMEYAAESCKQDIAE
ELLGWFLERDAYDCFAACLYQCYDLLRPDVILELAWKHKIVDFAMPYLIQVLREYTTK
VDKLELNEAQREKEDDSTEHKNIIQMEPQLMITAGPAMGIPPQYAQNYPPGAATVTAA
GGRNMGYPYL"
ORIGIN
1 atatcggcat taattgtata actttcacac ctctaccttg tcttgcagtc agtatttagt
61 caaacaaaag aaaaataaac tacatttgtt agagcttgat aagagctcag tttgtcttca
121 gtgtgtgtgc ccgtcatcta gcaacaaacc gacctaaacg ttccaatgga aaccgaatct
181 taaaccagaa accgactaaa ctcgaacgca attaaacaac aataaatcct cgaaacaaat
241 caaatacaaa ctgttaccaa gtgtgcaatt caagaaaaca tcgtattggc attgttttcg
301 caaagataca tttacgttgt aaaggtctgt caataactag acctcactgt agtagtaaag
361 atgacgcaac cactgcccat ccgctttcag gagcatttac agctcacaaa tgtcgggatc
421 aacgccaatt cattttcatt cagcacgctc acgatggaat cggataagtt tatttgcgtg
481 cgggagaagg tgaatgatac cgcccaagtg gtcatcattg acatgaacga tgccaccaat
541 cccacgcggc gtcccatctc agcggattcg gctatcatga atccagctag caaggtcatt
601 gcgctcaaag cgcaaaagac gttgcagatc ttcaacattg agatgaagtc gaagatgaag
661 gcgcatacca tgaacgagga tgtggtgttc tggaagtgga tctccctcaa tacgctagct
721 ctggtcacag aaacgagtgt gttccattgg tccatggagg gcgattcgat gccgcaaaag
781 atgtttgacc ggcattcatc gctgaacggt tgtcaaatca tcaactaccg ctgcaatgcc
841 tcccagcagt ggctactctt ggtcggcatt tcggcactgc caagtcgcgt tgccggtgcc
901 atgcagttgt attcggtgga gcgtaaggtg tcccaggcga tcgaggggca tgcggccagt
961 tttgccacgt ttaaaatcga tgccaacaag gagccgacca cgctgttctg ctttgcagtt
1021 cgtacggcca ctgggggcaa gcttcatatt atcgaagttg gtgctccgcc gaacggcaat
1081 cagccgtttg ccaagaaggc tgtcgatgtc ttctttccgc cggaagcaca aaatgatttc
1141 cctgtggcta tgcaagtctc tgccaagtac gacaccatct acttgataac caagtatgga
1201 tacatacatc tgtatgacat ggagacggcc acgtgcatat acatgaatcg tatatcggcc
1261 gatacgatct ttgttaccgc accgcatgag gcaagtggtg gcatcattgg cgtcaatcgc
1321 aagggacagg tcctctccgt gaccgtcgac gaggagcaga tcattcccta catcaacacc
1381 gttctgcaga atcccgattt ggcccttcgc atggccgtgc gcaacaattt ggctggtgcc
1441 gaagatctct ttgtgcgaaa gttcaacaag ctctttacag ccggccagta tgctgaagcg
1501 gctaaagttg ctgccctggc acccaaggcc attctgcgta cgccacagac gatccagcgt
1561 ttccaacagg tgcagacacc agctggctcc acgactccgc cgctgctgca atactttggc
1621 attctcctcg accagggcaa gctgaacaag ttcgagtctc tcgagctgtg ccgtcccgtc
1681 ttgctgcagg gcaagaagca gctgtgcgag aagtggctga aggaggagaa gttggaatgc
1741 agcgaggagt tgggtgatct ggtcaaggcc tccgatctta cacttgccct gtccatctat
1801 ctgcgcgcaa atgtgcccaa caaggttatc caatgctttg ctgagactgg gcagttccag
1861 aagattgtac tctacgccaa gaaggtcaac tatacgcccg attacgtgtt cctgctgcgc
1921 tccgtgatgc gaagcaaccc ggagcaagga gctggtttcg cctctatgtt ggtggccgag
1981 gaagagccac tggcggacat caatcagatt gtggacatct tcatggagca ctccatggtg
2041 cagcagtgca ctgcattcct gctggacgcc ctcaagcata accgtcccgc cgagggtgcc
2101 ctccagacgc gcctgctgga aatgaatctg atgtctgctc cgcaggtggc cgacgccatc
2161 ctgggcaatg ctatgttcac ccactacgat cgggcccaca ttgcccagct gtgcgagaag
2221 gctggactgc tccagcgcgc cctcgagcac tacacggatc tgtatgacat taagcgggcc
2281 gttgtgcaca cgcacatgct gaatgccgaa tggctggtca gtttctttgg cacgctgtcg
2341 gtggaggact cgctggaatg tctgaaggca atgcttacgg cgaatttgcg ccagaacttg
2401 cagatctgtg tgcagattgc caccaagtat cacgaacagc tgaccaacaa ggcactgatt
2461 gatctgttcg aaggtttcaa gagctacgac ggactgttct acttcctgag cagcattgtc
2521 aacttctcac aggatcccga agtgcacttc aaatacattc aggcggcatg caagactaat
2581 cagattaagg aggtggagcg aatttgccgt gaatcaaact gctacaatcc cgaacgggtg
2641 aagaacttct tgaaggaggc caagctgacg gatcagctac cattaattat tgtttgtgat
2701 cgttttgatt tcgtgcacga cttggtgctt tacctgtatc gtaacaatct gcagaagtac
2761 attgagatct atgtgcagaa agtgaatcca tcccgcttgc cagtggtagt gggtggtctt
2821 cttgatgttg attgcagtga ggatataatt aaaaatctaa ttctcgtggt caagggacaa
2881 ttctcaaccg acgaactggt cgaggaggtc gagaagcgca accgtctcaa gcttctcctt
2941 ccctggctgg agtcccgagt tcacgagggc tgcgtcgagc cagccaccca caacgcgttg
3001 gccaagatct acattgactc gaacaacaat cccgagagat atcttaagga gaatcagtac
3061 tacgatagcc gtgtggtcgg tcgctactgc gagaagcggg atccccattt ggcgtgtgtc
3121 gcctacgagc gtggattgtg cgatcgcgag ctgatcgccg tttgtaacga gaattctctg
3181 ttcaagagcg aagcacgcta cttggttggt cgccgcgacg ccgaactctg ggccgaggtc
3241 ctttcggaga gcaatccata caagcgccag ttgatcgatc aggtggtaca gaccgctttg
3301 tccgagaccc aggatcccga tgacatctct gtaacggtca aagcattcat gaccgccgat
3361 ttgcccaatg agctgatcga acttctcgag aagattattc tggactcgtc cgtctttagc
3421 gaccatcgca atctgcaaaa cttgctcatt ctcacagcca tcaaggctga tcgcacccga
3481 gtcatggact acattaaccg gctggagaac tacgatgcac cggacatcgc gaacattgcg
3541 atcagtaatc agttgtacga agaagccttc gccatcttca agaagttcga tgtgaacaca
3601 tcggccattc aggtgctcat cgatcaagtg aacaacctgg agcgggctaa cgagttcgcc
3661 gagcggtgca atgagccggc cgtttggtcg cagctggcca aggcccaact gcagcagggt
3721 ctggtcaagg aggctatcga ctcgtacatc aaggctgatg atccgagcgc ttacgtcgat
3781 gtcgtcgatg tggccagcaa ggtggagtct tgggatgacc tcgttcgcta tctgcaaatg
3841 gcacgcaaga aggcgcgcga atcttacatc gagagcgaat tgatctatgc ctatgcgcgc
3901 actggacgtc tggccgatct ggaggagttc atttcgggtc ccaaccatgc cgatatccag
3961 aagattggca accgttgctt cagcgacggc atgtacgatg cagcgaagct actgtacaac
4021 aatgtgagca actttgcccg tctggccatc actttggtct acctgaagga gttccaaggg
4081 gccgtggact cggcgcggaa agccaactca acgcgcacat ggaaggaggt gtgcttcgcc
4141 tgcgtggacg ccgaggagtt taggctagct cagatgtgcg gcctgcacat tgtggtgcat
4201 gccgatgagc tagaggatct gattaactac tatcaaaacc gtggatactt cgatgaattg
4261 attgcgctac tcgagtcggc tttggggctg gaacgtgcgc acatgggaat gttcaccgaa
4321 ttagctatac tttattcaaa attcaaacct tccaaaatgc gcgaacactt ggagctgttc
4381 tggtctcgcg ttaacattcc aaaagttctg cgtgccgctg aatcggctca cttgtggtcg
4441 gagctggtgt tcctgtacga taagtacgag gagtacgata acgccgtcct ggccatgatg
4501 gctcatccca cggaggcgtg gcgcgagggg cactttaagg acattatcac caaggtagcc
4561 aacattgagc tgtactacaa ggctatcgaa ttctatttgg acttcaagcc gctgctgttg
4621 aacgacatgc tgctcgtgct ggcacccagg atggatcaca ctcgtgctgt tagttacttc
4681 tccaaaaccg gctatttgcc actcgtcaag ccttatctgc gttcagtcca atctctcaat
4741 aacaaggcaa tcaacgaagc cctgaacgga ctcttaatcg acgaggagga ctaccagggt
4801 ctgcgcaatt cgatcgatgg atttgataac tttgacaaca ttgcgttggc acagaaactc
4861 gaaaagcacg aacttaccga attccgtagg attgccgcct acttgtacaa gggaaataat
4921 cgctggaaac agagcgttga gctctgcaaa aaggataaac tctacaagga tgctatggag
4981 tacgccgccg aatcttgcaa gcaagatatt gccgaagagt tgttgggttg gttcctagaa
5041 cgtgacgctt acgattgttt tgcagcttgt ctttatcagt gttacgactt gctgcgccct
5101 gatgttatct tggagttggc ctggaaacac aaaatcgttg actttgccat gccctatttg
5161 attcaggttc tgcgcgaata cacaacaaag gtggacaaac tggagttgaa cgaggctcag
5221 cgcgagaagg aggacgattc cactgagcac aaaaacatta ttcagatgga gccacaactg
5281 atgatcaccg ctggcccagc aatgggcatt cctccacaat atgcacagaa ttatccacct
5341 ggtgcagcaa cggtaacggc ggcaggagga cgcaacatgg gctatcccta cttgtaggac
5401 ttgcgcccga taatgagatc atcaaaacaa ctttaaaaac aaatgtatca gcattaaagg
5461 aataccaata aaataagaaa aataatctat taattgcatg tgcgccaaaa attaccaaaa
5521 cgaaacaaca cagtaaacat aactttgaac atcattatat taaagcaatg caattaacaa
5581 tccaattttt ggtttgtttg tagacagtta aaaattgatt atatgttttg ataattacag
5641 cactctttat tcaaatataa cagaatatat tactttttat cat
//