Dbfetch
LOCUS NM_206610 9439 bp mRNA linear INV 26-DEC-2023
DEFINITION Drosophila melanogaster Protostome-specific GEF (PsGEF), transcript
variant E, mRNA.
ACCESSION NM_206610
VERSION NM_206610.5
DBLINK BioProject: PRJNA164
BioSample: SAMN02803731
KEYWORDS RefSeq.
SOURCE Drosophila melanogaster (fruit fly)
ORGANISM Drosophila melanogaster
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE 1 (bases 1 to 9439)
AUTHORS Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
Strelets,V., Russo,S.M. and Gelbart,W.M.
CONSRTM FlyBase Consortium
TITLE Gene Model Annotations for Drosophila melanogaster: Impact of
High-Throughput Data
JOURNAL G3 (Bethesda) 5 (8), 1721-1736 (2015)
PUBMED 26109357
REMARK Publication Status: Online-Only
REFERENCE 2 (bases 1 to 9439)
AUTHORS Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
Russo,S.M. and Gelbart,W.M.
CONSRTM FlyBase Consortium
TITLE Gene Model Annotations for Drosophila melanogaster: The
Rule-Benders
JOURNAL G3 (Bethesda) 5 (8), 1737-1749 (2015)
PUBMED 26109356
REMARK Publication Status: Online-Only
REFERENCE 3 (bases 1 to 9439)
AUTHORS Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
TITLE The Release 6 reference sequence of the Drosophila melanogaster
genome
JOURNAL Genome Res 25 (3), 445-458 (2015)
PUBMED 25589440
REFERENCE 4 (bases 1 to 9439)
AUTHORS Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
TITLE Sequence finishing and mapping of Drosophila melanogaster
heterochromatin
JOURNAL Science 316 (5831), 1625-1628 (2007)
PUBMED 17569867
REFERENCE 5 (bases 1 to 9439)
AUTHORS Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
TITLE The Release 5.1 annotation of Drosophila melanogaster
heterochromatin
JOURNAL Science 316 (5831), 1586-1591 (2007)
PUBMED 17569856
REMARK Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE 6 (bases 1 to 9439)
AUTHORS Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
Ashburner,M. and Anxolabehere,D.
TITLE Combined evidence annotation of transposable elements in genome
sequences
JOURNAL PLoS Comput Biol 1 (2), 166-175 (2005)
PUBMED 16110336
REFERENCE 7 (bases 1 to 9439)
AUTHORS Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
Celniker,S.E., Rubin,G.M. and Karpen,G.H.
TITLE Heterochromatic sequences in a Drosophila whole-genome shotgun
assembly
JOURNAL Genome Biol 3 (12), RESEARCH0085 (2002)
PUBMED 12537574
REFERENCE 8 (bases 1 to 9439)
AUTHORS Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
Rubin,G.M., Ashburner,M. and Celniker,S.E.
TITLE The transposable elements of the Drosophila melanogaster
euchromatin: a genomics perspective
JOURNAL Genome Biol 3 (12), RESEARCH0084 (2002)
PUBMED 12537573
REFERENCE 9 (bases 1 to 9439)
AUTHORS Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
Rubin,G.M. and Lewis,S.E.
TITLE Annotation of the Drosophila melanogaster euchromatic genome: a
systematic review
JOURNAL Genome Biol 3 (12), RESEARCH0083 (2002)
PUBMED 12537572
REFERENCE 10 (bases 1 to 9439)
AUTHORS Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
Gibbs,R.A. and Rubin,G.M.
TITLE Finishing a whole-genome shotgun: release 3 of the Drosophila
melanogaster euchromatic genome sequence
JOURNAL Genome Biol 3 (12), RESEARCH0079 (2002)
PUBMED 12537568
REFERENCE 11 (bases 1 to 9439)
AUTHORS Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
TITLE The genome sequence of Drosophila melanogaster
JOURNAL Science 287 (5461), 2185-2195 (2000)
PUBMED 10731132
REFERENCE 12 (bases 1 to 9439)
AUTHORS Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
Smith,E., Yu,C. and Rubin,G.
CONSRTM Berkeley Drosophila Genome Project
TITLE Drosophila melanogaster release 4 sequence
JOURNAL Unpublished
REFERENCE 13 (bases 1 to 9439)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (20-DEC-2023) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 14 (bases 1 to 9439)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 15 (bases 1 to 9439)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 16 (bases 1 to 9439)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (20-APR-2020) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 17 (bases 1 to 9439)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (22-APR-2019) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 18 (bases 1 to 9439)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 19 (bases 1 to 9439)
CONSRTM FlyBase
TITLE Direct Submission
JOURNAL Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE 20 (bases 1 to 9439)
AUTHORS Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
Park,S., Svirskas,R. and Karpen,G.
TITLE Direct Submission
JOURNAL Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
REMARK Direct Submission
REFERENCE 21 (bases 1 to 9439)
AUTHORS Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
Svirskas,R. and Rubin,G.
TITLE Direct Submission
JOURNAL Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
64-121, Berkeley, CA 94720, USA
REMARK Direct Submission
REFERENCE 22 (bases 1 to 9439)
AUTHORS Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
CONSRTM Drosophila Heterochromatin Genome Project
TITLE Direct Submission
JOURNAL Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE 23 (bases 1 to 9439)
AUTHORS Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
TITLE Direct Submission
JOURNAL Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
Rockville, MD 20850, USA
COMMENT REVIEWED REFSEQ: This record has been curated by FlyBase. This
record is derived from an annotated genomic sequence (NC_004354).
On Jan 16, 2013 this sequence version replaced NM_206610.4.
##Genome-Annotation-Data-START##
Annotation Provider :: FlyBase
Annotation Status :: Full annotation
Annotation Version :: Release 6.54
URL :: http://flybase.org
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..9439
/organism="Drosophila melanogaster"
/mol_type="mRNA"
/db_xref="taxon:7227"
/chromosome="X"
/genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
gene 1..9439
/gene="PsGEF"
/locus_tag="Dmel_CG43947"
/old_locus_tag="Dmel_CG14045"
/old_locus_tag="Dmel_CG14047"
/gene_synonym="CG14045; CG14047; CG43947; Dmel\CG43947;
DmPsGEF; EG:BACH48C10.4; EG:BACH7M4.1; EG:BACH7M4.2;
FBgn0264598; null; psGEF; PsGef"
/note="Protostome-specific GEF"
/map="3A2-3A2"
/db_xref="FLYBASE:FBgn0264598"
/db_xref="GeneID:31224"
CDS 260..8593
/gene="PsGEF"
/locus_tag="Dmel_CG43947"
/old_locus_tag="Dmel_CG14045"
/old_locus_tag="Dmel_CG14047"
/gene_synonym="CG14045; CG14047; CG43947; Dmel\CG43947;
DmPsGEF; EG:BACH48C10.4; EG:BACH7M4.1; EG:BACH7M4.2;
FBgn0264598; null; psGEF; PsGef"
/note="CG43947 gene product from transcript CG43947-RE;
CG43947-PE; PsGEF-PE; protostome-specific GEF"
/codon_start=1
/product="Protostome-specific GEF, isoform E"
/protein_id="NP_996333.4"
/db_xref="FLYBASE:FBpp0305709"
/db_xref="GeneID:31224"
/db_xref="FLYBASE:FBgn0264598"
/translation="MPTMTRMHRHSSSSAVVEESRGRRRGVGVPGVGDANKENFGVHF
MSSPFGNASLIALQDLSNVHGKSPQRRSFSEGSGPRQATPQLAALRCLPRTTGGAVAL
EDSQLSSSRMGDTTLDRMLDAIIESARKEVRCTKTLPGAGATTSTTILADNAEWSETS
VHEMEVRTPTHLKRQRVVRRKNPHKTTTINQTQSHQAKKLEPLQLVPSTKRCLSFSSS
SASSDLDEDEQQVAKRSSLASPTTTPPSHCTTTTSSISSSSSSNGGADMEASQRGSID
VSIAFDAKEQQLNVHVIRCRDLQRSHGSGNGSINAYVKVALSGGAQPPGYGGHSSGGS
MSSGYQRTAVHRHSGRPYFDQRFNFQISSGEETAGQYLQLAVWHRDRHLNSVPVTQSE
WQARVKSGTERRTAERLAKLENAGNEAKRSEFLGCSTFPLNELVHPDSGVSAGSYKLH
AQACPPPTSRHSQPKANAGQDQKQETEAAKDQDKQQDDSPAEMAAAVTVTPQKPVATG
SGSGSNSAAMALNDEVISISIDSMGKEDPLLPQQPQQPMKLSKKALHQRDADENLFLR
FLELDPPADGNANSTTTQAQATGSQSSASKANESNANHLNNGTSGGRRQSTMPNSGGS
SVGGAVRQQQGRTPFTMTKRLTRTEERGFGFSIVWTHPPRVEKIEAGLSADRCGILPG
DYVIFVDKHNVVTMPEADVLNLIRSQGSSLTLEIFRRSGAGATTITSTDLGQNNVHIS
TRLGAAVGLGSEEHTLATTATGTTTVMSLQRTTSTRIQPLANSMSRPATACSGTTSSI
EAAKRRLHLPQVTFSKESIVPVTDNRRRFLLQLISREQNFTAALHFGVDRFVQPLGER
KDLISPNDHRTLFQNIDELLRIAEDILEQLCSSDQQDQEPQMNFASRVYLSKTTAICA
AYKKYCNGIKRADCVLVNKSRQTGSEFIAFITEPAVPRKRPDLTMFIHRPLQHFREIL
KLMQLLAGNCHVDTEEHKNFSTVINELQAAYREITVSSGLMEPLGEGRPLLTLQDLES
RMVFTKCKPFTLAVQGRQWIFGGDLSRVEGRSVKPYWTLLFSDIIVFAKVSRDRVLFI
TEEPIPIANVVDSCFHMRKKTTEFRLTVDPNGRLAESPTGYCAPDLTRTPKRGARRKS
LILRAPSLELKAVWQNLLQRQIFLVNAALGSTPLSSPLDSPDVLNTLVPLSDIGLTTA
SMGSMKLPSLDSIHLKQQQKQQRTNNSHGHVHSATGVATDRTADRSHGHNHGIHGGHV
EQIELLIDEKCRILNKTGTPKSSALHLANWMKGQLDKQQQQARLAAIARSQENVSAED
EQLIFNSDSEQDERITYWTRQQLEKRTKELNLAKENGGLSARPNFGGKRLSGVEELSM
SATSDIYSTSEAEGVTSVISQSHSTTSDSQITVRSSPIVLDKLAVCRHCHKNCQQSGP
GGARPGGGGGVNNSSSTPVLLCNSLKVQHSQSSPNRCCKLSNGIAGATEMANPKNSNR
TGTGTGSVTSMSSSTITGELSSKVVASSSRRETGDTESDVAQLITDEISQSQSEATDD
SVGAMPNSSRSAGEGNPVITGSRLRHLDPPVSVSSLPNGHCSAGVGGGSPKPLPPPRR
TRVVAQMCQEPPRPKANQQVKAIVTITETARIMRSSPTKGSGSGSATGTGSVGGSPAK
YAACHCRCTPEDFTHAQLSPDKMEHQTCKLLESPKQTTTTLEQKQEDEQLSLMLIGLA
QLAPAATLCGQERIPKEENSTPTIAVVPPTPDSVLTKTSTHVWDNSGCSSNGGGTSTT
STATTTTKQPRQAIIENIPEDSCDESPLDEEPPYRPMSSALRRFGTMSSLEKLPSDDR
MDEADELDDDLDSFKPNGHDGNSPQNEEDDEDADSERALVQNDLGASSSSGVIVNGEG
LASGAWTNRAGAYVSDKMSFFEESRAFIDKYLGRWNAGDAQQQHQQGTASETDEQMDE
CTSGATSGEEVWGTPTSGGDNDDQDLQLINSENTHSSPTKSSTSLNDDDDTELMMDEL
LMAPPMTASTIRGLLPRFYRRRLEPLFEEETESDEEKTQQDSDDIKKGEATTNGHYLE
DSAGSSSDSESVPPAPPLVPRPVVPDREEDPSAEGQQENLAMEDQEDLHSSVERLLPN
LATSAMGPRPLLTTRLLPTTAIDPQPPAVLVASSCEYLLDQRPSSVPGRDLAGGSIGT
TTMTSSRPTPLSPAKAAATPIGMNATMTTKTTTTTMMTPSRAPSSTISTISSASDTTS
SGGRASGRPPRFVPPPPPPRRLLLTQTDLIAGQAKTELPRKKSGATADSGCSVAVGDA
ASASDPTAIGSRTMPSTPSTSSLASAVAPKRPSSTIIETCLLGAPRPASATSSATASR
MAKPIRQSPPTQHQSQPRGQSCPAAVPAPSTAAPDAPTVPMPHAGCGLSPRLEMRLAL
NHDILGDEDLICYEPGPDLTTILGHDLSTFHRLTGRDLLSRSATNRVQPKEAVISYSQ
QRNSKMDTPTVNRRPRPLSAGALGGSTSNSNPNNSRSSSSHGNSPLGGGLTRAGVEPV
GQQQLVVQAGGGSAGRVGGGSGSAGGNSTACSSNRGGSKLGDLEILARREKIYSMSQS
RSGCRVRETTSTTTLMAMATEMRSGMDSEKRDSSWPKRAAAAASLTRTRSSTSMDGAG
AISTTTTTTTTMTEEAFLETEVGRSKRLINFIKRRNSEITASSTSSTPNFSDGAIGEQ
PQPSHSQSSLLLQKLPPGQSQAQDSVEMAQMSPRKDNDKPSLNRRLWKQITKRRRTNS
VSQIVAG"
ORIGIN
1 atttccgttt ctgagctcgt gctaagcaga cacgtgaact tcatagcaga aaagaacgaa
61 aatacctttg gcttgaagaa attagttggc ttattttttt tttcgtgtaa aagatctatc
121 aaaatttgtg tgccgccgac gcgcaaggat cacattttta aacaaaatca gtgctgttaa
181 tcaagttgaa cctaagctaa atcgagttgc agttcgccag gcgacgtagt cgcaaaaaga
241 tcaagttacc agtagggaca tgccaaccat gacacggatg catcgccact ccagttccag
301 tgccgtggtg gaggagtcgc gtggccggcg ccgtggagtg ggcgtgccag gagtcggaga
361 cgcgaacaag gagaatttcg gggtgcactt catgagcagt cccttcggca atgccagttt
421 gattgccctg caggatttga gcaacgtgca cggaaagagt ccgcagcgca ggagtttcag
481 cgagggcagt ggtccgcggc aggccacgcc acaattggcg gcactgcgct gcttgccacg
541 caccaccggt ggtgcagttg ctctggagga ttctcagctc tcctcatcgc gaatgggcga
601 taccacactg gaccgcatgc tggatgccat catagagtcg gccaggaagg aggtgaggtg
661 tacgaagaca ctgccaggcg ccggcgccac cacttcgacc accattttgg cggataatgc
721 cgagtggagc gagaccagtg tccacgaaat ggaggtgcgc acgcccaccc acttgaagcg
781 gcagcgggtg gtgcggcgca agaacccaca caagactacc accattaacc aaacgcaaag
841 tcaccaagcc aagaagttgg agccactgca gttggttccc agcaccaaac gctgcctgag
901 cttctcatcc agttccgcct cgagtgattt ggacgaggat gagcagcagg tggccaagag
961 gagttcgctg gcctccccca ccaccacacc tccctcccac tgcaccacca ccaccagcag
1021 catcagcagc agcagcagca gtaatggtgg tgcggacatg gaggccagtc agcggggcag
1081 catcgatgtg agcatcgcat tcgatgccaa ggagcagcaa cttaatgtac atgtgattcg
1141 ctgtcgcgac ttgcagcgct cccatggcag tggaaacggc agcatcaacg cctacgtcaa
1201 ggtggccctg tccggaggag cccagccacc cggatacggt ggccactcgt cggggggaag
1261 catgagttcc gggtaccaac gcaccgccgt ccatcggcac tcgggccgcc cgtacttcga
1321 tcagcgattc aacttccaga tttccagtgg cgaggagacc gctggacagt atcttcagtt
1381 ggccgtgtgg catcgggatc gccatttaaa ctcagttccc gttactcagt ccgagtggca
1441 agctcgcgtg aaaagcggca cagaacggcg aacggcagaa cggctggcga agctcgaaaa
1501 cgcaggaaac gaggcaaaac gcagcgagtt cctgggctgc agcacttttc cactgaatga
1561 gttagtgcat ccggattcgg gcgtctcggc aggatcctac aagctgcatg cacaggcatg
1621 tcctccgcct actagccgcc atagccaacc aaaagccaac gcaggccagg atcaaaagca
1681 ggaaacggaa gcggcgaagg atcaggataa acagcaggac gattcccccg cagaaatggc
1741 agcagccgtc acagtcacgc cccaaaaacc ggttgccact ggatccggtt cgggatcgaa
1801 ttccgcagcg atggcactga acgacgaggt gatcagcatt agcatcgata gcatgggcaa
1861 agaggatcct ctgctacccc agcagcccca gcagcccatg aagctcagca agaaggcact
1921 ccaccagcga gatgccgacg agaatctctt tttaagattc ctggagctgg atccgccagc
1981 ggatggcaat gcaaacagta ccaccaccca ggcacaagcg acgggctccc agtcctcggc
2041 gtcgaaggcc aacgaaagca atgccaacca cttgaacaat gggacatccg gtggacgcag
2101 acagtccacc atgccaaata gcggaggatc tagcgtagga ggcgctgtcc gccagcagca
2161 gggacgtacc ccattcacca tgacgaaaag acttacgcga accgaggaac ggggctttgg
2221 attctctatt gtctggacgc atccgccacg ggtggagaaa atcgaggcgg gactctcggc
2281 cgatcggtgt ggcattttgc ccggcgacta tgtgatcttt gtggacaagc ataacgtcgt
2341 cacgatgccc gaggctgatg ttctgaattt gatacgatcg cagggctcgt ccttgacgtt
2401 ggagatattc agaaggtcag gcgcgggcgc cacaacaatc acgtcgacgg acttgggcca
2461 gaacaatgta cacataagca ccagattggg cgcggcagtg ggcttgggca gcgaggagca
2521 caccctggcc accaccgcaa caggaacgac caccgtaatg agtctgcaaa ggaccaccag
2581 tacgaggatc cagccattgg caaatagcat gagccggccg gctaccgcat gctcgggaac
2641 cacctcctcc atcgaggcgg ccaaaaggag gcttcacctg ccacaggtca ccttcagcaa
2701 ggagtcgatt gtgcctgtca cggacaaccg taggcgcttc ctgctccagc tgattagtcg
2761 ggagcagaac ttcacggctg ccctgcactt cggagtggac cgctttgtgc agccgcttgg
2821 cgaacgaaag gatctgatct cgccgaacga tcatcgcact ttgttccaga acatcgacga
2881 gctgctcagg attgccgagg acatcctgga gcagttgtgc agcagcgatc agcaggacca
2941 ggagccacaa atgaactttg cctcgcgggt ctacctatca aaaactacag cgatctgtgc
3001 ggcatacaag aagtactgca atggaattaa gcgggccgac tgtgttttgg tcaataagtc
3061 ccgacagact ggctccgagt ttatagcctt cattacggaa ccggccgttc cacgtaaaag
3121 acccgatctc accatgttca tccatcgacc actgcagcat tttcgtgaaa tccttaaact
3181 gatgcagttg ctcgctggaa attgccatgt ggatacggag gagcacaaga actttagcac
3241 ggttatcaac gaactgcagg ctgcctatcg ggagatcact gtgagcagtg gtctgatgga
3301 accactggga gaaggtcgcc ccctgctcac cctccaggat ctggagtccc ggatggtctt
3361 taccaaatgc aagcctttta cgttggccgt tcaggggcgc cagtggatct ttggcggcga
3421 tttgtcccgg gtcgaaggtc gctccgttaa gccctattgg acactactat tcagtgatat
3481 catcgttttt gccaaggtca gccgtgatcg agtcctgttt atcacagagg agcccattcc
3541 catcgccaat gtggttgact cttgtttcca tatgcgcaag aaaaccaccg agtttcgtct
3601 cacagtggat cccaatggac gcctggcgga gagtccgacg ggttactgtg ctcccgatct
3661 cacccgcact ccaaagcgag gagcccgccg caagagcctt attttaaggg caccttcgct
3721 ggaacttaag gccgtttggc agaatcttct ccagcggcag atctttctag ttaacgccgc
3781 ccttggctcc acgcccttgt ccagtccctt ggactcgccg gatgttctca acaccctggt
3841 gccgctgagt gatattggtc tgaccaccgc ctccatgggg tcgatgaagt taccatcgct
3901 ggacagcatc cacctgaagc agcagcagaa acagcagcgt accaacaatt cccacggaca
3961 cgtccattcc gccacaggag tggcaaccga tcgaactgcg gatcgctcgc acggccacaa
4021 ccatggcatc cacggtggtc atgtggaaca gatcgagctg ctgatcgacg agaagtgtcg
4081 cattctgaac aagacaggca ctcccaagtc gagcgccttg catctggcca actggatgaa
4141 gggtcagctg gacaagcagc agcagcaggc tcgactggcg gcgatcgcgc gatcccagga
4201 gaatgtgagt gccgaggacg agcaactgat tttcaacagc gatagcgagc aggacgaacg
4261 gatcacctac tggacgcgtc agcagctgga gaagcgtacc aaggagctca atctggccaa
4321 ggaaaatggc gggctatcgg ctaggcccaa ttttgggggc aagcgtctga gtggcgtcga
4381 ggaactgagc atgagcgcca cctcggatat ctactccacg tcggaggcgg agggcgtgac
4441 cagtgtgatc agtcagtcgc atagcaccac atccgacagc cagatcaccg tacgctctag
4501 tcccattgtg ctggacaagc tggccgtttg ccgtcattgc cacaagaatt gccagcagag
4561 cggaccgggt ggagcgcgtc ctggcggcgg aggcggagtc aacaactcca gttccacgcc
4621 cgtcctgctc tgcaattcgt tgaaagtgca gcacagccaa tcgtcgccaa atcgctgctg
4681 caagcttagc aatggtattg cgggagccac ggaaatggct aacccgaaaa acagcaatcg
4741 aactggaact gggacaggaa gcgtgaccag catgtccagc agcactatta cgggggaact
4801 gtcctccaaa gtggtggcca gtagcagtcg tcgggaaacc ggcgacaccg agagcgacgt
4861 ggcccaactg ataaccgatg aaatttccca atcgcaatca gaggccaccg atgacagcgt
4921 cggagcgatg cccaatagca gtagaagtgc cggagaagga aacccagtta tcaccggctc
4981 cagactacgt catctagatc cgccagtctc agtttcatcc ttgcccaatg gccactgctc
5041 cgctggcgtc ggtggtggtt cgccaaaacc gctgccacca ccacgtcgca cccgcgttgt
5101 ggcccaaatg tgccaggagc caccaaggcc aaaggccaac cagcaggtga aggccattgt
5161 gaccattacg gagacggcca ggataatgcg gagcagtcca acgaaaggct cgggttcggg
5221 atcggcaacg ggcacgggct cggtgggtgg ttccccagcc aagtatgccg cctgccactg
5281 tcgctgtact ccagaggact ttacccacgc tcaactctcg ccggacaaaa tggagcacca
5341 gacctgcaag ctactggaat cacccaagca aacgaccacc acactggagc agaaacagga
5401 agacgagcag ttgtccctga tgctcattgg tctggcgcag cttgctccag ctgccacact
5461 ttgtgggcag gaaaggatcc ccaaggagga gaatagtaca cccaccatag ccgtggtgcc
5521 gcccactccc gattcggtgc tcactaaaac cagcactcac gtttgggaca acagtggctg
5581 ctcgtcaaac ggcggaggaa cctcaaccac cagcaccgct accactacaa cgaaacaacc
5641 aaggcaggcc attattgaga atattccaga ggactcgtgt gacgagtccc cattggatga
5701 ggaaccaccc tatcgaccca tgagcagtgc tcttcgccga ttcggcacca tgtcaagctt
5761 agagaaacta ccttccgatg atcgaatgga tgaggccgat gagctagatg atgatttgga
5821 ttcatttaag ccaaatggcc acgacggcaa cagtccccaa aacgaggaag acgacgagga
5881 tgcggactcg gaacgagctt tggtacaaaa cgacctgggg gcctccagtt cgtccggagt
5941 gattgtcaat ggcgagggcc tggccagtgg tgcgtggaca aatcgggcgg gagcctatgt
6001 ctccgacaag atgtccttct tcgaggaatc gcgcgctttc attgataagt acttgggtcg
6061 ctggaatgct ggcgatgcac aacagcagca ccagcaggga accgcctcgg agacggatga
6121 gcaaatggac gagtgcacct cgggagccac cagtggtgaa gaggtctggg gcacacccac
6181 cagtggcggg gataacgatg atcaggactt gcagctgatc aactcggaaa acacacattc
6241 gtcgccaacc aagtcgagca cctccttgaa tgacgacgac gacacggagt tgatgatgga
6301 cgagctgtta atggcgccac ctatgactgc cagcacaatt cggggattgt tgccacgctt
6361 ttacagacgt cgccttgagc ccctttttga ggaggagacg gagagcgacg aagagaagac
6421 gcaacaggac agcgatgaca tcaaaaaggg ggaagcaacg accaatggcc actacctaga
6481 ggattcggct ggaagttcaa gcgacagcga gtcggtgcca ccagccccac cactagttcc
6541 tcggccagtg gtgccggatc gggaggagga tccgtcagcg gagggccagc aggagaatct
6601 ggcgatggag gaccaggagg acttgcactc cagcgtggag cgactgcttc cgaatctggc
6661 cacgagtgcg atggggccgc gtcccttact caccacccgt ctactgccca ccacggctat
6721 cgatcctcag ccgccggcag tactggtggc ctcatcctgc gagtatctgc tcgatcagcg
6781 cccctcgagc gtaccaggac gcgatctcgc cggcgggagc ataggcacca ccacgatgac
6841 atcctctcgg ccgacgcctc tgtctccggc gaaggcagca gcgacaccga tcgggatgaa
6901 tgcgacgatg acaacgaaga cgacgacgac cacgatgatg acgccgagtc gagctccctc
6961 atcgactatt tcaacaatca gcagcgcgag cgacactacc agctccggcg gcagagccag
7021 cggcagacct ccgagatttg taccgccgcc accgcctccg cgtcgcttgc tcctcacgca
7081 gaccgatcta atcgctggtc aggccaaaac ggagctacct cgaaaaaaat ctggcgctac
7141 tgctgacagt ggctgctcgg tggctgtcgg cgacgccgcc agcgcttcgg acccaaccgc
7201 tatcgggtca aggacaatgc catcaacacc gagtaccagc agcttggcgt cagcggtggc
7261 cccaaaacga ccgtcatcaa cgataataga gacctgcctg ctgggggcac ccaggcccgc
7321 ctcagccact tcatcagcaa cagcctcgag aatggcaaag ccgatccgcc agagtccacc
7381 cacccagcac cagtcgcagc cgaggggtca gagctgcccg gctgcagtgc cagcaccatc
7441 gacggctgcg ccagacgcac ccactgtccc aatgccccac gctggatgcg gcctaagccc
7501 aaggctggaa atgcggctgg ccctcaacca cgacattctc ggggacgagg acttgatctg
7561 ctacgagcct ggtcccgatc taacgactat tctggggcac gacctctcca catttcatcg
7621 tctgacgggt cgcgatttat tgagtcgttc agccacgaac agggtgcagc caaaagaggc
7681 tgtcatatcg tactctcaac aacgcaactc aaagatggac acgccgacgg tgaaccgccg
7741 gccccgcccc ctctcagcgg gcgccttggg cggctccacc agcaacagca atccgaacaa
7801 cagtcgcagc agcagcagcc acggcaacag tccgctcggc ggtggtttaa cccgtgccgg
7861 cgtcgagcca gtgggccaac agcagctggt ggttcaagca ggtggtggat ctgctggtcg
7921 agtaggcggc gggagcggaa gcgccggcgg caacagcacc gcctgcagct ccaacagagg
7981 cgggagcaaa ttaggcgatc tggaaatctt agcgagacgc gagaagatat actccatgtc
8041 tcaatcgagg agtggctgtc gagtcaggga gacgacgtcc acgaccactc tgatggccat
8101 ggctacggag atgcgttcgg ggatggattc ggagaaacgc gactcttcct ggcccaaacg
8161 agcagctgca gcagcctctc tgacgaggac gaggagttct acttcgatgg acggagcggg
8221 agccatcagt accacaacca caaccaccac aacaatgact gaggaggcct tccttgagac
8281 cgaagtgggc agatctaagc gcttgataaa ctttataaag cgtcgcaact ccgagataac
8341 cgccagtagc accagcagca cccccaactt ctccgacggc gccattggcg agcagcccca
8401 accgagccac agccagagca gtcttcttct ccagaagctg ccgccgggtc aatcgcaggc
8461 ccaggactca gtggaaatgg cccaaatgtc gccgcgcaag gataacgaca agccatcgct
8521 gaatcgccgc ctctggaagc aaatcaccaa acgcagacgc accaactccg tgtcccaaat
8581 agtcgcaggc taggggagtc accacccgtt gatccgtcga tccgtcgccc ccacttgtcc
8641 cctctgttac tttcactaac cgtgttgtgt ggctagatct tagcctcgat ccgcgattgc
8701 cgctgagcag cagcagcaat cgacgagcga tggcagacaa caatttgcac gggacagccc
8761 gtcgataggc agtaatcaac tgtaaccata tcacggtgat cttcacgggc tgtcccaccg
8821 aaagaactgg caaaagctaa accaaaaact tagtagtatg ttaagaattt tgatacgact
8881 ttgtagctag cgaatccccc ggatcgcaac gataccgggc agctctaggc gtccaattgc
8941 gtccaaaaaa acgataagcg ataagcgata agcgactaaa acgataaaaa tgatagaaat
9001 gataaaagtc gattctcctg ccagcagagc caaggaaccg agtcccagcc aaatatgata
9061 tagacaacga acgttggtag tactgggcac actcactcac tcactcactt caaaatagac
9121 attgtagaca tcaaaaacaa aaaaaaaaaa acttaacaca taacaaataa cacttcagtc
9181 cagccaaatg aattaggatc tagaggagtt cgtaaactga ctatttttta gagatcatga
9241 ttatgtctta gcttagcagt tttgggttta tgttgcggga ttgagacagt gaatttgcct
9301 tacgtatcca atgaactatt gtatgggtag cttaacgtag tagtctttat acacataaaa
9361 tttacagcct cagcagttcg tagtagtcag tcgttagtag ttggtcgtaa taaaagttta
9421 agtacttcaa ttccacgtt
//