Comment[ArrayExpressAccession] E-GEOD-80300 MAGE-TAB Version 1.1 Public Release Date 2016-06-24 Investigation Title A comparative analysis of 5-azacytidine and zebularine induced DNA demethylation using whole-genome bisulfite sequencing Comment[Submitted Name] A comparative analysis of 5-azacytidine and zebularine induced DNA demethylation using whole-genome bisulfite sequencing Experiment Description The non-methylable cytosine analogs, 5-azacytidine and zebularine, are widely used to disrupt DNA methyltransferase activity and reduce genomic DNA methylation. In this study, whole-genome bisulfite sequencing is used to construct maps of DNA methylation with single base pair resolution in Arabidopsis thaliana seedlings treated with each demethylating agent. We find that 5-azacytidine and zebularine-treated seedlings have nearly indistinguishable patterns of DNA methylation genome-wide and that 5-azacytidine, despite being more unstable in aqueous solution, has a slightly greater demethylating effect at higher concentrations across the genome. Transcriptome analyses revealed a substantial number of up-regulated genes and transposable element genes, particularly CACTA-like elements, demonstrating that chemical demethylating agents have a disproportionately large effect on loci that are silenced by DNA methylation. Bisulfite-Seq and RNA-Seq Term Source Name ArrayExpress EFO Term Source File http://www.ebi.ac.uk/arrayexpress/ http://www.ebi.ac.uk/efo/efo.owl Person Last Name Schmitz Schmitz Person First Name Robert Robert Person Mid Initials J J Person Email schmitz@uga.edu Person Affiliation University of Georgia Person Address Genetics, University of Georgia, B416 Davison Life Sciences, Athens, GA, USA Person Roles submitter Protocol Name P-GSE80300-3 P-GSE80300-2 P-GSE80300-1 Protocol Description Data were mapped to the TAIR 10 MethylC-seq analysis were performed as described in Schmtiz et al. Epigenome-wide inheritance of DNA methylation variants in a recombinant inbred population. Genome Research (2013). Genome_build: TAIR 10 Supplementary_files_format_and_content: MethylC-seq data (marked as “allc_sampel.tsv”) column 1 = chr = chromosome; column 2 = pos = coordinate for the cytosine position on the chromosome; column 3 = strand = + or - strand; column 4 = mc_class = context of the cytosine and the two following bases from the same strand; column 5 = mc_count = number of reads supporting a methylated cytosine; column 6 = total = total number of reads at that position; column 7 = methylated = cytosine is considered methylated if there is a 1 or unmethylated if there is a 0 Supplementary_files_format_and_content: RNA-seq data (marked as gene_exp.diff) column 1 = Tested id = A unique identifier describing the transcipt, gene, primary transcript, or CDS being tested; column 2 = gene = The gene_name(s) or gene_id(s) being tested; column 3 = locus = Genomic coordinates for easy browsing to the genes or transcripts being tested.; column 4 = sample 1 = Label (or number if no labels provided) of the first sample being tested; column 5 = sample 2 = Label (or number if no labels provided) of the second sample being tested; column 6 = Test status = Can be one of OK (test successful), NOTEST (not enough alignments for testing), LOWDATA (too complex or shallowly sequenced), HIDATA (too many fragments in locus), or FAIL, when an ill-conditioned covariance matrix or other numerical exception prevents testing.; column 7 = FPKMx = FPKM of the gene in sample x; column 8 = FPKMy = FPKM of the gene in sample y; column 9 = log2(FPKMy/FPKMx) = The (base 2) log of the fold change y/x; column 10 = test stat = The value of the test statistic used to compute significance of the observed change in FPKM; column 11 = p = The uncorrected p-value of the test statistic; column 12 = q = The FDR-adjusted p-value of the test statistic; column 13 = significant = Can be either “yes” or “no”, depending on whether p is greater then the FDR after Benjamini-Hochberg correction for multiple-testing Tissue was flash frozen in liquid nitrogen for all experiments including RNA-seq and MethylC-seq. DNA was isolated using a Qiagen Plant DNeasy kit (Qiagen, Valencia, CA) following the manufacturer’s recommendations. RNA was isolated using TRIzol (Thermo Scientific, Waltham, MA) following the manufacturer’s instructions. RNA-seq: RNA-seq libraries were constructed using Illumina TruSeq Stranded RNA LT Kit (Illumina, San Diego, CA) following the manufacturer’s instructions with limited modifications. The starting quantity of total RNA was adjusted to 1.3 µg, and all volumes were reduced to a third of the described quantity. MethylC-seq libraries were constructed using the MethylC-seq protocol (Urich et al. 2015). Agarose gel (Ameresco) with added half-strength Linsmaier and Skoog nutrients (Caisson Laboratories, Inc) was prepared and autoclaved. 5-azacytidine (Sigma) and zebularine (APExBIO) were dissolved in dimethyl-sulfoxide (DMSO) and water, respectively, before being added to the liquefied cooling agar at final concentrations of 25 µM, 50 µM and 100 µM. Columbia-0 (col-0) background Arabidopsis thaliana seeds were then subjected to a ethanol-based seed sterilization and approximately 30 seeds were plated on the solid agar. As a control, seeds were plated on agar containing DMSO with no chemical demethylating agent (AZA mock-treated control) and agar containing no DMSO or chemical demethylating agent (untreated control). Three replicates of each treatment type were simultaneously plated and grown in parallel. After a two-day stratification period at 4ºC, the seeds were removed to room temperature and allowed to grow for 8 days under constant light. Protocol Type normalization data transformation protocol nucleic acid library construction protocol growth protocol Experimental Factor Name treatment ecotype Experimental Factor Type treatment ecotype Comment[SecondaryAccession] GSE80300 Comment[GEOReleaseDate] 2016-06-24 Comment[ArrayExpressSubmissionDate] 2016-04-14 Comment[GEOLastUpdateDate] 2016-06-27 Comment[AEExperimentType] RNA-seq of coding RNA Comment[AEExperimentType] methylation profiling by high throughput sequencing Comment[AdditionalFile:Data1] GSE80300_gene_exp.diff Comment[SecondaryAccession] SRP073326 Comment[SequenceDataURI] http://www.ebi.ac.uk/ena/data/view/SRR3384734-SRR3384748 SDRF File E-GEOD-80300.sdrf.txt