Comment[ArrayExpressAccession] E-MTAB-1132 Investigation Title Transcription profiling by array on 123 paired tumor and non-tumor tissue samples from patients with non-small cell lung carcinoma to gain a systems biology insight into the current clinical classification Comment[Submitted Name] Transcription profiling by array on 123 paired tumor and non-tumor tissue samples from patients with non-small cell lung carcinoma to gain a systems biology insight into the current clinical classification Experiment Description Non-small cell lung cancer (NSCLC), a leading cause of cancer deaths, represents a heterogeneous group of neoplasms, mostly comprising squamous cell carcinoma (SCC), adenocarcinoma (AC) and large-cell carcinoma (LCC). The aim of this study was to gain a systems biology insight into the current clinical classification. Patients and Methods: Comparative genomic hybridization followed by mutational analysis, gene expression and miRNA microarray profiling were performed on 123 paired tumor and non-tumor tissue samples from patients with NSCLC. Using integrated systems biology approaches, we sought to find out if combining data types from different levels of biology would improve clinical assessment of NSCLC. Results: At both DNA, RNA and miRNA levels we could identify molecular markers that discriminated significantly between the various clinicopathological entities of NSCLC. Conclusions: We report proofs of distinct molecular profiles that contribute to distinguishing NSCLC tumor subtypes even in small biopsies. The Gene expression experiments have been made in dual color and dye_swap with Agilent human Human Genome Exon 244K arrays (custom design 14891, from commercial 4x44K (design 014850 plus 195000 oligo - 1 per exon- defined with RefSeq hg18 and 1840 probes from viral transcripts). Note date of surgery is the date of the sample was frozen. Experimental Design disease_state_design dye_swap_design co-expression_design Comment[AEExperimentType] transcription profiling by array Experimental Factor Name HISTOLOGY DISEASE_STATE Experimental Factor Type histology disease_state Quality Control Type dye_swap_quality_control biological_replicate technical_replicate Public Release Date 2013-12-11 Person Last Name DESSEN Person First Name Philippe Person Mid Initials Person Email dessen@igr.fr Person Phone 33142114490 Person Address Institut Gustave Roussy, 114 rue Edouard Vaillant, 91805 Villejuif, Cedex, France Person Affiliation IGR / INSERM U985 Person Roles submitter PubMed ID 24299561 Publication Author List Lazar V, Suo C, Orear C, van den Oord J, Balogh Z, Guegan J, Job B, Meurice G, Ripoche H, Calza S, Hasmats J, Lundeberg J, Lacroix L, Vielh P, Dufour F, LehtiM-CM-6 J, Napieralski R, Eggermont A, Schmitt M, Cadranel J, Besse B, Girard P, Blackhall F, Validire P, Soria JC, Dessen P, Hansson J, Pawitan Y Publication Title Integrated molecular portrait of non-small cell lung cancers Publication Status in preparation Protocol Name P-MTAB-26661 P-MTAB-26662 P-MTAB-26663 P-MTAB-26664 P-MTAB-26665 Protocol Type nucleic_acid_extraction labeling hybridization image_acquisition bioassay_data_transformation Protocol Description Total RNAs were extracted using the TriReagent method (TriReagent, Euromedex, Strasbourg, France). 1) Tissue lysis/homogenization. For each sample, the following sequences were performed as quickly as possible: Cuts of the samples were placed in 1ml TriReagent and grinded by using a tissue ruptor. Then 600M-5l of TriReagent were added. 2) Separation of the aqueous phase and isopropanol precipitation. Tissue homogenates (1,6 ml) were added by 320 ul of chloroform and incubated at room temperature for 15 min and then centrifuged at 12M- 000 g for 15 min at 4C. The upper aqueous phase (about 800 ul) containing RNA was collected. RNA was then precipitated from the aqueous phase by addition of isopropanol (1 volume isopropanol for one volume aqueous solution) 10 min at room temperature and centrifugation at 12 000g for 10 min at 4C. Pellets were washed twice with ethanol 75 % (7 500 g, 5 min, 4C), resuspended in 100 ml sterile RNAse-free water, and frozen in M-^V20C. 4) Quality control. One microliter was used for determination of concentration using a Nanodrop spectrophotometer (www.nanodrop.com). Quality of RNA-preparations was further assessed using Lab-on-a-chip Bioanalyser 2000 technology (Agilent technologies), based on the 28S/18S ribosomal RNAs ratio. All samples included in this study displayed a ratio of ribosomal RNAs between 1.8 and 2. Concentration of all RNAs was adjusted at 100 ng/ul, and verified with a second measurement on a Nanodrop spectrophotometer. (500 ng, Amplification=RNA polymerases). Agilent oligo Cy5 or Cy3 probes labelling protocol. Kit used for probe labelling: Agilent Fluorescent Low input Linear Amplification kit (G2554A) adapted for small amount of total RNA (500 ng total RNA per reaction). In 1.5 ml tubes add 500 ng total RNA from sample. Add 1.2 ul T7 promoter primer, 2ul Spike-in A for Cy3 labelling or B for Cy5 labelling and add nuclease free water (Invitrogen ref:10977-015) to bring the volume to 11 ul. Denature by incubating at 65C for 10 minutes. Place the reactions on ice and incubate 5 min. Spin briefly. Prepare a Reverse Transcription master mix, adding for one reaction 4 ul First strand Buffer 5X, 2 ul DTT 0.1M, 1 ul dNTP 10 mM mix, 0.5 ul Random Hexamer, 1 ul MMLV Reverse Transcriptase (200 U/ul), 0.5 ul RNAse out (40 U/ul), 1 ul nuclease free water. Master mix is prepared in batch for all the samples included in the study (vol per one reaction multiplied by number of samples. In each reaction tube containing denaturated RNA and T7 promoter primer in a volume of 11 ul, add 9 ul of Reverse Transcriptase master mix, and mix by gently pipetting. Incubate at 40C in a circulating water bath for 2 hours. Move samples to 65 C for 15 minutes to inactivate MMLV RT, and incubate on ice for 5 minutes. Spin briefly. Prepare a in vitro transcription master mix, adding for one reaction : 20.1 ul Nuclease free water, 20 ul Transcription buffer 4X, 6 ul DTT 0.1 M, 8 ul NTP mix, 0.5 ul RNAse out, 0.6 ul Inorganic Pyrophosphatase, 0.8 ul T7 RNA Polymerase. Transcription master mix is prepared in batch for all the samples included in the study: (vol per one reaction multiplied by number of samples). Master mix is splited in two aliquots, one for Cy5 and one for Cy3. Add 1.6 ul (multiplied by reactions number) CTP-Cy5 25 mM (Perkin Elmer ref NEL 581) or 2.4 ul CTP-Cy3 (multiplied by reactions number) (Perkin Elmer, ref NEL 582) to a total volume of 60 ul/ reaction. To each RT reaction, add 60 ul Transcription master mix. Incubate at 40C for 3 hours. Add 20 ul nuclease free water and freeze at M-^V20C. Labeled probes are purified using Qiagen Rneasy mini kit and protocol provided by Agilent. For each probe, add 350 ul RLT buffer, and 250 ul ethanol 100. Mix by gently vortexing. Apply 700 ul on Rneasy columns and spin at 13,000 g for 30 s at 4C. Discard flow-through. Wash twice with RPE buffer. Dry the column and elute in 60 ul nuclease free water. Measure concentration and Cy5/Cy3 incorporation using a Nanodrop spectrophotometer. Adjust concentration at 100 ng/ul. Freeze at M-^V20C until hybridisation. Agilent Hybridization Protocol (Chamber type: Agilent SureHyb Chamber; Quantity of labelled extract: 825 ng per labelling ; duration: 17 hours; volume: 100 ul per array; Temperature in C: 65). Add the following components in clean 1.5 ml tubes: 825 ng linearly amplified cRNA labeled with Cy5 for sample 1, and 825 ng linearly amplified cRNA labeled with Cy3 for sample 2. Conversely, for dye swapped arrays, mix 825 ng linearly amplified cRNA labelled with Cy3 for sample1, and 825 ng linearly amplified cRNA labeled with Cy5 for sample 2. In each case, add 50 ul Bocking Agent 10X, 10 ul of Fragmentation buffer 25X and nuclease free water up to 250 ul. Mix by vortexing. Incubate at 60C for 30 minutes in dark. Spin briefly, add 250 ul of 2X hi-hybridization buffer. Mix gently. Assembly the Sure-Hyb hybridization chamber from Agilent. Place a backing side with the plastic inner on upper side. Gently add 490 ul of 1x hybridisation solution and cover with the Agilent 244k exon array properly oriented (active surface in contact with liquid). Finish to assembly the chamber and tight the screw. Hybridization was carried out for 17 hours at 65C in a rotating oven (Robbins Scientific, Mountain View, CA) at 10 rpm. The arrays were disassembled at room temperature in wash 1 buffer (Agilent), then washed 1 minute in a glass dish (Wheathon) at room temperature in wash 1 buffer, then 1 minute in wash 2 buffer at 37C and then 1 minute in acetonitril. Slides were dried using a nitrogen gun, and scanned by using an Agilent G2565 C DNA microarray scanner. Scanning was performed with a Agilent G2565C DNA Microarray scanner using defaults parameters (100 PMT, 5 um resolution, at 20C in low ozone concentration environment. Microarray images were analysed by using Feature Extraction software version (10.5.1.1) from Agilent technologies. Defaults settings were used. Microarray images were analysed by using Feature Extraction software version ( 10.5.1.1) from Agilent technologies. Defaults settings were used. Data processing : All processing methods used for gene expression analysis were performed on the Cy3 and Cy5 Median Signal from Agilent Feature Extraction raw data files in R, using functions and package collected in the Bioconductor project (Gentleman et al, 2004, Genome Biology, 5:R80) as well as custom written routines. First, dye swap arrays were combined (mean of intensities) to obtained only one array per condition. This combination have the result of centering the M values (log2ratios) on zero. Then, flagged spots as well as control spot were removed. A loess normalization was then performed using the M-^SnormalizeWithinArraysM-^T function from R package M-^SLIMMAM-^T (Smyth, 2004, Statistical applications in Genetics and molecular biology, vol3, N1, article3) from the Bioconductor project without a background subtraction : To assess differentially expressed genes, we first estimate the fold changes and standard errors between two groups of samples by fitting a linear model for each probes with the M-^SlmFitM-^T function of LIMMA package. Then we applied an empirical Bayes smoothing to the standard errors to the linear model previously computed with M-^SeBayesM-^T function of LIMMA. To extract a table of the top-ranked genes from the linear model fit, we used topTable function from LIMMA. The results were saved in table file format, as well as in a Volcanoplot. SDRF File E-MTAB-1132.sdrf.txt