Integrated Chromosome 19 Transcriptomic and Proteomic Datasets Derived from Six Glioma-Derived Cancer Stem Cell Lines


One sub-project within the global Chromosome 19 Consortium, part of the Chromosome-Centric Human Proteome Project, is to define chromosome 19 gene and protein expression in glioma-derived cancer stem cells (GSCs). Chromosome 19 is notoriously linked to glioma by 1p/19q co-deletions and clinical tests are established to detect that specific aberration. GSCs are tumor-initiating cells and are hypothesized to provide a repository of cells in tumors that can self-replicate and be refractory to radiation and chemotherapeutic agents developed for the treatment of tumors. In this pilot study, we performed RNA-Seq, label-free quantitative protein measurements in six GSC lines, and targeted transcriptomic analysis using a chromosome 19 specific microarray in an additional 6 GSC lines. Here, we present insights into differences in GSC gene and protein expression, including the identification of proteins listed as having no or low evidence at the protein level (such as small nuclear ribonucleoprotein G-like protein, RUXGL_HUMAN), as correlated to chromosome 19 and GSC subtype. Furthermore, the upregulation of proteins downstream of adenovirus-associated viral integration site 1 (AAVS1) in GSC11 in response to oncolytic adenovirus treatment was demonstrated. Taken together, our results may indicate new roles for chromosome 19, beyond the 1p/19q co-deletion, in the future of personalized medicine for glioma patients. Data analysis: MS files (.raw) were imported into Progenesis LC-MS (version 18.214.1528, Nonlinear Dynamics) for m/z and retention time alignment. The top 5 spectra for each feature were exported (charge deconvolution, top 1000 peaks) as a combined .mgf file for database searching in PEAKS (version 6, Bioinformatics Solutions Inc., Waterloo, ON) against the UniprotKB/Swissprot-Human database (July 2013 version, 20,264 proteins), appended with the cRAP contaminant database. PEAKS DB and Mascot (version 2.3.02, Matrix Science) searches were performed with a parent ion tolerance of 10 ppm, fragment ion tolerance of 0.025 Da, fixed carbamidomethyl cysteine, and variable modifications of oxidation (M), phosphorylation (STY), and deamidation (NQ). Trypsin was specified as the enzyme, allowing for 2 missed cleavages and a maximum of 3 PTMs per peptide. An additional search for unexpected modifications was performed with the entire Unimod database. Finally, homology searching was performed using the SPIDER algorithm to identify peptides resulting from nonspecific cleavages or amino acid substitutions. Mascot and PEAKS SPIDER searches were combined (inChorus), using a 1% false discovery rate cutoff for both search engines. The resulting peptide-spectrum matches (95% peptide probability) were imported into Progenesis LC-MS. Conflict resolution was performed manually to ensure that a single peptide sequence was assigned to each feature by removing lower scoring peptides. The resulting normalized peptide intensity data were exported, and the peptide list was filtered to remove non-unique peptides, methionine-containing peptides, and all modified peptides except cysteine carbamidomethylation. For quantification, the filtered list of peptide intensities was imported into DanteR (version 0.1.1), and intensities for peptides of the same sequence were combined to form a single entry. The resulting peptide intensities were log2 transformed and combined to protein abundances (RRollup) using the default settings, excluding one-hit wonders (50% minimum presence of at least one peptide, minimum dataset presence 3, p-value cutoff of 0.05 for Grubbs’ test, minimum of 5 peptides for Grubbs’ test). The resulting proteins were quantified by 1-way ANOVA relative to M37; p-value adjustment for multiple testing was performed according to Benjamini and Hochberg.

Sample Processing Protocol

See details in reference PMID : 24266786

Data Processing Protocol

See details in reference PMID : 24266786


HL Liu, Department of Pharmacology and Toxicology

Submission Date


Publication Date



Not available


LTQ Orbitrap Elite


Not available

Experiment Type

Bottom-up proteomics


    Lichti CF, Liu H, Shavkunov AS, Mostovenko E, Sulman EP, Ezhilarasan R, Wang Q, Kroes RA, Moskal JC, Fenyö D, Oksuz BA, Conrad CA, Lang FF, Berven FS, Végvári A, Rezeli M, Marko-Varga G, Hober S, Nilsson CL; Integrated Chromosome 19 Transcriptomic and Proteomic Data Sets Derived from Glioma Cancer Stem-Cell Lines., J Proteome Res, 2013 Dec 3, PubMed: 24266786