Please note that we have stopped the regular imports of Gene Expression Omnibus (GEO) data into ArrayExpress. This may not be the latest version of this experiment.

E-GEOD-43520 - The evolution of lncRNA repertoires and expression patterns in tetrapods

Released on 19 January 2014, last updated on 31 May 2014
Gallus gallus, Gorilla gorilla gorilla, Homo sapiens, Macaca mulatta, Monodelphis domestica, Mus musculus, Ornithorhynchus anatinus, Xenopus (Silurana) tropicalis
Samples (55)
Protocols (5)
Only a minuscule fraction of long non-coding RNAs (lncRNAs) are well characterized. The evolutionary history of lncRNAs can provide insights into their functionality, but comparative analyses have been precluded by our ignorance of lncRNAs in non-model organisms. Here, we use RNA sequencing to identify lncRNAs in eleven tetrapod species and we present the first large-scale evolutionary study of lncRNA repertoires and expression patterns. We identify ~11,000 primate- specific lncRNA families, which show evidence for selective constraint during recent evolution, and ~2,400 highly conserved lncRNAs (including ~400 genes that likely originated more than 300 million years ago). We find that lncRNAs, in particular ancient ones, are generally actively regulated and may predominantly function in embryonic development. lncRNA X-inactivation patterns reveal an extremely female-biased monotreme-specific lncRNA, which may partially compensate X-dosage in this lineage. Most lncRNAs evolve rapidly in terms of sequence and expression levels, but global patterns like tissue specificities are often conserved. We compared expression patterns of homologous lncRNA and protein-coding families across tetrapods to reconstruct an evolutionarily conserved co-expression network. This network, which surprisingly contains many lncRNA hubs, suggests potential functions for lncRNAs in fundamental processes like spermatogenesis or synaptic transmission, but also in more specific mechanisms such as placenta growth suppression through miRNA production. [Batch 1 and 2] To broaden our understanding of lncRNA evolution, we used an extensive RNA-seq dataset to establish lncRNA repertoires and homologous gene families in 11 tetrapod species. We analyzed the poly- adenylated transcriptomes of 8 organs (cortex/whole brain without cerebellum, cerebellum, heart, kidney, liver, placenta, ovary and testis) and 11 species (human, chimpanzee, bonobo, gorilla, orangutan, macaque, mouse, opossum, platypus, chicken and the frog Xenopus tropicalis), which shared a common ancestor ~370 millions of years (MY) ago. Our dataset included 47 strand-specific samples, which allowed us to confirm the orientation of gene predictions and to address the evolution of sense-antisense transcripts. See also GSE43721 (Soumillon et al, Cell Reports, 2013) for three strand-specific samples for mouse brain, liver and testis.
Experiment types
RNA-seq of coding RNA, RNA-seq of non coding RNA 
Anamaria Necsulea <>, Angélica Liechti, Frank Grutzner, Henrik Kaessmann, Julie Baker, Magali Soumillon, Maria Warnefors, Tasman Daish, Ulrich Zeller
The evolution of lncRNA repertoires and expression patterns in tetrapods. Necsulea A, Soumillon M, Warnefors M, Liechti A, Daish T, Zeller U, Baker JC, Gr�tzner F, Kaessmann H. , PMID:24463510
Exp. designProtocolsVariablesProcessedSeq. reads
Investigation descriptionE-GEOD-43520.idf.txt
Sample and data relationshipE-GEOD-43520.sdrf.txt
Processed data (32)Click to browse processed data