Gene trees

This presentation shows how gene / protein trees are produced in Ensembl, and how to fetch them via the Ensembl API.

Exercises 3

  1. Print the protein tree with the stable ID ENSGT00390000003602.
  2. Print all the members of the tree containing the human ncRNA gene ENSG00000238344.
  3. Count the number of duplication events in the tree of the zebrafish protein-coding gene ENSDARG00000003399.
Matthieu explains the answers to these questions in this 6-minute video.
You can download his sample scripts and outputs:
a. sample script and output
b. sample script and output
c. sample script and output