Virtual course
CABANA Workshop: Plant bioinformatics - analysis of crop genomics data
With optional Train the Trainer workshop
This course will introduce crop biologists to methods and approaches for analysing crop genomics data for genotyping. During the course participants will learn about genotyping experimental design, variant calling, comparative genomics using Ensembl, analyse variation data with Ensembl Variant Effect Predictor and explore concepts of genome wide association studies (GWAS). The content will explain how to make use of existing public data, retrieve, submit and share your data with the community.
Trainers will cover the basics of diversity analysis using R, before analyzing various genomes to assess diversity, investigating gene-phenotype relationships and microsatellites. The course will be delivered through a combination of hands-on training sessions and lectures.
This course will be a virtual event. The content will be delivered over Zoom. In order to make the most out of the course, you should make sure to have a stable internet connection throughout the week.
Computational practicals will be run on EMBL-EBI's virtual training infrastructure; this means there is no need to have a powerful computer to run exercises or a requirement to install complex software before the course.
Who is this course for?
This course is aimed at wet-lab researchers within Latin America (excluding Chile and Uruguay due to funding restrictions) who are working with and/or generating their own crop genomic data or who are working on genotyping technologies, including data analysis software development.
Researchers who apply for CABANA courses should have projects in one or more of the CABANA challenge areas (sustainable crop production, communicable diseases, protection of biodiversity).
Please note this course will be taught in English, however some trainers are fluent in either Spanish/Portuguese, and can offer language support where feasible. Priority will also be given to those who have not attended a CABANA event yet.
Scientists from underrepresented ethnic and gender groups are especially encouraged to apply for this workshop, for example women and those with Black and/or Indigenous heritage.
Prerequisites: basic computational or previous bioinformatics experience is required for this workshop, such as basic commands in R. Undergraduate-level knowledge of plant biology would be an advantage.
Please note: The course will use genomic data from crop plant species throughout the week. It will not explore in detail methods for assembling genomes.
What will I learn?
Learning outcomes
After the course, participants should be able to:
- Describe how to design a crop genotyping experiment
- Conduct variant calling and search for a variety of crop genomic motifs such as SSRs and indels
- Use Ensembl for comparative analysis to explore plant genomes
- Access plant variation data from public resources and use appropriate tools to analyse variation
- Input and study genomics data using R to evaluate genomic diversity
- Select the best method to investigate genomic diversity
Course content
During this course you will learn about:
- How to design sequence-based genotyping experiments and conduct variant calling
- Ensembl tools for browsing plant genomes and variant data
- Assess crop diversity
- Genome wide association studies (GWAS)
This training course is associated with a virtual Train the Trainer workshop you can register for. For any information about the workshop and the application process, please refer to the "How to apply" section on this webpage.
Trainers
Andrés Gatica-Arias
Universidad de Costa Rica, CR José Pablo Jiménez
Instituto Tecnológico de Costa Rica, CR Alexandra Holinski
EMBL-EBI Sarah Dyer
EMBL-EBI Ensembl Outreach Team
EMBL-EBI Carla Filippi
FAGRO - Universidad de la Republica Baron Koylass
EMBL-EBI Luis Barboza
Universidad de Costa Rica, CR Brad Till
University of California, Davis Noelle Anglin
Agricultural Research Service, US Department of Agriculture Manuel Humberto Reyes-Valdés
Universidad Autónoma Agraria Antonio Narro (UAAAN), Mexico Diana Lopez
Universidad Nacional de Colombia, Colombia
Universidad de Costa Rica, CR
Instituto Tecnológico de Costa Rica, CR
EMBL-EBI
EMBL-EBI
EMBL-EBI
FAGRO - Universidad de la Republica
EMBL-EBI
Universidad de Costa Rica, CR
University of California, Davis
Agricultural Research Service, US Department of Agriculture
Universidad Autónoma Agraria Antonio Narro (UAAAN), Mexico
Universidad Nacional de Colombia, Colombia
Programme
The course will run according to Costa Rican time (UTC-6). Check the current time in Costa Rice here.
This is a preliminary programme. Small changes are still possible.
Time (UTC-6) |
Subject |
Trainers |
Day 1 – Monday 13th December 2021 |
||
08:30-09:00 |
Welcome - How this course will run |
Andrés Gatica-Arias |
09:00-09:30 |
Icebreaker |
Alexandra Holinski |
09:30 - 10:15 |
Principles for good data management and overview of EMBL-EBI resources |
Alexandra Holinski |
10:15 - 10:30 |
Break |
|
10:30 - 11:30 |
Methodologies for genotyping and NGS overview |
Brad Till |
11:30 - 12:15 |
Scientific networking - flash talks |
All |
12:15 - 13:00 |
Lunch break |
|
13:00 - 14:00 |
Introduction to genome assembly - basic concepts |
Jose P. Jimenez |
14:00 - 14:45 |
Planning a genotyping experiment |
Brad Till |
14:45 - 15:00 |
Break |
|
15:00 - 16:00 |
Keynote lecture - pre-recording |
Noelle Anglin |
Day 2 - Tuesday 14th December 2021 |
||
08:30 - 08.45 |
Questions and Answers and reflection on Day 1 |
Andrés Gatica-Arias, Jose P. Jimenez |
08.45 - 09:15 |
Introduction to variant calling |
Carla Filippi |
09:15 - 10:15 |
Whole genome variant calling: Strategies for SNP, small indels and SSR whole genome calling |
Carla Filippi |
10:15 - 10:30 |
Break |
|
10:30 - 11:30 |
Structural Variant calling: Large insertions, deletions, translocations |
Carla Filippi |
11:30 - 12:30 |
European Variation Archive |
Baron Koylass |
12:30 - 13:15 |
Scientific networking – flash talks |
|
13:15 - 14:00 |
Lunch break |
|
14:00 - 14:15 |
Check-in |
All |
14:15 - 16:00 |
Optional: Basic training on R and the Unix command line |
Jose P. Jimenez |
16:00 - 16:30 |
Check-in |
All |
Day 3 - Wednesday 15th December 2021 |
||
08:30 - 08:45 |
Questions and Answers and reflection on Day 2 |
Andrés Gatica-Arias, Jose P. Jimenez |
08:45 - 09:15 |
Questions and Answers for exercises from previous day |
Carla Filippi, Baron Koylass |
09:15 - 09:45 |
Introduction to Ensembl plant resource |
Sarah Dyer |
09:45 - 10:30 |
Ensembl Plant browser, Compara and Variant Effect Predictor (demos) |
Ensembl Outreach team |
10:30 - 10:45 |
Break |
|
10:45 - 12:15 |
Ensembl Plant browser, Compara and Variant Effect Predictor (demos) |
Ensembl Outreach team |
12:15 - 13:00 |
Scientific networking – flash talks |
All |
13:00 - 14:00 |
Lunch Break |
|
14:00 - 15:30 |
Practical work (in your own time) |
|
15:30 - 16:00 |
Check-in |
All |
Day 4 - Thursday 16th December 2021 |
||
08:30 - 08:45 |
Questions and Answers and reflection on Day 3 |
Andrés Gatica-Arias, Jose P. Jimenez |
08:45-09:15 |
Questions and Answers for Ensembl practical |
Sarah Dyer, Ben Moore |
09:15 - 09:45 |
Interacting with Ensembl: requesting new genomes, annotations or variants |
Sarah Dyer |
09:45 - 10:15 |
Break |
|
10:15 - 11:45 |
Genomic molecular markers |
Luis Barboza |
11:45 - 12:30 |
Scientific networking - flash talks |
|
12:30 -13:30 |
Lunch Break |
|
13:30 - 15:00 |
Measuring diversity |
Manuel Humberto Reyes-Valdés |
15:00 - 15:15 |
Break |
|
15:15 - 15:45 |
Q+A for keynote lecture |
Noelle Anglin |
15:45 - 16:30 |
Open training, discussion and questions and answers |
Andrés Gatica-Arias, Jose |
16:30 |
End of day |
|
Day 5 - Friday 17th December 2021 |
||
08:30 - 08:45 |
Questions and Answers and reflection on Day 4 |
Andrés Gatica-Arias, Jose P. Jimenez |
08:45 - 10:15 |
Introduction to genome wide association studies (GWAS) |
Diana Lopez |
10:15 - 10:30 |
Break |
|
10:30 - 11:45 |
Practical: GWAS in diploid/polyploid crops data |
Diana Lopez |
11:45 - 12:00 |
Feedback and course wrap-up |
Andrés Gatica-Arias, Jose P. Jimenez |
Application for Plant bioinformatics course
Incomplete applications will NOT be considered.
If you have any general queries about the workshop application/registration process please email Guilherme Oliveira and Piv Gopalasingam.
Please note that this course is free, but unexplained absence will result in blacklisting for future courses and opportunities.
There will be a maximum of 30 participants on this course and registration is dependent on selection after successful completion of the application process
Application/Registration will close on 1st November 2021 at 12:00 midday (GMT).
Please note, applicants for the Plant bioinformatics course can also apply for the Train the Trainer workshop.
Application for Train the Trainer workshop
The CABANA Plant bioinformatics workshop will be associated with a Train the Trainer (TtT) workshop. The TtT workshop is aimed at plant researchers from Latin America interested in developing their bioinformatics training skills with the intention of running their own courses in the future. For more information about CABANA TtT workshops please refer to the CABANA website.
The workshop will take place live across four half-day sessions between 7-10 December 2021 virtually using Zoom. Each session will take place around 09:00-13:00 Central Standard Time (UTC-6) with breaks.
The Train the Trainer component of the workshop will:
- provide guidance on general training techniques and appropriate use of methods based on learner needs, including some general 'DO's' and 'DO NOTs' for successful training
- provide a framework for successful curriculum design and further development, to enable trainers to build a course appropriate to their learners' needs
- provide opportunities to practice virtual training delivery.
Please note, applications for both of these workshops have now closed.
Participant flash talks
All participants of the plant bioinformatics course will be asked to give a short presentation about their research work as part of the course. These provide an opportunity to share their research with the other participants and provide a forum for discussion. Further details will be provided for selected applicants.