Eukaryotic metabarcoding (EUKB01) https://www.prinformatics.com/course/eukaryotic-metabarcoding-eukb01/ 23rd - 27th July 2018 in Glasgow Delivered by Dr. Owen Wangensteen Course Overview: Metabarcoding techniques are a set of novel genetic tools for assessing biodiversity of natural communities. Their potential applications include (but are not limited to) accurate water quality, soil diversity assessment, trophic analyses of digestive contents, early detection of non-indigenous species, studies of global ecological patterns and biomonitoring of anthropogenic impacts. This course will give an overview of metabarcoding procedures with an emphasis on practical problem-solving and hands-on work using analysis pipelines on real datasets. After completing the course, students should be in a position to (1) understand the potential and capabilities of metabarcoding, (2) run complete analyses of metabarcoding pipelines and obtain diversity inventories and ecologically interpretable data from raw next-generation sequence data and (3) design their own metabarcoding projects, using bespoke primer sets and custom reference databases. All course materials (including copies of presentations, practical exercises, data files, and example scripts prepared by the instructing team) will be provided electronically to participants. Intended Audience This workshop is mainly aimed at researchers and technical workers with a background in ecology, biodiversity or community biology who want to use molecular tools for biodiversity research and researchers in other areas of bioinformatics who want to learn ecological applications for biodiversity- assessment. In general, it is suitable for every researcher who wants to join the growing community of metabarcoders worldwide. Course Programme Monday 23rd – Classes from 09:00 to 17:00 Session 1. Introduction to metabarcoding procedures. The metabarcoding pipeline. In this session students will be introduced to the key concepts of metabarcoding and the different next-generation sequencing platforms currently available for implementing this technology. The kind of results that we may obtain from metabarcoding projects is explained using examples from real life. We will outline the different steps of a typical metabarcoding pipeline and introduce some key concepts. In this session, we will check that the computing infrastructure for the rest of the course is in place and all the needed software is installed. Core concepts introduced: next-generation sequencer, multiplexing, NGS library, metabarcoding pipeline, metabarcoding marker, clustering algorithms, molecular operational taxonomic unit (MOTU), taxonomic assignment. Session 2. Metabarcoding markers. Primer design. PCR and library preparation protocols. In this session students will learn about the various kinds of molecular markers that can be used for metabarcoding different kinds of samples and the quality of the information which can be retrieved from them. They will learn about the most commonly used primer sets for each target taxonomic group and how to use the software available for designing their own custom metabarcoding primers. They will know about sample tags, library tags, adapter sequences, PCR protocols and library preparation procedures. Core concepts introduced: metabarcoding marker, universality, specificity, taxonomic range, taxonomic resolution, primer bias, amplification errors, sequencing errors, in silico PCR, sample tags, library tags, adapter sequences, PCR, library preparation kits, PCR-free methods, avoiding contaminations, good laboratory practice. Tuesday 24th – Classes from 09:00 to 17:00 Session 3. The OBITools pipeline. First steps and quality control. In this session, we will start to work with the OBITools software suite, using a real sequence dataset as example for testing our metabarcoding pipeline. We will outline the steps needed to start analysing raw data from next-generation sequencers. The students will learn about the different data formats used by OBITools for working with sequences and they will perform protocols for quality control, paired-end alignment, sequence filtering, removal of chimeric sequences, sample demultiplexing, format conversion and dereplication of unique sequences. Core concepts introduced: fastq, fasta and extended fasta formats, Phred quality score, paired-end alignment, demultiplexing, sequence filtering, chimeras, dereplication, unique sequences, reads. Session 4. Clustering algorithms. Constant and variable identity thresholds. In this session, we will introduce different algorithms available for clustering sequences into molecular operational taxonomic units (MOTUs). We will learn the differences between constant and variable identity percent threshold for delineating the MOTUS. We will run some of these algorithms with our example dataset and will analyse the results from different methods. Core concepts introduced: MOTU, reference clustering, de novo clustering, unsupervised-learning clustering, Bayesian clustering, step-by- step aggregation methods, identity threshold, variable identity threshold, singleton sequences, sequence mapping, abundance recalculation. Wednesday 25th – Classes from 09:00 to 17:00 Session 5. Taxonomic assignment. The ecotag algorithm. Reference databases. In this session the students will learn about different algorithms for taxonomic assigment of MOTUs. The ecotag algorithm will be used for adding taxonomic information to the MOTUs in our example dataset and the results will be compared to those from other assignment software. Core concepts introduced: reference database, identity assignment, BLAST, phylogenetic assignment, best match, higher taxa assignment. Session 6. Generating, improving and curating reference databases. The quality of the reference database used for taxonomic assignment is crucial for the accuracy and applicability of the resulting datasets from any metabarcoding project. In this session the students will learn how to build local reference databases from the information available in public sequence repositories and how to add custom sequences to existing reference databases. They will also learn how sequence reference databases interact with taxonomy databases for retrieving the phylogenetic information needed by the assignment algorithms. Core concepts introduced: ecoPCR, sequence reference database, taxonomic database, taxonomic identifier (taxid), GenBank, European Nucleotide Archive (ENA), Barcode Of Life Datasystems (BOLD), SILVA database. Thursday 23rd – Classes from 09:00 to 17:00 Session 7. Refining and analysing the final dataset. Collapsing, renormalising and blank correction. α- and ß- diversity patterns. In this session, students will learn about procedures for refining the final datasets obtained from the previous pipeline. They will learn about blank correction, renormalization procedures for deleting false positive results, and taxonomy collapsing of related MOTUs for obtaining enhanced final datasets. We will also discuss how to interpret these final datasets to obtain ecologically relevant information. Resampling and rarefying procedures are introduced. Qualitative and quantitative indices for assessing dissimilarity between samples are explained. We will introduce the UniFrac dissimilarity distance between samples, an index taking in account not only abundances of the different MOTUs but also their taxonomic affinity. Core concepts introduced: renormalization, taxonomy collapsing, blank correction, α-diversity, ß-diversity, rarefaction, MOTU richness, UniFrac distances, multidimensional scaling (MDS). Session 8. Presenting the final results. Online resources and future developments. In this session we will continue with the presentation of final results. Students will learn how to plot taxonomic summaries from their datasets, including krona plots, a type of graphic representation which allow to show relative abundances of reads at different taxonomic levels. The rest of the session will be dedicated to introduce current research and possible future developments of metabarcoding / metagenomics techniques and to provide a list of useful resources for further learning, continuous training and future research opportunities. Core concepts introduced: taxonomic summary, krona plots, target capture, metagenomics, mitogenomics, long range PCR, nanopore sequencing, Friday 24th – Classes from 09:00 to 16:00 Session 9. Customization. This session will be dedicated to customize individual metabarcoding projects, in function of the specific needs of the students. We will discuss the best strategies to use for obtaining good quality results from our metabarcoding projects, by optimizing time, money and computing resources. The idea is to make this session as interactive and useful as possible. We will present our current and future projects in the format of an open discussion and we will try to propose the best solutions for every potential problem in a collaborative way. Session 10. Optional free afternoon to cover previous modules, discuss data or continue with the customization session.
Email [email protected] with any questions. Check our sister sites www.PRstatistics.com (ecology and life sciences) www.PRinformatics.com (bioinformatics and data science) www.PSstatistics (behaviour and cognition) Upcoming courses below 1. February 19th – 23rd 2018 MOVEMENT ECOLOGY (MOVE01) Margam Discovery Centre, Wales, Dr Luca Borger, Dr Ronny Wilson, Dr Jonathan Potts https://www.prstatistics.com/course/movement-ecology-move01/ 2. February 19th – 23rd 2018 GEOMETRIC MORPHOMETRICS USING R (GMMR01) Margam Discovery Centre, Wales, Prof. Dean Adams, Prof. Michael Collyer, Dr. Antigoni Kaliontzopoulou http://www.prstatistics.com/course/geometric-morphometrics-using-r-gmmr01/ ---------------------------------------------------------------------------- ---------------------------------------------------------------------------- ------------------ 3. March 5th - 9th 2018 SPATIAL PRIORITIZATION USING MARXAN (MRXN01) Margam Discovery Centre, Wales, Jennifer McGowan https://www.prstatistics.com/course/introduction-to-marxan-mrxn01/ 4. March 12th - 16th 2018 ECOLOGICAL NICHE MODELLING USING R (ENMR02) Glasgow, Scotland, Dr. Neftali Sillero http://www.prstatistics.com/course/ecological-niche-modelling-using-r- enmr02/ 5. March 19th – 23rd 2018 BEHAVIOURAL DATA ANALYSIS USING MAXIMUM LIKLIHOOD IN R (BDML01) Glasgow, Scotland, Dr William Hoppitt http://www.psstatistics.com/course/behavioural-data-analysis-using-maximum- likelihood-bdml01/ ---------------------------------------------------------------------------- ---------------------------------------------------------------------------- ------------------ 6. April 9th – 13th 2018 NETWORK ANAYLSIS FOR ECOLOGISTS USING R (NTWA02 Glasgow, Scotland, Dr. Marco Scotti https://www.prstatistics.com/course/network-analysis-ecologists-ntwa02/ 7. April 16th – 20th 2018 INTRODUCTION TO STATISTICAL MODELLING FOR PSYCHOLOGISTS USING R (IPSY01) Glasgow, Scotland, Dr. Dale Barr, Dr Luc Bussierre http://www.psstatistics.com/course/introduction-to-statistics-using-r-for- psychologists-ipsy01/ 8. April 23rd – 27th 2018 MULTIVARIATE ANALYSIS OF ECOLOGICAL COMMUNITIES USING THE VEGAN PACKAGE (VGNR01) Glasgow, Scotland, Dr. Peter Solymos, Dr. Guillaume Blanchet https://www.prstatistics.com/course/multivariate-analysis-of-ecological- communities-in-r-with-the-vegan-package-vgnr01/ 9. April 30th – 4th May 2018 QUANTITATIVE GEOGRAPHIC ECOLOGY: MODELING GENOMES, NICHES, AND COMMUNITIES (QGER01) Glasgow, Scotland, Dr. Dan Warren, Dr. Matt Fitzpatrick https://www.prstatistics.com/course/quantitative-geographic-ecology-using-r- modelling-genomes-niches-and-communities-qger01/ ---------------------------------------------------------------------------- ---------------------------------------------------------------------------- ------------------ 10. May 7th – 11th 2018 ADVANCES IN MULTIVARIATE ANALYSIS OF SPATIAL ECOLOGICAL DATA USING R (MVSP02) CANADA (QUEBEC), Prof. Pierre Legendre, Dr. Guillaume Blanchet https://www.prstatistics.com/course/advances-in-spatial-analysis-of- multivariate-ecological-data-theory-and-practice-mvsp03/ 11. May 14th - 18th 2018 INTRODUCTION TO MIXED (HIERARCHICAL) MODELS FOR BIOLOGISTS (IMBR01) CANADA (QUEBEC), Prof Subhash Lele https://www.prstatistics.com/course/introduction-to-mixed-hierarchical- models-for-biologists-using-r-imbr01/ 12. May 21st - 25th 2018 INTRODUCTION TO PYTHON FOR BIOLOGISTS (IPYB05) SCENE, Scotland, Dr. Martin Jones http://www.prinformatics.com/course/introduction-to-python-for-biologists- ipyb05/ 13. May 21st - 25th 2018 INTRODUCTION TO REMOTE SENISNG AND GIS FOR ECOLOGICAL APPLICATIONS (IRMS01) Glasgow, Scotland, Prof. Duccio Rocchini, Dr. Luca Delucchi https://www.prinformatics.com/course/introduction-to-remote-sensing-and-gis- for-ecological-applications-irms01/ 14. May 28th – 31st 2018 STABLE ISOTOPE MIXING MODELS USING SIAR, SIBER AND MIXSIAR (SIMM04) CANADA (QUEBEC) Dr. Andrew Parnell, Dr. Andrew Jackson https://www.prstatistics.com/course/stable-isotope-mixing-models-using-r- simm04/ 15. May 28th – June 1st 2018 ADVANCED PYTHON FOR BIOLOGISTS (APYB02) SCENE, Scotland, Dr. Martin Jones https://www.prinformatics.com/course/advanced-python-biologists-apyb02/ ---------------------------------------------------------------------------- ---------------------------------------------------------------------------- ------------------ 16. June 12th - 15th 2018 SPECIES DISTRIBUTION MODELLING (DBMR01) Myuna Bay sport and recreation, Australia, Prof. Jane Elith, Dr. Gurutzeta Guillera https://www.prstatistics.com/course/species-distribution-models-using-r- sdmr01/ 17. June 18th – 22nd 2018 STRUCTURAL EQUATION MODELLING FOR ECOLOGISTS AND EVOLUTIONARY BIOLOGISTS USING R (SEMR02) Myuna Bay sport and recreation, Australia, Dr. Jon Lefcheck https://www.prstatistics.com/course/structural-equation-modelling-for- ecologists-and-evolutionary-biologists-semr02/ 18. June 25th – 29th 2018 SPECIES DISTRIBUTION/OCCUPANCY MODELLING USING R (OCCU01) Glasgow, Scotland, Dr. Darryl McKenzie https://www.prstatistics.com/course/species-distributionoccupancy-modelling- using-r-occu01/ ---------------------------------------------------------------------------- ---------------------------------------------------------------------------- ------------------ 19. July 2nd - 5th 2018 SOCIAL NETWORK ANALYSIS FOR BEHAVIOURAL SCIENTISTS USING R (SNAR01) Glasgow, Scotland, Prof James Curley http://www.psstatistics.com/course/social-network-analysis-for-behavioral- scientists-snar01/ 20. July 8th – 12th 2018 MODEL BASE MULTIVARIATE ANALYSIS OF ABUNDANCE DATA USING R (MBMV02) Glasgow, Scotland, Prof David Warton https://www.prstatistics.com/course/model-base-multivariate-analysis-of- abundance-data-using-r-mbmv02/ 21. July 16th – 20th 2018 PRECISION MEDICINE BIOINFORMATICS: FROM RAW GENOME AND TRANSCRIPTOME DATA TO CLINICAL INTERPRETATION (PMBI01) Glasgow, Scotland, Dr Malachi Griffith, Dr. Obi Griffith https://www.prinformatics.com/course/precision-medicine-bioinformatics-from- raw-genome-and-transcriptome-data-to-clinical-interpretation-pmbi01/ 22. July 23rd – 27th 2018 EUKARYOTIC METABARCODING (EUKB01) Glasgow, Scotland, Dr. Owen Wangensteen http://www.prinformatics.com/course/eukaryotic-metabarcoding-eukb01/ ---------------------------------------------------------------------------- ---------------------------------------------------------------------------- ------------------ Oliver Hooker PhD. PR statistics 2017 publications - Ecosystem size predicts eco-morphological variability in post-glacial diversification. Ecology and Evolution. In press. The physiological costs of prey switching reinforce foraging specialization. Journal of animal ecology. +44 (0) 7966500340
