Course on "Reproducible data science for population genetics" (RDPG01)
This course will be delivered by Dr. Thibaut Jombart who authors theadegenet package and Dr. Zhian Kamvar. www.prstatistics.com/course/reproducible-data-science-for-population- genetics-rdpg01/ This course will run from the 23rd - 27th October at Margam discovery centre in Wales, United Kingdom. Course Overview: With the increasing availability of various types of genetic and genomic data, population genetics and molecular ecology are becoming largely data driven sciences. Understanding the evolutionary, demographic, and ecological underpinning the genetic makeup of natural populations now relies on a combination of exploratory approaches and models. This course will provide an in-depth introduction to these techniques, with a strong emphasis on reproducibility though the use of modern analytic practices and tools. After an introduction to phylogenetic reconstruction, the course will cover a number of multivariate approaches for the analysis of genetic patterns, including supervised and unsupervised factorial methods, clustering approaches, and advanced methods for describing population diversity and revealing spatial genetic patterns. The approaches introduced will be applicable to most genetic data, including markers such as microsatellites, SNPs, or AFLP, as well as nucleotide and amino-acid sequence data. Every day will start with a lecture dedicated to a type of problem and methods, followed by an introduction to a specific technique for reproducible data analysis; afternoon will be devoted to hands on praticals. The last day will be devoted to open problems, where participants will be able to analyse their own data. Monday 23rd – Classes from 09:00 to 17:00 Intro to phylogenetic reconstruction Module 1a: reconstructing phylogenies from genetic sequence data. Three main approaches covered: distance-based phylogenies; maximum parsimony; and likelihood-based approaches. Module 1b: reproducible data science using R: an introduction Practical 1: phylogenetic reconstruction using R. Three main approaches plus rooting a tree; assessing/testing for a molecular clock; and bootstrapping. Main packages: knitr, ape, phangorn. Tuesday 24th – Classes from 09:00 to 17:00 Introduction to multivariate analysis of genetic data Module 2a: key concepts in multivariate analysis. Focus on using factorial methods for genetic data analysis. Module 2b: using R to generate high-quality pdf or word documents. Practical 2: multivariate analysis of genetic data in R. Topics include: data handling, Hardy-Weinberg tests, measures of diversity, tests of population structure, principal component analysis (PCA), multidimensional scaling (MDS). Main packages: knitr, rmarkdown, adegenet, ade4, pegasa, hierfstat, ape. Wednesday 25th – Classes from 09:00 to 17:00 Exploring group diversity Module 3a: approaches for identifying and describing genetic clusters. Topics include: hierarchical clustering, K-means, genetic distances between populations, supervised factorial methods including between-group PCA and the Discriminant Analysis of Principal Components (DAPC). Module 3b: using R to generate beamer and html5 slides. Practical 3: applying the approaches covered in morning lecture and emphasising their strengths and weaknesses. Main packages: rmarkdown, adegenet, ade4, hierfstat. Thursday 26th – Classes from 09:00 to 17:00 Spatial genetic structures Module 4a: on the origins of spatial genetic patterns, how to test for them, and how to reveal and visualise them. Module 4b: asking questions the right way with reproducible code. Practical 4: visualising and analysing spatial genetic data. Topics: spatial density estimates, univariate and multivariate tests of spatial structure (Moran and Mantel tests), mapping principal components from unsupervised methods (PCA), spatial PCA. Main packages: reprex, adegenet, spdep, ade4. Friday 27th – Classes from 09:00 to 16:00 Reproducible data science for population genetics in practice Open problem day – analyse your own data using R Main packages: knitr, rmarkdown, adegenet, ade4, ape, pegas, phangorn, hierfstat, poppr, ggplot2, etc. If you have any questions please email oliverhoo...@prstatistics.com or visit www.prstatistics.com www.prstatistics.com/course/reproducible-data-science-for-population- genetics-rdpg01/ Other up-coming courses are as follows 1. SPATIAL ANALYSIS OF ECOLOGICAL DATA USING R #SPAE 7th – 12th August 2017, Scotland, Prof. Jason Matthiopoulos, Dr. James Grecian http://www.prstatistics.com/course/spatial-analysis-ecological-data-using-r- spae05/ 2. ECOLOGICAL NICHE MODELLING USING R #ENMR 16th – 20th October 2017, Scotland, Dr. Neftali Sillero http://www.prstatistics.com/course/ecological-niche-modelling-using-r- enmr01/ 3. REPRODUCIBLE DATA SCIENCE FOR POPULATION GENETICS 23rd – 27th October, Wales, Dr. Thibaut Jombart, Zhian Kavar http://www.prstatistics.com/course/genetic-data-analysis-exploration-using- r-gdar03/ 4. STRUCTURAL EQUATION MODELLING FOR ECOLOGISTS AND EVOLUTIONARY BIOLOGISTS USING R #SEMR 23rd – 27th October, Wales, Prof Jarrett Byrnes, Dr. Jon Lefcheck http://www.prstatistics.com/course/structural-equation-modelling-for- ecologists-and-evolutionary-biologists-semr01/ 5. LANDSCAPE (POPULATION) GENETIC DATA ANALYSIS USING R #LNDG 6th – 10th November, Wales, Prof. Rodney Dyer http://www.prstatistics.com/course/landscape-genetic-data-analysis-using-r- lndg02/ 6. APPLIED BAYESIAN MODELLING FOR ECOLOGISTS AND EPIDEMIOLOGISTS #ABME 20th - 25th November 2017, Scotland, Prof. Jason Matthiopoulos, Dr. Matt Denwood http://www.prstatistics.com/course/applied-bayesian-modelling-ecologists- epidemiologists-abme03/ 7. ADVANCING IN STATISTICAL MODELLING USING R #ADVR 11th – 15th December 2017, Wales, Dr. Luc Bussiere, Dr. Tom Houslay, Dr. Ane Timenes Laugen, http://www.prstatistics.com/course/advancing-statistical-modelling-using-r- advr07/ 8. INTRODUCTION TO BAYESIAN HIERARCHICAL MODELLING #IBHM 29th Jan – 2nd Feb 2018, Scotland, Dr. Andrew Parnell http://www.prstatistics.com/course/introduction-to-bayesian-hierarchical- modelling-using-r-ibhm02/ 9. PHYLOGENETIC DATA ANALYSIS USING R (TBC) #PHYL 28th Jan – Feb 2nd Dr. Emmanuel Paradis – Date and location to be confirmed https://www.prstatistics.com/course/introduction-to-phylogenetic-analysis- with-r-phyg-phyl02/ 10. ANIMAL MOVEMENT ECOLOGY (ACOUSTIC TELELEMTRY DATA ANALYSIS (February 2018) #ANME 19th – 23rd February 2018, Wales, Dr Luca Borger, Dr. John Fieberg 11. GEOMETRIC MORPHOMETRICS USING R #GMMR 5th – 9th June 2017, Scotland, Prof. Dean Adams, Prof. Michael Collyer, Dr. Antigoni Kaliontzopoulou http://www.prstatistics.com/course/geometric-morphometrics-using-r-gmmr01/ 12. FUNCTIONAL ECOLOGY FROM ORGANISM TO ECOSYSTEM: THEORY AND COMPUTATION #FEER 5th – 9th March 2018, Scotland, Dr. Francesco de Bello, Dr. Lars Götzenberger, Dr. Carlos Carmona http://www.prstatistics.com/course/functional-ecology-from-organism-to- ecosystem-theory-and-computation-feer01/ 13. INTRODUCTION TO MARXAN FOR PLANNING MPA’S #MRXN 12TH – 16TH March 2018, Wales, Jennifer McGowan https://www.prstatistics.com/course/introduction-to-marxan-mrxn01/ 14. ECOLOGICAL NICHE MODELLING USING R #ENMR 26th – 30th March 2018, Wales, Dr. Neftali Sillero http://www.prstatistics.com/course/ecological-niche-modelling-using-r- enmr02/ 15. NETWORK ANAYLSIS FOR ECOLOGISTS USING R #NTWA 9th – 13th April 2018, Scotland, Dr. Marco Scotti 16. MULTIVARIATE ANALYSIS OF ECOLOGICAL COMMUNITIES IN R WITH THE VEGAN PACKAGE #VGNR 23rd – 27th April 2018, Scotland, Prof. Peter Solymos, Dr Guillaume Blnachet 17. ECOLOGICAL AND EVOLUTIONARY BIOGEOGRAPHY #EEBR 30TH April – 4th May 2018, Scotland, Dan Warren, Matt Fitzpatrick 18. ADVANCES IN MULTIVARIATE ANALYSIS OF SPATIAL ECOLOGICAL DATA USING R #MVSP 14th – 18th May 20128, Canada, Prof. Pierre Legendre, Dr. Guillaume Blanchet (TBC) 19. INTRODUCTIO9N TO MIXED MODELS FOR ECOLOGISTS USING R #IMMR 28TH May – 1st June 2018, Canada, Subhash Lele 20. MODEL BASE MULTIVARIATE ANALYSIS OF ABUNDANCE DATA USING R #MBMV0 8th – 12th July 2018, Scotland, Prof David Warton https://www.prstatistics.com/course/model-base-multivariate-analysis-of- abundance-data-using-r-mbmv02/ 21. STABLE ISOTOPE MIXING MODELS USING SIAR, SIBER AND MIXSIAR #SIMM Dr. Andrew Parnell, Dr. Andrew Jackson – Date and location to be confirmed https://www.prstatistics.com/course/stable-isotope-mixing-models-using-r- simm03/# 22. MULTIVARIATE ANALYSIS OF SPATIAL ECOLOGICAL DATA #MASE Prof. Subhash Lele, Dr. Peter Solymos - Date and location to be confirmed https://www.prstatistics.com/course/multivariate-analysis-of-spatial- ecological-data-using-r-mase01/ 23. INTRODUCTION TO R AND STATISTICS FOR BIOLOGISTS #IRFB Dr. Luc Bussiere, Dr Ane Laugen - Date and location to be confirmed https://www.prstatistics.com/course/introduction-to-statistics-and-r-for- biologists-irfb/ 24. TIME SERIES MODELS FOR ECOLOGISTS USING R (JUNE 2017 #TSME Dr. Andrew Parnell – Date and location to be confirmed https://www.prstatistics.com/course/time-series-models-foe-ecologists- tsme01/ 25. META-ANALYSIS IN ECOLOGY, EVOLUTION AND ENVIRONMENTAL SCIENCES #METR01 Prof. Julia Koricheva, Prof. Elena Kulinskaya - Date and location to be confirmed https://www.prstatistics.com/course/meta-analysis-in-ecology-evolution-and- environmental-sciences-metr01/ ---------------------------------------------------------------------------- -------------- 1. INTRODUCTION TO BIOINFORMATICS USING LINUX #IBUL 16th – 20th October, Scotland, Dr. Martin Jones http://www.prstatistics.com/course/introduction-to-bioinformatics-using- linux-ibul02/ 2. INTRODUCTION TO PYTHON FOR BIOLOGISTS #IPYB 27th Nov – 1st Dec, Wales, Dr. Martin Jones http://www.prinformatics.com/course/introduction-to-python-for-biologists- ipyb04/ 3. INTRODUCTION REMOTE SENSING AND GIS APPLICATIONS FOR ECOLOGISTS #IRMS 27th Nov – 1st Dec, Wales, Dr Duccio Rocchini, Dr. Luca Delucchi http://www.prstatistics.com/course/introduction-to-remote-sensing-and-gis- for-ecological-applications-irms01/ 4. DATA VISUALISATION AND MANIPULATION USING PYTHON #DVMP 11th – 15th December 2017, Wales, Dr. Martin Jones http://www.prinformatics.com/course/data-visualisation-and-manipulation- using-python-dvmp01/ 5. EUKARYOTIC METABARCODING 23rd – 27th July 2018, Scotland, Dr. Owen Wangensteen http://www.prinformatics.com/course/eukaryotic-metabarcoding-eukb01/ 6. CODING, DATA MANAGEMENT AND SHINY APPLICATIONS USING RSTUDIO FOR EVOLUTIONARY BIOLOGISTS AND ECOLOGISTS #CDSR Dr. Aline Quadros 7. BIOINFORMATICS FOR GENETICISTS AND BIOLOGISTS #BIGB Scotland, Dr. Nic Blouin, Dr. Ian Misner -- Oliver Hooker PhD. PR statistics 2017 publications - Ecosystem size predicts eco-morphological variability in post-glacial diversification. Ecology and Evolution. In press. The physiological costs of prey switching reinforce foraging specialization. Journal of animal ecology. prstatistics.com facebook.com/prstatistics/ twitter.com/PRstatistics groups.google.com/d/forum/pr-statistics-post-course-forum prstatistics.com/organiser/oliver-hooker/ 3/1, 128 Brunswick Street Glasgow G1 1TF +44 (0) 7966500340