Data Upskilling Short Courses 2023: Introduction to Statistics with R

University of Edinburgh

June 6-9, 2023


Instructors: Andrzej Romaniuk, Shrey Bhardwaj

Helpers: Shrey Bhardwaj, Juan Rodriguez Herrera, Mario Antonioletti, Andrzej Romaniuk, Ben King, Maria Jimenez Ramos


Introduction to Statistics with R This workshop uses a public health dataset and examples (NHANES from the US National Center for Health Statistics) but the materials are relevant to researchers more generally in the life, health and social sciences.

The workshop assumes no prior experience of statistical analysis in R. However, learners are expected to have some familiarity with R such as having done an introductory course. If you do not have any experience currently, one of these Carpentries courses would prepare you:

General Information

Where: Digital Scholarship Centre, Main Library, 6th floor - 30 George Square, Edinburgh EH8 9LJ. Get directions with OpenStreetMap or Google Maps.

When: June 6-9, 2023. Add to your Google Calendar.

Requirements: Participants must bring a laptop with a Mac, Linux, or Windows operating system (not a tablet, Chromebook, etc.) that they have administrative privileges on. They should have a few specific software packages installed (listed below).

Accessibility: We are committed to making this workshop accessible to everybody. The workshop organizers have checked that:

Materials will be provided in advance of the workshop and large-print handouts are available if needed by notifying the organizers in advance. If we can help making learning easier for you (e.g. sign-language interpreters, lactation facilities) please get in touch (using contact details below) and we will attempt to provide them.

Contact: Please email , or for more information.

Roles: To learn more about the roles at the workshop (who will be doing what), refer to our Workshop FAQ.

Code of Conduct

Everyone who participates in Carpentries activities is required to conform to the Code of Conduct. This document also outlines how to report an incident if needed.

Collaborative Notes

We will use this collaborative document for chatting, taking notes, and sharing URLs and bits of code.


Please be sure to complete these surveys before and after the workshop.

Pre-workshop Survey

Post-workshop Survey


The lesson taught in this workshop is being piloted and a precise schedule is yet to be established.

Day 1 : Tuesday, June 6th

Pre-workshop Setup
Statistical thinking for public health Estimating the mean, variance and standard deviation
Estimating the variation around the mean
Visualising and quantifying linear associations
Predicting means using linear associations
Lunch 12:00 - 13:00

Day 2 : Wednesday, June 7th

Simple linear regression for public health An introduction to linear regression
Linear regression with one continuous explanatory variable
Linear regression with a two-level factor explanatory variable
Making predictions from a simple linear regression model
Lunch 12:00 - 13:00

Day 3 : Thursday, June 8th

Simple linear regression for public health Assessing simple linear regression model fit and assumptions
Linear regression with a multi-level factor explanatory variable
Lunch 12:00 - 13:00

Day 4 : Friday, June 9th

Multiple linear regression for public health Linear regression with one continuous and one categorical explanatory variable
Linear regression including an interaction between one continuous and one categorical explanatory variable
Lunch 12:00 - 13:00


To participate in a workshop, you will need access to software as described below. In addition, you will need an up-to-date web browser.

We maintain a list of common issues that occur during installation as a reference for instructors that may be useful on the Configuration Problems and Solutions wiki page.

Please check the “Setup” page of the lesson site for instructions to follow to obtain the software and data you will need to follow the lesson. Also, look at the top of each lesson for prerequisites to follow the lessons.