Working with data

Overview

  • Originally presented: Day 2, June 2, 2026
  • Lead: Lead: Elizabeth Prom-Wormley
  • Topics: assumptions, QC, imputation, PLINK and R basics, phenotype distributions, and transformations

Additional Reference Reading

Lectures

This lecture series can be viewed as a YouTube playlist.

Quality control

Slides


Imputation

Slides


Plink 101

Slides

This series of lectures can be viewed as a YouTube playlist.

Some students really enjoy using Swirl to learn R https://swirlstats.com/students.html

Students are also welcome to walk through a 4-part introduction with videos and the accompanying scripts/data. This series of videos is appropriate for learners who haven’t had prior exposure to R (or limited exposure) and who want to prepare to successfully participate in hands-on-activities throughout the workshop. By the end of the videos, learners will be able to produce basic summary statistics with phenotypic data that would typically be conducted prior to GWAS.

R Basics: Downloading and installing R and RStudio (optional)

Practical files linked here are from when the lectures were originally recorded.

Installing R and RStudio is NOT necessary to complete the workshop exercises

Practical files


Finding, opening, and reviewings files in R

Practical files


Data management in R

Practical files


Graphics and basic statistics in R

Practical files

Practicals

The practical is through a Qualtrics worksheet.

Slides for the live session and practical.