Skip to content

msanter01/rstats-ed

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 

Repository files navigation

rstats-ed

Inspired by the Using Julia in the classroom page and suggestion by @Peter_Griffin's post on RStudio Community as well as the Learn the tidyverse page.

The goal is to create a repository where one can discover courses and learning materials for learning and teaching R.

This is a user curated list and is bound to be non-comprehensive at all times. If you have suggestions for courses to add, please submit a pull request or add an issue.

In person courses

University courses teaching R

These are semester/quarter long courses taught fully, or for the most part, in an in person setting.

2018

  • STA 112 - Better Living with Data Science - Duke University; Mine Çetinkaya-Rundel. Data Science course for first year undergraduates with little to no computing background. Combines techniques from statistics, math, computer science, and social sciences, to learn how to use data to understand natural phenomena, explore patterns, model outcomes, and make predictions. Data wrangling, exploratory data analysis, predictive modeling, data visualization, and effective communication of results. Discussions around reproducibility, data sharing, data privacy.

  • MAT 301 - Introduction to Probability and Statistics - City University of New York; Sebastian Hoyos-Torres. An Introduction to Probability and Statistics for undergraduates which focuses on understanding probabilistic distributions. Since calculus is not a prerequisite for this course, this course also shows users how to apply calculus conceptually to probabilistic distributions (when continuous distributions are discussed) and let R do a majority of the computations. Tidy principles are emphasized throughout the course but some base R is inevitably used.

  • MPA 630: Data Science for Public Management - Brigham Young University; Andrew Heiss. Data science and statistics class for Master of Public Administration (MPA) students with little math or computing experience. Uses ModernDive, DataCamp, R for Data Science, and OpenIntro Statistics to cover tidyverse data wrangling, inference, and hypothesis testing. All projects and in-class examples use data related to public affairs, administration, and policy.

  • MPA 635: Data Visualization - Brigham Young University; Andrew Heiss. Data visualization class for Master of Public Administration (MPA) students with some experience with R. Uses Alberto Cairo's The Truthful Art: Data, Charts, and Maps for Communication, Kieran Healy's Data Visualization: A Practical Introduction, Claus Wilke's Fundamentals of Data Visualization, and R for Data Science to cover principles of graphic design and fundamentals of visualizing data with ggplot2.

  • ESPM 288: Reproducible & Collaborative Data Science - UC Berkeley, Carl Boettiger. Data Science course for first year graduate students in both the natural and social sciences. A modular, flipped-classroom approach that combines reading, exercises and videos based on R for Data Science and DataCamp with more open-ended assignments to replicate, extend, and sometimes challenge key results from the scientific literature on global change. Note: an upper-division undergraduate version of the course is also being developed under the title Data Science for Global Change Ecology

  • STA 523: Statistical Computing - Duke University, Colin Rundel. Statistical programming with R and its interfaces with custom code development for central statistical models. Best practices and software development for reproducible results, selecting topics from: use of markup languages, understanding data structures, design of graphics, object oriented programming, vectorized code, scoping, documenting code, profiling and debugging, building modular code, and version control- all in contexts of specific applied statistical analyses.

  • DSCI 521: Computing Platforms for Data Science - University of British Columbia, Tiffany Timbers. How to install, maintain, and use the data scientific software "stack". The Unix operating system, integrated development environments (Jupyter and RStudio), and problem solving strategies.

  • CT5102: Programming for Data Analytics - NUI Galway, Jim Duggan: A module that is part of the M.Sc. Computer Science (Data Analytics). There are twelve topics, and these will be updated as the course progresses. The course structure has main three elements: (1) Base R (Vectors, Functions, Lists, Matrices and Data Frame), (2) Data Science, with the tidyverse packages in R (ggplot2, dplyr, readr, tidyr, lubridate and stringr), (3) Advanced R, including closures, object systems (S3, S4 and RC), and building packages.

  • Microsoft Research Data Science Summer School - Microsoft Research New York, Jake Hofman. An intensive, eight-week hands-on introduction to data science for college students in the New York City area focused on increasing diversity in computer science. Students learn Git, Bash, and R, focusing on concepts in statistics, modeling, and machine learning. All coursework is available on Github. Students produce an original group research project at the end of the program. Projects from the past several years are available here, along with corresponding data and code.

  • R Module - Data Visualization Diploma - Pontifical Catholic University of Chile; Pachá, Joshua Kunst. Tidy data principles for the non-expert. This course introduces the Tidyverse and covers how to import, tidy, transform, visualize, model and communicate data. The final goal of this module is to use ggplot2 as a tool to communicate and understand data.

  • SOC 4015/5050: Quantitative Analysis - Saint Louis University, Chris Prener. This course provides an introduction to applied statistical analysis for both undergraduate and graduate students with an emphasis placed on statistical techniques that are most common in the sociological literature. Students learn Git via GitHub desktop, R, and RStudio. A heavy emphasis is placed on literate programming.

  • SOC 4650/5650: Introduction to Geographic Information Science - Saint Louis University, Chris Prener. This class introduces both the theoretical and technical skills that constitute the field of Geographic Information Science (GISc). Techniques introduced include data cleaning and management, map production and cartography, and the manipulation of both tabular and spatial data. Students learn Git via GitHub desktop, R, and RStudio as well as ArcGIS. In the 2018 edition, Lectures 01-06 and 08-11 included R specific content.

2017

  • Modeling Social Data - Columbia University, Jake Hofman. One semester class for upper division undergraduate and first year graduate students that focuses on data-driven modeling for large-scale, social data. Material draws on statistics, computer science, and the social sciences. R is the primary language taught for the course, students gain experience collecting, cleaning, analyzing, and modeling with the tidyverse and related tools. All slides, code, and student-scribed notes are available on Github. Students complete a final project in small groups where work on an original research problem of their choice.

Workshops, short courses, modules

These are workshops, short courses, or modules taught fully, or for the most part, in an in person setting.

  • The Carpentries: Software Carpentry offers workshops are domain-agnostic, and teach the Unix shell, coding in R or Python, and version control using Git. Data Carpentry workshops are domain-specific, and focus on teaching skills for working with data effectively and reproducibly.

  • A Jupyter + R ( + mybinder.org) tutorial for social scientists - Goldsmiths, University of London, Caspar Addyman - A self taught or one session class taught to our Masters Level Advanced Methods class. Using an online notebook shared using MyBinder, it goes through basics of editing code and markdown in Jupyter notebooks, how to host these on MyBinder with fixed date snapshots for reproducibility. It provides very simple examples of loading local or remote data files, filtering and graphing with tidyverse and running simple statistical tests using ezANOVA.

  • Introduction to R for household survey - INDEC institute; Diego Kozlowski, Guido Weksler and Natsumi Shokida. An introductory course to R and the Households survey made by de National Institute of Statistics and Censuses from Argentina. The objective is to introduce R base, the Tidyverse, and Markdown, for beening able to reproduce the mains statistics of the labor market and poverty measures. Is oriented to people coming from a background of social science, who needs to wrangle with microdata from the survey. The Course is in spanish

  • Data Science Seminar - - Saint Louis University, led by Chris Prener and Christy Garcia. We offer a series of seminars each semester on using R for a variety of tasks, including conducting reproducible research, cleaning and plotting data, making maps, and fitting linear models. Our content is available on GitHub and a full list of our seminars is available on our website.

  • Reproducible analysis of bigger, naturally-occurring datasets using R, Rmarkdown, and the tidyverse - Michael C. Frank. A workshop originally presented at Data on the Mind 2017, this tutorial is an introduction to analyzing datasets using Tidyverse code (including readr, dplyr, tidyr, and shiny). Oriented towards researchers in psychology, cognitive science, or experimental science who may be interested in learning more about how R is a "powerful tool for statistical data analysis and reproducible research."

Online courses

MOOCs

Massive open online courses taught on platforms like Coursera, EdX, etc.

  • Statistics with R Coursera Specialization: Mine Çetinkaya-Rundel, Merlise Clyde, Colin Rundel, David Banks. 5 courses: Introduction to Data and Probability, Inferential Statistics, Linear Regression and Modeling, Bayesian Statistics, and Capstone.

  • JHU Data Science Coursera Specialization: This Specialization covers the concepts and tools you'll need throughout the entire data science pipeline, from asking the right kinds of questions to making inferences and publishing results. In the final Capstone Project, you’ll apply the skills learned by building a data product using real-world data.

  • Chromebook Data Science: Chromebook Data Science (CBDS) is a free, massive open online educational program offered through Leanpub to help anyone who can read, write, and use a computer to move into data science.

Tutorials

Built with learnr or a similar technology.

  • RStudio Cloud Primers: RStudio Cloud is a free, cloud based version of the RStudio IDE. Packaged within RStudio Cloud are primers, which are collections of interactive tutorials made with learnr. These primers teach the basics of R and the Tidyverse.

  • To R from Stata: An Introduction: The purpose of this tutorial is to provide a relatively light introduction to R for Stata users - straight to the point (mostly), and organized around the things that Stata users already know.

Other online courses

  • LinkedIn Learning offerings: LinkedIn Learning (previously lynda.com) offers beginner, intermediate, and advanced courses in R, as well as Python, SQL, Java, C, PHP, Javascript and other languages. All courses are offered as downloadable video content and are used in many universities.
  • DataCamp's R offerings: DataCamp offers interactive R (and Python, Sheets, SQL, and shell) courses on topics in data science, statistics and machine learning following a "learn by doing" philosophy. Courses run interactively in the browser.

About

List of courses teaching R

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published