Introduction to Data Science

Social Scientific Research Methods Using R

Dr Fatih Uenal

Last updated: 2020-08-25

This page is designed to provide you with the necessary resources and a step-by-step guide to learn the fundamentals of Data Science in R. It is mainly designed for a social scientist audience, meaning you are training as a psychologist, sociologist, political scientist, etc., and you want to augment your skills to run all your analyses in a powerful programming language while also expanding into more novel data science techniques such as Machine Learning, Web Scraping and more. The materials provided here are designed for people with no previous programming experience or familiarity with traditional social scientific methods.

The entire course is build on the textbook Introduction to Data Science by Prof Rafael A. Irizarry (Harvard University). The textbook has also been adapted for an online course on edX. Here, I provide an augmented version of these materials as an interactive learning course created with the learnr package ‘datsci’. The datsci package blends together the textbook materials with several complementary courses available on DataCamp, allowing you to practice and apply your skills on real-world datasets. Together, these resources form a step-by-step syllabus which will allow you to teach yourself R in an interactive and fun way! The materials used here are carefully selected and blended such that they complement and reinforce each other. The combination of online reading materials, video lectures, and practical courses and projects will allow for a smooth and efficient learning experience.

Most materials are free and open, including the primary textbook Introduction to Data Science. I will provide assessment materials to test your learning success as you work through the syllabus. For students taking my course at the University I will provide free access to the DataCamp materials by the beginning of the trimester. If you are not participating in my University course, you will need to purchase a subscription for access to DataCamp.

Copying/Adopting these materials

This website is written using R Markdown. The source code for the website is adapted from Dr. Matt Crump’s webpage here Link. For anyone interested in adopting these materials, the idea is you can fork the repo for the website. Then edit as desired for your purposes, as did I. I am grateful for the amazing materials provided by Prof. Rafael A. Irizarry (Harvard University) which are licenced under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International CC BY-NC-SA 4.0. More materials by Prof. Rafael A. Irizarry can be found here