Skip to content

marenrieker/lectures

 
 

Repository files navigation

Introduction to Data Science

Lectures | Summary | About | License

Lectures

  1. What is data science? [.html | .pdf | .Rmd]

  2. Version control and project management [.html | .pdf | .Rmd]

  3. R and the tidyverse [.html | .pdf | .Rmd]

Summary

This is a course taught by Simon Munzert at the Hertie School, Berlin.

Course contents

This course will introduce you to the modern data science workflow with R. In recent years, data analysis skills have become essential for those pursuing careers in policy advocacy and evaluation, business consulting and management, or academic research in the fields of education, health, and social science. We will cover topics like version control (Git) and project management; data collection, wrangling, storage, and visualization; model fitting and simulation; advanced workflow issues, debugging, automation; and data science ethics. The course is intended for students with some experience in working with R.

Main learning objectives

The goals are to (1) equip you with conceptual knowledge about the data science pipeline and coding workflow, data structures, and data wrangling, (2) enable you to apply this knowledge with statistical software, and (3) prepare you for our other methods electives and the master’s thesis.

License

The material in this repository is made available under the MIT license.

Many of the materials build on Grant McDermott's excellent course Data Science for Economists.

About

Introduction to Data Science

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • JavaScript 44.8%
  • HTML 43.8%
  • CSS 11.4%