Skip to content

intro-to-data-science-21-workshop/03-ReedGarvin-Dynamic_web_scraping-_with_RSelenium

Repository files navigation

Title

Dynamic Web Scraping with RSelenium

Summary

This repository provides materials for a session that is part of the I2DS Tools for Data Science workshop run at the Hertie School, Berlin in November 2021. The student-run workshop is part of the course Introduction to Data Science taught by Simon Munzert at the Hertie School, Berlin, in Fall 2021.

Session Content

This session will introduce you to the dynamic web scraper package RSelenium, including and overview on how to install it and how to operate it once installed.

Main Learning Objectives

The goals of this session are to (1) To show you the required steps to get RSelenium up and running on your machine, (2) explain why you would use RSelenium versus webscraping using an xpath into R, and (3) provide you with an example of how Rselenium works in real time.

Statement of Contributions

Reed Garvin and Francesca Giacco prepared the practice material and as well as completed the back research on how to operate selenium.

Francesca Giacco prepared the presentation slides. She also ran the live demo.

Reed Garvin presented the slide recording as well as the setup for selenium.

Authors

Instructors

Simon Munzert

Further Resources

  • RSelenium overview on Github
  • General information on Selenium as well as the driver download https://www.selenium.dev
  • Docker for running your selenium server in a container on your computer

License

The material in this repository is made available under the MIT license.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages