Dynamic Web Scraping with RSelenium
This repository provides materials for a session that is part of the I2DS Tools for Data Science workshop run at the Hertie School, Berlin in November 2021. The student-run workshop is part of the course Introduction to Data Science taught by Simon Munzert at the Hertie School, Berlin, in Fall 2021.
This session will introduce you to the dynamic web scraper package RSelenium, including and overview on how to install it and how to operate it once installed.
The goals of this session are to (1) To show you the required steps to get RSelenium up and running on your machine, (2) explain why you would use RSelenium versus webscraping using an xpath into R, and (3) provide you with an example of how Rselenium works in real time.
Reed Garvin and Francesca Giacco prepared the practice material and as well as completed the back research on how to operate selenium.
Francesca Giacco prepared the presentation slides. She also ran the live demo.
Reed Garvin presented the slide recording as well as the setup for selenium.
- Reed Garvin @skier921
- Francesca Giacco @francescagiacco
Simon Munzert
- RSelenium overview on Github
- General information on Selenium as well as the driver download https://www.selenium.dev
- Docker for running your selenium server in a container on your computer
The material in this repository is made available under the MIT license.