OT-ST-WS-02 | Data extraction from external online platforms using R
Prof. Dr. Kristina Klein
For everyone, who is interested in analyzing (and learning from) data from external (online) sources such as Amazon, Instagram, Twitter, etc.
The course will cover some basic insights into extracting data from external online platforms. It will discuss the general approaches as well as the legal requirements. We will work with R to program simple scrapers that systematically extract data from websites and to access application‑programming interfaces (APIs) that facilitate the extraction of data.
In this course, students will learn how to acquire, store, and manage data from external sources (such as online platform) for follow‑up statistical analysis for their own research.
Some prior knowledge in R is beneficial.
- Own PC, laptop
- Internet, web browser (up-to-date)
- For online format a second screen might be beneficial
- Installation of latest version of R / R studio; participants will receive installation instructions prior to the workshop
Aydin, O. (2018). R Web Scraping Quick Start Guide: Techniques and tools to crawl and scrape data from websites. Packt Publishing Ltd.