Name: Choosing the right web scraping strategy
Start: 2026-05-30T11:15:00+0200
End: 2026-05-30T12:30:00+0200

Choosing the right web scraping strategy

Saturday May 30, 2026 11:15am - 12:30pm CEST

3.04

Web scraping is a powerful way to access otherwise unavailable data, but it’s becoming more complex as websites deploy defenses like Captchas and anti-bot systems. At SWR Data Lab, we’ve tackled this across investigations ranging from Google price comparisons to healthcare platforms and social media scraping, each requiring a different approach. In this session, we share a practical decision framework for choosing the right scraping strategy based on robustness, cost, and maintainability.

In this session, we will present a decision framework for selecting the right scraping strategy based on our learnings. Rather than promoting a single tool, we want to focus on choosing the right approach for your use case, considering robustness, cost, and maintainability in a newsroom context. Using real examples, we walk through our workflow: from analyzing sites with dev tools to selecting between HTTP scraping, browser automation, and advanced tools—along with best practices and when paid services are worth it.

To follow along, you should have some experience in scraping and, ideally, Python. The participants will be able to extend their toolkit, make smarter choices in their scraping workflow, and handle real-world obstacles efficiently. No special tools are required to follow along

Speakers

Stephanie Jauss

SWR Data Lab

Stephanie Jauss is a data reporter at the German public broadcaster SWR. She studied Computer Science and Media in Stuttgart as well as Investigative Journalism in Gothenburg.

Verena Steinacher

Data Engineer, pub.tech

Saturday May 30, 2026 11:15am - 12:30pm CEST
3.04

Data skills, Workshop

Dataharvest 2026 - the European Investigative Journalism Conference

Stephanie Jauss

Verena Steinacher

Attendees (6)

Get help with the event

Dataharvest 2026 - the European Investigative Journalism Conference

Stephanie Jauss

Verena Steinacher

Attendees (6)

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Get help with the event