Loading…
arrow_back View All Dates
Saturday, May 30
 

9:00am CEST

Coffee
Saturday May 30, 2026 9:00am - 9:30am CEST
Coffee is served in the Mediaforum (ground floor, lobby area) and on the third floor.
Saturday May 30, 2026 9:00am - 9:30am CEST
Mediaforum

9:30am CEST

Copernicus Atmosphere Monitoring Service data buffet: find stories in the data
Saturday May 30, 2026 9:30am - 10:45am CEST
Every human inhales 14 kg of air per day, and air pollution caused by human (e.g., road traffic, industrial process, agriculture) and natural (e.g., wildfires, dust storms) processes affects everybody on the planet, contributing to a growing list of health impacts. The composition of the atmosphere also plays a critical role in the climate system. A wide range of observational and model data are available for analysing the distribution and timing of air pollution episodes and their possible impacts. Yet navigating these datasets and understanding what they can reliably show is not always straightforward.

The Copernicus Atmosphere Monitoring Service (CAMS) provides a wide range of open datasets on emission sources, atmospheric composition, and air pollution covering past, present, and future conditions, that can support your air quality reporting.

In this session, you will be guided through the CAMS data landscape: where to find key datasets, how they are built, what they can (and cannot) tell you, and common pitfalls to avoid.

Speakers
avatar for Mark Parrington

Mark Parrington

Senior CAMS Scientist, Copernicus ECMWF
I´m a Senior Scientist in the Copernicus Atmosphere Monitoring Service (CAMS) Development Section at the European Centre for Medium-Range Weather Forecasts (ECMWF). I hold a DPhil in Atmospheric Physics from the University of Oxford and have more than 15 years’ experience of working... Read More →
Saturday May 30, 2026 9:30am - 10:45am CEST
3.09

9:30am CEST

How to use Earth Index, an AI tool for finding leads in satellite imagery
Saturday May 30, 2026 9:30am - 10:45am CEST
This is a hands-on workshop introducing the practical use of Earth Index, a tool that uses AI to search for user-defined patterns in satellite imagery across large areas, anywhere on the planet. It is a game-changer for environmental journalism, opening up major possibilities: from spotting hotspots of illegal mining, to quantifying industrial farming in a region, or mapping new roads pushing into forested areas.

During the session, we will do a guided walkthrough of the platform and work through step-by-step exercises to explore its core features: how to create a project, define an area of interest, generate positive and negative labels, run predictions, refine the results, and export findings. We will also share practical tips and best practices based on the Pulitzer Center’s methodology for successfully integrating Earth Index into an environmental investigation, including how to audit results, reduce false positives, and inform field reporting.

By the end of the workshop, participants will have a clear understanding of Earth Index’s strengths and limitations, and how to add it to their investigative toolbox.

IMPORTANT:

  1. You must register in advance for this session to gain access to Earth Index. Please use this form to register:  https://docs.google.com/forms/d/e/1FAIpQLSe0CXmCcA3SZyVfDMHO3l9suLK_RwCJUqz_IU-oymMRAFVSuQ/viewform
  2. Please bring a laptop with Google Earth Pro installed. You can download it here: https://maps.google.com/intl/es/earth/download/gep/agree.html

No previous experience or coding skills are required. Those who sign up will receive free access to the tool and will be able to keep it afterwards.
Speakers
avatar for Federico Acosta Rainis

Federico Acosta Rainis

Data Editor, Pulitzer Center
Federico Acosta Rainis is the data editor at the Pulitzer Center. Previously an IT consultant, in 2017 he joined La Nación in Argentina, where he contributed to award-winning investigations and carried out extensive on-the-ground coverage of the COVID-19 pandemic in Buenos Aires... Read More →
Saturday May 30, 2026 9:30am - 10:45am CEST
3.02

9:30am CEST

Gambling on the beautiful game: Following the money and suspect sponsorships in European football
Saturday May 30, 2026 9:30am - 10:45am CEST
Behind the polished facade of the multi-million euro football industry lies a darker reality. This presentation will take you inside two cross-border investigations into the opaque world of football sponsorships – tracking the unlicensed betting firms and crypto exchanges that now have a seat at the game’s top table.

The first exposed the influence and lucrative partnerships betting operators have with teams in more than 30 European leagues. The second uncovered the increasing foothold that crypto and investment companies have, revealing how unregulated sponsors infiltrated major sporting organisations.

We will explain how we obtained the underlying data and turned it into compelling stories connecting countries across the continent. The presentation will offer insights into the different tools and techniques we used – from trawling corporate accounts and financial registries, to exploiting Virtual Private Networks to assist with international research. We will discuss how these methods were used to build out investigative strands with international media partners and can be utilised for other cross-border projects.
Speakers
avatar for Chris Matthews

Chris Matthews

Investigations Editor, Investigate Europe
Chris Matthews is the investigations editor of Investigate Europe, a cross-border investigative journalism consortium. Previously, he worked on climate investigations for an environmental nonprofit, with a particular focus on commodity-driven deforestation and global supply chains... Read More →
avatar for Lorenzo Buzzoni

Lorenzo Buzzoni

Investigative Journalist and documentary filmmaker, Investigate Europe
Investigative journalist and documentary filmmaker covering environmental, financial and social issues, based in Italy and member of Investigate Europe, a cross-border cooperative of journalists. His work has been published by European outlets including The Guardian, Die Zeit and... Read More →
Saturday May 30, 2026 9:30am - 10:45am CEST
Z1.15 - Aula Donche

9:30am CEST

How to investigate AI border surveillance in Europe
Saturday May 30, 2026 9:30am - 10:45am CEST
The session will reflect on Swiss newsroom WAV Collective's year-long collaborative investigative projects – Invisible Walls and Big Business at the Borders – looking at automated border systems across Europe and asking who benefits from the push in AI in migration and border control.

Their reporting combined on-the-ground investigation in Greece with policy research in Brussels and institutional analysis in Switzerland and the U.K. They will give a step-by-step explanation on how they traced flows of money, data and accountability across borders and institutions.

What they'll cover in this talk:
- how to understand the systems, tools and implications of AI (even if you’re not a techie!).
- how to distinguish between proposed capabilities, funded projects, and implemented systems when it comes to reporting on tech.
- how to understand the functions of such systems, instead of relying on labels.
- And how to dissect the language that is often used to obfuscate real-world implications of tech.

Participants will learn how cross-border, collaborative teams can tackle investigations of complex, opaque systems that operate in regulatory grey areas and are often deliberately fragmented. Insights from this session will not only be valuable to colleagues working on similar topics, but useful to anyone investigating fragmented yet connected systems across different countries.
Speakers
avatar for Marguerite Meyer

Marguerite Meyer

Journalist, Freelance
I‘m an experienced investigative journalist from Switzerland, currently based in Zurich & Barcelona. Got a murky letterbox address in Switzerland? Happy to help! —
Things I do: International stories with a Swiss twist or vice versa // AI & tech, defense & security, organised crime, migration // moderate panels & host events // journalist by day, slam poet by night// background in History, Political Science and Media Studies // Balkans & Mena... Read More →
Saturday May 30, 2026 9:30am - 10:45am CEST
Z0.10

9:30am CEST

How the hell did they make that?: Creating a collaborative inventory of data viz tools
Saturday May 30, 2026 9:30am - 10:45am CEST
Have you ever come across a striking data visual and wondered: "How the hell did they make that?" Your follow-up question may be: "Can I achieve that too with the resources I have?" In this session, we start with a deep dive into the (technical) production of two data visuals. Then, for the third visual, it's your turn to figure out how it might have been made. Finally, we launch our online inventory and subjective guide of data visualization tools - a work in progress you can contribute to!

Heads up: we won't teach any tool in depth, but we'll highlight a range of options so you can find an approach or tool that best fits your needs and constraints (e.g., limited budget, small team, learning curve…).

Speakers
avatar for Toon Vos

Toon Vos

Freelancer
Toon Vos is a multimedia journalist and educator focused on data visualization and creative storytelling
avatar for Eleanor Denneman

Eleanor Denneman

Data journalist & designer, BRUZZ
Vers in de pers
Saturday May 30, 2026 9:30am - 10:45am CEST
1.04

9:30am CEST

AI-Assisted OSINT: Automating the investigative workflow
Saturday May 30, 2026 9:30am - 10:45am CEST
Most investigative workflows still rely on manually juggling dozens of tools. In this session, we'll walk through a live demo of a semi-automated pipeline built for real casework: web search and archiving with Playwright, face extraction, reverse image search, database cross-referencing with Telegram bots, social media analysis, and structured reporting via Obsidian mcp. All of this is orchestrated by Claude, an AI layer you can teach your own investigative methodology. At the end, participants will work through a simplified case using a workflow of their own.

Before the session, please install: Python, Claude Code. This session will teach participants to combine several smaller OSINT tools so they work together efficiently without requiring much manual effort. No special tools needed
Speakers
avatar for Anastasiia Morozova

Anastasiia Morozova

Data and investigative journalist, Onet.pl/Ringier Axel Springer
I’m a data and investigative journalist with a background in tracking Russian influence, desinformation operations and sanctions evasion in Europe. I’m especially interested in projects where I can combine data analysis and visual storytelling to expose hidden networks or financial... Read More →
avatar for Leopold Salzenstein

Leopold Salzenstein

Data coordinator, Arena for Journalism in Europe
Leopold Salzenstein is a freelance investigative data journalist and trainer based in the south of France. At Arena, he coordinates the handling of data for publications and trainings. He is also a member of the collective of journalists Environmental Investigative Forum (EIF).

... Read More →
Saturday May 30, 2026 9:30am - 10:45am CEST
3.05

9:30am CEST

Scraping with a browser emulator
Saturday May 30, 2026 9:30am - 10:45am CEST
You need to harvest data from a Web site. But there's no download button. It's time to scrape! There are many options, but one of the most consistently effective is launching an automated browser. You tell the browser where to go and what to click, and when to ingest the content. To follow along, participants should have some knowledge of coding in any language.

Participants will come away from this class knowing the basics of Web scraping with a browser emulator. To follow along, participants should have R Studio installed https://posit.co/download/rstudio-desktop/, create a new project, download this file selenium-server-standalone-3.5.3.jar into the project directory, and have the appropriate Chrome binary downloaded into the directory https://googlechromelabs.github.io/chrome-for-testing/last-known-good-versions-with-downloads.json
Speakers
avatar for Robert Gebeloff

Robert Gebeloff

Reporter, New York Times
Robert Gebeloff has worked as a data projects reporter for The New York Times since 2008 and has taught data journalism for many years in newsrooms and at conferences. He was co-winner of the George Polk Award in 2015 and was a Pulitzer Prize finalist in both 2015 and 2016 for projects... Read More →
avatar for Simon Wörpel

Simon Wörpel

Director of Technology, Data and Research Center – DARC

Saturday May 30, 2026 9:30am - 10:45am CEST
3.04

9:30am CEST

Showcase your work online: Build a portfolio with GitHub pages
Saturday May 30, 2026 9:30am - 10:45am CEST
Many journalists have published investigations, data stories, and visualizations across different outlets, but no single place to display all their work. In this hands-on session, participants will build a simple, professional portfolio website using GitHub Pages, creating a central hub where their work can live together, even without previous web development experience. By writing and modifying small pieces of HTML and CSS together, participants will see how a simple page can gradually become a polished portfolio.

To attend this session, no prior coding knowledge is required. Familiarity with GitHub or basic HTML is helpful but not necessary. After attending this session, participants will have a live portfolio website that they can continue improving and use immediately for job applications, pitching stories, or showcasing investigative work.

Participants should create a free GitHub account before the session: https://github.com/join. It is also useful to install Visual Studio Code: https://code.visualstudio.com/
Speakers
avatar for Ioanna Petsiou

Ioanna Petsiou

Data Journalist, Freelancer
Ioanna Petsiou is an investigative data journalist working across data analysis, satellite imagery, and mapping to uncover and explain complex stories. She is particularly drawn to environmental reporting and to building clear, reproducible ways of working with data that others can... Read More →
avatar for Alina Yanchur

Alina Yanchur

Data and Investigative Journalist
Investigative and data journalist with a focus on transnational corruption, sanctions evasion, and OSINT methods. Trainer and mentor for journalists working under repressive regimes. Strong background in collaborative and data-driven journalism across Belarus, Europe, and exile c... Read More →
Saturday May 30, 2026 9:30am - 10:45am CEST
Z0.15

9:30am CEST

Reclaiming the (old) public square: from data and digital to journalism with and for our communities 
Saturday May 30, 2026 9:30am - 10:45am CEST
Independent newsrooms operate in a landscape full of contradictions. Digital journalism has the potential to reach large audiences, yet people today are overwhelmed by a constant stream of online content – much of it shaped by opaque corporate algorithms and polluted with disinformation. At the same time, funders increasingly ask newsrooms to demonstrate measurable audience reach, impact, and engagement.

Against this backdrop, in-person events offer a powerful opportunity. By reclaiming the spirit of the public square, live gatherings allow newsrooms to meet their audiences face-to-face, build community, and create deeper, more meaningful reporting and engagement. Yet organising such activities and integrating them into editorial work can also be time-consuming and resource-intensive.

In this session, speakers will share practical ways independent newsrooms are combining audience data with in-person events to better listen to the public and co-produce journalism with their communities. What tools and spaces can make this possible? What are the advantages – and limitations – of small, intimate gatherings vs large public events? What works, and what doesn't? Join us to explore practical strategies for using in-person engagement to strengthen journalism, deepen audience relationships, and support the long-term sustainability of independent newsrooms.

Moderators
avatar for Jose Miguel Calatayud

Jose Miguel Calatayud

Freelance journalist and writer
I am a freelance journalist and writer based in Valencia, in Spain, focusing on feature writing and investigative journalism, mainly about Europe. As of March 2026, I am doing preliminary research and planning collaborative investigations into corporate influence, climate adaptation... Read More →
Speakers
avatar for Nuno Viegas

Nuno Viegas

Journalist, fundraising co-lead, co-manager, Fumaça
Investigative journalist, focused on long-form audio narratives. I work mostly with FOI and state archives, to research policing, prisons, and the courts. At Fumaça, I share fundraising responsability for our non-profit, membership backed experiment on non-hierarchical newsroom... Read More →
avatar for Lydia Emmanouilidou

Lydia Emmanouilidou

Investigative Journalist & Fundraising Strategist, Greece, Solomon
Lydia Emmanouilidou is an investigative journalist and audio producer based in Athens, Greece. Her work focuses on migration, surveillance, and the environment, and has been published by outlets including the New York Times, NPR, BBC, and Al Jazeera. She currently works with the Greek... Read More →
avatar for Pierluigi Bizzini

Pierluigi Bizzini

Journalist, FADA Collective
Pierluigi Bizzini is a freelance journalist based in Sicily. He is member of FADA Collective.

His research focuses on migration, displacement and algorithmic surveillance within the Mediterranean area. He covers environmental issues in Sicily, focusing on water issues.

He is a former fellow reporter for Algorithmwatch. His stories have been published by Die Zeit, Al Jazeera, WOZ, EUobserver, and others. He is among the nominees at the European Press Prize 2024. He works as a curator at The Syllabus... Read More →
Saturday May 30, 2026 9:30am - 10:45am CEST
2.03

9:30am CEST

How we uncovered an international rapist network through undercover online research
Saturday May 30, 2026 9:30am - 10:45am CEST
In our digitally connected world, perpetrators feel so safe online that they build their networks openly: on public-access porn websites and messenger apps such as Telegram. Our investigation for STRG_F/NDR in Germany uncovered an international rape network that had formed on porn websites and in dozens of private chat groups on Telegram. One group had more than 70,000 members.

Users exchange detailed information on how to drug and rape women who are already close to them, such as their wives, girlfriends, sisters, or mothers, without them noticing. The rapists share videos and photos of the assaults online. One man from Germany drugged and raped his wife for more than 15 years and generated millions of views with the footage. Our investigation triggered a police investigation, and he was stopped.

We'll share how we began investigating these networks, how we gained access to them, how we investigated the users for years by also going undercover, what tools we used, what ethical and moral challenges we have faced, and how we kept an overview of all the footage we documented and saved over the years.

We will address:
- How do you conduct online research in criminal networks? (Structures, dynamics, and mechanisms)
- How do you gain access to them?  (User's communication and behaviour) 
- What opportunities and limitations did we encounter? (Laws and journalists' rights, ethical and moral responsibilities)
- When should you consider undercover research, and what can it look like? (Journalistic standards and guidelines)
- How do you protect your own mental health when confronted with disturbing content?

Saturday May 30, 2026 9:30am - 10:45am CEST
Z1.13 - Aula Hanswijk

9:30am CEST

Investigating arms producers
Saturday May 30, 2026 9:30am - 10:45am CEST
Following Russia's invasion of Ukraine, many European countries have increased their defence spending and awarded large new contracts to arms companies. Beneficiaries include US corporations such as Lockheed, as well as European firms such as BAE Systems, Airbus, Rheinmetall, or Leonardo. At the same time, these companies are making money by exporting weapons to authoritarian regimes, and their weapons could end up on the battlefields of Libya or Yemen. Some of these companies have continued to support Israel in its destruction of Gaza by selling weapons to the country.
How can such companies be investigated? Whistleblowers can occasionally help journalists to expose dubious deals and hidden manoeuvres. But arms deals can also be investigated without the help of an insider. Videos showing military vehicles can be geolocated. Satellite pictures reveal the locations of naval vessels. Tracking websites allow you to follow the routes of warships and aeroplanes. Company employees reveal the military projects they are working on on LinkedIn.

This presentation will demonstrate how open-source methods can be used to investigate the arms industry. Drawing on specific examples from recent investigations, it will show how these methods can be employed to shed light on the activities of companies such as a European missile-maker that supplied Israel with bombs used in Gaza, German engine manufacturers whose products are being used in Russian and Chinese warships, Turkey's breaches of the arms embargo in Libya, and the role of Airbus, as well as French-made warships in the war zone around Yemen. The presentation will also highlight the activities of a German arms giant Rheinmetall,, which sent personnel through an embargoed port in Eritrea to assist with the repair of Emirati naval guns in the Red Sea.

Speakers
avatar for Hans-Martin Tillack

Hans-Martin Tillack

Investigative Reporter
Hans-Martin Tillack is an investigative reporter based in Berlin. Until 2025, he was a senior reporter on the investigative team at Welt and Welt am Sonntag. Prior to this, he led investigations at the Berlin office of Stern magazine. From 1999 to 2004, he was Stern's EU correspondent... Read More →
Saturday May 30, 2026 9:30am - 10:45am CEST
1.16

9:30am CEST

R and AI - so many options. What works for you?
Saturday May 30, 2026 9:30am - 10:45am CEST
Over the last 12 months or so, ways of getting LLMs to help you write R code have multiplied. The Ellmer package is one solution. Then Posit tested their Databot, and later the Posit Assistant. There's also GitHub CoPilot. So what have you tried? What has worked? What have you learned? Are you happy using AI in R code? Are you still waiting to take the plunge? Come to this roundtable session to learn from each other's experiences in this rapidly developing field.
Speakers
avatar for Jonathan Stoneman

Jonathan Stoneman

Arena for Journalism in Europe
Former BBC journalist, turned datajournalist, trainer, consultant. Works with Arena as Lead Trainer, Arena Academy. 
avatar for Carmen Aguilar Garcia

Carmen Aguilar Garcia

Data Journalist, The Guardian
Data journalist at The Guardian Data Project team. I work on a variety of subjects - always finding the data angle in every story. Scraping, cleaning, data analysis, but above all JOURNALISM!
Saturday May 30, 2026 9:30am - 10:45am CEST
Z2.01 - Mediadrôme

9:30am CEST

Handling leaks and source protection in a changing technological environment
Saturday May 30, 2026 9:30am - 10:45am CEST
Leaks remain a  powerful source of investigations, but they are becoming harder to handle, authenticate, and secure in an era of mass leaks, AI-generated forgeries, surveillance, and legal pressure on newsrooms.

This hands-on session offers a practical, step-by-step methodology for journalists working with data leaks today, from first contact with a source to post-publication decisions. Based on real investigative cases and newsroom practices, the session focuses on decision-making, security, verification, and workflow, rather than theory.

The session walks participants through the full lifecycle of a data leak, using concrete examples and tools:

- How to evaluate a leak: source motivations, credibility, red flags, and public-interest tests
- How to decide whether to investigate (or walk away) without wasting newsroom resources
- How to verify authenticity in an era of AI-generated documents and manipulated leaks
- How to protect sources, data, and journalists throughout the investigation
- How to make smart editorial, ethical, and legal decisions under pressure
- What to do with data after publication (retain, restrict, destroy, or share)
Speakers
avatar for Sandrine Rigaud

Sandrine Rigaud

Program Director, GIJN
Sandrine Rigaud is the Program Director of the Global Investigative Journalism Network. She is an investigative journalist, director and Emmy-winning producer who served as editor-in-chief of Forbidden Stories from 2019 to 2024. In that position, she led international collaborations... Read More →
Saturday May 30, 2026 9:30am - 10:45am CEST
2.02

10:45am CEST

Coffee break
Saturday May 30, 2026 10:45am - 11:15am CEST
Coffee is served in the Mediaforum (ground floor, lobby area) and on the third floor.

Saturday May 30, 2026 10:45am - 11:15am CEST
Mediaforum

11:15am CEST

50 shades of the housing crisis – in data
Saturday May 30, 2026 11:15am - 12:30pm CEST
For more than eight years, we have been covering the housing crisis – locally and in a European-wide comparison. We investigated big corporate landlords, calculated rent affordability for key workers, explained how real estate speculation works, and investigated the complicated relationship between the climate crisis, energy costs, and housing unaffordability. We collected numerous datasets, interviewed scientists, politicians, and investors, and found ways to translate the data and abstract financial schemes into engaging and compelling stories for readers.

Now we want to share this knowledge with other journalists.

Come to our hands-on session where we will show you the most interesting datasets and modes of visualisations you can use for your own local reporting. We will explain different data-driven ways and methodologies for approaching the housing issue, which can be replicated in any European city, discuss different themes within the housing topic and what pitfalls to avoid, as well as what issues the current reporting is lacking and could be dug deeper.
Speakers
avatar for Gaby Khazalova

Gaby Khazalova

Editorial coordinator, Urban Journalism Network
Gaby Khazalová coordinates collaborative investigations within the Urban Journalism Network, a network of local journalists, media outlets and data analysts, that is dedicated to researching common challenges faced by European cities. She publishes her local investigations with the... Read More →
Saturday May 30, 2026 11:15am - 12:30pm CEST
1.04

11:15am CEST

Cancer Calculus: Investigating big pharma's dirty tricks
Saturday May 30, 2026 11:15am - 12:30pm CEST
Big pharma is responsible for inventing and manufacturing hundreds of life-saving innovative drugs, but often those medications come with a hefty price tag. The Cancer Calculus is a global cross-border investigation into how Keytruda, a life-saving cancer drug, is unaffordable for millions across the world and how pharma giant Merck protects their billions in revenue from it.

This session will examine the different tools pharma companies use to protect their revenues, from creating an impenetrable fortress of patents around their drug, to promoting higher doses that boost revenues and wild variation in prices worldwide.

The speakers will walk you through how they approached this complex investigation, the difficulties in unraveling the pricing knots, and what they learned along the way.
Speakers
avatar for Hala Nasreddine

Hala Nasreddine

Investigative journalist, Daraj Media
Hala Nasreddine is an award-winning Lebanese investigative journalist and the Head of the Investigative Unit at Daraj Media. She has contributed to several cross-border investigative projects, including Burning Skies, Fueling Ecocide, Cyprus Confidential, Pegasus Project, Pandora... Read More →
Saturday May 30, 2026 11:15am - 12:30pm CEST
3.09

11:15am CEST

Designing stories on maps: A practical guide to map-driven scrollytelling
Saturday May 30, 2026 11:15am - 12:30pm CEST
Description to follow
Saturday May 30, 2026 11:15am - 12:30pm CEST
2.03

11:15am CEST

Choosing the right web scraping strategy
Saturday May 30, 2026 11:15am - 12:30pm CEST
Web scraping is a powerful way to access otherwise unavailable data, but it’s becoming more complex as websites deploy defenses like Captchas and anti-bot systems. At SWR Data Lab, we’ve tackled this across investigations ranging from Google price comparisons to healthcare platforms and social media scraping, each requiring a different approach. In this session, we share a practical decision framework for choosing the right scraping strategy based on robustness, cost, and maintainability.

In this session, we will present a decision framework for selecting the right scraping strategy based on our learnings. Rather than promoting a single tool, we want to focus on choosing the right approach for your use case, considering robustness, cost, and maintainability in a newsroom context. Using real examples, we walk through our workflow: from analyzing sites with dev tools to selecting between HTTP scraping, browser automation, and advanced tools—along with best practices and when paid services are worth it.

To follow along, you should have some experience in scraping and, ideally, Python. The participants will be able to extend their toolkit, make smarter choices in their scraping workflow, and handle real-world obstacles efficiently. No special tools are required to follow along
Speakers
avatar for Stephanie Jauss

Stephanie Jauss

SWR Data Lab
Stephanie Jauss is a data reporter at the German public broadcaster SWR. She studied Computer Science and Media in Stuttgart as well as Investigative Journalism in Gothenburg.
VS

Verena Steinacher

Data Engineer, pub.tech
Saturday May 30, 2026 11:15am - 12:30pm CEST
3.04

11:15am CEST

Embracing agents with Pydantic AI
Saturday May 30, 2026 11:15am - 12:30pm CEST
"Agentic AI" is all the rage, but what does it offer beyond traditional LLM workflows? In this hands-on session we'll answer this question (and more) while leveraging Python's Pydantic AI library to build a start-to-finish agentic AI workflow.

Participants will learn how agents work, when they're useful, how to build custom tools, and options for tracing and evaluation. You'll leave able to write agentic workflows to extract information from texts, do semi-autonomous research, and deliver clean, structured results.

Basic experience with Python/LLMs is helpful but not required. After attending this session, participants will be able to understand when and how to apply agentic approaches to problems. Participants should have Python/Jupyter installed or a Google account for working in the cloud.
Speakers
avatar for Jonathan Soma

Jonathan Soma

Knight Chair in Data Journalism, Columbia University
Jonathan Soma is the Knight Chair in Data Journalism at Columbia University, where he serves as Director of the Data Journalism MS program and the Lede Program, an intensive data journalism summer course. His lectures cover everything from basic Python and data analysis to interactive... Read More →
avatar for Jan van der Burgt

Jan van der Burgt

Investigative coder / AI specialist, Freelance / Open State Foundation
I leverage AI technologies to collect and analyse data at scale, uncovering the hidden patterns that build stories.

Investigative focus: lobbying, government overreach, migration, global food supply chains.
Saturday May 30, 2026 11:15am - 12:30pm CEST
3.05

11:15am CEST

From data projects to pipelines
Saturday May 30, 2026 11:15am - 12:30pm CEST
Data journalism projects often rely on manually executed scripts, spreadsheet updates, or code running on private computers. As investigations become more complex, span longer timeframes, or require regular updates, these methods become inefficient and unsustainable. Automated data pipelines offer a solution to these challenges.

This workshop provides an introduction to Apache Airflow, an open-source platform for automating and managing workflows. The session demonstrates how Airflow can be utilized to efficiently automate data journalism processes—from scraping to creating and updating visualizations. Participants should have basic programming skills.

After attending this session, the participants will know why and when to use automated pipelines and understand the basics of Airflow.
Speakers
avatar for Natalie Widmann

Natalie Widmann

Data Journalist, SWR Data Lab
I'm a Data Journalist supporting journalist and human rights activists with data, tools and automation.
I'm happy to talk about scraping data, extracting the most relevant information from it, understanding algorithms and using them for investigations.
avatar for Max Harlow

Max Harlow

Bloomberg News
Max Harlow is a data reporter at Bloomberg News. He also runs Journocoders, a community group for journalists to develop technical skills for use in their reporting.
Saturday May 30, 2026 11:15am - 12:30pm CEST
Z0.15

11:15am CEST

Lean, not broken: How independent newsrooms adapt to uncertain funding
Saturday May 30, 2026 11:15am - 12:30pm CEST
Independent newsrooms across Europe are facing a tough reality: grant funding is becoming more competitive, short-term, and unpredictable. This panel will explore the challenges small newsrooms encounter when forced to downsize, and how they can restructure teams, workflows, and output to continue producing quality journalism under financial pressure. We’ll also examine how one can try to secure new revenue streams - and how to pursue them without compromising editorial independence-and about the strain that sudden influxes of one-off projects and grants can place on already overstretched teams. Join us to discuss these challenges, share experiences, and learn practical, transferable strategies.

Moderators Speakers
avatar for Catarina Carvalho

Catarina Carvalho

Catarina Carvalho is the editor of Mensagem de Lisboa, a community media in the Portuguese city of Lisbon. She founded Mensagem after several years in the traditional press namely as executive editor of the historic Diário de Notícias. She pivoted toward a more community-centric... Read More →
avatar for Elisabetta Tola

Elisabetta Tola

Founder and editor-in-chief, Facta
Elisabetta is a science, data, and investigative journalist.

She is the co-founder and editor-in-chief of Facta.eu, an Italian independent media outlet that applies the scientific method to journalism and promotes science journalism as a cornerstone of democratic participation.

She... Read More →
Saturday May 30, 2026 11:15am - 12:30pm CEST
3.13

11:15am CEST

How to investigate China 101
Saturday May 30, 2026 11:15am - 12:30pm CEST
The Great Firewall, censorship, and language barriers make open source research on China really difficult, if not impossible. In this session, we aim not only to make you aware of obstacles you may face, but to show you solutions and strategies, especially for journalists and researchers working from outside China who don't speak the language.

We’ll show you the research resources, toolbox and we'll share tips on:

1) how to navigate Chinese social media: WeChat, Douyin, Rednote and others,
2) how to reconcile information inconsistencies in English and Chinese,
3) how to read the ownership of Chinese companies: private, state owned and central government enterprises,
4) mobile device setup and translation applications.

Speakers
YZ

Yan ZH

East Asia Researcher, OCCRP
Saturday May 30, 2026 11:15am - 12:30pm CEST
Z1.13 - Aula Hanswijk

11:15am CEST

How to spot a bot: Practical techniques and investigation ideas
Saturday May 30, 2026 11:15am - 12:30pm CEST
Do you know what the telling signs of "non-authentic behaviour" - bots - are on social media platforms? This practical, hands-on presentation is for journalists interested in investigating bots across platforms - from Telegram and YouTube to TikTok. We'll use data research on bot networks and coordinated activity to learn how to identify bots, and discuss practical stories one can develop based on the data found. We will also give practical examples stemming from our research in Ukraine and Moldova, and present some insights into bot campaigns ahead of the June 2026 election in Armenia.

A key part of the session will focus on how things are changing with the rise of generative AI and the early shift toward more agent-like systems: what this means for bots, coordination, and the limits of traditional detection methods. By the end of the workshop, participants will have a clearer sense of how large-scale CIB operations can look in practice, and how to start identifying and documenting them across different platforms. The goal is to leave the session with concrete story ideas, investigative ideas, and practical methodological knowledge.

No programming skills are required to take part in the workshop, but prior experience with collecting and analyzing data from social media platforms will help participants get the most out of the session.

Speakers
avatar for Yuliia Dukach

Yuliia Dukach

Head of Disinformation Investigation, OpenMinds
I'm a researcher and data journalist specializing in disinformation, computational propaganda, and online influence operations. Over the past seven years, my work has combined investigative methods, data analysis, and machine learning to map how propaganda networks operate — from... Read More →
Saturday May 30, 2026 11:15am - 12:30pm CEST
3.02

11:15am CEST

Look what's flying there
Saturday May 30, 2026 11:15am - 12:30pm CEST
Everyone knows that you can track planes online. But how exactly do you use plane tracking tools, and how can they help you with investigative research in particular? That's what this workshop is all about. It's aimed at journalists who have only ever tracked their vacation flights, but it also offers tips and tricks for more experienced colleagues. (It can get a little nerdy at times, but hopefully never boring.)

This session will tackle questions such as:
-How do you use flight tracking sites for research?
-Can you really track all planes? And what do you do if they don't want a flight to be found?
-What are the advantages and disadvantages of different platforms?
-How does flight data help to verify other information from your research?
-What are the biggest stumbling blocks, and what do you need to watch out for?
-How do you access older data?
-And why are plane spotters your best friends...

The tools of the trade are explained with practical examples from real research cases. These include Russian sabotage and hijackings, kings on their travels, and smuggled wild animals.
Speakers
avatar for Sebastian Erb

Sebastian Erb

Reporter, Süddeutsche Zeitung
Sebastian Erb ist Redakteur im Investigativ-Ressort der Süddeutschen Zeitung in Berlin. Zuvor arbeitete er bei der taz. Er beschäftigt sich v.a. mit Themen der Inneren Sicherheit, insbesondere Rechtsextremismus und Spionage, aber auch mit MeToo und zuletzt dem internationalen Wildtierhandel... Read More →
Saturday May 30, 2026 11:15am - 12:30pm CEST
1.16

11:15am CEST

The “perfect” MoU: an invitation
Saturday May 30, 2026 11:15am - 12:30pm CEST
Coordinators of cross-border investigations are welcome to join a session with Coordinators Without Borders, a monthly meet-up for coordinators of cross-border investigations. This will be a social gathering for members of the group and those interested in joining. All are invited to revisit the group’s collaborative attempt to craft the “perfect” memorandum of understanding. This document is an essential part of any cross-border investigation, marking an agreement between journalists over the many elements of the collaboration, including the responsibilities of the coordinator.
Speakers
avatar for Hazel Sheffield

Hazel Sheffield

Coordinator, Arena for Journalism

avatar for Stéphane Horel

Stéphane Horel

Investigative Journalist, Le Monde
Stéphane Horel is an award-winning investigative journalist at Le Monde. Author of several documentaries and books, she specializes in corporate harm, toxic industries and scientific disinformation. She coordinated the “Forever Lobbying Project” (2025) and "Forever Pollution... Read More →
avatar for Jose Miguel Calatayud

Jose Miguel Calatayud

Freelance journalist and writer
I am a freelance journalist and writer based in Valencia, in Spain, focusing on feature writing and investigative journalism, mainly about Europe. As of March 2026, I am doing preliminary research and planning collaborative investigations into corporate influence, climate adaptation... Read More →
Saturday May 30, 2026 11:15am - 12:30pm CEST
Z2.01 - Mediadrôme

11:15am CEST

Behind closed doors: Exposing human rights violations in prisons and psychiatric facilities
Saturday May 30, 2026 11:15am - 12:30pm CEST
Human rights violations in prisons, psychiatric hospitals, and correctional facilities are particularly difficult for journalists to uncover: those affected are isolated, authorities are opaque, and data is often fragmented or non-existent.

How can we investigate places that hardly anyone has access to? What methods can we use to get a glimpse behind the walls or fences of those facilities? How do we obtain data? What evidence can support our investigation?

In this presentation we will explore ways to obtain information, access those closed institutions, strengthen the investigation, and develop strategic approaches.

The session will cover:
- Strategies for gaining physical access as a journalist to prisons and detention facilities
- Using Freedom of Information requests to obtain documents and datasets
- Working with fragmentary or missing data and still finding a story in there
- Building, protecting, and maintaining sources in highly sensitive environments

The presentation is aimed at investigative reporters, data- and document-driven journalists, and FOI enthusiasts who want to strengthen their ability to report on closed institutions and other hard-to-reach environments. Participants will leave with concrete tools, workflows, and ideas they can apply to their own investigations.
Speakers
avatar for Sabrina Winter

Sabrina Winter

investigative reporter
Sabrina works as an investigative journalist. Her reporting focuses primarily on prisons, mental health, law enforcement, and politics. She studied international criminology, sociology, and political science in Dresden and Hamburg. She then completed her journalism training at He... Read More →
Saturday May 30, 2026 11:15am - 12:30pm CEST
Z1.15 - Aula Donche

11:15am CEST

Risky research - protect yourself in high-risk investigations
Saturday May 30, 2026 11:15am - 12:30pm CEST
If you have been through the Digital Hygiene session at Dataharvest, or are otherwise already aware of the baseline of digital security, you are now ready to advance to the next level.

In this session, we will dive deep into threat modelling as well as the tools and tactics to know about if your investigations involve a high degree of risk to your sources, your collaborators, or yourself.

The goal of the session is that you are able to make informed choices by assessing digital risks specific to an investigation and selecting the appropriate systems and strategies to manage them consistently.

Saturday May 30, 2026 11:15am - 12:30pm CEST
2.02

12:30pm CEST

Lunch
Saturday May 30, 2026 12:30pm - 1:45pm CEST
Lunch is served in Mediaforum on the ground floor (lobby area), and on the first floor (Z1.06 and Z1.07). Vegan food and special meals (gluten-free, dairy-free, and all allergies and special dietary requirements) are available on the first floor only.


Saturday May 30, 2026 12:30pm - 1:45pm CEST
Mediaforum

12:35pm CEST

Budgeting café
Saturday May 30, 2026 12:35pm - 1:35pm CEST
Do you want to learn how to translate your ideas into budgets, explore different financial scenarios, draft budgets for grant applications, and improve your financial reporting? Join the budgeting cafe to ask anything you’ve always wanted to know about journalism finance.

Please note: you need to book a time slot in advance. You can do it HERE.
Speakers
avatar for Cora Moyano

Cora Moyano

Finance Manager, Arena for Journalism in Europe
Happy to share and learn about finance, business models, fundraising, processes, systems, and transparency in nonprofits. It doesn't need to be boring or terribly serious ;)

Saturday May 30, 2026 12:35pm - 1:35pm CEST
2.02

12:35pm CEST

Cross-border café
Saturday May 30, 2026 12:35pm - 1:35pm CEST
This cafe/clinic is intended as a safe space for individuals or teams involved in a cross-border investigation where they can receive custom support with specific challenges. Potential topics include pitching stories to editors cold-door, finding colleagues to pursue a story, managing and resolving team conflicts, organising workflows, beginning cross-border projects from scratch, and tips for progressing when you find yourself stuck in an investigation.

Please note: you need to book a time slot in advance! You can do it HERE.
Saturday May 30, 2026 12:35pm - 1:35pm CEST
2.03

12:35pm CEST

Data helpdesk
Saturday May 30, 2026 12:35pm - 1:35pm CEST
Stuck on something technical? Come talk it through.

Maybe you've got a folder of 800 PDFs and no good way in. Maybe a source sent you a database and you don't know where to start. Maybe your scraper broke the night before deadline, or you're staring at an Excel file with merged cells and dreams. Maybe you just want to know if the thing you're trying to do is actually possible, before you sink three more days into it.

Bring it to a one-to-one session with a data engineer. No prep required, just show up with the problem.

Things people often bring:
  • Messy data that won't behave
  • Scraping and extraction that's stuck
  • An investigation idea where you don't know if the data exists
  • A manual task you suspect could be automated
  • When to use LLMs, and when not to
  • Whatever else you'd like to discuss!

We may get in touch before the conference to learn more, so we can hit the ground running.
Please note: you need to book a time slot in advance. You can do it HERE.

Speakers
avatar for Johan Schujit

Johan Schujit

Data Engineer, Resolve.
I'm a data engineer responsible for EveryPolitician and PoliLoom at OpenSanctions. I'm a self-taught hacker with a stubborn belief that good data should be open and technology should serve the public interest. Previously at Follow the Money.

Saturday May 30, 2026 12:35pm - 1:35pm CEST
Z0.15

12:35pm CEST

EU/FOI café
Saturday May 30, 2026 12:35pm - 1:35pm CEST
Do you need help drafting an FOI request, writing a complaint about your FOI request being rejected, or understanding EU legislation? Are you looking for specific EU databases, experts, and sources? Come to the "Eu Cafe" for advice!

Please note: you need to book a spot in advance. You can do it HERE.
Speakers
avatar for Alexander  Fanta

Alexander Fanta

Journalist, Follow the Money
Journalist bei Follow the Money mit Fokus auf EU-Digitalpolitik. Davor Stationen bei netzpolitik.org, Austria Presse Agentur und Der Standard.
Saturday May 30, 2026 12:35pm - 1:35pm CEST
1.04

12:35pm CEST

Safety and Security café
Saturday May 30, 2026 12:35pm - 1:35pm CEST
Do you have questions or considerations about digital security? Do you need help setting up your hard disk encryption, or do you need advice on how to use a password manager? Would you like security advice on your personal software, hardware, or on precautions when traveling?

Bring your questions for a one-to-one session with our trainers. The individual consultations are suitable for small teams or individuals who have specific questions or face security concerns.

We will connect you with the digital security trainers, who may contact you before the conference to learn more about your concerns.

Topics:
  • Hardening your devices
  • Securing your accounts
  • Spyware concerns
  • Checking your digital footprint
  • Whatever else you'd like to discuss!
Please note: you need to book a time slot in advance! You can do it HERE.
Speakers
avatar for Ela Stapley

Ela Stapley

Senior Digital Security Adviser
Ela Stapley is a Senior Adviser in Digital Security and Strategy who has spent the past decade working with journalists, newsrooms, and journalist networks to provide high-level digital security support.  During this time, she has trained and provided individual assistance to over... Read More →
avatar for Benedikt Hebeisen

Benedikt Hebeisen

Arena for Journalism in Europe
Benedikt coordinates the IT at Arena for Journalism and manages the development of the Collaborative Desk, where he supports cross-border teams with tools, workflows and secure environments. He focuses his work on the intersection of investigative journalism and technology, with a... Read More →
Saturday May 30, 2026 12:35pm - 1:35pm CEST
1.16

1:45pm CEST

Climate data for non-data journalists
Saturday May 30, 2026 1:45pm - 3:00pm CEST
You don't necessarily have to be a data journalist to report on climate change using data. As data journalists, we will explain options available to non-coders. This session is intended to give you a feeling for what you can do yourself with temperature measurement data, emissions figures, and tables showing the damage caused by extreme weather events, and to overcome your fear that working with data is “too complicated.”

In the session, we will discuss existing databases, some of which already have built-in filtering and visualization options. Where they don't, we will explain what the first steps might look like if you can't or don't want to code yourself. We also will address the questions that are important for assessing the quality of a data set.

This session is explicitly not aimed at data journalists, but at climate journalists who have little or no experience with data but want to know what is possible and what they can do themselves.
Speakers
avatar for Julia Barthel

Julia Barthel

Datenjournalistin, BR Data
Julia arbeitet als Daten- und Investigativjournalistin für BR Data. Dabei verbindet sie datengetriebene Recherchemethoden mit crossmedialer Berichterstattung in Audio, Video und Text. Das heißt auch: Der beste Radiobeitrag entsteht mit einem Bild im Kopf.
avatar for Constanze Bayer

Constanze Bayer

Datenjournalistin, BR Data
Constanze arbeitet als Datenjournalistin mit an Geschichten rund um Klima und Umwelt. Das können große Storytelling-Projekte wie "Schnee war gestern" zur Zukunft des Schnees in den Alpen oder ein "CO2-Rechner" sein, der die Wirkung von Heizungsgesetz und Co illustriert, aber auch... Read More →
Saturday May 30, 2026 1:45pm - 3:00pm CEST
1.04

1:45pm CEST

Copernicus climate change service data buffet: find stories in the data
Saturday May 30, 2026 1:45pm - 3:00pm CEST
Behind headlines about record-breaking temperatures, extreme events affecting millions, and long-term climate trends reshaping ecosystems and our environment lies a wealth of datasets used to track how our planet is changing and better make sense of these events. Yet navigating these datasets and understanding what they can reliably show is not always straightforward.

The Copernicus Climate Change Service (C3S) provides a wide range of open climate datasets covering past, present, and future conditions that can support your climate reporting.

In this session, you will be guided through the C3S data landscape: where to find key datasets, how they are built, what they can (and cannot) tell you, and common pitfalls to avoid.

Speakers
Saturday May 30, 2026 1:45pm - 3:00pm CEST
3.09

1:45pm CEST

How to pitch biodiversity stories in a news ecosystem that rarely has space for them
Saturday May 30, 2026 1:45pm - 3:00pm CEST
This panel explores why biodiversity stories often struggle to find a place in general newsrooms and how to pitch them more effectively.

Two experienced journalists will share practical, replicable strategies, including reframing biodiversity as political, economic, or investigative stories; grounding pitches in data, regulation, and accountability; and targeting beyond environmental desks.

If you have had difficulty getting your biodiversity or environmental stories published, this session offers a chance to exchange experiences, ask questions, and learn from journalists who have successfully persuaded editors of the importance of these stories.
Speakers
avatar for Tracy Keeling

Tracy Keeling

Freelance Environmental Journalist
I am a UK-based journalist who writes about a variety of environmental subjects but specialises in biodiversity reporting, particularly in relation to the wildlife trade. My work has been published in Bloomberg Businessweek, The Bureau of Investigative Journalism, The Revelator, Yahoo... Read More →
avatar for Annick Hus

Annick Hus

Journalist and Researcher, Freelance
I’m a journalist and researcher specialising in biodiversity, wildlife, ecosystem health, livestock farming, and animal welfare. My work has been published in outlets including Apache, De Groene Amsterdammer, The Green European Journal, Falter, Follow the Money, and EOS. I’m available... Read More →
Saturday May 30, 2026 1:45pm - 3:00pm CEST
2.04

1:45pm CEST

How to save the planet with nerds
Saturday May 30, 2026 1:45pm - 3:00pm CEST
As environmental and climate crises grow more complex, journalists are evolving to meet the challenge. Across Europe, reporters are increasingly working alongside scientists - not just as sources but as collaborators- to strengthen their investigations, boost their methodologies, and create more impact. This shift is the core of “expert-reviewed journalism” and this panel will explore how it actually works in practice. 

Drawing from investigative projects they were part of, three seasoned journalists will share how journalism-scientist partnerships are built, how trust is established, and how the roles are defined so the day-to-day work is fruitful and dream projects are done. The speakers will discuss practical realities, nerdy secrets and their lessons learned: how they managed data and workflows, navigated tensions between disciplines, and developed shared work practices where both sides work to reach the finish line while respecting each other’s professional lanes. 

Some of the projects that will be mentioned in this talk: 

The Forever Lobbying Project: https://foreverpollution.eu/lobbying/ 
The Green to Grey Project: https://greentogrey.eu/ 
Swampower: https://facta.eu/focus-on/swampower/ 

Speakers
avatar for Zeynep Sentek

Zeynep Sentek

Journalist, Arena for Journalism in Europe
Zeynep Sentek is a Turkish investigative journalist specialising in corruption, human rights, and the environment. She is now leading the climate network of Arena for Journalism in Europe. In this network, Arena facilitates journalists to do cross-border climate investigations and... Read More →
avatar for Stéphane Horel

Stéphane Horel

Investigative Journalist, Le Monde
Stéphane Horel is an award-winning investigative journalist at Le Monde. Author of several documentaries and books, she specializes in corporate harm, toxic industries and scientific disinformation. She coordinated the “Forever Lobbying Project” (2025) and "Forever Pollution... Read More →
avatar for Elisabetta Tola

Elisabetta Tola

Founder and editor-in-chief, Facta
Elisabetta is a science, data, and investigative journalist.

She is the co-founder and editor-in-chief of Facta.eu, an Italian independent media outlet that applies the scientific method to journalism and promotes science journalism as a cornerstone of democratic participation.

She... Read More →
Saturday May 30, 2026 1:45pm - 3:00pm CEST
Z0.15

1:45pm CEST

The Fueling Ecocide project: how we handled a mapping exercise of epic proportions
Saturday May 30, 2026 1:45pm - 3:00pm CEST
“Fueling Ecocide” is a cross-border, collaborative data investigation coordinated by EIC (European Investigative Collaborations) and EIF (Environmental Investigative Forum). Over the course of a year, 13 media outlets across four continents joined forces, asking a simple but crucial question: how much protected land and sea have we lost globally to oil and gas extraction?

To answer this question, we embarked on a mapping exercise of epic proportions. Using QGIS and Post-QGIS, we compared 315,000 protected areas from all over the world with 15,000 extraction blocks, spanning 120 countries.

For the first time, this work provides a clear global picture of the damage: 7,021 protected areas in 99 countries overlap with oil and gas projects. This represents a surface of approximately 690,000 km², just over the size of France, most of it under internationally recognized protection statuses. Through extensive reporting, they also identified 763 oil and gas companies involved, with the largest contributors headquartered in Europe.

In this session, the reporters involved in the project will take you through the makings of a data-driven global investigation. They will talk you through the trial-and-error process of handling and harmonising an immense geospatial dataset: which tools they used, which methodological choices they made, and which obstacles proved the hardest to overcome. They will explain how the team turned this data into impactful stories, collaborated across the globe, and what lessons they learned.

 To read more on the project and the story, go here: https://ecocide.reportersunited.gr/
Speakers
avatar for Hala Nasreddine

Hala Nasreddine

Investigative journalist, Daraj Media
Hala Nasreddine is an award-winning Lebanese investigative journalist and the Head of the Investigative Unit at Daraj Media. She has contributed to several cross-border investigative projects, including Burning Skies, Fueling Ecocide, Cyprus Confidential, Pegasus Project, Pandora... Read More →
avatar for Leopold Salzenstein

Leopold Salzenstein

Data coordinator, Arena for Journalism in Europe
Leopold Salzenstein is a freelance investigative data journalist and trainer based in the south of France. At Arena, he coordinates the handling of data for publications and trainings. He is also a member of the collective of journalists Environmental Investigative Forum (EIF).

... Read More →
avatar for Dafni Karavola

Dafni Karavola

Visual Investigator, Reporters United
Dafni is a visual-forensic investigator with a background in architectural engineering and a member of Reporters United. Her work is dedicated to advocating for Romani rights and spatial violence. Additionally, Dafni participates in various projects tackling environmental issues... Read More →
Saturday May 30, 2026 1:45pm - 3:00pm CEST
3.02

1:45pm CEST

Naming the Unsaid: How to navigate the power dynamics in collaborative investigations
Saturday May 30, 2026 1:45pm - 3:00pm CEST
Journalists taking part in collaborations want equity and respect alongside impact, according to a 2025 poll by Lighthouse Reports. But what does that mean on the ground, and how can journalists advance equity in practical ways?

In this session moderated by journalist Hazel Sheffield, Lighthouse managing editor Hui Yee Tan will be joined by journalist and researcher Ruona Meyer. Together they will share the highlights of the survey and discuss its implications as well as practical needs of journalists in collaborations, and solutions to overcome inequitable relations in collaborative journalism projects.

Come with your own experiences and questions to share with experienced journalists in a safe, peer-to-peer environment, and openly discuss the things everyone else might be feeling are better left unsaid. 

Moderators
avatar for Hazel Sheffield

Hazel Sheffield

Coordinator, Arena for Journalism

Speakers
avatar for Ruona Meyer

Ruona Meyer

Journalism Researcher and Trainer, Freelance
Ruona Meyer is Nigeria's first Emmy-nominated investigative journalist, a researcher, trainer with over 20 years of experience across Africa and Europe. She is currently a final-year PhD candidate, researching power dynamics within cross-continental investigative journalism networks. She also designs and partakes in media development research in various capacities: as Visiting Senior Research Associate, Faculty of Life Sciences & Medicines... Read More →
avatar for Hui Yee Tan

Hui Yee Tan

Managing Editor, Lighthouse Reports
I spend a lot of time looking at and thinking about power as managing editor of Lighthouse Reports. In my previous life, I was the Bangkok-based bureau chief of The Straits Times, helming its coverage of Thailand, Myanmar, Laos, Cambodia and Vietnam. As a co-founder of The Gender... Read More →
Saturday May 30, 2026 1:45pm - 3:00pm CEST
2.03

1:45pm CEST

How to use court decisions to build multimedia narratives that enhance human rights
Saturday May 30, 2026 1:45pm - 3:00pm CEST
Court decisions are treasure troves of verified facts—but they're often locked in dense legal documents that the public never sees. This session will present a specific methodology for extracting, contextualizing, and translating judicial findings into compelling multimedia narratives that serve human rights advocacy, accountability, and education.

We want to present the methodology we developed in the Balkan Investigative Reporting Network in Bosnia and Herzegovina - BIRN BiH - on how to connect court findings into coherent narratives, translating court-verified facts into accessible, informative, and educational content.

We will use three BIRN BiH developed databases to demonstrate our methodology. Additionally, we want to present how we used court facts to develop educational tools. Many journalists and human rights documentalists treat court records as supporting evidence for stories. This session will show how court decisions themselves can be the primary story—and how systematic extraction creates accountability infrastructure that outlasts individual articles or projects.
Speakers
avatar for Katarina Zrinjski

Katarina Zrinjski

Head of Programmes, Balkan Investigative Reporting Network in Bosnia and Herzegovina
Passionate about projects promoting peacebuilding and rule of law in the Balkans since 2010. 

avatar for Denis Džidić

Denis Džidić

Director, Balkan Investigative Reporting Network Bosnia and Herzegovina (BIRN BiH)

Saturday May 30, 2026 1:45pm - 3:00pm CEST
2.02

1:45pm CEST

Data Magic made simple: three ways to crunch numbers in spreadsheets
Saturday May 30, 2026 1:45pm - 3:00pm CEST
We know that thousands of lines in a dataset can be intimidating, especially if you’re not a programmer. Spreadsheets can do the heavy lifting — and mastering them is easier than you expect!

In this session, we will walk you through three different ways to dive into data using nothing but spreadsheet tools. Along the way, we’ll show you how to cross-check your calculations, ensuring your findings are accurate and reliable. Whether you’re a complete beginner or have already used spreadsheets in your work, you’ll leave with practical skills to handle data confidently without ever touching a line of code.

Bring your laptop and join us to discover how easy and powerful data analysis can be!
Speakers
avatar for Alina Yanchur

Alina Yanchur

Data and Investigative Journalist
Investigative and data journalist with a focus on transnational corruption, sanctions evasion, and OSINT methods. Trainer and mentor for journalists working under repressive regimes. Strong background in collaborative and data-driven journalism across Belarus, Europe, and exile c... Read More →
avatar for Kuang Keng Kuek Ser

Kuang Keng Kuek Ser

Senior Editor for Rainforest Investigations, Pulitzer Center
Kuang Keng Kuek Ser is the Senior Editor for Rainforest Investigations at the Pulitzer Center, a non-profit organization based in Washington, DC that supports independent journalists globally. He supports and mentors three fellowships investigating issues related to tropical rainforest... Read More →
Saturday May 30, 2026 1:45pm - 3:00pm CEST

1:45pm CEST

How to manage mass FOI projects using AI, vibe coding and verification
Saturday May 30, 2026 1:45pm - 3:00pm CEST
Projects involving FOI requests to multiple bodies often create significant challenges, from different file formats and data trapped in PDFs, to organisations providing data in different structures and different levels of detail. To get the big picture often requires data extraction, cleaning, reshaping, and checking.

In this session, we will share a series of tips and tools used to manage one project — including vibe coding with AI — which can be used to make any multi-response FOI project more efficient and accurate. No prior knowledge is required. By the end of this session, attendees should be able to design a data structure for an FOI project, use a range of tools, including AI, to extract, reshape, clean, and combine data from FOI responses, and design a data validation process to check AI outputs.

You will need a laptop with Google Drive and an account with an AI tool such as ChatGPT, Gemini, Claude, or Copilot. Installing Tabula and Open Refine will help you get more out of the session.
Speakers
avatar for Paul Bradshaw

Paul Bradshaw

Journalist and Academic, BBC/Birmingham City University
Paul Bradshaw runs the MA in Data Journalism at Birmingham City University and also works as a consulting data journalist with the BBC Shared Data Unit. A journalist, writer and trainer, he has worked with news organisations including The Guardian, Telegraph, Mirror, Der Tagesspi... Read More →
avatar for Ioanna Petsiou

Ioanna Petsiou

Data Journalist, Freelancer
Ioanna Petsiou is an investigative data journalist working across data analysis, satellite imagery, and mapping to uncover and explain complex stories. She is particularly drawn to environmental reporting and to building clear, reproducible ways of working with data that others can... Read More →
Saturday May 30, 2026 1:45pm - 3:00pm CEST
3.05

1:45pm CEST

Turning raw data into reliable sources: Python for journalists
Saturday May 30, 2026 1:45pm - 3:00pm CEST
Have you ever tried to investigate how much groceries or rent in your city really impact people’s budgets? Journalists don’t always get all the data in one place. Often, we find it in ads, public announcements, or different sources, then clean, structure, and track it over time, compare it with other datasets, or monitor changes to uncover trends.

This hands-on workshop teaches journalists how to clean, transform, and structure real newsroom data using Python. Participants will learn practical techniques to handle messy data, including changing data types, filtering by values or dates, splitting columns, labeling and recoding, calculating averages and percentages, and extracting quantities from text fields. The session also covers tasks specific to regional datasets, such as converting scripts from Cyrillic to Latin.

With these skills, journalists can analyze grocery prices and compare them with income data or calculate meal costs to report on rising food prices, examine traffic accident data near schools, or track public officials’ gifts and benefits. By the end of the workshop, participants will have concrete tools and workflows to turn raw data into reliable sources ready for investigation and reporting.

To follow along, participants should have some experience with Python basics and working with datasets. After attending this session, participants will be able to turn messy data into clean, reliable sources, compare thousands of entries, and extract insights for investigative stories using Python. Participants should have Python installed on their own computers to follow along. This tutorial can also be accessed via Google Colab, where most of the steps are similar, though Python installed locally is the recommended option for a smoother experience.
Speakers
avatar for Teodora Curcic

Teodora Curcic

BBC
Teodora Ćurčić is an investigative and data journalist from Serbia with over seven years of experience reporting on corruption, political finance, gender-based violence, and social justice. She spent most of her career at the award-winning Center for Investigative Journalism of... Read More →
VS

Verena Steinacher

Data Engineer, pub.tech
Saturday May 30, 2026 1:45pm - 3:00pm CEST
3.04

1:45pm CEST

Tourism on stolen land: How we found tourist accommodations on Israeli settlements
Saturday May 30, 2026 1:45pm - 3:00pm CEST
This talk will show the methodology behind The Guardian’s exclusive investigation into properties advertised as tourist destinations on Airbnb and Booking.com located in Israeli settlements that are illegal under international law. The Guardian found 760 rooms being advertised in hotels, apartments, and other holiday rentals in illegal Israeli settlements in the West Bank, as NGOs and activists warned that these companies are violating international law and profiting from war crimes.

In their investigation, they used geospatial tools, like PostGIS and QGIS, to localise accommodations on a specific geographical area, and to join these accommodations with shapefiles representing the borders of illegal settlements to identify those based within settlement boundaries. Using OSINT methods and Google Sheets, they designed a methodology to solve data challenges like duplicate and multi-platform listings, and listings with approximate locations.

In this session, attendees will come up with a robust methodology to find tourist accommodations in any geographical area, so they can reproduce the method for new stories.
Speakers
avatar for Zeke Hunter-Green

Zeke Hunter-Green

Software Developer, The Guardian
Zeke is a Senior Software Engineer on the Guardian’s Digital Investigations team. The team contributes to journalistic research and builds secure tools to enable investigative journalism.

avatar for Carmen Aguilar Garcia

Carmen Aguilar Garcia

Data Journalist, The Guardian
Data journalist at The Guardian Data Project team. I work on a variety of subjects - always finding the data angle in every story. Scraping, cleaning, data analysis, but above all JOURNALISM!
Saturday May 30, 2026 1:45pm - 3:00pm CEST
Z1.13 - Aula Hanswijk

1:45pm CEST

How to investigate conflicts of interest in science
Saturday May 30, 2026 1:45pm - 3:00pm CEST
Industry biases science by funding research projects and organisations, as well as individual scientists. In this session, we will explore how to investigate this bias, presenting different approaches, story angles, data sources, and methods. We will focus on investigations covering food, tobacco, environment, and medicine. The attendees will get an understanding of the importance of commercial influence on science, ways to investigate the subject, as well as possible pitfalls. 
Speakers
avatar for Hristio Boytchev

Hristio Boytchev

Science and health reporter, Berlin
Hristio is a Berlin-based investigative health and science journalist, using data driven methods to tacke research integrity issues and systemic problems in medicine. Hristio is freelance investigations reporter at The BMJ (British Medical Journal) and leader of “Follow the Gra... Read More →
Saturday May 30, 2026 1:45pm - 3:00pm CEST
0.10

1:45pm CEST

Who's behind this website?
Saturday May 30, 2026 1:45pm - 3:00pm CEST
Reporting online today, journalists have to battle astroturf campaigns, fake news sites and sketchy shell companies to find out who is behind the story. It frequently leads to a frustratingly common question: who is behind this website?

Popular tools and approaches to investigating websites have been less reliable lately. There's more opaqueness in areas where there should be more transparency; crypto payments add a layer of confusion, and generative AI makes it easy for adversarial actors to operate hundreds of websites.

Using a range of OSINT tools and real-world investigations, we will walk you through investigating the provenance and ownership of websites: identifying the scope and scale of the network it belongs to — if any? Who’s behind the site, now and in the past? Who are the main actors promoting this website? Is it AI slop? Are foreign actors likely behind the domain?

While it is not always possible to fully unmask the owner of a site, using a thorough checklist of tools and techniques that we have used in real-world investigations we can help you make sure to reveal as much as possible about a website, and potentially uncover important clues. We will also walk you through how to conduct these investigations safely depending on your threat model, and how to document your findings reliably.

 This session is suitable for beginners and doesn't assume existing technical knowledge.
Speakers
avatar for Priyanjana Bengani

Priyanjana Bengani

Computational Journalism Fellow, Columbia University
Priyanjana Bengani is the Tow Computational Journalism Fellow at Columbia University's Tow Center for Digital Journalism. Her work focuses on using computational techniques to research the digital media landscape, including partisan local news and the intersection of platform companies... Read More →
Saturday May 30, 2026 1:45pm - 3:00pm CEST
Z1.15 - Aula Donche

1:45pm CEST

How to FOI in difficult contexts
Saturday May 30, 2026 1:45pm - 3:00pm CEST
This presentation will examine various methodologies and share practical tips — as well as common challenges — when working with freedom of information (FOI) requests in “difficult” contexts. These include investigations into closed institutions such as the police or prison system, requests made under time constraints, or cases complicated by non-transparent governmental bodies.

Drawing on examples from the presenters’ work in Portugal and Turkey, we will discuss how to craft precise and effective FOI requests under pressure, handle incomplete or evasive responses from authorities, and leverage the human connection to follow up informally and obtain information.

We will also discuss how strategic litigation, formal complaints to state regulatory bodies, and public pressure campaigns can be used to compel institutions to release information. We invite FOI practitioners and FOI-curious participants to attend the session and share their own examples of successes (and failures) during the discussion.
Moderators
avatar for Alexander  Fanta

Alexander Fanta

Journalist, Follow the Money
Journalist bei Follow the Money mit Fokus auf EU-Digitalpolitik. Davor Stationen bei netzpolitik.org, Austria Presse Agentur und Der Standard.
Speakers
avatar for Nuno Viegas

Nuno Viegas

Journalist, fundraising co-lead, co-manager, Fumaça
Investigative journalist, focused on long-form audio narratives. I work mostly with FOI and state archives, to research policing, prisons, and the courts. At Fumaça, I share fundraising responsability for our non-profit, membership backed experiment on non-hierarchical newsroom... Read More →
avatar for Elif Ince

Elif Ince

Freelance Journalist
Elif  Ince is an Istanbul based freelance journalist. Her reporting on urban  and environmental issues has received awards from  Turkey's Progressive Journalists Association (ÇGD), the Chamber of  Architects, the Chamber of Urban Planners and People's Houses  (Halkevleri). She... Read More →
Saturday May 30, 2026 1:45pm - 3:00pm CEST
3.13

1:45pm CEST

Digital security networking roundtable
Saturday May 30, 2026 1:45pm - 3:00pm CEST
Calling all tech‑savvy reporters, newsroom technologists, digital‑security and IT staff, emerging‑threat analysts, and forensic investigators to connect, share recent trends, and swap practical solutions from the field in this casual Dataharvest meetup. Bring a brief update on your (newsroom’s) biggest security challenges or a tool/technique that’s helped your work. Conversations will focus on real‑world risks, incident response, and collaboration opportunities. Ideal for anyone responsible for team security, sensitive reporting, or infrastructure who wants to build peer relationships and leave with actionable ideas.
Speakers
avatar for Benedikt Hebeisen

Benedikt Hebeisen

Arena for Journalism in Europe
Benedikt coordinates the IT at Arena for Journalism and manages the development of the Collaborative Desk, where he supports cross-border teams with tools, workflows and secure environments. He focuses his work on the intersection of investigative journalism and technology, with a... Read More →
Saturday May 30, 2026 1:45pm - 3:00pm CEST
Z2.01 - Mediadrôme

1:45pm CEST

Not a chatbot: A hand-drawn case for putting primary sources first in designing AI tools for investigative journalism
Saturday May 30, 2026 1:45pm - 3:00pm CEST
Every slide in this presentation is hand-drawn. So is the thinking behind Cheatsheet, The New York Times' AI tool built from scratch for investigative reporters. We'll show what we built, in which cases we ditched off-the-shelf AI, and how we tested it with non-technical journalists.

The presentation will offer a practical roadmap for use cases (needle-in-a-haystack document search, mass translation, statistical analysis) plus an honest Dos and Don'ts list for AI in journalism and a first look at our open-source plans.
Saturday May 30, 2026 1:45pm - 3:00pm CEST
1.16

3:00pm CEST

Coffee break
Saturday May 30, 2026 3:00pm - 3:30pm CEST
Coffee is served in the Mediaforum (ground floor, lobby area) and on the third floor.
Saturday May 30, 2026 3:00pm - 3:30pm CEST
Mediaforum

3:30pm CEST

Tracking fire in a hotter Europe: Methods for long-term wildfire data
Saturday May 30, 2026 3:30pm - 4:00pm CEST
As wildfires become more frequent and intense across Europe, attracting growing public and media attention, journalists need solid methods to analyze fire data over time. But building a dataset is only the beginning. This session goes beyond the story to explore how to sustain and adapt a data project over more than a decade, using Civio’s Spain in Flames as a case study.

In this session, we'll share best practices for long-term data journalism: designing datasets that can evolve, working with fragmented official sources, handling changes in definitions, revising past data responsibly, and deciding what comparisons remain valid over time. The talk covers both data processing workflows and visualization strategies that hold up as your project grows.

In the context of increasingly severe wildfire seasons, this session offers replicable methods and tips for building climate data projects that last and that can inform responsible reporting for years to come.
Speakers
avatar for Adrián Maqueda

Adrián Maqueda

Data Analysis, Dataviz & Front-end, Civio
Saturday May 30, 2026 3:30pm - 4:00pm CEST
3.09

3:30pm CEST

Hack your CMS (and the rest of the web!): Tampermonkey 101
Saturday May 30, 2026 3:30pm - 4:00pm CEST
Tampermonkey is an age-old browser extension that allows you to inject scripts and stylesheets into any web page, turning the web into your personal playground. We'll look at how to customize your CMS with DIY features, add "Download all" buttons to paginated websites, automate tedious processes like filling out forms and redesign websites however you'd like. Best of all, Tampermonkey scripts are saveable and sharable, allowing you to give other members of your newsroom superpowers without fiddling with distributing extensions or asking them to run Python scripts. To follow along, participants should be able to install extensions in their web browser of choice.
Speakers
avatar for Jonathan Soma

Jonathan Soma

Knight Chair in Data Journalism, Columbia University
Jonathan Soma is the Knight Chair in Data Journalism at Columbia University, where he serves as Director of the Data Journalism MS program and the Lede Program, an intensive data journalism summer course. His lectures cover everything from basic Python and data analysis to interactive... Read More →
Saturday May 30, 2026 3:30pm - 4:00pm CEST
3.05

3:30pm CEST

Make a publication-ready static map with QGIS
Saturday May 30, 2026 3:30pm - 4:00pm CEST
In this demo, participants will learn how to create a static map in QGIS that is ready for publication. The session will cover setting map dimensions, selecting a basemap, adding geospatial data, and incorporating key design elements such as text annotations, a north arrow, a scale bar, an inset map, and images. Participants will also learn how to export the finished map as a JPG.

Download and install QGIS on your laptop before the session and confirm that it opens properly. MacBook users who run into security warnings when opening QGIS can follow the workaround here
Speakers
avatar for Kuang Keng Kuek Ser

Kuang Keng Kuek Ser

Senior Editor for Rainforest Investigations, Pulitzer Center
Kuang Keng Kuek Ser is the Senior Editor for Rainforest Investigations at the Pulitzer Center, a non-profit organization based in Washington, DC that supports independent journalists globally. He supports and mentors three fellowships investigating issues related to tropical rainforest... Read More →
Saturday May 30, 2026 3:30pm - 4:00pm CEST
2.03

3:30pm CEST

One template, many stories: Parameterized reports with Quarto
Saturday May 30, 2026 3:30pm - 4:00pm CEST
Learn how to build reusable report templates in Quarto that generate multiple outputs (PDF, HTML, Word documents) from a single source document. By defining parameters — such as a region, time period, or data source — you can produce dozens or even hundreds of tailored reports without duplicating code or copy-pasting results.

This is especially useful for cross-border investigations, where partners share a common dataset, but each team needs a report focused on its own country. Build the analysis once, then render a customized version for each partner with only their slice of the data.

To follow along, participants should have basic familiarity with Quarto, R Markdown, or Jupyter notebooks, and some experience writing code in R or Python.
Speakers
avatar for Leopold Salzenstein

Leopold Salzenstein

Data coordinator, Arena for Journalism in Europe
Leopold Salzenstein is a freelance investigative data journalist and trainer based in the south of France. At Arena, he coordinates the handling of data for publications and trainings. He is also a member of the collective of journalists Environmental Investigative Forum (EIF).

... Read More →
Saturday May 30, 2026 3:30pm - 4:00pm CEST
1.04

3:30pm CEST

I tracked how the EU’s €26 billion Just Transition Fund is really being spent (saunas! axe-throwing!) – find stories for your country, too
Saturday May 30, 2026 3:30pm - 4:45pm CEST
We can all agree that Europe’s green energy transition is critical. But the transition has to be fair and not leave anyone behind. Therefore, the EU’s €26.7 billion Just Transition Fund – which is intended to support communities shift away from fossil fuel industries, such as coal mining, to greener economies – is hugely important. But, surprise, surprise, those crucial billions do not appear to be going where the EU promised they would. In this presentation, we will explain how our cross-border team dug into the fund – FOIs, sources in the European Parliament, following the money – with the aim of helping you to do exactly the same and find your own exclusives.
Speakers
Saturday May 30, 2026 3:30pm - 4:45pm CEST
2.04

3:30pm CEST

Bad practice: investigating medical malpractice across borders
Saturday May 30, 2026 3:30pm - 4:45pm CEST
Trusting your doctor is something you should be able to take for granted. Bad Practice, a collaboration co-led by OCCRP, The Times of London, and VG of Norway, exposed a European health scandal that has brought that into question.

The project identified at least 100 doctors who had been banned from practicing by medical regulators in one country but remained licensed to work in another. The stories have had a major impact, with governments across Europe and the European Commission pledging to crack down on the issue.

While a lot of cross-border projects rely on a central leak, this project required us to build a dataset from scratch by obtaining as many lists of doctors' licenses and doctors' medical disciplines as possible. This session will set out how we were able to make these findings, and the lessons we learned from building our database from fragmented records across jurisdictions, and then, once we identified likely matches, how we proved these doctors were still practicing. We would explain what went well, what didn't, and our ambitions for the future of the project as we add more countries from around the world to our database.

Attendees need no prior knowledge, just an interest in how to build cross-border investigations into regulated professions.
Speakers
avatar for George Greenwood

George Greenwood

Investigations Reporter, The Times
avatar for David Ilieski

David Ilieski

Researcher, OCCRP
Based in North Macedonia, I joined OCCRP in 2020 as an ID
researcher. Since 2018, I work as a reporter at Investigative Reporting
Lab Macedonia (IRL), an OCCRP member center, where we specializes in uncovering
corruption and political abuse of power.
Saturday May 30, 2026 3:30pm - 4:45pm CEST
Z0.10

3:30pm CEST

Old trade, new playbook: Investigating transnational narco-trafficking
Saturday May 30, 2026 3:30pm - 4:45pm CEST
This panel will delve into the methods behind a months-long cross-border investigation that painstakingly revealed the industrialisation of the new “drop-off” method of cocaine trafficking in the Mediterranean.

What began with a single (but very dramatic) 2022 police operation involving two vessels near Sicily evolved into the reconstruction by the journalists of a transnational criminal system linking Latin American suppliers, Turkish traffickers, and European mafia. The journalists will explain how they moved from isolated events to identifying a coordinated fleet and uncovering the logistical role of key actors operating across jurisdictions.

The session will also focus on practical techniques: combining open-source maritime tracking data (AIS) with judicial records, seizure reports, and corporate registries to map connections between ships, companies, and individuals. Journalists will discuss how cross-referencing vessel movements with court documents and port data revealed patterns invisible to authorities at the time, as well as how interviews with prosecutors, law enforcement, and sources in multiple countries helped verify findings and fill gaps.

The panel will offer a step-by-step look at how to build a solid investigation that combines innovative data work with traditional reporting to expose complex criminal systems even before law enforcement fully connects the dots.
Speakers
avatar for Cecilia Anesi

Cecilia Anesi

Co-founder, IrpiMedia
Cecilia Anesi is an investigative reporter with IrpiMedia, the investigative media outlet of Italy's investigative journalism centre IRPI (Investigative Reporting Project Italy) which she co-funded in 2012. IRPI is a member of the Global Investigative Journalism Network and the OCCRP... Read More →
avatar for Craig Shaw

Craig Shaw

Journalist/editor, theblacksea.eu
I'm an award-winning journalist and the Director of The Black Sea Foundation. As a journalist, I've worked on human rights, political corruption, financial and organised crime reporting, and transnational investigative projects.I've published stories with The Guardian, The Sunday... Read More →
Saturday May 30, 2026 3:30pm - 4:45pm CEST
1.16

3:30pm CEST

Uncovering the organized economy of image-based sexual abuse
Saturday May 30, 2026 3:30pm - 4:45pm CEST
Non-consensual intimate content sharing is a growing problem, disproportionately affecting women and marginalized people, and increasingly amplified by deepfakes and automated tools. What looks like chaotic “leaks” online is often a structured, organized, and profitable ecosystem.

In this session, we’ll show how we uncovered networks that steal, share, and monetize image-based sexual abuse. You’ll see the technical approaches we use: tracking digital traces, mapping channels, analyzing hosting setups, entering abusive groups safely, and spotting patterns in offshore companies. We’ll also explain how to map networks of abusers by understanding their ecosystem of filehosters, cloud services, and CDN layers.

Finally, we’ll share practical tips to safeguard your well-being when investigating abusive material. By the end, participants will be able to turn overwhelming, chaotic content into actionable insights, exposing the organized systems that enable and profit from abuse.
Speakers
avatar for Polina Bachlakova

Polina Bachlakova

Technology correspondent, The Fuller Project


Saturday May 30, 2026 3:30pm - 4:45pm CEST
3.02

3:30pm CEST

When numbers are missing: Building your own datasets to investigate the police
Saturday May 30, 2026 3:30pm - 4:45pm CEST
Police work is notoriously difficult to investigate. While police actions directly affect fundamental rights and guide politics and law-making, official data on their influence is scarce.

In this session, we will share some tools and methodologies on how to investigate the police sources. We will draw on more than ten years of investigations into different aspects of police violence and state abuse of power in Germany. We will also show how the data-driven journalism tools can help to make the underlying structures more visible, and present our recent investigation into police unions in Germany: who they are, want they stand for and how they influence the public debate and politics. We pieced together data from different sources and combined it with investigative journalism to draw the full picture.

We want to show how it is possible to do a data-driven investigative story despite missing data. We will focus on the following questions:

- How to build your own datasets when no official statistics exist? Which public data sources about the police are available? Which data can we get from ministry inquiries? What about social media?
- How to clean up, classify and analyze the data? Which tools are helpful in combining manual and automated workflows?
- How to tell a complex data-driven story without this one headline? What are judicial and editorial hurdles when publishing a sensitive topic?
- What can investigative and data journalists gain from close collaboration? What worked well? What would we do differently?
Speakers
avatar for Mohamed Amjahid

Mohamed Amjahid

Freier investigativer Journalist
Mohamed Amjahid ist investigativer Journalist und Buchautor. Er schreibt für mehrere Medien wie ZEIT, Spiegel, taz, Süddeutsche Zeitung, RBB, WDR und SWR. Für seine Bücher „Unter Weißen“, „Der weiße Fleck“ und "Let's talk about sex, habibi!" hat Amjahid viel Aufmerksamkeit... Read More →
avatar for Natalie Widmann

Natalie Widmann

Data Journalist, SWR Data Lab
I'm a Data Journalist supporting journalist and human rights activists with data, tools and automation.
I'm happy to talk about scraping data, extracting the most relevant information from it, understanding algorithms and using them for investigations.
Saturday May 30, 2026 3:30pm - 4:45pm CEST
Z1.15 - Aula Donche

3:30pm CEST

So…you want to investigate crypto?
Saturday May 30, 2026 3:30pm - 4:45pm CEST
Analysts estimate that at least $28 billion tied to illicit activity has flowed into cryptocurrency exchanges over the last two years. ICIJ's Coin Laundry investigation, in collaboration with 37 media partners, exposed part of this shadow financial system, collecting dozens of cryptocurrency wallet addresses and uncovering hundreds of millions of dollars' worth of illicit funds linked to suspected criminals, including a Cambodian conglomerate that facilitated money laundering for hackers and scam compound operators.

In this hands-on session, the panelists will explain how any reporter can use open source resources as well as advanced techniques to investigate major crypto exchanges, find leads, and "follow the crypto." The session will blend an intro showing the kinds of stories you can do on crypto, starting from scratch, the kinds of tools you can use, and then show how we use those tools to compile the data for analysis.
Speakers
avatar for Delphine Reuter

Delphine Reuter

Head of data and research, ICIJ
Delphine Reuter is the head of the data & research team at the International Consortium of Investigative Journalists. She started collaborating with ICIJ on the LuxLeaks project in 2014. She worked for several years as a researcher for environmental organizations, has taught at conferences... Read More →
avatar for Miguel Fiandor

Miguel Fiandor

Data Analyst & Engineer, ICIJ
Data engineer and analyst also specialised in Neo4j graph databases at International Consortium of Investigative Journalists (ICIJ). I enjoy in working in any stage of data ETLs, from start to end.
Saturday May 30, 2026 3:30pm - 4:45pm CEST
Z1.13 - Aula Hanswijk

3:30pm CEST

FOI meet-up: presentation of a cross-border guide
Saturday May 30, 2026 3:30pm - 4:45pm CEST
This informal session is intended to bring together journalists who use access to documents procedures within European Institutions and member states. It is the second (in-person) meeting of the informal network of these journalists since it was launched at Dataharvest 2024, but it is open to all journalists and researchers with knowledge of the right to information & access to documents anywhere in Europe. During the meeting, we will present a cross-border guide, a handy resource for journalists making access to information requests to governments and official bodies across borders in different European countries and with the European Union institutions. The guide is meant to provide basic information for each country, such as how to make requests, timelines, and points of contact for each country if assistance is required.
Speakers
avatar for Alexander  Fanta

Alexander Fanta

Journalist, Follow the Money
Journalist bei Follow the Money mit Fokus auf EU-Digitalpolitik. Davor Stationen bei netzpolitik.org, Austria Presse Agentur und Der Standard.
avatar for Jean Comte

Jean Comte

Reporter, MLex
I am a Brussels-based journalist, currently covering financial regulation for the financial newswire MLex.
I spent several years before that writing about transparency, ethics and lobbying in the EU institutions. I published a book on lobbying in 2023, that was reedited this year.
... Read More →
Saturday May 30, 2026 3:30pm - 4:45pm CEST
Z2.01 - Mediadrôme

3:30pm CEST

Crossing the borders safely: securing devices, data and sources
Saturday May 30, 2026 3:30pm - 4:45pm CEST
Are you working on cross-border investigations that lead you to physically cross a border? Are you crossing borders with devices that hold information on your sources? Do you know what data is stored on your phone and how it could be accessed at a border? If this is you and you want to find out how best to protect yourself, then this session is for you!

Your devices hold a significant amount of information on you and your sources. At a border, this data could be accessed, copied and stored, putting you and others at risk. Join us for this 75-minute session, organised by ACOS Alliance, where you will be given clear, practical guidance to help you navigate the uncertainty around securing your data at the border. 
Speakers
avatar for Ela Stapley

Ela Stapley

Senior Digital Security Adviser
Ela Stapley is a Senior Adviser in Digital Security and Strategy who has spent the past decade working with journalists, newsrooms, and journalist networks to provide high-level digital security support.  During this time, she has trained and provided individual assistance to over... Read More →
Saturday May 30, 2026 3:30pm - 4:45pm CEST
2.02

3:30pm CEST

You received a leak, now what? A hands-on OPSEC simulation
Saturday May 30, 2026 3:30pm - 4:45pm CEST
Receiving a noteworthy tip is an achievement in itself, but it's almost never the end of the story. Where does your Signal application live? How can you safely share and store leaked documents? Oh, and what if they are -- gasp --malware? This is where the concept of OPSEC (Operational Security) comes into play.

However, discussing OPSEC practices can sometimes feel too detached from the reality of a working journalist. In this session, we will strive to make things more grounded. We'll conduct a live simulation of a leak throughout its lifecycle. We'll discuss OPSEC in practice and demonstrate how to use:- a hardened phone and PC OS (GrapheneOS / QubesOS), and- some lesser-known security and privacy tools (e.g., Dangerzone, Orbot, BlinkComparison)
Finally, we'll go beyond mere tool usage and share tips and lessons learned from past OPSEC failures and wins.

This is an intermediate session on the subject of security hygiene. We don't assume any background in engineering or security, and it should be approachable to anyone familiar with some basic concepts (encryption, anonymity, threat modeling) and tools (Signal, Tor), which we will use as the foundation for the rest of the demonstration.
Speakers
Saturday May 30, 2026 3:30pm - 4:45pm CEST
3.13

4:15pm CEST

Beyond data cleaning: Enhancing OpenRefine with LLM
Saturday May 30, 2026 4:15pm - 4:45pm CEST
Data journalism has always relied on clean, structured data; but cleaning messy datasets remains one of the most time-consuming parts of the workflow. Enter OpenRefine, our old buddy for data wrangling, now enhanced by Large Language Models (LLMs).

In this 20-minute session, we explore how combining OpenRefine’s powerful transformation capabilities with modern AI unlocks new possibilities for journalists. Using the open-source LLM extension for OpenRefine, we’ll demonstrate practical workflows for:
- Automated Enrichment: Extracting entities, categorizing content, and enriching records using natural language prompts.
- Smart Disambiguation: Resolving inconsistencies and matching fuzzy data with AI-assisted reconciliation.
- Rapid Prototyping: Turning raw, unstructured text into structured datasets ready for investigation

Why This Matters Now: Journalists are increasingly working with large, messy datasets, from leaked documents to public records.

While LLMs offer powerful analysis, they often lack precision on structured data. OpenRefine provides that precision. Together, they create a workflow that is both scalable and auditable; critical for investigative reporting where accuracy is non-negotiable.

What Attendees Will Take Away:
- A clear understanding of how to integrate LLMs into existing OpenRefine workflows.
- Practical examples relevant to journalistic investigations (entity extraction, classification, enrichment).

To attend this session, participants should have experience with data cleaning
Speakers
avatar for Herve Letoqueux

Herve Letoqueux

OpenFacto
Co-Founder of OpenFacto with Lou (@CapteursOuverts) and Aliaume (@yaolri), a french NGO dedicated to online investigation for journalists and activists, I love OpenSource researches, Python, Gephi, R and OpenRefine. I used to deal with money laundering, financial frauds and terrorism... Read More →
Saturday May 30, 2026 4:15pm - 4:45pm CEST
1.04

4:15pm CEST

From 007 to n8n - build your own no-code AI Agents
Saturday May 30, 2026 4:15pm - 4:45pm CEST
With so-called low-code platforms like n8n, you can quickly click together programs that would otherwise require tedious Python coding. And you can integrate LLMs at various points to, for example, extract information from texts or summarize content. This allows you to build complex workflows. Receive a Teams message from an agent when a nearby river level approaches extreme values? No problem! Automatically monitor the police website for accident reports and generate suggestions for brief news items? With n8n, this can be automated quickly. This workshop provides an introduction to the free platform n8n. No prior knowledge is expected.
Speakers
avatar for Claus Hesseling

Claus Hesseling

Freier Journalist und Trainer
Macht Daten-Sachen für den NDR und HR, erfindet für die Interlink-Academy im EU-Projekt INJECT Tools für Newsrooms, ist Trainer bei der ARD.ZDF-Medienakademie und anderen. Twitter: @the_claus... Read More →
Saturday May 30, 2026 4:15pm - 4:45pm CEST
2.03

4:15pm CEST

No download button? Getting web data without writing a scraper
Saturday May 30, 2026 4:15pm - 4:45pm CEST
Journalists often run into data that is visible on a website but impossible to download directly: a table buried in a government page, a list of public records, or search results that change with every query. Writing a full scraper can be time-consuming and technically demanding for what is often a one-time task.

This session introduces three lightweight approaches that cover most of these cases: reading a table directly from a page using pandas, downloading raw HTML and parsing it into a dataframe and pulling data through network requests. These techniques are practical tools for everyday newsroom situations. Participants will take home a GitHub repository with a working notebook to try on their own data, though some adaptation will be needed to apply it to different websites.

The three approaches vary in complexity. Basic Python knowledge is enough to follow along, but participants with more experience will be able to go further, and the code can be adapted with the help of an LLM.
Speakers
avatar for Teodora Curcic

Teodora Curcic

BBC
Teodora Ćurčić is an investigative and data journalist from Serbia with over seven years of experience reporting on corruption, political finance, gender-based violence, and social justice. She spent most of her career at the award-winning Center for Investigative Journalism of... Read More →
Saturday May 30, 2026 4:15pm - 4:45pm CEST
3.05

4:45pm CEST

Coffee break
Saturday May 30, 2026 4:45pm - 5:15pm CEST
Coffee is served in the Mediaforum (ground floor, lobby area) and on the third floor.
Saturday May 30, 2026 4:45pm - 5:15pm CEST
Mediaforum

5:15pm CEST

"The Mechelen Connection" (Escape Room)
Saturday May 30, 2026 5:15pm - 5:45pm CEST
There's a (genuine) story hiding in plain sight. Using a mixture of OSINT skills and clues hidden in the room, you will have 30 minutes to get to the story. First come, first served, max 3-4 teams per session competing to get the exit code and get out!
Speakers
avatar for Jonathan Stoneman

Jonathan Stoneman

Arena for Journalism in Europe
Former BBC journalist, turned datajournalist, trainer, consultant. Works with Arena as Lead Trainer, Arena Academy. 
Saturday May 30, 2026 5:15pm - 5:45pm CEST
3.04

5:15pm CEST

How to look up named entities in text – fast
Saturday May 30, 2026 5:15pm - 5:45pm CEST
Have you ever stumbled at the problem "I have a bunch of documents, give me all the politicians named in it"? If yes, you know the hassle: NER is noisy, and to qualify names (Is this a politician or not) requires external services, APIs or a large language model.

Or, use "Juditha": It's an open source poor mans entity extraction and resolution tool. No external service required, just put in your list of names and then extract them from arbitrary unstructured content. Works on any laptop, super fast. Of course it works with names of criminals, too. Or company names. Whatever you need.

In this session I'll walk through how to use the "juditha" command line and how to populate it with names of interest. At the end, anyone can take it home to detect the names that matter in your material.

Knowledge about how to use a command line and install python packages helps. If you ever suffered the problems about named entity recognition, you'll have even more fun.

Speakers
avatar for Simon Wörpel

Simon Wörpel

Director of Technology, Data and Research Center – DARC

Saturday May 30, 2026 5:15pm - 5:45pm CEST

5:15pm CEST

Mining data from unstructured documents
Saturday May 30, 2026 5:15pm - 5:45pm CEST
You have a folder of documents and you want to extract data points from each one. And the data isn't in a structured table with neat rows and columns either. Here's where string functions and regular expressions can help. The demonstration will be in R but the skills are generic to all languages.
Speakers
avatar for Robert Gebeloff

Robert Gebeloff

Reporter, New York Times
Robert Gebeloff has worked as a data projects reporter for The New York Times since 2008 and has taught data journalism for many years in newsrooms and at conferences. He was co-winner of the George Polk Award in 2015 and was a Pulitzer Prize finalist in both 2015 and 2016 for projects... Read More →
Saturday May 30, 2026 5:15pm - 5:45pm CEST
3.05

5:15pm CEST

No data, no story? Generate them. Citizen science and social research for investigative data journalism
Saturday May 30, 2026 5:15pm - 6:00pm CEST
Investigating data-driven stories on environmental health, long-term pollution, or biodiversity loss often means facing a structural problem: the data don’t exist, or have never been systematically collected, not even by public institutions. In these cases, the only option is to produce them from scratch.

This session presents a journalism-led methodology that integrates citizen science and social research as core tools of investigative data journalism. Developed through collaboration with scientists while preserving editorial independence, the approach is grounded in concrete case studies and practical experience.

Drawing on projects conducted with communities exposed to chronic industrial pollution, environmental degradation, and conflict, the session shares practical tools, workflows, and lessons learned for designing, collecting, validating, and using original data when datasets are missing.

Participants will learn how to:
-design and run a citizen science project for investigative reporting, from framing research questions and defining indicators, to engaging and training citizen scientists, data collection, quality control, and journalistic use of the data- establish effective collaboration with scientists during research design, data collection, and verification through shared protocols and a memorandum of understanding
- design a rigorous social research process to investigate community needs and priorities using surveys, interviews, and qualitative methods, including collecting and analysing texts to identify trends, patterns, recurring themes, and undercovered issues
- integrate community-generated data, social research findings, and scientific measurements into original investigations and engaging stories.

The session also demonstrates how this approach can generate multiple, high-impact outputs: original investigative reporting based on co-produced data, policy briefs grounded in empirical evidence, and social research outputs in scientific journals.
Speakers
avatar for Elisabetta Tola

Elisabetta Tola

Founder and editor-in-chief, Facta
Elisabetta is a science, data, and investigative journalist.

She is the co-founder and editor-in-chief of Facta.eu, an Italian independent media outlet that applies the scientific method to journalism and promotes science journalism as a cornerstone of democratic participation.

She... Read More →
Saturday May 30, 2026 5:15pm - 6:00pm CEST
1.16

5:15pm CEST

Which schools are the most exposed to pesticides in your country? How to investigate with data, maps and scientists
Saturday May 30, 2026 5:15pm - 6:30pm CEST
In this session, we’ll show you how we approached a sensitive topic: mapping the potential exposure of all schools to the use of pesticides from surrounding agricultural activities. We will explain  how - working with scientists - we came up with a robust methodology,  found the data we needed and then crunched it to tell us where to go to make truly data-driven reporting on the ground.  

This investigation, published in Le Monde in December 2025, has sparked a lot of national and local interest thanks to its interactive map. Attendees will leave the session with a clear step-by-step guide to get started and adapt the ambition of the investigation to their own capacity. Feel free to bring with you any datasets that would help to map similar issues in your country.

You can read the main story in English here
Free access to the map in French here

Speakers
avatar for Raphaelle Aubert

Raphaelle Aubert

Data journalist, Le Monde
I'm an investigative data journalist at Le Monde. My most recent cross-border collaborations include:Green to Grey: how Europe is destroying the little nature it has left Forever Lobbying Project: revealing the cost of PFAS remediation in EuropeUnder the Surface: 300 Contaminants... Read More →
Saturday May 30, 2026 5:15pm - 6:30pm CEST
Z1.15 - Aula Donche

5:15pm CEST

Academic complicity on Palestine: Crowdsourcing and maintaining an actionable database
Saturday May 30, 2026 5:15pm - 6:30pm CEST
Across Europe, university staff and students have been challenging their institutions' complicity in oppression and occupation in the Palestinian West Bank and genocide in Gaza. This has led to numerous research efforts from ad-hoc coalitions at universities across Europe uncovering their own university's ties to institutions engaging in and enabling this violence, as well as an uptick in interest in existing research, mostly done by actors not in establishment academia.

Academic Complicity (academiccomplicity.eu) is a mapping project collating the most recent research on this and displaying it in a form usable by journalists, activists and the public. The first database was set up for the Netherlands in Summer 2024 and has since spread to Belgium, Germany, France, and Norway. It is currently expanding to Denmark and, in late May, we will have published a new database on Horizon Europe collaborations across Europe.

In this session, we will talk about how we ensured safety and anonymity of our sources and how this affected our decisions; how we maintain the volunteer-run database; how we verify the data. We will discuss our design decision and rationale based on these factors and what lessons we learned from maintaining a database like this. This session is useful to journalists working in the theme of activist repression as well as those who want to build up databases via crowdsourcing. 



Speakers
avatar for Aaron Pereira

Aaron Pereira

Co-Founder/Researcher, Solid Sustainability Research
Together with my colleague Linda Knoester, I work on research, analysis and communication about greenwashing and other forms of climate obstruction. Since 2023, we coordinate a distributed research project with students, academics and action groups mapping relationships between universities... Read More →
Saturday May 30, 2026 5:15pm - 6:30pm CEST
2.02

5:15pm CEST

Darcula Unmasked: Investigating Chinese cybercrime and scam operations
Saturday May 30, 2026 5:15pm - 6:30pm CEST
Hundreds of thousands of Europeans were tricked after receiving text messages on their phone claiming they had missed a payment at a toll road or that they needed to pay a fee for a parcel they ordered. Who is behind it?

In this session, we will share the methods that led us to unmasking the person calling himself “Darcula” - the lead developer of one of the major Chinese phishing-as-a-service platforms. The software Darcula created, called Magic Cat, is used by hundreds of scammers worldwide.

The presentation will also go into analysing a large throve of victim data shared by security researchers, re-constructing the phishing software, and infiltrating Telegram groups. The OSINT section will go into detail about using technical data sources and leveraging visual cues in photos shared by the criminals.

This investigation was led by NRK (the Norwegian Broadcasting Corporation) in collaboration with Germany's public broadcaster ARD and Le Monde.

Come to this session to learn more about Chinese cybercrime and how to investigate it, as well as hear more about the data trove the NRK journalists still have that might be relevant to your country.

The stories (in English) can be read here:
https://www.nrk.no/spesial/inside-the-scam-network-1.17399135
https://www.nrk.no/spesial/the-hunt-for-darcula-1.17399157
Saturday May 30, 2026 5:15pm - 6:30pm CEST
Z1.13 - Aula Hanswijk

5:15pm CEST

Investigating algorithms by land, sea and air
Saturday May 30, 2026 5:15pm - 6:30pm CEST
Algorithms are everywhere, but they're still extremely opaque. They affect us more and more, and often have a meaningful impact on our daily lives—especially when the public administration uses them. But it's incredibly hard to 1) get data and information about how they work, 2) understand it, and 3) explain it in a simple, impactful way.

In this session, we'll share different techniques that Civio has used to research and report on algorithms. We'll look into the ways in which one can crack open the black box of algorithms, from FOIA requests to court battles, reverse engineering to scientific reports. We will also explain how we analyse their outputs and how we make reporting that is often technical and complex more palatable for our readers. In the presentation, we will mention some particular examples from our reporting on algorithms, such as exposing facial recognition in doctor offices, algorithm bias when it comes to cancer detection, and a silly lie detector used by police.

We'll also talk about how we went to the Supreme Court to get the code of a system which decides -wrongly- who receives subsidies, and who doesn't. This session is suitable for beginners - you don't need to have had any experience reporting on algorithms in order to attend and follow the session.
Speakers
avatar for Eva Belmonte

Eva Belmonte

Co-Director, Civio
Graduate of Journalism from the Universitat Autònoma de Barcelona. She arrived to Civio after eight years in the newsroom of El Mundo in Barcelona (2004-2012). Eva Belmonte designs, leads, and monitors all of Civio's journalistic investigations. An expert in the analysis and treatment... Read More →
avatar for Adrián Maqueda

Adrián Maqueda

Data Analysis, Dataviz & Front-end, Civio
Saturday May 30, 2026 5:15pm - 6:30pm CEST
3.02

5:15pm CEST

Reporting from space: Using satellite images for investigations when access is limited
Saturday May 30, 2026 5:15pm - 6:30pm CEST
In this session, we will show how satellite imagery can support investigations into war, militarisation, and environmental changes. The methods discussed are applicable to climate investigations such as land-use change, infrastructure expansion, and environmental damage.

We will cover:
-Getting the image: How to choose between open and commercial satellite imagery based on editorial needs, such as resolution, timeliness, licensing, and costs—and what to be aware of when downloading an image.
-Analysing the image: Measuring distances and object sizes in images (e.g., airplanes), verifying the time an image was taken, and merging different satellite images.
-Presenting the image: Tools to tell stories with satellite images so that readers can understand them intuitively—for example, through before–and–after comparisons, annotations, colour correction, sliders, GIFs, and scrolly-telling, with a focus on mobile-first presentation.

The session is intended for reporters, editors, and visual journalists. No prior experience with satellite imagery is required.
Speakers
avatar for Eike Hoppmann

Eike Hoppmann

OSINT Reporter / Digital Investigations, Neue Zürcher Zeitung
Saturday May 30, 2026 5:15pm - 6:30pm CEST
0.10

5:15pm CEST

Investigative journalism on health: networking roundtable
Saturday May 30, 2026 5:15pm - 6:30pm CEST
Health is one of the most cross-border beats there is. Pharmaceutical companies, regulators, clinical trials, food and tobacco lobbying, environmental hazards, patient harm – none of it stops at borders. Whether you've been on the health beat for years, have just started chasing your first lead, or are simply drawn to the topic and wondering where to begin, come and meet others working in the same space. The goal is to leave the session knowing a few people you'd genuinely want to team up with on the next investigation.
Speakers
avatar for Hristio Boytchev

Hristio Boytchev

Science and health reporter, Berlin
Hristio is a Berlin-based investigative health and science journalist, using data driven methods to tacke research integrity issues and systemic problems in medicine. Hristio is freelance investigations reporter at The BMJ (British Medical Journal) and leader of “Follow the Gra... Read More →
Saturday May 30, 2026 5:15pm - 6:30pm CEST
Z2.01 - Mediadrôme

6:00pm CEST

"The Mechelen Connection" (Escape Room - 2nd session)
Saturday May 30, 2026 6:00pm - 6:30pm CEST
There's a (genuine) story hiding in plain sight. Using a mixture of OSINT skills and clues hidden in the room, you will have 30 minutes to get to the story. First come, first served, max 3-4 teams per session competing to get the exit code and get out!
Speakers
avatar for Jonathan Stoneman

Jonathan Stoneman

Arena for Journalism in Europe
Former BBC journalist, turned datajournalist, trainer, consultant. Works with Arena as Lead Trainer, Arena Academy. 
Saturday May 30, 2026 6:00pm - 6:30pm CEST
3.04

6:00pm CEST

Bluetooth Trackers for Investigations
Saturday May 30, 2026 6:00pm - 6:30pm CEST
Bluetooth trackers can help you develop interesting investigations. This team started using trackers while following two cars from Germany to Siberia, then a parcel from Prague to Moscow. In late 2024, they tracked more than 230 letters sent within Germany, using up to 80 trackers simultaneously. For almost 18 months they tracked 24 items of electronic waste from Germany to places as far afield as Pakistan.

In this session, the team will share the learnings and the technology behind all these projects and the scraping tools and software behind them. They will also bring some trackers and covers to inspire colleagues to use these devices, and share lessons learnt from ongoing collaborations in various countries where other journalists and newsrooms licensed them to help them move their projects forward.
Speakers
avatar for Marcus Lindemann

Marcus Lindemann

geschäftsführender Autor, autoren(werk) GmbH & Co.KG
Marcus Lindemann ist Dozent für Recherche, TV-Journalismus und Presserecht sowie geschäftsführender Autor der TV-Produktionsfirma autoren(werk). Seit 25 Jahren produziert er Magazinbeiträge und Dokumentationen für öffentlich-rechtliche Sender, insbesondere zu Wirtschafts- und... Read More →
Saturday May 30, 2026 6:00pm - 6:30pm CEST

6:00pm CEST

Modern document processing with Natural PDF
Saturday May 30, 2026 6:00pm - 6:30pm CEST
Say hello to Natural PDF, a new Python library for wrangling PDFs that's focused on usability and feature-completeness. Process PDFs with scraping-like selectors and spatially-aware queries, asking for "the red alphanumeric string" or "the content below the big Summary header." Beyond the basics, Natural PDF is also full of modern conveniences like table detection, multiple OCR engines, and citation-aware LLM data extraction.

To get the most out of this session, participants should have experience with Python and struggling with terrible PDFs.
Speakers
avatar for Jonathan Soma

Jonathan Soma

Knight Chair in Data Journalism, Columbia University
Jonathan Soma is the Knight Chair in Data Journalism at Columbia University, where he serves as Director of the Data Journalism MS program and the Lede Program, an intensive data journalism summer course. His lectures cover everything from basic Python and data analysis to interactive... Read More →
Saturday May 30, 2026 6:00pm - 6:30pm CEST
3.05
 
Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
Filtered by Date -