Loading…
Type: Data skills clear filter
arrow_back View All Dates
Sunday, May 31
 

9:30am CEST

Update your google skills
Sunday May 31, 2026 9:30am - 10:45am CEST
Google search is all the same since 1996? No, Google does change over time, but so slow that most people will not notice. The session will give you an update about recent changes (i.e. in the last 4-5 years), will point at workarounds where necessary and will show you what is really new and useful. Towards the end of the session it will give you some advanced Google dorks for immediate journalistic use, but also inspire you to build your own dorks and how to combine LLMs and Google searches.

To follow along, the participants should have used google operators before. After attending the session, you will have an up to date knowledge of Googles web search and other tools for journalistic use.

A Google account can be useful, but is not a must-have.
Speakers
avatar for Marcus Lindemann

Marcus Lindemann

geschäftsführender Autor, autoren(werk) GmbH & Co.KG
Marcus Lindemann ist Dozent für Recherche, TV-Journalismus und Presserecht sowie geschäftsführender Autor der TV-Produktionsfirma autoren(werk). Seit 25 Jahren produziert er Magazinbeiträge und Dokumentationen für öffentlich-rechtliche Sender, insbesondere zu Wirtschafts- und... Read More →
Sunday May 31, 2026 9:30am - 10:45am CEST
3.05

11:15am CEST

A map for every reader: how to generate hundreds of images for multiple audiences or partners using QGIS and Python
Sunday May 31, 2026 11:15am - 12:30pm CEST
The BBC Shared Data Unit wanted to generate a map image for each authority in the UK showing the state of flood defences in that area — so they turned to the mapping tool QGIS’s built-in Python functionality.

In this session, you will learn how to generate and export dozens of maps in QGIS centred at different points, and how AI can help speed up the process.

To follow along, participants should have some basic knowledge of QGIS and be comfortable using Python or vibe coding.

After attending this session, participants should be able to understand how Python works in QGIS and use AI to help generate, understand, and adapt code. Participants should have QGIS and Python installed on the computer (qgis.org/download + python.org/downloads) and a free account with an AI tool such as ChatGPT, Gemini, or Claude
Speakers
avatar for Paul Bradshaw

Paul Bradshaw

Journalist and Academic, BBC/Birmingham City University
Paul Bradshaw runs the MA in Data Journalism at Birmingham City University and also works as a consulting data journalist with the BBC Shared Data Unit. A journalist, writer and trainer, he has worked with news organisations including The Guardian, Telegraph, Mirror, Der Tagesspi... Read More →
avatar for Ioanna Petsiou

Ioanna Petsiou

Data Journalist, Freelancer
Ioanna Petsiou is an investigative data journalist working across data analysis, satellite imagery, and mapping to uncover and explain complex stories. She is particularly drawn to environmental reporting and to building clear, reproducible ways of working with data that others can... Read More →
Sunday May 31, 2026 11:15am - 12:30pm CEST
3.02

11:15am CEST

Text embeddings: navigating text in high dimensions
Sunday May 31, 2026 11:15am - 12:30pm CEST
Most "big data" problems in journalism aren't really data problems, they're reading problems: a big leak, a ministry dump of 12,000 pages, or a FOI coming back as zip of PDFs. The instinct is to search, but keyword search assumes you already know what you're looking for. Which sometimes is the thing you don't know yet.

This session introduces embeddings: a technique that turns any text into a point in space, positioned by meaning, so texts with similar meaning end up close together. You stop searching a pile and start looking at it.

To make the idea tangible, we'll walk through a live semantic map we built of Google's "trending now" feeds from 125 countries, projected into 3D.

The method applies beyond trending searches and is applicable to TikTok captions, YouTube transcripts, court filings, a scraped forum, or years of parliamentary speeches.

We'll cover the full workflow end to end: how to embed your corpus, how to project it without losing what matters, how to build a map you can actually navigate, and where this approach breaks.

To follow along, participants should be comfortable running basic Python scripts on their laptop or in google collab.

After attending this session, participants will be able to take a large, unstructured text corpus and turn it into a navigable semantic map.

Participants should have Python installed on their computer, or have a google account where they can run collab. A Hugging Face account is recommended for generating embeddings. We will provide examples of text to work with, but if you have your own collection, feel free bring it, but make sure it's in a text format, as we won't cover how to convert PDF's into text.
Speakers
avatar for Johan Schujit

Johan Schujit

Data Engineer, Resolve.
I'm a data engineer responsible for EveryPolitician and PoliLoom at OpenSanctions. I'm a self-taught hacker with a stubborn belief that good data should be open and technology should serve the public interest. Previously at Follow the Money.

avatar for Ada Homolova

Ada Homolova

ARENA, Austria/ Slovakia
Adriana is a freelance data journalist, trainer and public spending nerd. She coordinates the data skills training track on the Dataharvest conference, and herds frogs at The Pond.

https://homolova.sk/newsletter
Sunday May 31, 2026 11:15am - 12:30pm CEST
1.14
 
Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
Filtered by Date -