Where Do All The Leaks Go? OpenAleph Roundtable

SoS Stage H
catileptic (they/she)
Abstract: If you work involves sifting through and making sense of large amounts of data, we welcome you to a session on OpenAleph (openaleph.org), where we can brainstorm (and commiserate) together. We will show practical examples of surfacing interesting leads from leaked data and explore falsely-held beliefs that stand in the way of investigators. Making sense of a large volume of data is marketed as being a textbook use-case for generative AI. We beg to differ. Description: Making sense of large amounts of unstructured data makes users reach for prompting a chatbot. Newsrooms and organizations we work with hope to “ask questions” in a chat window, instead of searching through their data. But these same users rightfully demand accuracy and deterministic results. If they don’t find exactly what they are looking for, they doubt the efficiency of the entire software stack. Our experience, at the Data and Research Center (darc.li) with supporting research and investigations with algorithms and infrastructure led to insights about how to answer difficult questions in a deterministic way. This session will walk the audience through several features that surface names, companies, and other interesting data from large leaks. We will explore conundrums about search and deduplication features, which pose difficult questions for investigators and programmers. There are many falsehoods we tend to believe about our world. What is a name, actually? What constitutes a country, and who decides on that? How do you search for words across several languages, all at once? And how do you reveal hidden links in large volumes of data? All these will be answered without prompting a chatbot, not even once!

Additional information

Type other

More sessions

12/27/25
katy13
Komonin
Astrology is usually associated with horoscopes, prediction, or belief systems. In this self-organised session, we’ll test a different idea: using astrology as a symbolic language to reflect on daily routines, decision-making, and energy management — without fate, mysticism, or “the stars made me do it”. The session is interactive and experimental. We’ll look at how astrological concepts can function similarly to tools people already use: retrospectives, calendars, personality models, ...
12/27/25
blinry
SoS Workshop D
Jujutsu (jj) is a new version control system that uses Git as its backend. Since trying it last year, it has completely replaced Git for me. It manages to be less complex than Git, while giving you more control. I think you'll like it too! Lately, when people ask me complex Git questions, my answer is often: "First, install jj…" And that's only half a joke. :P --- A few things I like about Jujutsu: There's no index, but instead you get a subcommand for splitting changes. Commits have stable ...
12/27/25
htext
SoS Saal 6
How can we work together to improve political decision-making processes in the long term? What do you want from democracy? Motivation: While our democracy can be shaped by the people as they wish on paper, the population seems to be largely dissatisfied with political actions: - The handling of many crises appears to be inadequate - Urgent problems seem to be postponed - Democratic participation seems tedious and ineffective Dissatisfaction mixed with these perceptions can lead to the loss of ...
12/27/25
Kidspace - Workshopraum
Möchtest du uns unterstützen den Kidspace zu einem sicheren Wohlfühlort für Familien zu machen? Dann schließe dich dem Kidspace-Awareness-Team an.
12/27/25
Johannes_Max
SoS Lecture E
Wie ist das Gehirn und das Nervensystem aufgebaut? Was ist Stress und wie geht man effektiv damit um? Wie regeneriert man optimal? All das und viele Hacks lernst du hier.
12/27/25
HouseOfTea
House of Tea
Join us to get things started and be part of our Pu'Er circles! <3
12/27/25
Kidspace - Elektrotisch
Elektrobausteine/Electric circuits with building blocks