Session
FOSDEM Schedule 2021
Open Research Tools and Technologies

Datasette

An open source multi-tool for exploring and publishing data
D.research
Simon Willison
<p><a href="https://datasette.io/">Datasette</a> is a tool for exploring and publishing data. It helps people take data of any shape or size, analyze and explore it, and publish it as an interactive website and accompanying API.</p> <p>Datasette is aimed at data journalists, museum curators, archivists, local governments and anyone else who has data that they wish to share with the world. It is part of a wider ecosystem of tools and plugins dedicated to making working with structured data as productive as possible.</p>

I’ll use this talk to introduce Datasette and show how it can be used to quickly explore and publish data.

I’ll talk about why SQLite is an ideal publishing format for structured data and demonstrate several open source tools for converting data from different sources into SQLite databases for use with Datasette.

Datasette is the foundation of my Dogsheep personal data warehouse project. I’ll show how I’ve pulled together data about my own life from a wide range of sources - from Twitter and GitHub to 23AndMe and Apple Health - and used it to build my own data warehouse to help me answer questions about myself.

Additional information

Type devroom

More sessions

2/6/21
Open Research Tools and Technologies
Albert Yumol
D.research
<p>As technology advances, so as our maps. In this talk, we will explore the ever growing open map data that can help us understand, validate, and explore socio-economic indicators with the aid of network theory and machine learning techniques.</p>
2/6/21
Open Research Tools and Technologies
Olivier Aubert
D.research
<p>We will describe in this talk how to combine crowdsourcing approaches with scientific expertise in Digital Humanities projects, and some of the issues that are at stake. The talk will focus on Recital, a Digital Humanities project aiming at gaining insights on 18th-century theater through the analysis of its accounting books. It combines crowdsourcing, using the ScribeAPI free software, producing results that need to be evaluated and validated by scientific expertise, which requires ...
2/6/21
Open Research Tools and Technologies
D.research
<p>This talk will focus on our experiences with making open source tools for the study of social media platforms (amongst others, DMI-TCAT for Twitter, the YouTube Data Tools, and 4CAT for forum-like platforms such as Reddit and 4chan) in the context of social science and humanities research. We will discuss questions of reliability and reproducibility, but also how tools are taking part in shaping which questions are being asked and how research is done in practice - making open source ...
2/6/21
Open Research Tools and Technologies
Benjamin Ooghe-Tabanou
D.research
<p>The World Wide Web’s original design as a vast open documentary space built around the concept of hypertext made it a fantastic research field to study networks of actors of a specific field or controversy and analyse their connectivity. <a href="https://medialab.sciencespo.fr/outils/navicrawler/"><em>Navicrawler</em></a>, <a href="https://www.issuecrawler.net/"><em>IssueCrawler</em></a>, <a href="https://hyphe.medialab.sciences-po.fr/"><em>Hyphe</em></a>... Over the past 15 years, a ...
2/6/21
Open Research Tools and Technologies
Béatrice Mazoyer
D.research
<p>Many open-source libraries provide an interface for the Twitter API. However, most people use these tools in temporary scripts for a one-time tweets collection. Moving to a robust application for collecting and indexing tweets over long periods of time requires some programming knowledge that most social science researchers do not master. In order to meet this need, the medialab has developed gazouilloire, a tool that makes it possible to easily configure the collection parameters (keywords ...
2/6/21
Open Research Tools and Technologies
Guillaume Levrier
D.research
<p>PANDORÆ : Retrieving, curating and exploring enhanced corpi through time and space Mapping the state of research in a particular field has been made easier through commercial services providing API-based bibliometric-enhanced corpuses retrieval. Common assertions such as “the use of CRISPR technologies has skyrocketed in laboratories all around the world since 2012” can now be easily verified in both quantitative and qualitative perspectives using those platforms. Such services as ...
2/6/21
Open Research Tools and Technologies
D.research
<p>This is a live panel session which gathers speakers from three lightning talks about web mining tools and technologies.</p>