Open Research Tools and Technologies

Open research in life science: funding foundational tools, trust, and talent

D.research
Yo Yehudi
Open source software powers incredible scientific and research discoveries, from images of the black hole in 2019, visualising proteins in 3D virtual reality, and open source hardware can affordably save lives during the COVID-19 pandemic. Despite this, the academic credit/metric system does not incentivise or reward activities around code sharing, instead preferring to use research papers, and citations for structured reward and promotion systems. In this short talk, we will share the vision of the Wellcome Trust Data for Science and Health team, which is working over the next five years to incentivise the foundational tools, trust, and talent required to recognise and fund open research software sustainably.
Picture Sam, a researcher in a white coat: they spend hours in the lab with their beakers, test tubes, and agar plates. Once they’ve gathered results they think the world will want to know, they write up their conclusions, publish in a journal, and repeat. While this type of research has existed for a long time, and will continue to do so, this model doesn’t effectively acknowledge newer scenarios, where machines and mass data collection result in what is often referred to by the now-buzzword “big data” – data at a scale where it must be managed and analysed with the use of computers. This type of research gap is often addressed by Joy, the computational researcher. Joy may identify as a coder, a researcher, an engineer, a data scientist, support staff, a lab assistant, or some other title. The main theme is that Joy writes computer code in order to make their job easier, or perhaps even to make it feasible at all. Given that this computer code feeds into and creates the end results, it is reasonable to consider it a research method, albeit a less traditional research method or output than the research paper. In the current world, Joy is typically less supported than a more traditional academic like Sam: Fewer funding calls exist for funding code work, and ever fewer recognise or reward good open source behaviour, even though an incredible number of scientific analyses rely on open source software themselves, and even though the perils of not sharing research code could be completely wrong scientific results that don't look wrong since no-one can see inside the black-box of proprietary code. Things have gotten a little better - there are scientific journals that accept software papers, and even journals that are exclusive to open source software papers, but they're far from the norm. We still have a long way to go. In order to make real progress, research code must be open to allow others to verify, re-use, and build upon it. This includes recognising and funding underfunded areas such as open source community management, accessibility, software maintenance, and user experience. The Wellcome Trust has dedicated £75 million GBP over the next five years towards making trustworthy data science a first-class citizen in the research ecosystem. By sharing our vision with others in the open source world we hope to encourage this attitude, meet others working towards the same goals, and meet prominent contributors to the open source research ecosystem who might be looking for funding opportunities.

Additional information

Type devroom

More sessions

2/6/21
Open Research Tools and Technologies
Albert Yumol
D.research
As technology advances, so as our maps. In this talk, we will explore the ever growing open map data that can help us understand, validate, and explore socio-economic indicators with the aid of network theory and machine learning techniques.
2/6/21
Open Research Tools and Technologies
Olivier Aubert
D.research
We will describe in this talk how to combine crowdsourcing approaches with scientific expertise in Digital Humanities projects, and some of the issues that are at stake. The talk will focus on Recital, a Digital Humanities project aiming at gaining insights on 18th-century theater through the analysis of its accounting books. It combines crowdsourcing, using the ScribeAPI free software, producing results that need to be evaluated and validated by scientific expertise, which requires appropriate ...
2/6/21
Open Research Tools and Technologies
D.research
This talk will focus on our experiences with making open source tools for the study of social media platforms (amongst others, DMI-TCAT for Twitter, the YouTube Data Tools, and 4CAT for forum-like platforms such as Reddit and 4chan) in the context of social science and humanities research. We will discuss questions of reliability and reproducibility, but also how tools are taking part in shaping which questions are being asked and how research is done in practice - making open source ...
2/6/21
Open Research Tools and Technologies
Benjamin Ooghe-Tabanou
D.research
The World Wide Web’s original design as a vast open documentary space built around the concept of hypertext made it a fantastic research field to study networks of actors of a specific field or controversy and analyse their connectivity. Navicrawler, IssueCrawler, Hyphe... Over the past 15 years, a variety of web crawling tools, most often free and open source, have been developped by or for social sciences research labs across the world. They provide means to engage with the web as a research ...
2/6/21
Open Research Tools and Technologies
Béatrice Mazoyer
D.research
Many open-source libraries provide an interface for the Twitter API. However, most people use these tools in temporary scripts for a one-time tweets collection. Moving to a robust application for collecting and indexing tweets over long periods of time requires some programming knowledge that most social science researchers do not master. In order to meet this need, the medialab has developed gazouilloire, a tool that makes it possible to easily configure the collection parameters (keywords ...
2/6/21
Open Research Tools and Technologies
Guillaume Levrier
D.research
PANDORÆ : Retrieving, curating and exploring enhanced corpi through time and space Mapping the state of research in a particular field has been made easier through commercial services providing API-based bibliometric-enhanced corpuses retrieval. Common assertions such as “the use of CRISPR technologies has skyrocketed in laboratories all around the world since 2012” can now be easily verified in both quantitative and qualitative perspectives using those platforms. Such services as ...
2/6/21
Open Research Tools and Technologies
D.research
This is a live panel session which gathers speakers from three lightning talks about web mining tools and technologies.