Skip to main content

News

From Open Data to Industry Impact: A Journey of the BY-COVID Project

From Open Data to Industry Impact: A Journey of the BY-COVID Project 

Within PathOS we are collecting stories on how Open Science (Open Access to publications, Open/FAIR data and software, collaborations with citizens) has made a positive or negative impact. Our ultimate aim is to highlight stories of Open Science practices and how these are linked to impactful outcomes. In this way, we hope to foster a learning experience and to inspire others to follow. Join us and read the first Open Science stories!

Could you briefly introduce yourself and what your Open Science story is about, including its time (e.g. year range) and location? 

I am Despoina Sousoni, the Programme Manager for Impact, Innovation, and Industry at the ELIXIR Hub, the European Research Infrastructure for Bioinformatics. ELIXIR is a distributed digital infrastructure that unites bioinformaticians from 21 countries to manage data, compute, tool, and training resources across various life sciences domains, operating under Open Science principles. Most of these resources are completely open and free, often not even requiring registration. This openness, while beneficial, poses challenges when assessing the impact of these resources. Over the years, we have developed several methods to demonstrate the impact of our resources to funders, ensuring they are well-sustained through community efforts and remain open and free. As part of this journey, we did some work during the BY-COVID project (funded by EOSC, 2021 to 2024), where we demonstrated the value of open infectious disease data for industry innovation related to COVID-19. 

"Quantifying the value of Open Data in innovation is difficult, as academic research often takes many years before it reaches its full potential or becomes an invention and is produced at an industrial scale"

What was the context or background in which this Open Science practice was used? What were the goals or expected outcomes? 

The BY-COVID project was launched in autumn 2021 as part of the European Commission’s HERA incubator plan, 'Anticipating together the threat of COVID-19 variants.' The aim was to consolidate solutions, often rapidly assembled during the COVID-19 pandemic, to support the ongoing response to COVID-19 and prepare for future infectious disease outbreaks. The project aimed to make COVID-19 data easily accessible not only to scientists in laboratories but also to medical staff in hospitals, government officials, and anyone else who could benefit from it. 

One of the key tasks was industry engagement, which aimed to explore the usage and value of COVID-19 data and affiliated resources by industry. This task also aimed to demonstrate the importance of Open Data and research infrastructures during and beyond the COVID-19 pandemic for developing vaccines, medicines, and other industrial products. 

The outcome of this task was a deliverable report entitled "Industry value of Infectious disease data." This report included a desktop research analysis of patents and publications that mentioned the COVID-19 data portal or at least one of its integrated biodata resources. The analysis focused on identifying industrial affiliations and analysing these companies and the inventions. The report also included statements made by industrial representatives in interviews, highlighting the integration of open biodata resources in their R&D work and their operational flexibility during the pandemic. 

What was your role or relationship to this Open Science practice? Who were the key actors involved? 

I had a leading role in the industry-related task of the BY-COVID project, with the ELIXIR Hub managing the overall project. For the industry-related task, I collaborated closely with colleagues from Uppsala University, particularly during the initial stages of industry engagement activities, when we approached companies to extract use cases. After several unsuccessful efforts, I decided to transition to desktop research, where I brought in the expertise I had built over the years from my previous Open Science work in the European Commission and UNESCO, as well as the more recent knowledge gained from the PathOS project and ELIXIR's work on the bioinformatics case study "Innovation from Open Research Resources", through the PathOS Handbook of Indicators and the Cost-Benefit Analysis conducted by CSIL. Based on this transition, we managed to create a compelling narrative for the BY-COVID partners, demonstrating the impact of Open Data on innovation during a health crisis. 

How was this Open Science practice implemented, to your knowledge?  

The transition to desktop research in this task was crucial due to the difficulties in engaging with industry representatives. Based on my experience working in Open Science over the last five years, I can easily say that the hardest Open Science topics are impact assessment and open innovation. Therefore, when I first joined ELIXIR at the end of the COVID-19 pandemic (2022) and started being involved in the BY-COVID project, I very soon saw the opportunity of building a complete story regarding the applications of Open Data in innovation in the time of a crisis, like the COVID-19 pandemic. And this is what we did in this report. 

Were there any quantifiable outcomes or measurable successes linked to this practice? What metrics or indicators were used to evaluate these outcomes, if any? 

This study includes both quantitative and qualitative information to demonstrate the impact of COVID-19 open biodata resources on the operations of the private sector and COVID-19 related innovation. 

The quantitative information focuses on patent and publication analysis, along with further analysis of the affiliated industries. Over 1,000 patent mentions reference at least one of the COVID-19 Data Portal resources (5% of the total found in this search). It is worth mentioning that this number represents only the mentions of the resources' names, not the data included in the resource. 30% of these patents are affiliated with for-profit companies, with the majority of these companies being in the SME size and covering the pharma and biotech sectors. Additionally, we examined the number of citations of these patents, identifying the most impactful inventions, and analysed how many patents the identified companies had in this search (50% of the companies found had more than one patent in this search). These findings highlight the importance and successful integration of open resources in the industry sector. 

The COVID-19 Data Portal was mentioned in scientific articles, with 25 for-profit companies cited. The most cited article referred to the portal as a "great example of international collaboration for building infrastructure for a global approach." 

The qualitative information in this study includes interviews conducted during and beyond this project, as we know that innovation is not always documented in scientific publications or patent filings, and it depends on the company's mandates or operating procedures. Based on the work conducted by Lauer K.B. as part of her thesis (2022), interviewees agreed that Open Data resources, free and without restrictions, are crucial for enabling scientific discovery and benefiting society through job creation, tax contributions, and lifesaving medicines. Interviews were also conducted later in this project, aiming to understand the business operations of companies and research infrastructures that work with industry during and after the COVID-19 pandemic. All agreed that standardised data collection and sharing procedures are crucial for a rapid pandemic response, along with efforts to break the silos and build collaborative approaches in research. 

What impacts, both expected and unexpected, did this practice have? Were there any surprising developments or results? 

The immediate impact of this study has been the demonstration to funders of the socio-economic benefits of the COVID-19 Data Portal and its open resources during the pandemic. This can potentially be translated to more open research infrastructures that play a crucial role in boosting innovation in academia and industry, creating a social mandate to sustain them as open and free-of-charge resources. 

In addition to the socio-economic impact, we also observed the impact on better pandemic preparedness for the future. Some interviewees mentioned the need for standards in data collection and sharing, along with the establishment of flexible guidelines for emergency procedures. These areas are now a focus in upcoming EC-funded projects. 

What challenges were associated with this practice, from your perspective? What lessons can be drawn from its implementation? 

Despite the success and the great outcomes of the BY-COVID project, including the COVID-19 Data Portal, an infectious disease toolkit, and more (see success stories), and the continuous impact of these outcomes in pandemic preparedness (Pathogen Portal, EVORA project), the journey of the industry engagement task in the BY-COVID project was not easy and straightforward. Our initial efforts were focused around surveys and engaging through events, though the issues we identified were: 

  • Industry representatives often do not see the return on investing their time in sharing experiences, or they may not have a full story to share. 
  • COVID-19 research was no longer a high-level priority topic for companies, after early 2022. 
  • There is not a defined methodology to assess the impact of open digital data resources in innovation. 

Therefore, we managed to extract some stories from industrial representatives regarding the usage and benefits of Open Data in their COVID-19 related work, and the combination of quantitative and qualitative evidence was the best way to demonstrate the high integration of open biodata resources in the R&D sector of companies. 

A useful insight that I have kept from this work is that not all companies are willing to mention the usage of open resources in their openly available methodological description, as it might cause replication and procedural questioning. This was an important point to understand the limitations of the information when collecting desktop data and when engaging with companies throughout the project. Therefore, my lessons learned from this work is to ensure a clear communication of the underestimation of the collected numbers due to the limitations of the used methodologies, and the need for positive referencing of the companies that mention the usage of open resources in their methodology. 

How do you perceive this practice's influence on the wider scientific community or society? Has it affected your own views or approaches to research? 

When a digital resource is completely open, it typically does not require user identification. However, this makes it challenging to track who is accessing the data and how it is being used. Additionally, quantifying the value of Open Data in innovation is difficult, as academic research often takes many years before it reaches its full potential or becomes an invention and is produced at an industrial scale. These aspects of Open Science present challenges in assessing the return on value for publicly funded infrastructures and create a continuous race to find impact stories and supplementary data to demonstrate their usage in products and services. This study is an attempt to show how we could start building some good practices and stories tackling these challenges. 

In addition, this work demonstrates the importance of collaborative efforts across domains to build a common infrastructure that benefits scientists in academia and industry, as well as medical staff in hospitals, government officials, and citizens. These topics are very hard to measure the impact of, but they are essential for the success of Open Science. 

Based on your experience or observation, would you recommend this Open Science practice to others? Why or why not? 

Definitely. It is essential to establish mechanisms to continuously monitor the usage of Open Science resources in innovation, and this study highlights that. Implementing good practices is the only way to keep up with the value of Open Data in innovation and address new challenges collectively. This study demonstrates the impact of open resources on company development and in the creation of products and services with high social value. 

Additionally, it is important to ensure positive visibility for companies that are willing to acknowledge the usage of open resources. Better understanding the industrial contributions to research and society can further encourage the adoption of Open Science practices in the private sector. 

Stay tuned with PathOS updates

Sign up for our newsletter!

Follow us on LinkedIn & Twitter!

Do you want to share your own OS story? Join us and share your story here.

OSStories

Read more …From Open Data to Industry Impact: A Journey of the BY-COVID Project

Key Takeaways from the PathOS Workshop on Impact Pathways

In March 2024, PathOS hosted an invitation-only co-creation workshop on Open Science Impact Pathways. Being more than halfway into the project, the PathOS team organised the event to share advanced results of the project work and gather feedback from key Open Science experts. 

The workshop focused on presenting preliminary results from the testing of the Key Impact Pathway (KIP) framework, developed by the project team. Building on evidence from scoping reviews and case studies, PathOS is applying this pathway framework by gathering the evidence from various perspectives, taking into consideration different Open Science activities and categories of impact.

The workshop was an opportunity for participants to have early access to project results (soon to be released with Deliverable 1.3 Key Impact Pathways for Open Science), to provide insightful feedback on the work carried out so far, and help shape the direction of the project’s work to ensure relevance for policy makers and stakeholders.

Participants expressed a strong interest in the framework and preliminary results developed by the project. They also stressed that different elements must be analysed carefully when presenting the impact pathways. Such elements include the data collection process in the form of the scope,  the amount of the screened literature and the type and strength of evidence for the existence (or absence) of impacts.

The discussion identified challenges, such as the availability and diversity of evidence, particularly in delineating societal impacts. Participants underscored the significance of weighing evidence quality and considering alternative sources beyond traditional publications, such as grey literature and archived data. In addition, the detail and context for interventions, such as the type of Open Access, were stressed as crucial elements to address in the final impact pathways.

The workshop proved to be constructive for PathOS, as it provided an external validation of the general approach, whilst resulting in valuable feedback on how to refine the analysis. PathOS will use the gathered information to improve this approach in order to better address the needs of policymakers and other Open Science stakeholders.

Recap written by Izabella Martins Grapengiesser, Lennart Stoy, & Elisa Seminaroti from Technopolis Group. 

Image by rawpixel.com on Freepik

Stay tuned with PathOS updates

Sign up for our newsletter!

Follow us on LinkedIn & Twitter!

Investigating the Cross-Cutting effects of Open Research Data from National Repositories: the EASY case study

Investigating the Cross-Cutting effects of Open Research Data from National Repositories: the EASY case study

By Tim Willense & Vincent Traag, CWTS

Introduction

The EASY case study investigates the impact of data availability on its usage by researchers. It explores whether data shared through national repositories experiences higher uptake compared to other platforms like international or disciplinary repositories. Focused on the EASY repository managed by DANS in the Netherlands, the study analyzes datasets from the SSH, with a specific emphasis on Dutch research. By utilizing DataCite, it correlates data with related publications to assess citation rates, providing insights into the effectiveness of national repositories in promoting data dissemination and scholarly collaboration.

Why was this particular study selected to support testing and operationalization of Open Science indicators?

This study on the cross-cutting effects of open research data from a national repository, specifically focusing on the EASY database, was chosen to support testing and operationalization of Open Science indicators for several reasons. The case study's unique focus on data availability and its impact on data use aligns closely with some of the core objectives of Open Science. By investigating whether scientists exhibit different usage patterns based on the repository from which data is made available, the study provides a nuanced understanding on some of the dynamics involved. The EASY database, managed by DANS in the Netherlands, stands out as a case due to its commitment to the FAIR principles, making it an ideal candidate for assessing the effectiveness of open data repositories. This enables this case study to leverage insights for refining Open Science indicators related to data sharing practices.

Why do you think this is study is important for the broader Open Science context?

In the broader Open Science context, this study holds importance for multiple reasons. Firstly, it addresses a critical gap in understanding the relationship between open data repositories, data reuse, and data referencing. The investigation into the effectiveness of open data repositories in making research data openly available contributes valuable insights. Additionally, by examining how data is referenced in research and its compliance with open data reuse and repositories, the study provides a foundation for developing more effective Open Science practices. Understanding these dynamics is crucial for fostering a transparent, collaborative, and accessible research environment, aligning with the overarching principles of Open Science.

How will this study contribute to the main aims of the project? 

The EASY case study significantly contributes to the main aims of the PathOS project by offering insight into the impact pathways of open science. Specifically, the study sheds light on the impact pathway of open data repositories on open data reuse. By examining factors such as accessibility, findability, data quality, and the role of various contextual elements, the case study highlights mechanisms within the broader open science system. This understanding is pivotal for refining and operationalizing Open Science indicators, providing a comprehensive view of how data repositories influence data uptake and reuse.

What kind of impact is expected to be generated by the results/outcomes of the study for different stakeholder groups?

  • Government: While not directly affected, governments that provide funding for building national data repositories, will have a keen interest in the results. Insights from the study can inform future decisions regarding funding allocation and the development of national-level data repositories.
  • National data repository: The results would be of interest to the national data repository by offering valuable insights into its usage.
  • Scientists: Scientists who contribute data to repositories will benefit from understanding the differences in data uptake across various repositories. The study's outcomes can guide scientists in making informed decisions about where to share their data for optimal visibility and impact within the research community.

The CWTS team

Tim Willemse

Researcher

Tim is a researcher at CWTS with a background in Innovation Studies. He holds a MSc in Innovation Sciences and a BSc in Science and Innovation Management. He has great interest and academic experience in the sustainability transition, in which he has been part of several innovation projects. At CWTS Tim assists European regions in making innovation strategy decisions to address grand societal challenges by providing analytical insights in the mapping, experimentation and implementation phases. In this work Tim focuses on the question what knowledge and contextual characteristics influence the innovation capabilities of different geographical territories. 

Vincent Traag

Senior researcher and bibliometric consultant

Vincent Traag is a senior researcher at the Centre for Science and Technology Studies (CWTS) of Leiden University in the Netherlands. His main interests are mathematical models in the social sciences with a focus on (social) networks. Traag is a core member of the Engagement & Inclusion focal area, where he studies the role of science in societal debates. In addition to his scientific research, Traag also acts as a bibliometric consultant at CWTS.

Traag obtained his Master in sociology (cum laude) from the University of Amsterdam (2008). Coming from a computer science background, and taking up mathematics during his studies in sociology, he went on to obtain a PhD in applied mathematics in Louvain-la-Neuve, Belgium (2013). He joined CWTS in 2015.

More information can be found on his personal website.

Read more …Investigating the Cross-Cutting effects of Open Research Data from National Repositories: the EASY...

From Access to Impact: Exploring the Role of French Open Science Platforms in Measuring Societal Change

From Access to Impact: Exploring the Role of French Open Science Platforms in Measuring Societal Change

By Simon Apartis & Tommaso Venturini, CNRS

Introduction

The presented interview of case study leaders examines how French Open Science platforms — HAL, OpenEdition, and RechercheDataGouv— shape the impact of Open Science policies by analyzing connection logs. It focuses on user access patterns to explore the societal impact of Open Science beyond citation metrics.

Why was this particular study selected to support testing and operationalization of Open Science indicators?

Over the past decade, France has implemented ambitious Open Science (OS) policies underpinned by two National Plans for Open Science (2018–2021 and 2021–2024). These initiatives aim to foster "an ecosystem where scientific research becomes more cumulative, better supported by data, and more transparent, with faster and more universal access to results." The ultimate vision is to make "knowledge accessible to all, serving research, education, the economy, and society." 

As incentives for researchers to publish in Open Access generalized and public funding for OS expanded, two French platforms established in 2000—HAL (Hyper Articles en Ligne) and OpenEdition—have emerged as central pillars in France's open publication and Open Data ecosystem. HAL, in particular, has seen remarkable growth, with total deposits rising from 500,000 in 2010 to 3.5 million by 2024.  

More recently, the RechercheDataGouv Platform, funded as part of the National Plan for Open Science, was launched to promote data recognition as an independent research output. The platform aims to improve reproducibility, reduce redundancy, and encourage the open sharing of research data to increase citation rates. It intends to support researchers in managing their data effectively, enabling them to retrieve, reuse, and decide whether to preserve, share, or delete data, thereby hoping to minimize the digital footprint. 

Along with expanding OS practices and platforms and following the principle of evidence-based steering of public policies, there has been an increasing need for advanced monitoring tools to properly understand and demonstrate the effects fostered and expected by OS policies on science, society and the economy. Launched in 2018, the Open Science Barometer aims to address this need, but much remains to be done to align it with the indicators set by UNESCO´s working group on Open Science monitoring, to dive into the complexity of societal effects, especially on the long run, and not only to monitor but to counterfactually assess and evaluate the effects as consequences causally attributable to OS policies and OS outputs.  

Why do you think this is study is important for the broader Open Science context? 

Because Open Science Platforms act as a pivotal meeting point between, on the one hand, scientific offer and, on the other hand, scientific, economic and societal demand for science, they are crucial to study in order better to understand the effects of OS policies and practices. Access to Open Science through platforms is the first step for its reuse by a wide variety of heterogeneous players, whose societal, economic and scientific practices would not have been impacted and thus be made to impact the world around them in unprecedented ways if OS platforms had not been available in the first place.  

Our study primarily focuses on the preliminary step for impact: access. Access can be studied at the platform level by focusing on the connection logs. Our approach boils down to three core questions. Who are the leading private and public organizations using these platforms? When are these platforms used the most and why? Who are the websites referring to these platforms and how are they connected? 

We will tackle those questions by analysing the connection logs to HAL, OpenEdition and RechercheDataGouv and enriching them with data based on the IP address, referrer-based data and data from OpenAlex.

Figure 1 shows the first result of the first version of this automated classification script run on the list of all the identifiable organizations (~10%) which accessed OpenEdition during the first four days of October 2022 (96% of organizations (4267) domains divided into 22 categories, 5,07% (228) couldn’t be appropriately classified).

Figure 1: Preliminary results - distribution of identifiable organizations accessing OpenEdition, categorized by sector

Our goal is to analyze how patterns of OS usage (specifically, which categories of users access certain types of resources) vary based on their degree of openness. For example, we might find that articles published through diamond Open Access are accessed by a larger proportion of non-academic users compared to articles available only as preprints. This trend could be an indicator of the potential for greater societal impact of scientific work published in Open Access formats.

How will this study contribute to the main aims of the project?  

This case study seeks to understand the role of OS platforms as central nodes in the impact pathways of Open Science. Rather than focusing solely on citations—which primarily reflect the academic diffusion and impact of OS—we expand our analysis to include 1) referrers pointing to OS artifacts and, more importantly, 2) connection logs. This approach aims to develop replicable and scalable tools for assessing the societal impact of OS while laying the foundation for further qualitative analyses of how OS artifacts are used and reused by various actors in society.

Our methodology is closely aligned with PathOS' framework. It is structured around data sprints, fostering collaboration with platform technical teams and experts from the French Ministry of Higher Education. Following PathOS' iterative process, we integrate the causality model insights from meta-analyses on OS impact assessment and recommendations from the Open Science handbook At the same time and in a four-step iterative manner, we address technical challenges, explore the possibilities and constraints of log analysis, and incorporate the practical knowledge of platform technical teams alongside the policy needs reported by the ministry collaborators.

Who are the main actors involved and why are they important within the R&I ecosystem represented in this study?  

Based on a generalist conception of the R&I ecosystem, which still remains a widely discussed term[1], it is possible to distinguish between four main groups of stakeholders that are either indirectly challenged or directly involved in OS platforms. (1) Concurring science dissemination infrastructures, (2) academic science producers and consumers, (3) non-academic science producers and consumers, (4) Open Science professionals who manage the platforms.  Among those four categories, the most significant stakeholders are large commercial publishers; small commercial publishers; not-for-profit publishers; university libraries; universities, research communities and individual researchers at research performing organizations (RPOs); companies; patient groups; citizen sciences, non-governmental organizations (NGOs), activists and citizens; technical, administrative and support department of platforms at RPOs. It is important to note, however, that the extent and nature of these stakeholders' involvement and transformations in the R&I ecosystem remain under investigation. This inquiry includes both quantitative and econometric methods like those employed by PathOS and detailed ethnographic approaches to research ecosystems.

What kind of impact is expected to be generated by the results/outcomes of the study? 

By leading this case study we are already strengthening collaboration between Open Science platforms and the technical teams that run them. Connection logs are the common denominator of thousands of Open Science platforms, and working on a standardized log analysis method could also help achieve broader strategic goals.

For easy future upscaling, we are coding a Node.js middleware for the Ezpaarse toolkit, a set of COUNTER5 compatible digital tools for eletronic resources dedicated to the detection, collection, enrichment and dynamic visualization of usage data, which is already being used by hundreds of platforms. Our development script is available here and will be published on Ezpaarse’s own GitHub repository once it has been approved for production.

Because it relies on widely used standards and Open Source software, our method could be easily replicated and applied to greater sets of connection logs, both from for-pay and Open Access platforms, allowing for greater comparability and scale effects due to the volume of data and an even stronger contractual assessing method if for-pay and Open Access data can be used respectively as a test and control group.

The possible impacts will stem from our tools enabling everyone to easily visualize and intuitively grasp how strongly different degrees of openness influences the kind of actors who access science, possibly advocating for lasting and significant funding and support of OS policies at the highest political level, at times when science is expected to bridge its gap with society, address societal challenges and support social innovation.


[1] See for instance :

  • Altman, Micah et Philip N. Cohen (2022). “The Scholarly Knowledge Ecosystem: Challenges and Opportunities for the Field of Information,” Frontiers in Research Metrics and Analytics, vol. 6. https://doi.org/10.3389/frma.2021.751553
  • Kuehn, Evan F. (2022). “The information ecosystem concept in information literacy: A theoretical approach and definition,” JASIST, pp. 1–10. https://doi.org/10.1002/asi.24733
  • Lyle, Peter, Henrik Korsgaard et Susanne Bødker (2020). “What’s in an Ecology? A Review of Artifact, Communicative, Device and Information Ecologies,” NordiCHI ’20, October 25–29, 2020, Tallinn, Estonia. https://doi.org/10.1145/3419249.3420185
  • Mounier, Pierre et Simon Dumas Primbault (2023). “Sustaining Knowledge and Governing its Infrastructure in the Digital Age: An Integrated View”, Zenodo, 2023. https://doi.org/10.5281/zenodo.10036402
  • Star, Susan Leigh (ed.) (1995). Ecologies of Knowledge: Work and Politics in Science and Technology. New York, State University of New York Press.

 


Read more …From Access to Impact: Exploring the Role of French Open Science Platforms in Measuring Societal...

New Preprint - Introduction to causality in science studies

New Preprint: Introduction to causality in science studies

Sound causal inference is crucial for advancing the study of science. Incorrectly interpreting predictive effects as causal might be ineffective or even detrimental. Many publications in science studies lack appropriate methods to substantiate their causal claims. In this preprint we provide an introduction to structural causal models. Such models allow researchers to make their causal assumptions transparent and provide a foundation for causal inference. We illustrate how to use structural causal models based on simulated data of a hypothetical structural causal model of Open Science. We hope our introduction helps researchers in science studies to consider causality explicitly.


The PathOS context

Concerns of causality are centre stage for the PathOS project. Without a proper understanding of causality, it is impossible to provide proper policy recommendations. For example, imagine we observe that published research using open data is less reproducible. Even if open data does in fact have a positive effect on reproducibility, this negative association might appear if journals select research based on open data and rigour. That is, journals may be more likely to publish research if it has open data, but also if it is more rigorous. If published research has no open data, it therefore tends to be more rigorous, otherwise it would not be published at all. Research that is more rigorous tends to be more reproducible, and this effect might be stronger than the effect of open data. For this reason, the association between open data and reproducibility might be negative, even if the actual causal effect is positive. If we incorrectly interpret the negative association as causal, and then recommend not to incentivise open data, we would be providing ill advice.


Read the preprint here

What's next?

Having a common understanding of causality and structural causal modelling helps the PathOS project interpret the existing literature and the identification of impact pathways. This requires us to differentiate the impact of open science from the effect of openness on that impact. That is: how does the fact that something is open—be it publication, data, code, review—have a causal effect on its impact? The introduction to causality provides such a common understanding. This will be especially important as PathOS builds upon the knowledge gained through our evidence scoping and intervention logic definition to further map and validate Open Science impact pathways and their verification methods (work on which is well underway – watch this space!).

Read more …New Preprint - Introduction to causality in science studies