A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences

Ammar Ammar; Serena Bonaretti; Laurent Winckers; Joris Quik; Martine Bakker; Dieter Maier; Iseult Lynch; Jeaphianne van Rijn; Egon Willighagen

doi:10.3390/nano10102068

A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences

Ammar Ammar, Serena Bonaretti, Laurent Winckers, Joris Quik, Martine Bakker, Dieter Maier, Iseult Lynch, Jeaphianne van Rijn, Egon Willighagen

Earth and Environmental Sciences

Research output: Contribution to journal › Article › peer-review

4 Citations (Scopus)

Abstract

Data sharing and reuse are crucial to enhance scientific progress and maximize return of investments in science. Although attitudes are increasingly favorable, data reuse remains difficult due to lack of infrastructures, standards, and policies. The FAIR (findable, accessible, interoperable, reusable) principles aim to provide recommendations to increase data reuse. Because of the broad interpretation of the FAIR principles, maturity indicators are necessary to determine the FAIRness of a dataset. In this work, we propose a reproducible computational workflow to assess data FAIRness in the life sciences. Our implementation follows principles and guidelines recommended by the maturity indicator authoring group and integrates concepts from the literature. In addition, we propose a FAIR balloon plot to summarize and compare dataset FAIRness. We evaluated the feasibility of our method on three real use cases where researchers looked for six datasets to answer their scientific questions. We retrieved information from repositories (ArrayExpress, Gene Expression Omnibus, eNanoMapper, caNanoLab, NanoCommons and ChEMBL), a registry of repositories, and a searchable resource (Google Dataset Search) via application program interfaces (API) wherever possible. With our analysis, we found that the six datasets met the majority of the criteria defined by the maturity indicators, and we showed areas where improvements can easily be reached. We suggest that use of standard schema for metadata and the presence of specific attributes in registries of repositories could increase FAIRness of datasets.

Original language	English
Article number	2068
Pages (from-to)	1-14
Number of pages	14
Journal	Nanomaterials
Volume	10
Issue number	10
DOIs	https://doi.org/10.3390/nano10102068
Publication status	Published - 20 Oct 2020

Keywords

FAIR guidelines
FAIR maturity indicators
Jupyter Notebook
Life sciences

ASJC Scopus subject areas

Chemical Engineering(all)
Materials Science(all)

Access to Document

10.3390/nano10102068Licence: Creative Commons: Attribution (CC BY)

Cite this

@article{c9d961c2d49640ee8a0db9c7ef7998b8,

title = "A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences",

abstract = "Data sharing and reuse are crucial to enhance scientific progress and maximize return of investments in science. Although attitudes are increasingly favorable, data reuse remains difficult due to lack of infrastructures, standards, and policies. The FAIR (findable, accessible, interoperable, reusable) principles aim to provide recommendations to increase data reuse. Because of the broad interpretation of the FAIR principles, maturity indicators are necessary to determine the FAIRness of a dataset. In this work, we propose a reproducible computational workflow to assess data FAIRness in the life sciences. Our implementation follows principles and guidelines recommended by the maturity indicator authoring group and integrates concepts from the literature. In addition, we propose a FAIR balloon plot to summarize and compare dataset FAIRness. We evaluated the feasibility of our method on three real use cases where researchers looked for six datasets to answer their scientific questions. We retrieved information from repositories (ArrayExpress, Gene Expression Omnibus, eNanoMapper, caNanoLab, NanoCommons and ChEMBL), a registry of repositories, and a searchable resource (Google Dataset Search) via application program interfaces (API) wherever possible. With our analysis, we found that the six datasets met the majority of the criteria defined by the maturity indicators, and we showed areas where improvements can easily be reached. We suggest that use of standard schema for metadata and the presence of specific attributes in registries of repositories could increase FAIRness of datasets.",

keywords = "FAIR guidelines, FAIR maturity indicators, Jupyter Notebook, Life sciences",

author = "Ammar Ammar and Serena Bonaretti and Laurent Winckers and Joris Quik and Martine Bakker and Dieter Maier and Iseult Lynch and Rijn, {Jeaphianne van} and Egon Willighagen",

year = "2020",

month = oct,

day = "20",

doi = "10.3390/nano10102068",

language = "English",

volume = "10",

pages = "1--14",

journal = "Nanomaterials",

issn = "2079-4991",

publisher = "MDPI",

number = "10",

}

TY - JOUR

T1 - A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences

AU - Ammar, Ammar

AU - Bonaretti, Serena

AU - Winckers, Laurent

AU - Quik, Joris

AU - Bakker, Martine

AU - Maier, Dieter

AU - Lynch, Iseult

AU - Rijn, Jeaphianne van

AU - Willighagen, Egon

PY - 2020/10/20

Y1 - 2020/10/20

N2 - Data sharing and reuse are crucial to enhance scientific progress and maximize return of investments in science. Although attitudes are increasingly favorable, data reuse remains difficult due to lack of infrastructures, standards, and policies. The FAIR (findable, accessible, interoperable, reusable) principles aim to provide recommendations to increase data reuse. Because of the broad interpretation of the FAIR principles, maturity indicators are necessary to determine the FAIRness of a dataset. In this work, we propose a reproducible computational workflow to assess data FAIRness in the life sciences. Our implementation follows principles and guidelines recommended by the maturity indicator authoring group and integrates concepts from the literature. In addition, we propose a FAIR balloon plot to summarize and compare dataset FAIRness. We evaluated the feasibility of our method on three real use cases where researchers looked for six datasets to answer their scientific questions. We retrieved information from repositories (ArrayExpress, Gene Expression Omnibus, eNanoMapper, caNanoLab, NanoCommons and ChEMBL), a registry of repositories, and a searchable resource (Google Dataset Search) via application program interfaces (API) wherever possible. With our analysis, we found that the six datasets met the majority of the criteria defined by the maturity indicators, and we showed areas where improvements can easily be reached. We suggest that use of standard schema for metadata and the presence of specific attributes in registries of repositories could increase FAIRness of datasets.

AB - Data sharing and reuse are crucial to enhance scientific progress and maximize return of investments in science. Although attitudes are increasingly favorable, data reuse remains difficult due to lack of infrastructures, standards, and policies. The FAIR (findable, accessible, interoperable, reusable) principles aim to provide recommendations to increase data reuse. Because of the broad interpretation of the FAIR principles, maturity indicators are necessary to determine the FAIRness of a dataset. In this work, we propose a reproducible computational workflow to assess data FAIRness in the life sciences. Our implementation follows principles and guidelines recommended by the maturity indicator authoring group and integrates concepts from the literature. In addition, we propose a FAIR balloon plot to summarize and compare dataset FAIRness. We evaluated the feasibility of our method on three real use cases where researchers looked for six datasets to answer their scientific questions. We retrieved information from repositories (ArrayExpress, Gene Expression Omnibus, eNanoMapper, caNanoLab, NanoCommons and ChEMBL), a registry of repositories, and a searchable resource (Google Dataset Search) via application program interfaces (API) wherever possible. With our analysis, we found that the six datasets met the majority of the criteria defined by the maturity indicators, and we showed areas where improvements can easily be reached. We suggest that use of standard schema for metadata and the presence of specific attributes in registries of repositories could increase FAIRness of datasets.

KW - FAIR guidelines

KW - FAIR maturity indicators

KW - Jupyter Notebook

KW - Life sciences

UR - https://doi.org/10.3390/nano10102068

UR - http://www.scopus.com/inward/record.url?scp=85093684755&partnerID=8YFLogxK

U2 - 10.3390/nano10102068

DO - 10.3390/nano10102068

M3 - Article

C2 - 33092028

SN - 2079-4991

VL - 10

SP - 1

EP - 14

JO - Nanomaterials

JF - Nanomaterials

IS - 10

M1 - 2068

ER -

A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Fingerprint

Cite this