Journal article icon

Journal article

An actionable framework for AI‐ready data

Abstract:
Data is the foundation of AI. Poor‐quality data drive up costs and can lead to hidden problems for AI models, especially in complex fields such as healthcare and manufacturing. Meanwhile, biased data negatively affect the performance of AI models, and untested evaluation datasets can result in false positives or overestimates of model accuracy. For data publishers to realize their true potential in supporting the AI ecosystem and its impacts, they should take measures to ensure that their datasets support AI practitioners' needs; in other words, their data should be made AI‐ready. In this article, we present a framework for data publishers to follow to make their datasets AI‐ready. The framework provides specific, actionable guidance based on previous work and experience at the Open Data Institute and augmented with insights from literature and discussions with a range of experts. We first define AI‐ready data before briefly discussing a selection of frameworks in the literature and where they are insufficient. We then provide a visual snapshot of our framework for AI‐ready data, and a subsequent in‐depth discussion of its criteria. Finally, we demonstrate the usage of our framework with a number of example datasets. We conclude by discussing the further steps that should be taken for the entire open data ecosystem to be made AI‐ready in order to realize its true potential in supporting an innovative future.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publisher copy:
10.1002/aaai.70054

Authors

More by this author
Role:
Author
ORCID:
0009-0008-3969-2514
More by this author
Institution:
University of Oxford
Role:
Author


Publisher:
Wiley
Journal:
AI Magazine More from this journal
Volume:
47
Issue:
1
Article number:
e70054
Publication date:
2026-02-21
Acceptance date:
2026-02-11
DOI:
EISSN:
2371-9621
ISSN:
0738-4602


Language:
English
Pubs id:
2384754
Local pid:
pubs:2384754
Source identifiers:
3784378
Deposit date:
2026-02-21
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP