Journal article icon

Journal article

Curator – a data curation tool for clinical real-world evidence

Abstract:

Objective
This research aims to establish an efficient, systematic, reproducible, and transparent solution for advanced curation of real-world data, which are highly complex and represent an invaluable source of information for academia and industry.

Materials and methods
We propose a novel software solution that splits the statistical analytical pipeline into two phases. The first phase is implemented through Curator, which performs data engineering and data modelling on deidentified real-world data to achieve advanced curation and provides selected information ready to be analyzed in the second phase by statistical packages. Curator is made of a suite of Python programs and uses MySQL as its database management system. Curator has been utilised with several UK primary and secondary care data sources.

Results
Curator has been used in 25 completed clinical and health economics research studies. Their output has been published in 2 NIHR-funded reports and 33 prestigious international peer-reviewed journals and presented at 38 global conferences. Curator has consistently reduced research time and costs by over 36% and made research more reproducible and transparent.

Discussion
Curator fits in well with recent UK governmental guidelines that recognise health data curation as a complex standalone technical challenge. Curator has been used extensively on UK real-world data and can handle several linked datasets. However, for Curator to be accessed by a wider audience, it needs to become more user-friendly.

Conclusion
Curator has proven to be a cost-effective and trustworthy data curation tool, which should be developed further and made available to third parties.

Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Publisher copy:
10.1016/j.imu.2023.101291

Authors


More by this author
Institution:
University of Oxford
Division:
MSD
Department:
NDORMS
Sub department:
Botnar Research Centre
Role:
Author
ORCID:
0000-0003-0388-3403
More by this author
Institution:
University of Oxford
Division:
MSD
Department:
NDORMS
Sub department:
Botnar Research Centre
Role:
Author


Publisher:
Elsevier
Journal:
Informatics in Medicine Unlocked More from this journal
Volume:
40
Article number:
101291
Publication date:
2023-06-07
Acceptance date:
2023-06-05
DOI:
EISSN:
2352-9148


Language:
English
Keywords:
Pubs id:
1393764
Local pid:
pubs:1393764
Deposit date:
2023-06-13

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP