Journal article icon

Journal article

Comparing human vs. machine-assisted analysis to develop a new approach for Big Qualitative Data Analysis

Abstract:
Background: The exponential growth of Big Qualitative (Big Qual) data in healthcare research presents methodological challenges for traditional analysis approaches. This study evaluates the effectiveness of machine-assisted analysis using artificial intelligence (AI) tools compared to human-only analysis for processing large-scale qualitative datasets, using the Royal College of Anaesthetists’ 7th National Audit Project (NAP7) baseline survey as a test case. Methodology/Principal Findings: We conducted a comparative methodological study analysing 5,196 free-text responses about peri-operative cardiac arrest experiences. Three researchers established a human-coded reference standard following SRQR guidelines. We then applied machine-assisted analysis using Pulsar for exploratory analysis and Caplena for sentiment and thematic analysis, evaluating performance against the human gold standard using STARD-AI reporting standards. Performance metrics included accuracy, precision, recall, F1-scores, and Cohen’s Kappa, with confidence intervals calculated using bootstrap resampling. Machine-assisted analysis substantially reduced analysis time, with particularly dramatic improvements in theme identification speed. The machine-assisted approach achieved good thematic and sentiment classification accuracy compared to the human reference standard, though human analysis identified an emergent ‘ambiguous’ sentiment category that current AI tools cannot accommodate, highlighting limitations in commercial platforms’ flexibility for inductive analysis. Conclusions/Significance: Machine-assisted analysis offers substantial efficiency gains with acceptable accuracy trade-offs for large-scale qualitative data analysis. However, human expertise remains essential for capturing nuanced meanings, identifying emergent categories, and providing domain-specific interpretation. This hybrid approach represents a viable methodology for Big Qual research, though current AI tools’ constraints in accommodating emergent classification schemes remain a limitation. Our findings establish benchmarks for future development of more flexible AI systems adapted to qualitative research paradigms.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Authors

More by this author
Institution:
University of Oxford
Division:
MSD
Department:
Paediatrics
Sub department:
Paediatrics
Role:
Author
More by this author
Role:
Author
ORCID:
0000-0002-1832-0286
More by this author
Role:
Author
ORCID:
0000-0001-9479-0143


More from this funder
Funder identifier:
10.13039/501100000272
Grant:
NIHR204297
More from this funder
Funder identifier:
https://ror.org/0187kwz08
Grant:
NIHR204297


Publisher:
Public Library of Science
Journal:
PLOS Digital Health More from this journal
Volume:
5
Issue:
2
Pages:
e0000576
Article number:
e0000576
Publication date:
2026-02-25
Acceptance date:
2026-02-08
DOI:
EISSN:
2767-3170
ISSN:
2767-3170


Language:
English
Pubs id:
2384296
Local pid:
pubs:2384296
Source identifiers:
3800153
Deposit date:
2026-02-25
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP