Journal article icon

Journal article

Machine learning to attribute the source of Campylobacter infections in the United States: A retrospective analysis of national surveillance data

Abstract:

Objectives: Integrating pathogen genomic surveillance with bioinformatics can enhance public health responses by identifying risk and guiding interventions. This study focusses on the two predominant Campylobacter species, which are commonly found in the gut of birds and mammals and often infect humans via contaminated food. Rising incidence and antimicrobial resistance (AMR) are a global concern, and there is an urgent need to quantify the main routes to human infection.


Methods: During routine US national surveillance (2009–2019), 8856 Campylobacter genomes from human infections and 16,703 from possible sources were sequenced. Using machine learning and probabilistic models, we target genetic variation associated with host adaptation to attribute the source of human infections and estimate the importance of different disease reservoirs.


Results: Poultry was identified as the primary source of human infections, responsible for an estimated 68% of cases, followed by cattle (28%), and only a small contribution from wild birds (3%) and pork sources (1%). There was also evidence of an increase in multidrug resistance, particularly among isolates attributed to chickens.


Conclusions: National surveillance and source attribution can guide policy, and our study suggests that interventions targeting poultry will yield the greatest reductions in campylobacteriosis and spread of AMR in the US.


Data availability: All sequence reads were uploaded and shared on NCBI’s Sequence Read Archive (SRA) associated with BioProjects; PRJNA239251 (CDC / PulseNet surveillance), PRJNA287430 (FSIS surveillance), PRJNA292668 & PRJNA292664 (NARMS) and PRJNA258022 (FDA surveillance). Publicly available genomes, including reference genomes and isolates sampled worldwide from wild birds are associated with BioProject accessions: PRJNA176480, PRJNA177352, PRJNA342755, PRJNA345429, PRJNA312235, PRJNA415188, PRJNA524300, PRJNA528879, PRJNA529798, PRJNA575343, PRJNA524315 and PRJNA689604. Contiguous assemblies of all genome sequences compared are available at Mendeley data (assembled C. coli genomes doi: 10.17632/gxswjvxyh3.1; assembled C. jejuni genomes doi: 10.17632/6ngsz3dtbd.1) and individual project and accession numbers can be found in Supplementary tables S1 and S2, which also includes pubMLST identifiers for assembled genomes. Figshare (10.6084/m9.figshare.20279928). Interactive phylogenies are hosted on microreact separately for C. jejuni (https://microreact.org/project/pascoe-us-cjejuni) and C. coli (https://microreact.org/project/pascoe-us-ccoli).

Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Files:
Publisher copy:
10.1016/j.jinf.2024.106265

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Biology
Role:
Author
ORCID:
0000-0001-6376-5121
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Biology
Role:
Author
ORCID:
0000-0002-7411-4743


More from this funder
Funder identifier:
https://ror.org/029chgv08
Grant:
218205/Z/19/Z
MR/L015080/1


Publisher:
Elsevier
Journal:
Journal of Infection More from this journal
Volume:
89
Issue:
5
Article number:
106265
Place of publication:
England
Publication date:
2024-09-07
Acceptance date:
2024-08-30
DOI:
EISSN:
1532-2742
ISSN:
0163-4453
Pmid:
39245152


Language:
English
Keywords:
Pubs id:
2030247
Local pid:
pubs:2030247
Deposit date:
2024-09-16

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP