Journal article icon

Journal article

Deep learning‐based ecological analysis of camera trap images is impacted by training data quality and quantity

Abstract:
Large image collections generated from camera traps offer valuable insights into species richness, occupancy, and activity patterns, significantly aiding biodiversity monitoring. However, the manual processing of these data sets is time‐consuming, hindering analytical processes. To address this, deep neural networks have been widely adopted to automate image labelling, but the impact of classification error on key ecological metrics remains unclear. Here, we analyze data from camera trap collections in an African savannah (82,300 labelled images, 47 species) and an Asian sub‐tropical dry forest (40,308 labelled images, 29 species) to compare ecological metrics derived from expert‐generated species identifications with those generated by deep‐learning classification models. We specifically assess the impact of deep‐ learning model architecture, the proportion of label noise in the training data, and the size of the training data set on three key ecological metrics: species richness, occupancy, and activity patterns. We found that predictions of species richness derived from deep neural networks closely match those calculated from expert labels and remained resilient to up to 10% noise in the training data set (mis‐labelled images) and a 50% reduction in the training data set size. We found that our choice of deep‐learning model architecture (ResNet vs. ConvNext‐T) or depth (ResNet18, 50, 101) did not impact predicted ecological metrics. In contrast, species‐specific metrics were more sensitive; less common and visually similar species were disproportionately affected by a reduction in deep neural network accuracy, with consequences for occupancy and diel activity pattern estimates. To ensure the reliability of their findings, practitioners should prioritize creating large, clean training sets and account for class imbalance across species over exploring numerous deep‐learning model architectures.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Publisher copy:
10.1002/rse2.70052

Authors

More by this author
Role:
Author
ORCID:
0000-0001-9289-3026


More from this funder
Funder identifier:
https://ror.org/001aqnf71
More from this funder
Funder identifier:
https://ror.org/02b5d8509


Publisher:
Wiley
Journal:
Remote Sensing in Ecology and Conservation More from this journal
Publication date:
2026-01-11
Acceptance date:
2025-12-04
DOI:
EISSN:
2056-3485
ISSN:
2056-3485


Language:
English
Keywords:
Pubs id:
2361423
UUID:
uuid_489b7e1c-4216-42d6-85a5-a8e1e84a7232
Local pid:
pubs:2361423
Source identifiers:
3654053
Deposit date:
2026-01-12
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP