Conference item
Towards label-free biological reasoning synthetic dataset creation via uncertainty filtering
- Abstract:
- Synthetic chain-of-thought (CoT) traces are widely used to train large reasoning models (LRMs), improving generalization by providing step-level supervision. Yet most approaches require ground-truth labels to seed or filter these traces—an expensive bottleneck in domains like biology where wet-lab data are scarce. We propose a label-free alternative: uncertainty-based filtering, which uses a model’s own confidence—quantified through established uncertainty metrics like self-consistency and predictive perplexity—as a substitute for external labels. We sample multiple reasoning traces and retain only low-uncertainty subsets. Applied to biological perturbation prediction, a domain where wet-lab labels are especially costly, we show that the filtered subset has higher accuracy, and that supervised fine-tuning (SFT) on uncertainty-filtered data outperforms unfiltered synthetic data, narrows the gap to ground-truth training, and surpasses strong LRM baselines. Ablations show that per-class filtering corrects for class-specific uncertainty scales and that hybrid uncertainty metrics yield higher-quality datasets. Our results suggest that modelinternal confidence is a powerful signal for efficient reasoning dataset creation, enabling LRMs in domains where supervision is expensive.
- Publication status:
- Accepted
- Peer review status:
- Peer reviewed
Actions
Authors
- Publisher:
- Neural Information Processing Systems Foundation
- Acceptance date:
- 2025-10-16
- Event title:
- Thirty-Ninth Annual Conference on Neural Information Processing Systems
- Event location:
- San Diego, California, USA and Mexico City, Mexico
- Event website:
- https://neurips.cc/
- Event start date:
- 2025-11-30
- Event end date:
- 2025-12-07
- Language:
-
English
- Pubs id:
-
2328645
- Local pid:
-
pubs:2328645
- Deposit date:
-
2025-11-17
- ARK identifier:
Terms of use
- Notes:
- This paper was presented at The Thirty-Ninth Annual Conference on Neural Information Processing Systems, 30/11-7/12/2025, San Diego, California, USA and Mexico City, Mexico
If you are the owner of this record, you can report an update to it here: Report update to this record