Journal article icon

Journal article

Kinetic predictions for S N 2 reactions using the BERT architecture: comparison and interpretation

Abstract:
The accurate prediction of reaction rates is an integral step in elucidating reaction mechanisms and designing synthetic pathways. Traditionally, kinetic parameters have been derived from activation energies obtained from quantum mechanical (QM) methods and, more recently, machine learning (ML) approaches. Among ML methods, Bidirectional Encoder Representations from Transformers (BERT), a type of transformer-based model, is the state-of-the-art method for both reaction classification and yield prediction. Despite its success, it has yet to be applied to kinetic prediction. In this work, we developed a BERT model to predict experimental log k values of bimolecular nucleophilic substitution (SN2) reactions and compared its performance to the top-performing Random Forest (RF) literature model in terms of accuracy, training time, and interpretability. Both BERT and RF models exhibit near-experimental accuracy (RMSE ≈ 1.1 log k) on similarity-split test data. Interpretation of the predictions from both models reveals that they successfully identify key reaction centres and reproduce known electronic and steric trends. This analysis also highlights the distinct limitations of each; RF outperformed BERT in identifying aromatic allylic effects, while BERT showed stronger extrapolation capabilities.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publisher copy:
10.1039/d5dd00192g

Authors

More by this author
Institution:
University of Oxford
Role:
Author
ORCID:
0009-0009-7842-8051
More by this author
Institution:
University of Oxford
Role:
Author
ORCID:
0009-0001-9676-523X
More by this author
Institution:
University of Oxford
Role:
Author
ORCID:
0000-0002-0571-2264
More by this author
Institution:
University of Oxford
Role:
Author
ORCID:
0009-0008-7942-5576
More by this author
Institution:
University of Oxford
Role:
Author
ORCID:
0009-0007-7935-298X



Publisher:
Royal Society of Chemistry
Journal:
Digital Discovery More from this journal
Publication date:
2025-12-26
Acceptance date:
2025-12-23
DOI:
EISSN:
2635-098X
ISSN:
2635-098X


Language:
English
Keywords:
Pubs id:
2361444
UUID:
uuid_f544cdef-b5d6-4568-a918-880637626ff2
Local pid:
pubs:2361444
Source identifiers:
3660655
Deposit date:
2026-01-14
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP