Decode-gLM: tools to interpret, audit, and steer genomic language models

Maiwald, A; Jedryszek, P; Draye, F; Morris, GM; Crook, OM

AI Collection

Preprint

Decode-gLM: tools to interpret, audit, and steer genomic language models

Abstract:: While genomic language models are enabling the de novo design of entire genomes, they remain challenging to interpret, limiting their trustworthiness. Here, we show that sparse autoencoders (SAEs) trained on Nucleotide Transformer activations decompose hidden representations into interpretable biological features without supervision. Across layers and model sizes, SAEs identified over 100 diverse functional annotations encoded in the model’s activations. This included viral regulatory elements such as the CMV enhancer, despite viral genomes being excluded from training data. Tracing this signal revealed contamination in reference databases, demonstrating that interpretability methods can audit training data and identify hidden data leakage. We then show that Meta-SAEs, trained on the decoder weights of another SAE, can identify conceptual hierarchies encoded in the model, including a more abstract feature related to multiple HIV annotations. We confirmed that the features identified by our SAEs were learned during pretraining through probing a randomly initialised model. Finally, we demonstrate that our SAEs allow us to steer model predictions in biologically meaningful ways, showing that we can use an antibiotic-resistance SAE-feature to steer the model toward the A1408G aminoglycoside-resistance mutation in the ribosomal gene 16S rRNA. Together, these results establish SAEs as a method for both discovery and auditing, providing a toolkit for interpretable and trustworthy genomic foundation models. Readers can explore our findings at https://interpretglm.netlify.app/.

Publication status:: Published

Peer review status:: Not peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Maiwald, A., Jedryszek, P., Draye, F., Morris, G. M., & Crook, O. M. (2025). Decode-gLM: tools to interpret, audit, and steer genomic language models. bioRxiv.

MLA Style

Maiwald, A, et al. Decode-GLM: Tools to Interpret, Audit, and Steer Genomic Language Models. bioRxiv, 2025.

Chicago Style

Maiwald, A, P Jedryszek, F Draye, GM Morris, and OM Crook. 2025. Decode-GLM: Tools to Interpret, Audit, and Steer Genomic Language Models. BioRxiv.
Print

Access Document

Files:: Maiwald_et_al_2025_Decode-gLM_tools_to.pdf

(Preview, Pre-print, pdf, 5.1MB, Terms of use)

Preprint server copy:: 10.1101/2025.10.31.685860

Authors

+ Maiwald, A More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Statistics
Role:: Author

+ Jedryszek, P More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Biology
Role:: Author

+ Draye, F More by this author

Role:: Author

+ Morris, GM More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Statistics
Oxford college:: Green Templeton College
Role:: Author
ORCID:: 0000-0003-1731-8405

+ Crook, OM More by this author

Role:: Author

+ Engineering and Physical Sciences Research Council More from this funder

Funder identifier:: https://ror.org/0439y7842
Grant:: EP/S024093/1

Preprint server:: bioRxiv
Publication date:: 2025-11-03
Acceptance date:: 2025-11-03
DOI:: 10.1101/2025.10.31.685860
Server owner:: Cold Spring Harbor Laboratory

Language:: English
Pubs id:: 2336884
UUID:: uuid_62a5fd92-5bb6-48ca-89bf-f70c0325778e
Local pid:: pubs:2336884
Deposit date:: 2025-11-28
ARK identifier:: ark:/29072/ora_62a5fd925bb648ca89bff70c0325778e

Terms of use

Copyright holder:: Maiwald et al.
Rights statement:: The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.

Licence:: CC Attribution-NonCommercial (CC BY-NC)

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Preprint

Decode-gLM: tools to interpret, audit, and steer genomic language models

Actions

Access Document

Authors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Preprint

Decode-gLM: tools to interpret, audit, and steer genomic language models

Actions

Access Document

Authors

Funding

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions