Journal article icon

Journal article

Genetic prediction with ARG-powered linear algebra

Abstract:
Ancestral recombination graphs (ARGs) are an attractive means for quantitative genetic analysis of complex traits because they encode the realized genetic relatedness between a sample of individuals in the presence of genetic drift, recombination, and mutation. Data structures for efficiently storing ARGs can also be used to rapidly process millions of genomes, and are thus promising for fitting linear mixed models to large phenotype and genome datasets. Here, we study the problems of variance component estimation and prediction of genetic values with ARGs, by describing a generative model of complex traits with additive effects on an ARG, and then developing algorithms that use the ARG to solve these problems efficiently on biobank-scale datasets. We observe nearly linear scaling of runtime with sample size, which is achieved by using the succinct tree sequence representation of the ARG for implicit matrix-vector products, along with modern randomized linear algebra algorithms. We estimate variance components using restricted maximum likelihood, which we find performs substantially better than the Haseman–Elston method. In simulation tests, both variance component estimation and prediction of genetic values (using the best linear unbiased predictor) perform nearly as well with inferred ARGs as with true ARGs. We also discuss interpretations of the variance component estimates as mutational variance and additive genetic variance. We provide an implementation of the algorithms as a Python package tslmm, which leverages the tree sequence library tskit.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Authors

More by this author
Role:
Author
ORCID:
0000-0002-4545-0027
More by this author
Role:
Author
ORCID:
0000-0001-8409-7812
More by this author
Institution:
University of Oxford
Department:
Big Data Institute
Role:
Author
ORCID:
0000-0002-7894-5253
More by this author
Role:
Author
ORCID:
0000-0001-8008-2787
More by this author
Role:
Author
ORCID:
0000-0002-9459-6866


More from this funder
Funder identifier:
10.13039/501100000268
Grant:
BB/T014067/1
More from this funder
Funder identifier:
10.13039/100005187
Grant:
346741
More from this funder
Funder identifier:
10.13039/100013398
Grant:
BBS/E/D/30002275
More from this funder
Funder identifier:
10.13039/100000051
Grant:
HG012473
More from this funder
Funder identifier:
10.13039/100000002


Publisher:
Oxford University Press
Journal:
Genetics More from this journal
Volume:
233
Issue:
1
Pages:
iyag074
Article number:
iyag074
Publication date:
2026-03-27
Acceptance date:
2026-03-12
DOI:
EISSN:
1943-2631
ISSN:
0016-6731


Language:
English
Keywords:
Pubs id:
2398021
Local pid:
pubs:2398021
Source identifiers:
4018602
Deposit date:
2026-05-06
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP