Journal article icon

Journal article

Tree search in DAG spacewith an arbitrary ordering of the initial edges with model-based reinforcement learning for causal discovery

Abstract:
Identifying causal structure is central to many fields ranging from strategic decision making to biology and economics. In this work, we propose Causal Discovery Upper Confidence Bound for Trees (CD-UCT), a model-based reinforcement learning (RL) method for causal discovery based on tree search that builds directed acyclic graphs (DAGs) incrementally. We also formalize and prove the correctness of an efficient algorithm for excluding edges that would introduce cycles, which enables deeper discrete search and sampling. The proposed method can be applied broadly to causal Bayesian networks with both discrete and continuous random variables. We conduct a comprehensive evaluation on synthetic and real-world datasets showing that CD-UCT substantially outperforms the state-of-the-art model-free RL technique that operates in DAG space and greedy search, constituting a promising advancement for combinatorial methods.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publisher copy:
10.1098/rspa.2024.0450

Authors

More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
ORCID:
0000-0001-9250-8175
More by this author
Role:
Author
ORCID:
0000-0001-9712-4090


Publisher:
Royal Society
Journal:
Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences More from this journal
Volume:
481
Issue:
2312
Article number:
20240450
Publication date:
2025-04-16
Acceptance date:
2024-10-08
DOI:
EISSN:
1471-2946
ISSN:
1364-5021


Language:
English
Keywords:
Pubs id:
2123404
Local pid:
pubs:2123404
Deposit date:
2025-05-21
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP