Journal article
Tree search in DAG spacewith an arbitrary ordering of the initial edges with model-based reinforcement learning for causal discovery
- Abstract:
- Identifying causal structure is central to many fields ranging from strategic decision making to biology and economics. In this work, we propose Causal Discovery Upper Confidence Bound for Trees (CD-UCT), a model-based reinforcement learning (RL) method for causal discovery based on tree search that builds directed acyclic graphs (DAGs) incrementally. We also formalize and prove the correctness of an efficient algorithm for excluding edges that would introduce cycles, which enables deeper discrete search and sampling. The proposed method can be applied broadly to causal Bayesian networks with both discrete and continuous random variables. We conduct a comprehensive evaluation on synthetic and real-world datasets showing that CD-UCT substantially outperforms the state-of-the-art model-free RL technique that operates in DAG space and greedy search, constituting a promising advancement for combinatorial methods.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 2.1MB, Terms of use)
-
- Publisher copy:
- 10.1098/rspa.2024.0450
Authors
- Publisher:
- Royal Society
- Journal:
- Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences More from this journal
- Volume:
- 481
- Issue:
- 2312
- Article number:
- 20240450
- Publication date:
- 2025-04-16
- Acceptance date:
- 2024-10-08
- DOI:
- EISSN:
-
1471-2946
- ISSN:
-
1364-5021
- Language:
-
English
- Keywords:
- Pubs id:
-
2123404
- Local pid:
-
pubs:2123404
- Deposit date:
-
2025-05-21
- ARK identifier:
Terms of use
- Copyright holder:
- Darvariu et al.
- Copyright date:
- 2025
- Rights statement:
- © 2025 The Authors. Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record