Conference item icon

Conference item

Priors for symbolic regression

Abstract:
When choosing between competing symbolic models for a data set, a human will naturally prefer the “simpler” expression or the one which more closely resembles equations previously seen in a similar context. This suggests a non-uniform prior on functions, which is, however, rarely considered within a symbolic regression (SR) framework. In this paper we develop methods to incorporate detailed prior information on both functions and their parameters into SR. Our prior on the structure of a function is based on a ngram language model, which is sensitive to the arrangement of operators relative to one another in addition to the frequency of occurrence of each operator. We also develop a formalism based on the Fractional Bayes Factor to treat numerical parameter priors in such a way that models may be fairly compared though the Bayesian evidence, and explicitly compare Bayesian, Minimum Description Length and heuristic methods for model selection. We demonstrate the performance of our priors relative to literature standards on benchmarks and a real-world dataset from the field of cosmology.
Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Publisher copy:
10.1145/3583133.3596327

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Physics
Sub department:
Astrophysics
Oxford college:
Oriel College
Role:
Author
ORCID:
0000-0001-9426-7723
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Physics
Sub department:
Astrophysics
Oxford college:
St John's College
Role:
Author
ORCID:
0000-0003-0685-9791
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Physics
Sub department:
Astrophysics
Oxford college:
Oriel College
Role:
Author
ORCID:
0000-0002-3021-2851


More from this funder
Funder identifier:
https://ror.org/0472cxd90
Grant:
693024
More from this funder
Funder identifier:
https://ror.org/057g20z61
Grant:
ST/W000903/1
More from this funder
Funder identifier:
https://ror.org/01cmst727


Publisher:
Association for Computing Machinery
Host title:
GECCO '23 Companion: Proceedings of the Companion Conference on Genetic and Evolutionary Computation
Pages:
2402-2411
Publication date:
2023-07-24
Event title:
Companion Conference on Genetic and Evolutionary Computation (GECCO 2023)
Event location:
Lisbon, Portugal
Event website:
https://gecco-2023.sigevo.org/HomePage.html
Event start date:
2023-07-15
Event end date:
2023-07-19
DOI:
ISBN:
9798400701207


Language:
English
Keywords:
Pubs id:
1496273
Local pid:
pubs:1496273
Deposit date:
2025-08-11

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP