Journal article
Gradient-Free De Novo Learning
- Abstract:
- This technical note applies active inference to the problem of learning goal-directed behaviour from scratch, namely, de novo learning. By de novo learning, we mean discovering, directly from observations, the structure and parameters of a discrete generative model for sequential policy optimisation. Concretely, our procedure grows and then reduces a model until it discovers a pullback attractor over (generalised) states; this attracting set supplies paths of least action among goal states while avoiding costly states. The implicit efficiency rests upon reframing the learning problem through the lens of the free energy principle, under which it is sufficient to learn a generative model whose dynamics feature such an attracting set. For context, we briefly relate this perspective to value-based formulations (e.g., Bellman optimality) and then apply the active inference formulation to a small arcade game to illustrate de novo structure learning and ensuing agency.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 3.0MB, Terms of use)
-
- Publisher copy:
- 10.3390/e27090992
Authors
- Publisher:
- MDPI
- Journal:
- Entropy More from this journal
- Volume:
- 27
- Issue:
- 9
- Pages:
- 992
- Publication date:
- 2025-09-22
- Acceptance date:
- 2025-09-12
- DOI:
- EISSN:
-
1099-4300
- ISSN:
-
1099-4300
- Pmid:
-
41008118
- Language:
-
English
- Keywords:
- Pubs id:
-
2297330
- Local pid:
-
pubs:2297330
- Source identifiers:
-
3342377
- Deposit date:
-
2025-10-05
- ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.
Terms of use
- Copyright date:
- 2025
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record