Abstract
Over the last 10 to 15 years, active inference has helped to explain various brain mechanisms from habit formation to dopaminergic discharge and even modelling curiosity. However, the current implementations suffer from an exponential (space and time) complexity class when computing the prior over all the possible policies up to the time-horizon. Fountas et al. (2020) used Monte Carlo tree search to address this problem, leading to impressive results in two different tasks. In this paper, we present an alternative framework that aims to unify tree search and active inference by casting planning as a structure learning problem. Two tree search algorithms are then presented. The first propagates the expected free energy forward in time (i.e., towards the leaves), while the second propagates it backward (i.e., towards the root). Then, we demonstrate that forward and backward propagations are related to active inference and sophisticated inference, respectively, thereby clarifying the differences between those two planning strategies.
Original language | English |
---|---|
Pages (from-to) | 295-316 |
Number of pages | 22 |
Journal | Neural Networks |
Volume | 151 |
Early online date | 6 Apr 2022 |
DOIs | |
Publication status | Published - Jul 2022 |
Bibliographical note
Funding Information:We would like to thank the reviewers for their valuable feedback, which greatly improved the quality of the present paper. LD is supported by the Fonds National de la Recherche, Luxembourg (Project code: 13568875). This publication is based on work partially supported by the EPSRC, United Kingdom Centre for Doctoral Training in Mathematics of Random Systems: Analysis, Modelling and Simulation (EP/S023925/1).
Publisher Copyright:
© 2022 The Author(s)
Keywords
- Active inference
- Free energy principle
- Planning
- Tree search
- Variational message passing
ASJC Scopus subject areas
- Cognitive Neuroscience
- Artificial Intelligence