We develop an approach to policy selection in active inference that allows us to efficiently search large policy spaces by mapping each policy to its embedding in a vector space. We sample the expected free energy of representative points in the space, then perform a more thorough policy search around the most promising point in this initial sample. We consider various approaches to creating the policy embedding space, and propose using k-means clustering to select representative points. We apply our technique to a goal-oriented graph-traversal problem, for which naive policy selection is intractable for even moderately large graphs.
翻译:我们以积极的推论来制定政策选择方法,使我们能够通过绘制每一政策图来有效搜索大型政策空间,将其植入矢量空间。我们抽样研究空间中代表点的预期自由能量,然后围绕初步抽样中最有希望的点进行更彻底的政策搜索。我们考虑建立政策嵌入空间的各种方法,并提议使用 k 手段分组来选择代表点。我们运用我们的技术来解决一个面向目标的图形-三角问题,而对于这个问题,即使中度大的图表,天真的政策选择也是难以解决的。