Building models of human decision-making from observed behaviour is critical to better understand, diagnose and support real-world policies such as clinical care. As established policy learning approaches remain focused on imitation performance, they fall short of explaining the demonstrated decision-making process. Policy Extraction through decision Trees (POETREE) is a novel framework for interpretable policy learning, compatible with fully-offline and partially-observable clinical decision environments -- and builds probabilistic tree policies determining physician actions based on patients' observations and medical history. Fully-differentiable tree architectures are grown incrementally during optimization to adapt their complexity to the modelling task, and learn a representation of patient history through recurrence, resulting in decision tree policies that adapt over time with patient information. This policy learning method outperforms the state-of-the-art on real and synthetic medical datasets, both in terms of understanding, quantifying and evaluating observed behaviour as well as in accurately replicating it -- with potential to improve future decision support systems.
翻译:从观察行为中建立人类决策模式对于更好地理解、诊断和支持临床护理等现实世界政策至关重要。既定的政策学习方法仍然侧重于模仿性表现,因此没有能够解释所展示的决策进程。通过决策树(POETREE)(POETREE)(POETREE)(通过决策树(POETREE))(POETREE)(POETRE)(POETRE))(POET )(POETRE)(POET )(POETRE)(POET )(POETRE)(POETREE)(PI))(POTI)(POTI)(P)(POETRE)(POETRE)(P)(POETRE)(P)(POT)(POL)(P)(PLI)(POL(P)(POL)(POL(POL)(POL)(POL)(POL)(POL)(POL)(POL)(POL)(POL(POL)(POL)(POL)(PLI)(PLI)(POL)(POL)(POL)(POL)(POL)(POL(POL)(P)(P)(POL)(P)(P)(P)(P)(POL)(P)(P)(P)(P)(P)(POL(POL)(POL)(P)(POL)(POL)(POL)(P)(P)(P)(P)(POL(POL)(P))(P)(P)(P)(P)(P))(P)(P)(P)(P)(P)(P)(P)(P)(P)(POL(POL(P)(POL(POL(POL))(POL(P)(P))(P)(POL(P)(P)(P)))))(POL)(P)(P)(P)(P)(P)(P)(POL(LI)(POL(