We propose the use of a conjecturing machine that generates feature relationships in the form of bounds involving nonlinear terms for numerical features and boolean expressions for categorical features. The proposed \textsc{Conjecturing} framework recovers known nonlinear and boolean relationships among features from data. In both settings, true underlying relationships are revealed. We then compare the method to a previously-proposed framework for symbolic regression and demonstrate that it can also be used to recover equations that are satisfied among features in a dataset. The framework is then applied to patient-level data regarding COVID-19 outcomes to suggest possible risk factors that are confirmed in medical literature.
翻译:我们建议使用一种推论机器来产生特征关系,其形式是数字特征的非线性术语和绝对特征的布尔语表达方式等边框。提议的 \ textsc{ jecturing} 框架从数据中恢复了已知的非线性和布尔语关系。 在两种情况下,都揭示了真正的内在关系。然后,我们将该方法与先前提议的象征性回归框架进行比较,并表明它也可以用来恢复数据集各特征之间满足的方程式。然后,将该框架应用于患者一级关于COVID-19结果的数据,以提出医学文献所证实的可能风险因素。