使用深组群选择复合特征 (Composite Feature Selection using Deep Ensembles)

In many real world problems, features do not act alone but in combination with each other. For example, in genomics, diseases might not be caused by any single mutation but require the presence of multiple mutations. Prior work on feature selection either seeks to identify individual features or can only determine relevant groups from a predefined set. We investigate the problem of discovering groups of predictive features without predefined grouping. To do so, we define predictive groups in terms of linear and non-linear interactions between features. We introduce a novel deep learning architecture that uses an ensemble of feature selection models to find predictive groups, without requiring candidate groups to be provided. The selected groups are sparse and exhibit minimum overlap. Furthermore, we propose a new metric to measure similarity between discovered groups and the ground truth. We demonstrate the utility of our model on multiple synthetic tasks and semi-synthetic chemistry datasets, where the ground truth structure is known, as well as an image dataset and a real-world cancer dataset.

翻译：在许多真实的世界问题中,特征并非单独行动,而是相互结合。例如,在基因组学中,疾病可能不是由任何单一的突变引起的,而是需要多种突变的存在。先前关于特征选择的工作要么寻求确定单个特征,要么只能从预先定义的一组中确定相关群体。我们调查在不预先定义分组的情况下发现预测特征群的问题。为了这样做,我们从各特征之间的线性和非线性互动的角度来界定预测群体。我们引入了一种新的深层次的学习结构,它使用特征选择模型的组合来寻找预测群体,而不需要提供候选群体。选定的群体稀少,并表现出最低限度的重叠。此外,我们提出了衡量被发现群体与地面真理之间相似性的新指标。我们展示了我们的模型在多个合成任务和半合成化学数据集方面的实用性,在地面真相结构为人所知的地方,以及图像数据集和真实世界癌症数据集。

相关内容

GROUP

关注 1

Group一直是研究计算机支持的合作工作、人机交互、计算机支持的协作学习和社会技术研究的主要场所。该会议将社会科学、计算机科学、工程、设计、价值观以及其他与小组工作相关的多个不同主题的工作结合起来，并进行了广泛的概念化。官网链接：https://group.acm.org/conferences/group20/

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日