Deep learning can extract rich data representations if provided sufficient quantities of labeled training data. For many tasks however, annotating data has significant costs in terms of time and money, owing to the high standards of subject matter expertise required, for example in medical and geophysical image interpretation tasks. Active Learning can identify the most informative training examples for the interpreter to train, leading to higher efficiency. We propose an Active learning method based on jointly learning representations for supervised and unsupervised tasks. The learned manifold structure is later utilized to identify informative training samples most dissimilar from the learned manifold from the error profiles on the unsupervised task. We verify the efficiency of the proposed method on a seismic facies segmentation dataset from the Netherlands F3 block survey, significantly outperforming contemporary methods to achieve the highest mean Intersection-Over-Union value of 0.773.
翻译:深层学习如果能够提供足够数量的标签培训数据,就可以提取丰富的数据表述。但是,对于许多任务来说,由于在医学和地球物理图像判读任务等方面需要高标准的主题专门知识,说明数据在时间和金钱方面成本巨大。积极学习可以确定口译员培训最丰富的培训实例,从而提高效率。我们建议采用基于共同学习的受监督和不受监督任务的主动学习方法。学到的多元结构后来被用来确定与未受监督任务错误剖析中学到的多元信息培训样本最不相同的信息培训样本。我们核查了荷兰F3区块调查中拟议的地震表面分解数据集方法的效率,该方法大大优于当代方法,以达到0.773美元这一最高平均值的跨部对齐值。