Using evolutionary computation algorithms to solve multiple tasks with knowledge sharing is a promising approach. Image feature learning can be considered as a multitask problem because different tasks may have a similar feature space. Genetic programming (GP) has been successfully applied to image feature learning for classification. However, most of the existing GP methods solve one task, independently, using sufficient training data. No multitask GP method has been developed for image feature learning. Therefore, this paper develops a multitask GP approach to image feature learning for classification with limited training data. Owing to the flexible representation of GP, a new knowledge sharing mechanism based on a new individual representation is developed to allow GP to automatically learn what to share across two tasks and to improve its learning performance. The shared knowledge is encoded as a common tree, which can represent the common/general features of two tasks. With the new individual representation, each task is solved using the features extracted from a common tree and a task-specific tree representing task-specific features. To learn the best common and task-specific trees, a new evolutionary process and new fitness functions are developed. The performance of the proposed approach is examined on six multitask problems of 12 image classification datasets with limited training data and compared with three GP and 14 non-GP-based competitive methods. Experimental results show that the new approach outperforms these compared methods in almost all the comparisons. Further analysis reveals that the new approach learns simple yet effective common trees with high effectiveness and transferability.
翻译:利用进化计算算法解决知识共享的多重任务是一种很有希望的方法。图像特征学习可被视为一个多任务问题,因为不同任务可能具有相似的特征空间。基因方案(GP)已被成功地应用于图像特征学习分类。但是,大多数现有的GP方法都用足够的培训数据独立地解决一项任务。没有为图像特征学习开发多任务GP方法。因此,本文件开发了一个多重任务GP方法,以利用有限的培训数据来为分类进行特征学习。由于GP的灵活代表制,开发了一个基于新的个人代表制的新知识共享机制,使GP能够自动学习如何在两个任务中分享,并改进其学习绩效。共享知识被编码成共同树,这可以代表两个任务的共同/一般特征。随着新的个人代表制,每一项任务都用从共同树中提取的特征和代表特定任务特性的树来解决。由于GPP的灵活代表了最佳和特定任务的树种,因此,一个新的演进过程和新的健身功能得以开发。提议的GP方法的绩效将自动学习在两个任务之间分享,在六种多任务中进行新的、高层次数据分析,然后用三种不同的分析方法进行比较,这些格式,这些方法的模型将展示了所有14个通用数据格式的模型分析方法,然后用不同的分析,然后用有限的新方法将三种不同的分析,然后用不同的分析。比较方法将三种不同的分析。