Meta-learning has gained wide popularity as a training framework that is more data-efficient than traditional machine learning methods. However, its generalization ability in complex task distributions, such as multimodal tasks, has not been thoroughly studied. Recently, some studies on multimodality-based meta-learning have emerged. This survey provides a comprehensive overview of the multimodality-based meta-learning landscape in terms of the methodologies and applications. We first formalize the definition of meta-learning and multimodality, along with the research challenges in this growing field, such as how to enrich the input in few-shot or zero-shot scenarios and how to generalize the models to new tasks. We then propose a new taxonomy to systematically discuss typical meta-learning algorithms combined with multimodal tasks. We investigate the contributions of related papers and summarize them by our taxonomy. Finally, we propose potential research directions for this promising field.
翻译:元学习作为一种培训框架,其数据效率高于传统的机器学习方法,已受到广泛欢迎,然而,在多式任务等复杂任务分配方面的一般化能力尚未得到彻底研究。最近,出现了一些关于基于多式联运的元学习的研究。这项调查全面概述了基于多式联运的元学习在方法和应用方面的情况。我们首先正式确定元学习和多式的定义,同时确定这个日益增长领域的研究挑战,例如如何丰富在微小或零点情景中的投入,以及如何将模型推广到新的任务中。我们然后提议一种新的分类学,系统地讨论典型的元学习算法与多式任务相结合。我们调查相关文件的贡献,并以我们的分类法来总结这些贡献。最后,我们提出这一有希望的领域的潜在研究方向。