Meta-learning models transfer the knowledge acquired from previous tasks to quickly learn new ones. They are trained on benchmarks with a fixed number of data points per task. This number is usually arbitrary and it is unknown how it affects performance at testing. Since labelling of data is expensive, finding the optimal allocation of labels across training tasks may reduce costs. Given a fixed budget of labels, should we use a small number of highly labelled tasks, or many tasks with few labels each? Should we allocate more labels to some tasks and less to others? We show that: 1) If tasks are homogeneous, there is a uniform optimal allocation, whereby all tasks get the same amount of data; 2) At fixed budget, there is a trade-off between number of tasks and number of data points per task, with a unique and constant optimum; 3) When trained separately, harder task should get more data, at the cost of a smaller number of tasks; 4) When training on a mixture of easy and hard tasks, more data should be allocated to easy tasks. Interestingly, Neuroscience experiments have shown that human visual skills also transfer better from easy tasks. We prove these results mathematically on mixed linear regression, and we show empirically that the same results hold for few-shot image classification on CIFAR-FS and mini-ImageNet. Our results provide guidance for allocating labels across tasks when collecting data for meta-learning.
翻译:元化学习模式将以往任务获得的知识转移给从以往任务中获得的知识, 并快速学习新任务。 他们接受基准培训, 每任务有固定数量的数据点。 这个数字通常是任意的, 并且不知道它如何影响测试的绩效。 由于数据标签费用昂贵, 找到在培训任务之间最佳分配标签的方法可能会降低成本。 根据固定的标签预算, 我们是否应该使用少量高标签任务, 或者很多任务, 每个标签很少? 我们是否应该将更多的标签分配给某些任务, 而不是其他任务? 我们显示:(1) 如果任务相同, 对所有任务都有统一的最佳分配方式, 所有任务都有相同数量的数据; (2) 在固定预算, 每任务的任务数量和数据点数量之间有一个平衡, 并且具有独特和一贯的最佳性;(3) 如果经过培训后, 更难的任务应该以较少的任务获得更多的数据;(4) 当我们使用简单和困难的任务组合的培训时, 应该将更多的数据分配给一些容易的任务? 有趣的是, 内罗科学实验表明, 人类视觉技能也比简单的任务更好地转移。 在混合线性回归分析中, 我们用这些结果来进行模拟分析, 当我们收集所有MFS- FRA的分类时,我们的经验显示, 当我们的所有结果都用于进行。