Machine Learning requires large amounts of labeled data to fit a model. Many datasets are already publicly available, nevertheless forcing application possibilities of machine learning to the domains of those public datasets. The ever-growing penetration of machine learning algorithms in new application areas requires solutions for the need for data in those new domains. This thesis works on active learning as one possible solution to reduce the amount of data that needs to be processed by hand, by processing only those datapoints that specifically benefit the training of a strong model for the task. A newly proposed framework for framing the active learning workflow as a reinforcement learning problem is adapted for image classification and a series of three experiments is conducted. Each experiment is evaluated and potential issues with the approach are outlined. Each following experiment then proposes improvements to the framework and evaluates their impact. After the last experiment, a final conclusion is drawn, unfortunately rejecting this work's hypothesis and outlining that the proposed framework at the moment is not capable of improving active learning for image classification with a trained reinforcement learning agent.
翻译:机器学习需要大量标签数据才能适合模型。 许多数据集已经公开存在,但迫使机器学习的可能性被应用到这些公共数据集的领域。机器学习算法在新应用领域日益普及,需要解决这些新应用领域的数据需求。该论文的用意是积极学习,作为减少需要手工处理的数据数量的可能解决办法,只处理那些具体有利于培训强有力任务模型的数据点。新提出的设计积极学习工作流程的框架已适应图像分类,并进行一系列三个实验。每个实验都经过评估,并概述了采用这种方法的潜在问题。随后的每个实验都提出改进框架并评估其影响。在上次实验之后,得出了最后结论,不幸地否定了这项工作的假设,并概述了目前拟议的框架无法用训练有素的强化学习剂改进对图像分类的积极学习。