Few-shot instance segmentation extends the few-shot learning paradigm to the instance segmentation task, which tries to segment instance objects from a query image with a few annotated examples of novel categories. Conventional approaches have attempted to address the task via prototype learning, known as point estimation. However, this mechanism is susceptible to noise and suffers from bias due to a significant scarcity of data. To overcome the disadvantages of the point estimation mechanism, we propose a novel approach, dubbed MaskDiff, which models the underlying conditional distribution of a binary mask, which is conditioned on an object region and $K$-shot information. Inspired by augmentation approaches that perturb data with Gaussian noise for populating low data density regions, we model the mask distribution with a diffusion probabilistic model. In addition, we propose to utilize classifier-free guided mask sampling to integrate category information into the binary mask generation process. Without bells and whistles, our proposed method consistently outperforms state-of-the-art methods on both base and novel classes of the COCO dataset while simultaneously being more stable than existing methods.
翻译:微小例子分解法将微小的学习范式扩大到实例分解任务,它试图用几个附加说明的新类别实例从查询图像中分解实例对象。 常规方法试图通过原型学习(称为点估计)处理任务。 但是,这一机制容易受到噪音的影响,由于数据严重缺乏而存在偏差。 为了克服点估计机制的缺点,我们提议了一种新颖的方法,称为MaskDiff,它模拟了以对象区域和美元分解信息为条件的二元面罩的基本有条件分布。受扩增方法的启发,用高森噪音渗透低数据密度区域的数据,我们用扩散概率模型模拟遮罩的分布。此外,我们提议使用无分类的引导遮罩取样法将分类信息纳入二元遮罩生成过程。没有钟和哨子,我们提出的方法始终超越了CO数据集基类和新类的状态方法,同时比现有方法更加稳定。</s>