The size of an individual cell type, such as a red blood cell, does not vary much among humans. We use this knowledge as a prior for classifying and detecting cells in images with only a few ground truth bounding box annotations, while most of the cells are annotated with points. This setting leads to weakly semi-supervised learning. We propose replacing points with either stochastic (ST) boxes or bounding box predictions during the training process. The proposed "mean-IOU" ST box maximizes the overlap with all the boxes belonging to the sample space with a class-specific approximated prior probability distribution of bounding boxes. Our method trains with both box- and point-labelled images in conjunction, unlike the existing methods, which train first with box- and then point-labelled images. In the most challenging setting, when only 5% images are box-labelled, quantitative experiments on a urine dataset show that our one-stage method outperforms two-stage methods by 5.56 mAP. Furthermore, we suggest an approach that partially answers "how many box-labelled annotations are necessary?" before training a machine learning model.
翻译:单细胞类型的大小, 如红血细胞, 在人类中差别不大。 我们使用这种知识作为先行, 用于在图像中分类和探测单元格, 图像中只有几块地面的真相捆绑框说明, 而大多数单元格都有附加点注。 这个设置导致半监督的学习不力。 我们建议在培训过程中用随机(ST) 框或捆绑盒预测来替换点。 提议的“ 中IOU” ST 框将属于样本空间的所有框的重叠最大化, 并配有特定等级的、 大约先前的捆绑框概率分布。 我们的方法是用盒式和点标签图像一起进行分类和检测, 与现有的方法不同, 先用箱式然后用点标签图像进行训练。 在最具挑战的环境下, 当只有5%的图像被框绑定时, 尿液数据集的定量实验显示我们的单阶段方法在5.56 mAP 中超越了两阶段方法。 此外, 我们建议了一种方法, 部分解答“ 许多箱标签式的描述是必需的 ” 。