Despite their superior performance, deep-learning methods often suffer from the disadvantage of needing large-scale well-annotated training data. In response, recent literature has seen a proliferation of efforts aimed at reducing the annotation burden. This paper focuses on a weakly-supervised training setting for single-cell segmentation models, where the only available training label is the rough locations of individual cells. The specific problem is of practical interest due to the widely available nuclei counter-stain data in biomedical literature, from which the cell locations can be derived programmatically. Of more general interest is a proposed self-learning method called collaborative knowledge sharing, which is related to but distinct from the more well-known consistency learning methods. This strategy achieves self-learning by sharing knowledge between a principal model and a very light-weight collaborator model. Importantly, the two models are entirely different in their architectures, capacities, and model outputs: In our case, the principal model approaches the segmentation problem from an object-detection perspective, whereas the collaborator model a sematic segmentation perspective. We assessed the effectiveness of this strategy by conducting experiments on LIVECell, a large single-cell segmentation dataset of bright-field images, and on A431 dataset, a fluorescence image dataset in which the location labels are generated automatically from nuclei counter-stain data. Implementing code is available at https://github.com/jiyuuchc/lacss_jax
翻译:尽管深度学习方法表现出优异的性能,但它们通常需要大规模注释的训练数据。为了解决这个问题,近期文献中出现了很多关于减少注释负担的方法。本文关注单细胞分割模型的弱监督训练设置,其中唯一可用的标注信息是单个细胞大致位置。此问题具有实际研究意义,因为在生物医学文献中普遍可用的细胞核染色数据能够轻松程序化地获取单元位置标签。更具普遍意义的是,本文提出了一种自学习方法,称为基于协作式知识共享的策略,它类似于但不同于更为知名的一致性学习方法。该策略通过在主模型和轻量级合作者模型之间共享知识实现自学习。值得注意的是,两个模型在它们的架构、容量和模型输出方面完全不同:在本文中,主模型从目标检测的角度探讨分割问题,而合作者模型从语义分割的角度探讨问题。我们使用LIVECell和A431数据集进行实验评估了该策略的有效性。其中A431数据集是一组荧光图像,其位置标签是通过核染色数据自动生成的。该策略的实现代码可在https://github.com/jiyuuchc/lacss_jax上找到。