Progress in automated microscopy and quantitative image analysis has promoted high-content screening (HCS) as an efficient drug discovery and research tool. While HCS offers to quantify complex cellular phenotypes from images at high throughput, this process can be obstructed by image aberrations such as out-of-focus image blur, fluorophore saturation, debris, a high level of noise, unexpected auto-fluorescence or empty images. While this issue has received moderate attention in the literature, overlooking these artefacts can seriously hamper downstream image processing tasks and hinder detection of subtle phenotypes. It is therefore of primary concern, and a prerequisite, to use quality control in HCS. In this work, we evaluate deep learning options that do not require extensive image annotations to provide a straightforward and easy to use semi-supervised learning solution to this issue. Concretely, we compared the efficacy of recent self-supervised and transfer learning approaches to provide a base encoder to a high throughput artefact image detector. The results of this study suggest that transfer learning methods should be preferred for this task as they not only performed best here but present the advantage of not requiring sensitive hyperparameter settings nor extensive additional training.
翻译:自动化显微镜和定量图像分析的进展促进了高含量筛选,作为高效药物发现和研究工具的高效药物发现和研究工具。虽然高浓度成像中心主动提出从高含量的图像中量化复杂的细胞型细胞成象,但这一进程可能受到图像偏差的阻碍,如焦点外图像模糊、含氟磷饱和、碎片、高度噪音、意外的自动浮肿或空图像等图像偏差。虽然这个问题在文献中得到了中等程度的注意,但忽略这些工艺品会严重阻碍下游图像处理任务,并妨碍发现微妙的pheno类型。因此,在高浓度中使用质量控制是首要关切的一个先决条件。在这项工作中,我们评价不要求大量图像说明的深层次学习选择方案,以提供直接和易于使用半超强的学习方法解决这个问题。具体地说,我们比较了最近自我监控和传输学习方法的功效,以便为高投入的Artefact图像探测器提供基础编码。这一研究的结果表明,转让学习方法对于这项任务来说并不可取,因为转让方法不仅需要在这里进行最广泛的培训,而且也是目前最具有最敏感性优势。