Purpose: Image classification is perhaps the most fundamental task in imaging AI. However, labeling images is time-consuming and tedious. We have recently demonstrated that reinforcement learning (RL) can classify 2D slices of MRI brain images with high accuracy. Here we make two important steps toward speeding image classification: Firstly, we automatically extract class labels from the clinical reports. Secondly, we extend our prior 2D classification work to fully 3D image volumes from our institution. Hence, we proceed as follows: in Part 1, we extract labels from reports automatically using the SBERT natural language processing approach. Then, in Part 2, we use these labels with RL to train a classification Deep-Q Network (DQN) for 3D image volumes. Methods: For Part 1, we trained SBERT with 90 radiology report impressions. We then used the trained SBERT to predict class labels for use in Part 2. In Part 2, we applied multi-step image classification to allow for combined Deep-Q learning using 3D convolutions and TD(0) Q learning. We trained on a set of 90 images. We tested on a separate set of 61 images, again using the classes predicted from patient reports by the trained SBERT in Part 1. For comparison, we also trained and tested a supervised deep learning classification network on the same set of training and testing images using the same labels. Results: Part 1: Upon training with the corpus of radiology reports, the SBERT model had 100% accuracy for both normal and metastasis-containing scans. Part 2: Then, using these labels, whereas the supervised approach quickly overfit the training data and as expected performed poorly on the testing set (66% accuracy, just over random guessing), the reinforcement learning approach achieved an accuracy of 92%. The results were found to be statistically significant, with a p-value of 3.1 x 10^-5.
翻译:图像分类可能是成像 AI 中最基本的任务 。 然而, 标签标签图像可能是最基本的任务 AI 。 但是, 标签标签是耗时和烦琐的 。 我们最近已经证明, 强化学习( RL) 可以高精度地对 MRI 大脑图像的 2D 切片进行分类 。 在这里, 我们为加速图像分类的目的采取了两个重要步骤 : 首先, 我们自动从临床报告中提取类标签 。 第二, 我们从机构里将我们之前的 2D 分类工作扩展至 3D 图像的完整量 。 因此, 我们用 SB 的自然语言处理方法自动从报告中提取标签 。 然后, 在第二部分里, 我们用 RL 和 RL 一起的正常的标签来为 3D- 5 进行分类 。 我们用这些标签来训练的90 。 我们用一套正常的 深度的 深Q 网络( DQN QN ) 来进行分类 。 我们用一个经过训练的 S REL 版本 的 测试, 用经过训练的预估测的 S 2 校 校 。