从大语言模型临床说明中抽取的自定义可解释性特征</s> (CHiLL: Zero-shot Custom Interpretable Feature Extraction from Clinical Notes with Large Language Models)

Large Language Models (LLMs) have yielded fast and dramatic progress in NLP, and now offer strong few- and zero-shot capabilities on new tasks, reducing the need for annotation. This is especially exciting for the medical domain, in which supervision is often scant and expensive. At the same time, model predictions are rarely so accurate that they can be trusted blindly. Clinicians therefore tend to favor "interpretable" classifiers over opaque LLMs. For example, risk prediction tools are often linear models defined over manually crafted predictors that must be laboriously extracted from EHRs. We propose CHiLL (Crafting High-Level Latents), which uses LLMs to permit natural language specification of high-level features for linear models via zero-shot feature extraction using expert-composed queries. This approach has the promise to empower physicians to use their domain expertise to craft features which are clinically meaningful for a downstream task of interest, without having to manually extract these from raw EHR (as often done now). We are motivated by a real-world risk prediction task, but as a reproducible proxy, we use MIMIC-III and MIMIC-CXR data and standard predictive tasks (e.g., 30-day readmission) to evaluate our approach. We find that linear models using automatically extracted features are comparably performant to models using reference features, and provide greater interpretability than linear models using "Bag-of-Words" features. We verify that learned feature weights align well with clinical expectations.

翻译：大型语言模型(LLMS)在NLP中取得了快速和显著的进展,现在,它为新任务提供了强大的少发和零发能力,减少了批注需求。这对于医疗领域来说特别令人兴奋,因为在这个领域,监管往往很少,而且费用昂贵。与此同时,模型预测很少如此准确,以至于他们可以盲目地信任。临床医生因此倾向于偏爱“解释”分类器而不是不透明的LMS。例如,风险预测工具往往是由人工制作的预测器所定义的线性模型,必须从 EHRs 中努力提取这些模型。我们建议使用高亮的临床特征特征来使用LLLMS来允许通过零发特征提取线性模型的自然语言规格。与此同时,这一方法有望使医生能够使用其域性专长来设计对下游任务具有临床意义的特征,而无需手动地从原始 EHR 中提取这些(通常需要用手动的参考数据) 。我们受到现实世界风险预测任务的激励,但作为可复制的直线性模型,我们使用更精确的代理,我们使用对 MI-III 和 MIC 标准数据进行“我们阅读-C ” 的解读的模型来提供“我们阅读- 的“我们阅读- dal- dal- disal-d-d-d-dal ex-daldalal ” labisald exal ex-daldal exal ex ex exal exal ex exalisal exal ex ex ex exalviewaldalviewdaldaldal ex exaldald exal exal ex ex laction-daldaldal exaldal exaldaldaldaldaldal exaldaldaldal exal ledal exal exaldaldal exaldaldaldaldaldal ex ex ex exal exal lactiondal exal lactiondal-daldaldaldaldaldaldaldaldaldaldaldaldaldaldaldaldaldaldal</s>