The logic behind design decisions, called design rationale, is very valuable. In the past, researchers have tried to automatically extract and exploit this information, but prior techniques are only applicable to specific contexts and there is insufficient progress on an end-to-end rationale information extraction pipeline. Here we outline a path towards such a pipeline that leverages several Machine Learning (ML) and Natural Language Processing (NLP) techniques. Our proposed context-independent approach, called Kantara, produces a knowledge graph representation of decisions and of their rationales, which considers their historical evolution and traceability. We also propose validation mechanisms to ensure the correctness of the extracted information and the coherence of the development process. We conducted a preliminary evaluation of our proposed approach on a small example sourced from the Linux Kernel, which shows promising results.
翻译:设计决定背后的逻辑,称为设计原理,是非常宝贵的。在过去,研究人员曾试图自动提取和利用这一信息,但先前的技术只适用于具体情况,在端到端的理由信息提取管道方面进展不足。这里我们概述了一条通往利用多种机器学习和自然语言处理技术的管道的道路。我们提议的 " 以环境为独立 " 的方法,称为 " Kantara ",提出了决定及其原理的知识图表,其中考虑了其历史演变和可追溯性。我们还提议了验证机制,以确保所提取的信息的正确性和发展进程的一致性。我们对从Linux Kernel(Linux Kern)获得的一小例子(显示有希望的结果)进行了初步评估。