We demonstrate a novel table discovery pipeline called DIALITE that allows users to discover, integrate and analyze open data tables. DIALITE has three main stages. First, it allows users to discover tables from open data platforms using state-of-the-art table discovery techniques. Second, DIALITE integrates the discovered tables to produce an integrated table. Finally, it allows users to analyze the integration result by applying different downstreaming tasks over it. Our pipeline is flexible such that the user can easily add and compare additional discovery and integration algorithms.
翻译:我们展示了一种新颖的表格发现流程,称为DIALITE,使用户能够发现、整合和分析开放数据表格。DIALITE有三个主要阶段。首先,它允许用户使用最先进的表格发现技术从开放数据平台发现表格。其次,DIALITE整合发现的表格,生成一个整合表格。最后,它允许用户通过在整合表格上应用不同的下游任务来分析整合结果。我们的流程是灵活的,用户可以轻松添加和比较其他发现和整合算法。