Users frequently interact with software systems through data entry forms. However, form filling is time-consuming and error-prone. Although several techniques have been proposed to auto-complete or pre-fill fields in the forms, they provide limited support to help users fill categorical fields, i.e., fields that require users to choose the right value among a large set of options. In this paper, we propose LAFF, a learning-based automated approach for filling categorical fields in data entry forms. LAFF first builds Bayesian Network models by learning field dependencies from a set of historical input instances, representing the values of the fields that have been filled in the past. To improve its learning ability, LAFF uses local modeling to effectively mine the local dependencies of fields in a cluster of input instances. During the form filling phase, LAFF uses such models to predict possible values of a target field, based on the values in the already-filled fields of the form and their dependencies; the predicted values (endorsed based on field dependencies and prediction confidence) are then provided to the end-user as a list of suggestions. We evaluated LAFF by assessing its effectiveness and efficiency in form filling on two datasets, one of them proprietary from the banking domain. Experimental results show that LAFF is able to provide accurate suggestions with a Mean Reciprocal Rank value above 0.73. Furthermore, LAFF is efficient, requiring at most 317 ms per suggestion.
翻译:用户经常通过数据输入表格与软件系统互动,但表格填充是耗费时间和容易出错的。虽然提出了多种技术,以自动完成或预填表格形式填充字段,但它们提供了有限的支持,帮助用户填入绝对字段,即需要用户在大量选项中选择正确价值的字段。在本文件中,我们提议采用学习型自动化方法,以填补数据输入表格中绝对字段的基于学习的自动方法。LAFF首先从一系列历史输入实例中学习贝叶西亚网络模型,以了解外地依赖性,代表过去填充的字段的价值。为提高其学习能力,LAFF利用当地模型,在一组输入实例中有效挖掘外地的当地依赖性。在形式填充阶段,LAFF利用这些模型,根据表格中已经填满的字段及其依赖性的价值预测目标字段的可能价值;然后向最终用户提供预测值(根据外地依赖性和预测信任性),以代表过去填充的字段价值。为了提高学习能力,LAFF利用本地模型有效地挖掘外地的实地价值,我们评估其准确性区域结构建议,以AFF的安全标准形式向最终用户展示。