Data-driven decision making has been a common task in today's big data era, from simple choices such as finding a fast way for driving to work, to complex decisions on cancer treatment in healthcare, often supported by visual analytics. For various reasons (e.g., an ill-defined problem space, network failures or bias), visual analytics for sensemaking of data involves missingness (e.g., missing data and incomplete analysis), which can impact human decisions. For example, data, with missing records, can cost a business millions of dollars, and failing to recognize key evidence can put an innocent person into a sentence to death as a falsely convicted of murder. Being aware of missingness is critical to avoid such catastrophes. To achieve this, as an initial step, we present a framework of categorizing missingness in visual analytics from two perspectives: data-centric and human-centric. The former emphasizes missingness in three data-related categories: data composition, data relationship and data usage. The latter focuses on the human-perceived missingness at three levels: observed missingness, inferred missingness and ignored missingness. Based on the framework, we discuss possible roles of visualizations for handling missingness, and conclude our discussion with future research opportunities.
翻译:数据驱动决策是当今大数据时代的一项共同任务,从简单选择,例如找到快速开车工作的方法,到往往得到视觉分析支持的关于保健领域癌症治疗的复杂决定,从往往由视觉分析支持的简单选择,到往往由视觉分析支持的关于保健领域癌症治疗的复杂决定,由于各种原因(例如,定义不明确的问题空间、网络故障或偏差),数据感化的视觉分析涉及缺失(例如,数据缺失和不完全分析),这可能影响到人类决策。例如,数据,缺少记录,可能花费数百万美元,而不承认关键证据,可能使无辜者被判处死刑,被误判为谋杀罪。了解失踪是避免此类灾难的关键。为了实现这一点,作为第一步,我们提出了一个框架,从两个角度对视觉分析中的缺失进行分类:以数据为中心的和以人为中心的。前一个框架强调三种数据相关类别中的缺失:数据构成、数据关系和数据使用。后一个侧重于人类认知缺失的三个层面:观察到的失踪、推断失踪和忽略未来。我们观察到的缺失是避免此类灾难的关键。为了实现这一点,作为第一步,我们用视觉讨论的未来机会而结束框架。