Online reviews enable consumers to engage with companies and provide important feedback. Due to the complexity of the high-dimensional text, these reviews are often simplified as a single numerical score, e.g., ratings or sentiment scores. This work empirically examines the causal effects of user-generated online reviews on a granular level: we consider multiple aspects, e.g., the Food and Service of a restaurant. Understanding consumers' opinions toward different aspects can help evaluate business performance in detail and strategize business operations effectively. Specifically, we aim to answer interventional questions such as What will the restaurant popularity be if the quality w.r.t. its aspect Service is increased by 10%? The defining challenge of causal inference with observational data is the presence of "confounder", which might not be observed or measured, e.g., consumers' preference to food type, rendering the estimated effects biased and high-variance. To address this challenge, we have recourse to the multi-modal proxies such as the consumer profile information and interactions between consumers and businesses. We show how to effectively leverage the rich information to identify and estimate causal effects of multiple aspects embedded in online reviews. Empirical evaluations on synthetic and real-world data corroborate the efficacy and shed light on the actionable insight of the proposed approach.
翻译:在线审查使消费者能够与公司接触,并提供重要的反馈。由于高维文本的复杂性,这些审查往往被简化为单一的数值评分,例如评级或情绪评分。这项工作从经验上审查了用户生成的微粒水平在线审查的因果关系:我们考虑多个方面,例如餐馆的食品和服务。了解消费者对不同方面的意见有助于详细评价商业业绩,并有效地规划商业业务。具体地说,我们的目标是回答干预性问题,例如,如果服务质量提高10%,餐厅的受欢迎程度会是什么?对观测数据进行因果关系推断的决定性挑战在于是否存在“C confounder”,这一点可能得不到观察或衡量,例如,消费者对食品类型的偏好,造成估计的影响偏差和高度差异。为了应对这一挑战,我们可诉诸多种模式,例如消费者概况信息和消费者与企业之间的互动。我们展示如何有效地利用丰富的信息来查明和估计在线数据快速解读方式的多种方面的实际有效性。