Online review systems are the primary means through which many businesses seek to build the brand and spread their messages. Prior research studying the effects of online reviews has been mainly focused on a single numerical cause, e.g., ratings or sentiment scores. We argue that such notions of causes entail three key limitations: they solely consider the effects of single numerical causes and ignore different effects of multiple aspects -- e.g., Food, Service -- embedded in the textual reviews; they assume the absence of hidden confounders in observational studies, e.g., consumers' personal preferences; and they overlook the indirect effects of numerical causes that can potentially cancel out the effect of textual reviews on business revenue. We thereby propose an alternative perspective to this single-cause-based effect estimation of online reviews: in the presence of hidden confounders, we consider multi-aspect textual reviews, particularly, their total effects on business revenue and direct effects with the numerical cause -- ratings -- being the mediator. We draw on recent advances in machine learning and causal inference to together estimate the hidden confounders and causal effects. We present empirical evaluations using real-world examples to discuss the importance and implications of differentiating the multi-aspect effects in strategizing business operations.
翻译:在线审查体系是许多企业寻求建立品牌和传播其信息的主要手段。先前的研究研究在线审查的效果主要集中于单一数字原因,例如评级或情绪分数。我们争辩说,这种原因概念包含三个关键限制:它们仅仅考虑单一数字原因的影响,忽视文本审查中包含的多种方面 -- -- 例如食品、服务 -- -- 的不同影响;它们假定观察研究中不存在隐藏的混淆者,例如消费者的个人偏好;它们忽视了可能抵消文字审查对商业收入的影响的数字原因的间接影响。因此,我们提出对网上审查的这种单一的、基于原因的影响估计的另一种观点:在隐藏的聚合者面前,我们考虑多层次的文本审查,特别是其对商业收入的总体影响和与数字原因的直接影响 -- -- 评级 -- -- 是调解者。我们借鉴机器学习和因果关系推断的最新进展,共同估计隐藏的聚合者和因果关系效应。我们用现实世界的例子提出实证评价,以讨论区分多种业务影响的重要性和影响。