Online experimentation platforms collect user feedback at low cost and large scale. Some systems even support real-time or near real-time data processing, and can update metrics and statistics continuously. Many commonly used metrics, such as clicks and page views, can be observed without much delay. However, many important signals can only be observed after several hours or days, with noise adding up over the duration of the episode. When episodical outcomes follow a complex sequence of user-product interactions, it is difficult to understand which interactions lead to the final outcome. There is no obvious attribution logic for us to associate a positive or negative outcome back to the actions and choices we made at different times. This attribution logic is critical to unlocking more targeted and efficient measurement at a finer granularity that could eventually lead to the full capability of reinforcement learning. In this paper, we borrow the idea of Causal Surrogacy to model a long-term outcome using leading indicators that are incrementally observed and apply it as the value function to track the progress towards the final outcome and attribute incrementally to various user-product interaction steps. Applying this approach to the guest booking metric at Airbnb resulted in significant variance reductions of 50% to 85%, while aligning well with the booking metric itself. Continuous attribution allows us to assign a utility score to each product page-view, and this score can be flexibly further aggregated to a variety of units of interest, such as searches and listings. We provide multiple real-world applications of attribution to illustrate its versatility.
翻译:在线实验平台以低成本和大规模规模收集用户反馈。 有些系统甚至支持实时或近实时数据处理,并可以持续更新度量和统计数据。 许多常用的计量标准, 如点击和页面浏览, 可以立即观察。 但是, 许多重要信号只能在数小时或数日后观测, 并在事件持续期间出现噪音。 当流行病学结果遵循一个复杂的用户-产品互动序列时, 很难理解哪种互动导致最终结果。 某些系统甚至支持实时或近实时数据处理, 可以持续更新度和统计。 许多常用的计量标准, 如点击和页面浏览, 可以立即立即观察到。 但是, 许多重要信号只能在数小时或数日后才能观察到, 并且在这个插图中, 我们借用Causal Suragacy的想法, 来模拟一个长期的结果, 使用逐渐观察到的主要指标来跟踪最终结果的进展, 并且将各种用户- 产品互动步骤归为递增的。 将这一方法应用到在不同的时间段里, 将这个方法应用到客座数据库中, 更加有针对性和高效的测量度测量度测量度测量度测量度测量度测量度测量值的精确度, 直至每页的排名, 将使得我们的排名的排名的比值的比值的比值可以大幅递增分数 。