We present the first empirical study on customer churn prediction in the scholarly publishing industry. The study examines our proposed method for prediction on a customer subscription data over a period of 6.5 years, which was provided by a major academic publisher. We explore the subscription-type market within the context of customer defection and modelling, and provide analysis of the business model of such markets, and how these characterise the academic publishing business. The proposed method for prediction attempts to provide inference of customer's likelihood of defection on the basis of their re-sampled use of provider resources -in this context, the volume and frequency of content downloads. We show that this approach can be both accurate as well as uniquely useful in the business-to-business context, with which the scholarly publishing business model shares similarities. The main findings of this work suggest that whilst all predictive models examined, especially ensemble methods of machine learning, achieve substantially accurate prediction of churn, nearly a year ahead, this can be furthermore achieved even when the specific behavioural attributes that can be associated to each customer probability to churn are overlooked. Allowing as such highly accurate inference of churn from minimal possible data. We show that modelling churn on the basis of re-sampling customers' use of resources over subscription time is a better (simplified) approach than when considering the high granularity that can often characterise consumption behaviour.
翻译:我们提出了关于学术出版业客户周遭预测的第一份实证研究。该研究审查了我们提议的在6.5年期间预测客户订阅数据的方法,该方法由一位主要学术出版商提供。我们探讨了客户叛逃和建模背景下的订阅型市场,分析了这些市场的商业模式,以及这些模式如何体现学术出版业的特点。拟议的预测方法试图根据客户重新使用的供应商资源来推断其叛逃的可能性――在此情况下,下载内容的数量和频率。我们表明,这一方法既准确又独特,在企业对企业环境下有用,学术出版业务模式也具有相似之处。这项工作的主要研究结果表明,尽管所有预测型模型,特别是机器学习的混合方法,几乎提前一年就相当精确地预测了客户叛逃的可能性。即使与每种客户概率有关的具体行为特征被忽略了,这也可以进一步实现。我们经常将这种非常准确的商对企业对企业的实用性加以借鉴,而学术出版商模式与类似。这项工作的主要研究结果表明,尽管所有预测型模型,特别是机器学习的混合方法,但几乎在近一年之后,仍然可以实现这一点,即使可以忽略与每个客户的概率相关的具体行为特征。让我们能够如此精确地利用尽可能高的消费基础,从而从可能的模型化地表明客户的高度地展示。