石油储油层回收系数估计的机机学习:碳氢资产评价中去风险的工具 (Machine learning for recovery factor estimation of an oil reservoir: a tool for de-risking at a hydrocarbon asset evaluation)

Well known oil recovery factor estimation techniques such as analogy, volumetric calculations, material balance, decline curve analysis, hydrodynamic simulations have certain limitations. Those techniques are time-consuming, require specific data and expert knowledge. Besides, though uncertainty estimation is highly desirable for this problem, the methods above do not include this by default. In this work, we present a data-driven technique for oil recovery factor estimation using reservoir parameters and representative statistics. We apply advanced machine learning methods to historical worldwide oilfields datasets (more than 2000 oil reservoirs). The data-driven model might be used as a general tool for rapid and completely objective estimation of the oil recovery factor. In addition, it includes the ability to work with partial input data and to estimate the prediction interval of the oil recovery factor. We perform the evaluation in terms of accuracy and prediction intervals coverage for several tree-based machine learning techniques in application to the following two cases: (1) using parameters only related to geometry, geology, transport, storage and fluid properties, (2) using an extended set of parameters including development and production data. For both cases model proved itself to be robust and reliable. We conclude that the proposed data-driven approach overcomes several limitations of the traditional methods and is suitable for rapid, reliable and objective estimation of oil recovery factor for hydrocarbon reservoir.

翻译：众所周知的石油回收系数估算技术,如类比、体积计算、材料平衡、曲线下降分析、流体动力学模拟等,具有一定的局限性。这些技术耗时,需要具体的数据和专业知识。此外,尽管不确定性估算对于这一问题非常可取,但上述方法并不默认地包括这一点。在这项工作中,我们提出了利用储油层参数和代表性统计数据进行石油回收系数估算的数据驱动技术。我们对历史上全世界油田的数据集(超过2000年的储油层)采用先进的机器学习方法。数据驱动模型可以用作快速和完全客观估计石油回收系数的一般工具。此外,它还包括利用部分投入数据和估计石油回收系数预测间隔期的能力。我们从准确性和预测间隔的角度评价了几个基于树木的机器学习技术,这些技术适用于以下两个案例:(1) 仅使用与地理测量、地质学、运输、储存和液体特性有关的参数,(2) 使用一套扩展的参数,包括开发和生产数据。两种案例都证明是稳健和可靠的。我们的结论是,拟议的数据驱动的油库快速回收方法客观地克服了传统方法的若干限制。