Closed-loop reservoir management (CLRM), in which history matching and production optimization are performed multiple times over the life of an asset, can provide significant improvement in the specified objective. These procedures are computationally expensive due to the large number of flow simulations required for data assimilation and optimization. Existing CLRM procedures are applied asset by asset, without utilizing information that could be useful over a range assets. Here, we develop a CLRM framework for multiple assets with varying numbers of wells. We use deep reinforcement learning to train a single global control policy that is applicable for all assets considered. The new framework is an extension of a recently introduced control policy methodology for individual assets. Embedding layers are incorporated into the representation to handle the different numbers of decision variables that arise for the different assets. Because the global control policy learns a unified representation of useful features from multiple assets, it is less expensive to construct than asset-by-asset training (we observe about 3x speedup in our examples). The production optimization problem includes a relative-change constraint on the well settings, which renders the results suitable for practical use. We apply the multi-asset CLRM framework to 2D and 3D water-flooding examples. In both cases, four assets with different well counts, well configurations, and geostatistical descriptions are considered. Numerical experiments demonstrate that the global control policy provides objective function values, for both the 2D and 3D cases, that are nearly identical to those from control policies trained individually for each asset. This promising finding suggests that multi-asset CLRM may indeed represent a viable practical strategy.
翻译:闭路储油层管理(CLRM)在资产寿命期间多次进行历史匹配和生产优化,可以大大改进特定目标。由于数据同化和优化需要大量的流程模拟,这些程序计算成本很高。现有的CLRM程序按资产应用,而没有使用对各种资产有用的信息。这里,我们为不同水井的多个资产开发CLRM框架(我们用深度强化学习来培训适用于所考虑的所有资产的单一全球控制政策。新框架是最近为个别资产引入的控制政策方法的延伸。嵌入层被纳入代表中,以处理不同资产产生的不同数量的决定变量。由于全球控制政策从多个资产中学习了统一的有用特征,因此,构建一个CLRM框架的费用要小于资产逐资产培训的费用(我们从实例中看到大约3x的加速速度)。生产优化问题包括对井环境的相对变化制约,使得结果适合实际使用。我们采用了多代算的CLRM政策,将嵌入层层层层层层纳入代表了2D和3D两种不同的目标模型。这些成本分析案例都表明CRM框架和3和3D的精确分析,这些背景分析案例都表明,这些背景分析的逻辑分析,这些背景分析为稳定的分析和分析案例可以提供。