Mixture of relevance vector regression experts for reservoir properties prediction

doi:10.1016/j.petrol.2022.110498

Journal of Petroleum Science and Engineering

Volume 214, July 2022, 110498

https://doi.org/10.1016/j.petrol.2022.110498 Get rights and content

Highlights

•
Mixture of experts strategy is helpful to divide and conquer reservoir data.
•
RVR can determines hyperparameters automatically and avoid the dimension Curse.
•
MRVRE improves the accuracy of the reservoir property prediction.

Abstract

One of the most indispensable works in the oil and gas exploration and exploitation is reservoir properties forecast. We develop a reservoir properties prediction method based on the mixture of relevance vector regression (RVR) experts. For reservoir properties prediction, an individual machine learning model is probably insufficient with limited training data. The mixture of experts can decompose a complicated problem of reservoir properties prediction into several relatively simple sub-problems by incorporating multiple learning models, where each model processes specific parts of the data. In the proposed method, RVR has been chosen as the expert because it results in sparser regressors and determines hyperparameters automatically. It can also project data into a high-dimensional space through kernel functions to resolve the non-linear problem and avoid the curse of dimensionality. At first, a mixture of RVR experts model is trained on the well data samples. The input features are elastic properties and the output is a reservoir property. Then, the learning model is applied to test dataset and the corresponding reservoir properties are generated. The proposed method is applied to two field data, in which the learning model is obtained by training on the well data and is tested on the well and seismic data, respectively. Compared with an individual RVR expert, some metrics, such as mean absolute derivation (MAD), root mean square error (RMSE), coefficient of determination (R $^{2}$ ) and Akaike information criterion (AIC) etc., are effectively improved. The successful implementation of the method demonstrates its feasibility, certifying the superiority of the new method in the aspect of likelihood of fit and accuracy again.

Introduction

Reservoir characterization is a process to quantitatively predict and describe the reservoirs by using multidisciplinary information. One of the main tasks is to simulate the spatial distribution of reservoir parameters (including elastic attributes, lithofacies, porosity, permeability, etc.) by using various observed data. Predicting reservoir properties is imperative to reservoir engineers for appraising reservoirs, determining optimal well locations, and promoting production. Therefore, reservoir characterization research is significant in the exploration, development, and evaluation of oil and gas fields (Fournier and Derain, 1995, Alvarez et al., 2003, Feng et al., 2018, Babasafari et al., 2020).

Geophysicists are tasked to estimate unknown reservoir properties in the extensive inter-well area by utilizing the limited known target and other information. Seismic data contain lateral and vertical changes that are the signatures of the response caused by a reservoir. It is essential while difficult to directly obtain reservoir properties from seismic data because the relationship between reservoir properties and seismic data is influenced by many interacting factors (Grana and Rossa, 2010, Baron and Holliger, 2011, Saggaf et al., 2003). Seismic elastic attributes can be used to estimate reservoir properties (Tetyukhina, 2011, Wang, 2012, Yu et al., 2020). Traditional methods can achieve reservoir properties prediction by establishing petrophysical models under some assumptions or building empirical formulas. Then, they are applied to seismic elastic attributes (Babasafari et al., 2021, Bashir et al., 2021). However, the accuracy will be negatively impacted when the assumptions cannot be completely satisfied.

Machine learning can achieve the reservoir properties prediction by training models with the known data and applying the model to unknown data, while it is independent of the traditional pertophysical hypothesis (Liu et al., 2021a). The first step is training a regression model by using the observed data. Then, the learned model performs prediction on incoming data. Scholars have invented a wide range of techniques for geophysical application (Zhang et al., 2018, Liu et al., 2021b). Support vector regression (SVR) and neural networks are two of the most-used machine learning algorithms in geophysics for reasons of computational tractability. Li et al. (2005), and Zhong and Carr (2019) used the support vector machine to detect the reservoirs. Saggaf et al. (2003), and Ahmed et al. (2010) estimated the reservoir properties from seismic data by using neural network approaches. However, the two algorithms are very sensitive to parameters such as penalty parameters (which are used to balance the empirical risk and structural risk), and learning rate (Suykens and Vandewalle, 1999, Wohlberg et al., 2005, Liu et al., 2020). Inappropriate parameters will result in an unsatisfactory prediction result and will enlarge uncertainty. Many publications have investigated how to optimize the related parameters by a global optimization algorithm, such as genetic algorithms (Li et al., 2018), particle swarm algorithms (Zhang and Liu, 2008) and quantum particle swarm algorithms (Liu et al., 2019), and many others. However, it is time-consuming to find the ideal parameters (Liu et al., 2020).

Relevance vector regression (RVR) is an available alternative Bayesian based method that is first described in Tipping (2001). It can, in principle, overcome the drawbacks of SVR. RVR does not rely on human experience to set the penalty parameter. RVR involves dramatical fewer key vectors than SVR with a comparable accuracy, thus the learning model of RVR is more sparse, resulting in a shorter prediction time on test data (Bishop and Tipping, 2000). Through kernel learning, RVR also projects the original data into a high dimensional space, which makes the data be more easily forecasted. Apart from that, RVR does not require the kernel function meets Mercer’s condition, and any function can be exploited as kernel in theory (Burden and Winkler, 2015). RVR has been successfully applied to many fields, whereas applications in geophysics have rarely been reported.

In practice, the distribution of the data in different wells is not completely consistent. In a well, the distribution of data for different lithofacies may also be inconsistent. The issue of data distribution can be addressed by training an individual machine learning model with a large number of training samples. Unfortunately, the training data is limited in practice. That is to say, a single expert usually cannot account for the whole features in the case of limited samples. In this situation, we can train multiple models that are responsible for different parts of the data to enhance accuracy. The mixture of experts (ME) can achieve this goal. Initial ME is a neural network that trains multiple models for local regions of the input data (Jacobs et al., 1991, Meeds and Osindero, 2006, Jain, 2019). It is competitive to regression and classification for non-stationary data. In addition, ME is flexible because most machine learning algorithms can be combined theoretically (Lima et al., 2007, Chao and Neubauer, 2008, Yuksel and Gader, 2010). Each model represents an expert and all experts are integrated by a gating function. ME is a compromise between a single global learning model and multiple local learning models (Meeds and Osindero, 2006, Kim-Anh et al., 2010). ME allows each expert to specialize on different smaller parts of a complex problem (Kim-Anh et al., 2010). The gating function is responsible for making partitions of the input dataset and assigning regions for the individual experts.

We develop a reservoir properties prediction method based on ME to divide and conquer the original dataset. We employ RVR as experts to attain the learning model that is a weighted sum of experts by a gating function. The proposed mixture of relevance vector regression experts method (MR-VRE) decomposes a complex large prediction problem into several small regression problems, whose structure is shown in Fig. 1. The presented method does not assume that the data are stationary. It also does not depend on the rock physics modeling that is used to transform elastic attributes into reservoir properties in many traditional methods.

The key contributions of this manuscript are: (1) The superiority of the union of the mixture of experts and relevance vector regression algorithm is demonstrated in the aspect of likelihood of fit and accuracy; (2) The method for reservoir properties prediction has been successfully applied to well and seismic datasets; (3) The proposed method could potentially be used to incorporate other experts, and would stimulate more investigations into this learning strategy in the geological and geophysical field.

In the following sections, we will introduce the mathematical formulation and the related algorithms in detail, and then show some applications of the novel method on well and seismic data.

Section snippets

The input and output data

We aim to predict reservoir properties in wells and seismic areas where reservoir properties are not observed or interpreted. For each sample, the input is a vector consisted of input features, such as logging measurements and elastic attributes. The training data contain input attributes and target reservoir properties, while the test data only contain input attributes. Assuming a training dataset $D = {X, r}$ with $N$ samples where $X = {x^{(n)}, n = 1, \dots, N}$ denotes the input elastic attributes (density, P-

Test on well data

Initially, we test the method on a well dataset from a work area in China. There are only three available wells (A, B, and C), where well A is used as training data with a total of 4000 samples and well B with a total of 3800 samples is used as a validation well to analyze the performance of MRVRE with varying quantities of experts. Well C contains 5440 samples. The space distance between well A and B is about 3500 m, and well C is far away from Well A with a distance of 2700 m. Then, the

Discussion

The computational time of the method is mainly spent on the training process, which is associated with the number of samples. The method includes three iterate processes. The first iteration (Step.3) is for the whole model and the iteration number in all examples of this paper is about 5 to 15. Too few iterations will produce unsatisfactory predictions whereas too many iterations extend the training time. The inner iteration involves the training of experts (Step.3.2) and the calculation of

Conclusion

We propose a novel quantitative reservoir properties prediction method that employs a mixture of relevance vector regression experts model. The major advantage of the proposed method is that it can use multiple experts to dynamically divide and conquer the data by cooperating with a gating function, which is more targeted than using an individual learning model. Therefore, it allows capturing the incomprehensible relations between reservoir properties and elastic attributes, then improving the

CRediT authorship contribution statement

Xingye Liu: Conceptualization, Methodology, Software, Writing. Guangzhou Shao: Resources, Data curation, Writing – review & editing. Cheng Yuan: Reviewing, Resources. Xiaohong Chen: Investigation, Supervision, Validation. Jingye Li: Supervision, Validation. Yangkang Chen: Visualization, Writing - review & editing.

References (55)

BabasafariA.A. et al.
Practical workflows for monitoring saturation and pressure changes from 4D seismic data: A case study of malay basin
J. Appl. Geophys.
(2021)
GholamiR. et al.
Applications of artificial intelligence methods in prediction of permeability in hydrocarbon reservoirs
J. Pet. Sci. Eng.
(2014)
LimaC.A.M. et al.
Hybridizing mixtures of experts with support vector machines: Investigation into nonlinear dynamic systems identification
Inform. Sci.
(2007)
LiuX. et al.
Extreme learning machine for multivariate reservoir characterization
J. Pet. Sci. Eng.
(2021)
OtchereD.A. et al.
Application of supervised machine learning paradigms in the prediction of petroleum reservoir properties: Comparative analysis of ANN and SVM models
J. Pet. Sci. Eng.
(2021)
XingH. et al.
An adaptive fuzzy c-means clustering-based mixtures of experts model for unlabeled data classification
Neurocomputing
(2008)
ZhouL. et al.
Bayesian time-lapse difference inversion based on the exact Zoeppritz equations with blockiness constraint
J. Environ. Eng. Geophys.
(2020)
AhmedO.A. et al.
Reservoir property prediction using abductive networks
Geophysics
(2010)
AlvarezG. et al.
Lithologic characterization of a reservoir using continuous-wavelet transforms
IEEE Trans. Geosci. Remote Sens.
(2003)
BabasafariA.A. et al.
A new approach to petroelastic modeling of carbonate rocks using an extended pore-space stiffness method, with application to a carbonate reservoir in Central Luconia, Sarawak, Malaysia
Leading Edge
(2020)

BaronL. et al.

Constraints on the permeability structure of alluvial aquifers from the poro-elastic inversion of multifrequency P-wave sonic velocity logs

IEEE Trans. Geosci. Remote Sens.

(2011)

BashirY. et al.

Seismic expression of miocene carbonate platform and reservoir characterization through geophysical approach: application in central luconia, offshore Malaysia.

J. Pet. Explor. Prod.

(2021)

BishopC.M.

Bayesian regression and classification

BishopC.M. et al.

Variational relevance vector machines

Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence

(2000)

BoL. et al.

Fast algorithms for large scale conditional 3D prediction

BurdenF.R. et al.

Relevance vector machines: sparse classification methods for QSAR

J. Chem. Inform. Model.

(2015)

ChaoY. et al.

Variational mixture of Gaussian process experts

FengR. et al.

Reservoir lithology determination by hidden Markov random fields based on a Gaussian mixture model

IEEE Trans. Geosci. Remote Sens.

(2018)

FournierF. et al.

A statistical methodology for deriving reservoir properties from seismic data

Geophysics

(1995)

GranaD. et al.

Probabilistic petrophysical-properties estimation integrating statistical rock physics with seismic inversion

Geophysics

(2010)

GuY. et al.

Multiple kernel learning for hyperspectral image classification: A review

IEEE Trans. Geosci. Remote Sens.

(2017)

IoannisP. et al.

Multiclass relevance vector machines: sparsity and accuracy

IEEE Trans. Neural Netw.

(2010)

JacobsR.A. et al.

Adaptive mixtures of local experts

Neural Comput.

(1991)

Jain, V., 2019. Class-based machine learning for next-generation wellbore data processing and interpretation. In: SPWLA...

JordanM.I. et al.

Hierarchical mixtures of experts and the EM algorithm

Neural Comput.

(1994)

KanaujiaA. et al.

Learning ambiguities using Bayesian mixture of experts

Kim-AnhL.C. et al.

Integrative mixture of experts to combine clinical factors and gene markers

Bioinformatics

(2010)

Cited by (16)

Machine learning approach for core permeability prediction from well logs in Sandstone Reservoir, Mediterranean Sea, Egypt
2024, Journal of Applied Geophysics
Precise reservoir description is crucial for the proper evaluation of reservoirs. The ability to predict permeability is key to successful reservoir characterization. Although obtaining permeability values through a reservoir is crucial, this is not a simple task. This requires a significant amount of time and money. Numerous methods use the core correlations documented in the literature. Well-log information that offers ways to determine permeability. In this study, we describe a machine learning-based permeability prediction method. With log and core (data), Generalized Additive Models (GAMs) allow for the expansion of the forecasts for(uncored)wells.
It also considers the characteristics of the field to ensure a better comprehension of reservoirs, reservoir rock characteristics, and geological variations. We demonstrate the utility of generalized additive models (GAMs), a non-parametric regression-based technique, to account for nonlinear trends in seven wells located in the (WDDM) concession where data analysis is performed on the collected information to evaluate the model's performance. Both the linear and nonlinear functions were used to train the data. The findings of the analysis demonstrate that GAMs outperform segmented linear regression models when the trend is nonlinear, but they also demonstrate their effectiveness when the trend is linear. The GAMs used five wells for training and two blind wells for testing. The GAMs with all five wells used as input, were found to perform best in predicting permeability for the shaly sandstone, with coefficients of determination (Pseudo-R2) of approximately 0.98 and 0.82 for the training and blind data sets, respectively.
Facies conditional simulation based on VAE-GAN model and image quilting algorithm
2023, Journal of Applied Geophysics
Characterization of complex reservoir structures by using limited observations is challenging in geosciences because it requires to reproduce geological realism. We propose a novel method to reconstruct complex structures by combining variational autoencoder and generative adversarial networks (VAE-GAN) with conditional image quilting algorithm. It improves the stability of traditional GAN-based simulation method without decreasing the quality of patterns. Firstly, we construct a VAE-GAN model to extract the high-dimensional features of facies patterns and to create abundant new patterns. The VAE-GAN-based learning method has a good ability in feature learning from training patches and accurately reproduce new facies patterns. Finally, these new patterns are spliced together to reconstruct a complex geological structure by employing the conditional image quilting algorithm, in which patterns are pasted to the simulated areas based on the calculated minimum cost path. During this process, conditioning is also considered. Since the pattern is generated by using the deep learning method rather than directly extracted from the training image, the diversity of realizations is enhanced without losing reproducibility. Based on synthetic training image, several examples are described and analyzed in detail, demonstrating the effectiveness and reliability of the present method. In addition, the new method is applied to a real training image. The complex heterogeneous structures are well reproduced by our method, indicating its practicability.
A high resolution inversion method for fluid factor with dynamic dry-rock V<inf>P</inf>/V<inf>S</inf> ratio squared
2023, Petroleum Science
As an important indicator parameter of fluid identification, fluid factor has always been a concern for scholars. However, when predicting Russell fluid factor or effective pore-fluid bulk modulus, it is necessary to introduce a new rock skeleton parameter which is the dry-rock V_P/V_S ratio squared (DVRS). In the process of fluid factor calculation or inversion, the existing methods take this parameter as a static constant, which has been estimated in advance, and then apply it to the fluid factor calculation and inversion. The fluid identification analysis based on a portion of the Marmousi 2 model and numerical forward modeling test show that, taking the DVRS as a static constant will limit the identification ability of fluid factor and reduce the inversion accuracy. To solve the above problems, we proposed a new method to regard the DVRS as a dynamic variable varying with depth and lithology for the first time, then apply it to fluid factor calculation and inversion. Firstly, the exact Zoeppritz equations are rewritten into a new form containing the fluid factor and DVRS of upper and lower layers. Next, the new equations are applied to the four parameters simultaneous inversion based on the generalized nonlinear inversion (GNI) method. The testing results on a portion of the Marmousi 2 model and field data show that dynamic DVRS can significantly improve the fluid factor identification ability, effectively suppress illusion. Both synthetic and filed data tests also demonstrate that the GNI method based on Bayesian deterministic inversion (BDI) theory can successfully solve the above four parameter simultaneous inversion problem, and taking the dynamic DVRS as a target inversion parameter can effectively improve the inversion accuracy of fluid factor. All these results completely verified the feasibility and effectiveness of the proposed method.
Quantitative characterization of shale gas reservoir properties based on BiLSTM with attention mechanism
2023, Geoscience Frontiers
Evaluating the potential of shale gas reservoirs is inseparable from reservoir properties prediction. Accurate characterization of total organic carbon, porosity and permeability is necessary to understand shale gas reservoirs. Seismic data can help to estimate these parameters in the area crossing-wells. We develop an improved deep learning method to achieve shale gas reservoir properties estimation. The relationship between elastic attributes and reservoir properties is built up by training a deep bidirectional long short-term memory network, which is suitable for time/depth sequence prediction, on the logging and core data. Except some commonly used technologies, such as layer normalization and dropout, we also introduce attention mechanism to further enhance the prediction accuracy. Besides, we propose to carry on the normal scores transform on the input features, which aims to make the relationship between inputs and targets clear and easy to learn. During the training process, we construct quantile loss function, then use Adam algorithm to optimize the network. Not only the characterization results, but also the confidence interval can be output that is meaningful for uncertainty analysis. The well experiment indicates that the method is promising for reducing prediction errors when training samples are insufficient. After analyzing in wells, the established model is acted upon seismic inverted elastic attributes to characterize shale gas reservoirs in the whole studied area. The estimation results coincide well with the actual development results, showing the feasibility of the novel method on the characterization for shale gas reservoirs.
Permeability prediction using logging data in a heterogeneous carbonate reservoir: A new self-adaptive predictor
2023, Geoenergy Science and Engineering
Accurately predicting permeability is critical for oil deposit exploitation and high-quality reservoir identification. However, the substantial heterogeneity in carbonate reservoirs has significantly challenged the accurate permeability prediction. In this study, 325 core samples with logging data from the Sinian carbonate reservoirs of the second member of the Dengying Formation were collected to establish a reliable predictor for permeability prediction in the Gaoshiti-Moxi block, Sichuan Basin. Three typical machine learning algorithms and three population-based optimization algorithms were applied to the core samples and logging data to evaluate the applicability and prediction performance of different methods. Two types of objective functions are well designed to obtain a more satisfactory result for permeability prediction. By comparison, the multi-objective mayfly algorithm (MMA) combined with gradient boosting decision tree (LGB) among all the algorithms had a more vital ability to predict permeability. Therefore, a new self-adaptive predictor was developed by combining the MMA-LGB algorithm with the low-pass filters that were applied as a noise filtering method from the original geophysical well logs with Fourier transform (FT). Filtered logs were tested by using Fractal statistics. Through comparisons, the self-adaptive predictors significantly improved the prediction accuracy with the lowest MSE of 0.239 and the highest R² of 0.831, well demonstrating that combining the machine learning algorithm with low-pass filters could mitigate the adverse effects of heterogeneity on permeability prediction. The excellent prediction performance of the proposed self-adaptive predictor lays a sound theoretical foundation for logging interpretation and identifying high-quality carbonate reservoirs in the target field.
Stochastic simulation of facies using deep convolutional generative adversarial network and image quilting
2022, Marine and Petroleum Geology
Citation Excerpt :
Sedimentary facies simulation is an essential tool to indicate depositional environment (Webber and Van Geuns, 1990; Gómez-Hernández and Wen, 1998; Zhao et al., 2018; Siddiqui et al., 2019). Facies are defined as sedimentary rocks which differ in appearance and form in different ways for that area or environment (Bjorlykke, 2010; Liu et al., 2020, 2022). For a reservoir, understanding sedimentary processes and facies relationships is important for reservoir prediction in oil and gas reservoir exploration and development.
Sedimentary facies simulation is one of the essential works in sedimentary environment analysis and reservoir characterization. The traditional facies simulation method is based on geostatistics. However, the traditional two-point geostatistics-based facies simulation method cannot characterize complex facies structures. Most multiple-point geostatistical simulation methods are unable to flexibly generate abundant geologic patterns. To address these shortcomings, we develop an intelligent method to automatically simulate sedimentary facies according to the training image provided by geologists. The method can learn efficient representations for complex facies architectures and obtain the simulation results in a larger area, rather than being limited to an area with the same size as the training image. First, we construct a deep convolutional generative adversarial network to extract the high-dimensional features of facies. Then, a large number of specific patterns are randomly generated based on these features. Thus, the diversity of geologic patterns is improved. Finally, the patterns are spliced together to obtain possible facies maps by using an improved image quilting algorithm. A model test is described and analyzed to demonstrate the effectiveness and reliability of the new method. The results are consistent with the actual situation in the aspect of variability and continuity. The method is also applied to non-stationary geological facies unconditional simulation. The successful application indicates that the method is able to learn the features of non-stationary geological phenomena, showing the practicability of the proposed methods.

View all citing articles on Scopus

^☆: This work is financially supported by the Fundamental Research Funds for the Central Universities, CHD (300102261504) and Natural Science Basic Research Program of Shaanxi Province, China (2021JQ-561).

View full text

Mixture of relevance vector regression experts for reservoir properties prediction☆

Highlights

Abstract

Introduction

Section snippets

The input and output data

Test on well data

Discussion

Conclusion

CRediT authorship contribution statement

J. Appl. Geophys.

J. Pet. Sci. Eng.

Inform. Sci.

J. Pet. Sci. Eng.

J. Pet. Sci. Eng.

Neurocomputing

J. Environ. Eng. Geophys.

Reservoir property prediction using abductive networks

Geophysics

Lithologic characterization of a reservoir using continuous-wavelet transforms

IEEE Trans. Geosci. Remote Sens.

A new approach to petroelastic modeling of carbonate rocks using an extended pore-space stiffness method, with application to a carbonate reservoir in Central Luconia, Sarawak, Malaysia

Leading Edge

Constraints on the permeability structure of alluvial aquifers from the poro-elastic inversion of multifrequency P-wave sonic velocity logs

IEEE Trans. Geosci. Remote Sens.

Seismic expression of miocene carbonate platform and reservoir characterization through geophysical approach: application in central luconia, offshore Malaysia.

J. Pet. Explor. Prod.

Bayesian regression and classification

Variational relevance vector machines

Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence

Fast algorithms for large scale conditional 3D prediction

Relevance vector machines: sparse classification methods for QSAR

J. Chem. Inform. Model.

Variational mixture of Gaussian process experts

Reservoir lithology determination by hidden Markov random fields based on a Gaussian mixture model

IEEE Trans. Geosci. Remote Sens.

A statistical methodology for deriving reservoir properties from seismic data

Geophysics

Probabilistic petrophysical-properties estimation integrating statistical rock physics with seismic inversion

Geophysics

Multiple kernel learning for hyperspectral image classification: A review

IEEE Trans. Geosci. Remote Sens.

Multiclass relevance vector machines: sparsity and accuracy

IEEE Trans. Neural Netw.

Adaptive mixtures of local experts

Neural Comput.

Hierarchical mixtures of experts and the EM algorithm

Neural Comput.

Learning ambiguities using Bayesian mixture of experts

Integrative mixture of experts to combine clinical factors and gene markers

Bioinformatics