Thursday, October 23, 2025

Enhancing Petrographic Image Classification of Complex Carbonate Rocks in Brazil’s Pre-Salt using Convolutional Neural Networks

Share

Interpreter Facies Classification: An In-Depth Exploration

Introduction to Facies Classification

The classification of facies in geology is crucial for understanding subsurface formations, particularly in carbonates where the textural complexities can significantly impact reservoir quality. This article delves into the manual classification of thin sections, highlighting the various classification schemes used to categorize carbonate facies. The classification not only aids in geological interpretation but also enhances predictive models that inform drilling and production strategies.

Overview of Classification Schemes

Figure 4 illustrates the manual classification of thin sections across the dataset, emphasizing the unevenness in data distribution. In the simplified classification scheme, over 50% of the samples fall under the category of grainstones (represented in light blue). This dominance persists in both intermediate (grainstone) and complete (mixed grainstone) classifications, although the proportions decrease to 49% and 42%, respectively. This trend reflects the uneven subdivision of "subfacies" within the grainstone category, underscoring the necessity for more detailed classifications.

The second most predominant category consists of silicified carbonates (shown in yellow), which account for 20% of the total samples. Within this group, the silicified grainstone "subfacies" emerges as the most common, illustrating significant variability that is captured more fully in the complete classification scheme. Interestingly, several facies are underrepresented in the overall dataset, with nine classes comprising only 1% or less of the thin sections, reflecting the challenges of comprehensive classification.

Porosity Evaluation across Classifications

The estimated porosity behaviors for facies reveal several insights as we compare the different classification schemes. The simplified classification provides a reasonable overview of porosity trends, but it lacks granularity. As noted, key distinctions are obscured when "subfacies" with distinctive porosity characteristics are grouped together.

The intermediate classification adeptly distinguishes between pure and volcanic grainstones, which exhibit markedly different petrophysical behaviors. This differentiation is particularly important since pure grainstones typically show enhanced reservoir potential due to their superior porosity characteristics. Moreover, various types of mudstones display distinct behaviors, further reinforcing the necessity for nuanced classification.

The complete classification scheme uncovers subtler variations, such as the higher porosity observed in completely silicified carbonates versus the lower porosity in silicified spherulitestones. These findings underscore how variations in textural and genetic types of silica directly correlate with porosity, an essential consideration for reservoir modeling.

Computational Model Analysis: Performance Metrics

To assess the efficacy of the classification schemes, our study explored five computational models, summarized in Table 4 and illustrated in Figure 5. Each model was rigorously evaluated against three classification schemes using a 70/30 train-test split to ascertain performance metrics including F1-score and accuracy.

While models performed favorably under simplified and intermediate classification schemes, a notable decline in performance was evident with the complete classification scheme. For example, the MobileNet v3 large model achieved an impressive F1-score of 0.795 for simplified classification; however, the RegNet Y 400MF model only reached an F1-score of 0.528 for the complete classification scheme. This drop is primarily attributed to the non-uniform class distribution, where certain classes had a minimal number of training samples.

Misclassifications: Insights from Confusion Matrices

Examining the confusion matrices reveals important insights about model predictions. In the simplified classification (Fig. 6), most models performed well, achieving scores above 0.7. However, misclassifications occurred primarily between shrubstones and spherulitestones, often misidentified as grainstones. This confusion extends to the intermediate classification (Fig. 7), where new categories further complicate predictions. The challenge arises partly from the complexity of distinguishing between in situ and reworked classes, particularly when grainstones incorporate fragments from multiple lithologies.

In the complete classification scheme (Fig. 8), the models faced significant difficulties classifying numerous facies with overlapping features. Misclassifications varied, with higher-permeability grainstones occasionally mistaken for low-porosity spherulitestones. Understanding these misclassifications is crucial since they can lead to interpretative errors with significant geological ramifications.

Independent Blind Testing: Generalization of Models

To rigorously evaluate the generalization capabilities of the proposed workflow, a blind test was conducted using 200 thin sections from well SB-E3, unrelated to the training or validation processes. As anticipated, model accuracy and F1-scores exhibited a downward trend with increasing classification complexity, particularly evident in the complete scheme. Despite lower overall metrics, the models consistently identified silicified carbonates with high accuracy, reinforcing their practical application.

Additionally, misclassification patterns observed during blind testing echoed those identified during internal validation. Notably, volcanic grainstones and mudstones were misclassified as grainstones, suggesting that local depositional conditions may differ from those represented in training data.

Limitations of CNN Models and Mitigation Strategies

As with any machine learning approach, CNN models are susceptible to overfitting, especially when trained on moderate-sized datasets characterized by high intra-class variability. To combat this, data augmentation techniques were employed to enhance the diversity of the training data. We also utilized architectures with varying complexity, opting for lighter models to reduce the risk of overfitting.

Image quality and consistency remained paramount throughout the study, as variations in sample preparation and lighting conditions could significantly influence model efficacy. Future efforts should focus on standardizing imaging protocols to bolster these models’ reliability.

Moreover, the interpretability of CNN outputs is a recognized challenge. While not the focus here, future research could explore explainable AI techniques to develop greater transparency in model decisions, fostering confidence in automated classification workflows.

Future Directions: Embracing Emerging Technologies

Looking ahead, integrating other data modalities—such as geochemical or mineralogical analyses—holds promise for enhancing classification accuracy. Moreover, exploring emerging architectures, like Vision Transformers (ViTs), could yield improved understanding of complex carbonate systems due to their potential for capturing subtle textural variations. Although this study did not delve deep into ViTs, their inclusion in future investigations may significantly uplift performance metrics, particularly when supported by larger, well-balanced datasets.

In essence, the journey toward effective interpreter facies classification is ongoing, marked by continuous adaptation and advancement of methodologies to meet the intricate demands of subsurface exploration and analysis.

Read more

Related updates