Automatic detection and observation of mineral extraction sites using satellite images¶

Pierre Sledz (UNIGE), Clémence Herny (Exolabs), Roxane Pott (swisstopo), Gwenaëlle Salamin (Exolabs), Gregory Giuliani (UNIGE)

Proposed by UNIGE and STDL - PROJ-SATQUARRIES
February 2025 to July 2025 - Published on October 2025

This work by STDL is licensed under CC BY-SA 4.0

Abstract: This project builds on previous work to demonstrate the effectiveness of an automatic deep learning-based object detection algorithm for identifying Mining Extraction Sites (MES) in satellite imagery from the Swiss Data Cube and the Brazil Data Cube. The algorithm was trained using Earth Observation (EO) data, leveraging Open Data Cube (ODC) infrastructure, to detect MES across large areas. The trained models achieved f1-scores ranging from 49% to 76% on validation datasets, successfully identifying potential MES, though false positives (FPs) were observed, particularly due to confusion with features such as rock outcrops, water bodies, and deforested areas, as well as the influence of spectral resolution of certain layers. Despite these challenges, the framework enabled rapid detection, processing large areas within minutes or hours depending on the ground truths, and providing a high replicability of the object detector framework.
The automatic detection of MES with this method allows for efficient, large-scale monitoring, and could support the monitoring of MES evolution inventories over time. By using satellite imagery with high temporal resolution, the algorithm can offer valuable insights into the ongoing changes in mining activities, providing a significant advantage over manual mapping and in situ surveys in terms of time/cost savings. The integration of the Open Data Cube infrastructure also facilitates multi-layers analysis, with Analysis Ready Data already available for future feature tracking. The framework offers the potential for live-time updates, making it a valuable tool for improving the speed and efficiency of MES monitoring in Switzerland and in other regions of the world.

1. Introduction¶

Mineral extraction constitutes a strategic activity worldwide. Demand for mineral resources has been growing significantly in recent decades¹, mainly due to the rapid increase in the production of batteries and electronic chips, or buildings construction, for example. As a result, the exploitation of some resources, such as rare earth elements, lithium, or sand, is putting pressure on their availability. Being able to observe the development of mineral extraction sites (MES) is of primary importance to adapt mining strategy and anticipate demand and shortage.
The extraction plants, and the activity of mineral extraction more generally, can severely impact the environment in many different ways²³⁴: such as chemical waste, heat discharge, water pollution, and air pollution. MES implies the extraction of rocks and minerals from water ponds, cliffs, and quarries. The surface affected, initially natural areas, can reach up to thousands of square kilometres¹. Economic and political interests of some resources might overwhelm land protection, and conflicts are gradually intensifying³. This is particularly applicable in developing countries, where these sectors have the greatest impact on human health and the ecosystem⁵. According to Reed (2002)⁶, extractive industries are prevalent in these countries because they are often rich in natural resources, attract extractive industries due to economic dependency, weak regulations, and the need for foreign investments, but face challenges of poverty, weak governance, and exploitation. MES are dynamic features that can evolve according to singular patterns, especially if they are small, as they are the ones that can undergo the most modifications over time across the whole spectrum of worldwide MES characteristics. A site can expand horizontally and vertically or be filled⁷⁸. Changes can happen quickly, in a matter of months. As a results, updating the MES inventory can be challenging. In most cases, the management of MES, from their creation to their registration, is conducted by public administrations. In the context of mining operations within a nation's sovereign territory, the state is often responsible for the allocation of mining permits, entrusting third parties with the exploitation of its mineral resources through the grant of concessions⁹¹⁰. Given the significance of mining resources and their role in global supply chains, this sector presents significant economic opportunities, often escaping governmental oversight at various levels, whether within established mining enterprises or at the site of small-scale artisanal mining operation¹¹. Such activities often operate illegally and are often found in regions where mineral deposits are present, akin to the historical American gold rushes.
For the reasons given above, there is a crucial need for MES mapping and observation worldwide. The majority of MES mapping is performed manually by visual inspection of images¹. Alternatively, recent improvements in the availability of high spatial and temporal resolution space/airborne imagery and computational methods have encouraged the development of automated image processing, and helped assess evolution of MES¹². Supervised classification of spectral images is an effective method but requires complex workflow³¹³⁸ and can be considered as a valuable tool for landcover change detection applied to MES¹⁴¹⁵. More recently, few studies have implemented deep learning algorithms to train models to detect extraction sites in images and have shown high levels of accuracy⁴¹⁶¹⁷.

The STDL has developed a framework named object-detector to automatically detect objects in a georeferenced imagery dataset based on deep learning method¹⁸. It was used for the automatic detection of MES¹⁶¹⁷ in order to improve the process of mapping MES in Switzerland, and to keep the database up to date with the annual acquisition of the dataset of high-resolution (up to 10 cm) aerial images SWISSIMAGE. The method has proven its efficiency detecting MES with a trained model achieving a f1-score of 82%. The MES inferences for SWISSIMAGE from 1998 to 2024 were provided and reviewed by experts from swisstopo. The model's replicability to other regions or datasets with different characteristics has not been tested yet. While the framework has shown strong results, its performance in areas with varying data quality, landscapes, or environmental conditions remains uncertain, which limits its use beyond Switzerland. Indeed, high-resolution image dataset like SWISSIMAGE requires significant time and financial investment, with data collected over multiple flights during spring and summer to cover the whole country¹⁹ and an extensive manual post-processing to merge and correct images. In situ surveys with manned or unmanned devices are still costly and slow, often still used alongside remote sensing²⁰²¹. Thanks to Switzerland’s geographic scale and economic means, high-resolution data, such as SWISSIMAGE can be feasibly collected and integrated. However, for larger or global applications, satellite imagery offers a better scale-cost balance and more frequent updates, making it well-suited for MES detection over wide areas³²². Satellite sensors also provide richer spectral information beyond RGB images, improving landscape analysis²³.
The Open Data Cube (ODC) initiative seeks to provide a free and open data architecture solution, to facilitate the use of EO satellite data²⁴, by making national data available to other users or by processing and bringing together opensource data that already exist in the country²⁵. The Swiss Data Cube²⁶ and the Brazil Data Cube²⁷ joins this network and the development of an ODC community²⁸²⁹, acknowledging the potential of a central data archive which delivers decision-ready product.

This project mainly aims to develop the object-detector framework to enable the use of satellite images for automated MES detection, with the support of an ODC infrastructure. The performance of high-resolution SWISSIMAGE aerial images and satellite images in Switzerland will be evaluated. In addition, applying the framework to a new region in Brazil will allow its performance to be evaluated in another region.

The workflow developed from the proj-dqry framework and applied to satellite images is shown in Figure 1. First, a deep learning algorithm is trained using a mapped MES dataset that serves as ground truth (GT). After evaluating the performance of the trained model, the selected model was used to perform inference detection for a given layer dataset and area of interest (AoI). The results were filtered to discard irrelevant detections. The procedure was repeated for both data sources and each provided layers.

In this report, we first describe the data used, including the image description and the definition of AoI. Then we explain the model training, evaluation and object detection procedure. Next, we present the results of potential MES detection. Finally, we provide conclusion and perspectives. This project is a continuation of the following proj-dqry, therefore some elements are similar.

2 Data¶

2.1 Images sources¶

2.1.1 Satellite programs¶

2.1.1.1 Landsat-8¶

Landsat 8, launched in 2013 by NASA and the US Geological Survey (USGS), continues the long-standing Landsat EO program that began in the 1970s³⁰. It carries two main sensors: the Operational Land Imager (OLI), which provides multispectral images at 30 meters resolution and a panchromatic band at 15 meters, and the Thermal InfraRed Sensor (TIRS), with 100 meters resolution³¹. Each scene covers an area of roughly 185 by 180 km, with a revisit time of 16 days. Since 2008, the USGS’s open data policy has dramatically increased access to Landsat images, especially benefiting regions with limited resources³². However, cloud cover and the relatively long revisit time limit the availability of cloud-free images, reducing the number of good acquisitions each year³³³⁴. Landsat 8 data is offered in several correction levels, including Level-1 (systematically corrected) and Level-2 (surface reflectance).

2.1.1.2 Sentinel-2¶

The Sentinel-2 mission, part of the European Space Agency’s Copernicus program, consists of two satellites, Sentinel-2A launched in 2015 and Sentinel-2B in 2017, each equipped with a Multispectral Instrument (MSI) capturing data in 13 spectral bands at varying spatial resolutions between 10 and 60 meters³¹³⁰. Sentinel-2 covers a wider swath of 290 km and revisits the same area approximately every 5 days, improving temporal resolution compared to Landsat 8. Sentinel-2 data is processed into different levels, including Level-1C (Top-of-Atmosphere reflectance with geometric corrections) and Level-2A (Bottom-of-Atmosphere reflectance with atmospheric corrections).

2.1.2 Open Data Cube platforms¶

2.1.2.1 Swiss Data Cube¶

The Swiss Data Cube (SDC) is a comprehensive data source for EO analytics²⁶³⁵, offering a database of over 35 years of satellite data, including from the Landsat and Sentinel missions. The system is designed to transform raw satellite imagery into standardised Analysis Ready Data (ARD), with the aim of optimising the data for the purposes of environmental monitoring and time-series analysis. The SDC facilitates access to pre-processed and validated datasets for a range of applications, including land use classification, vegetation dynamics, water resource management, and snow cover mapping³⁶²⁹³⁷. The SDC has been constructed on Open Data Cube architecture, thereby ensuring seamless data integration, storage, and analysis³⁸.

SDCworkflow — *Figure 2: Swiss Data Cube ARD products from Chatenoux et al. (2021).*

The SDC acquires and processes ARD products which are continuously updated²⁸. This archive integrates data from prominent EO satellite programs. The SDC employs Python scripts for automated data download and processing, ensuring that data from these satellites are ingested into the system for seamless access. The ingestion process includes atmospheric corrections, topographic corrections and the generation of multitemporal backscatter composites²⁶.
These ARD products are available in a consistent format, allowing for easy time-series analysis and multi-sensor integration. The data is archived with backup copies and reprocessing capabilities, ensuring that future updates or improvements to processing algorithms can be easily incorporated. This structured workflow guarantees that the archive remains up-to-date and ready for analysis, with users benefiting from the highest quality and consistency in the data. See Figure 2 for a summary of the workflow for generating ARD products.
The SDC, with its localised, real-time monitoring and time-series analysis, could be particularly useful for tracking the evolution of MES with higher temporal resolution in Switzerland.

2.1.2.2 Brazil Data Cube¶

The Brazil Data Cube (BDC) is a national initiative developed by the National Institute for Space Research (INPE), aimed at processing large volumes of medium-resolution remote sensing data for environmental monitoring across Brazil²⁷. The BDC is part of a broader effort to monitor Brazilian biomes, with a focus on the Amazon region. The workflow to generate ARD is similar to the one of SDC and follows the ODC initiative³⁹²⁴. However, the focus of the finished products differs slightly: the BDC utilizes advanced computational methods, including AI and machine learning, to analyse time-series data and to produce highly effective models for large-scale land-use and land-cover mapping across Brazil's biomes ⁴⁰⁴¹.

2.2 Images¶

This section describes the images used for this project. Table 1 summarises the main characteristics of the selected images.

Product	Type	Year	Coordinate system	Spatial resolution
SWISSIMAGE Journey	True colour RGB	2018-2020	CH1903+/MN95 (EPSG:2056)	0.10 m (\(\sigma\) \(\pm\) 0.15 m) - 0.25 m
Landsat8/9 Collection 2 Level 2 Surface reflectance and Temperature (landsat_ot_c2_l2)	True colour Bands 4 (red), 3 (green), and 2 (blue)	11/08/2020 / 20/08/2020	WGS 84 / Pseudo-Mercator (EPSG:3857)	30 m
Landsat-8/OLI image mosaic of Brazilian Amazon (mosaic-landsat-amazon-3m)	True colour Bands 4 (Red), 3 (Green) and 2 (Blue)	07/2016 - 09/2016	WGS 84 / Pseudo-Mercator (EPSG:3857)	30 m
Landsat-8/OLI image mosaic of Brazil (mosaic-landsat-brazil-6m)	False colour Bands 6 (SWIR), 5 (NIR), and 4 (Red)	07/2017 - 06/2018	WGS 84 / Pseudo-Mercator (EPSG:3857)	30 m
Sentinel-2 image Mosaic of Brazilian Amazon Biome (mosaic-s2-amazon-3m)	False colour Bands 11 (SWIR), 8A (NIR), and 4 (Red)	06/2022 - 08/2022	WGS 84 / Pseudo-Mercator (EPSG:3857)	30 m

Table 1: Characteristics of aerial images and ODC products.

2.2.1 swisstopo¶

Using EO data, the maximum zoom produced is level 14, which is lower than the zoom level 16 for SWISSIMAGE Journey products chosen for the previous project¹⁷. In order to compare models and maintain consistency, a model using SWISSIMAGE orthophotos at zoom level 14 was retrained for evaluation, and to match the maximum potential of satellite images. For this analysis only the images from mosaic of the year 2020 (a combination of 2020, 2019 and 2018 images acquisition) are needed and the images are georeferenced RGB TIF tiles with a size of 256 x 256 pixels (1 km²sup>).

2.2.2 Swiss Data Cube¶

Only Landsat data was available on the SDC via Web Map Service³⁵⁴² (WMS). Landsat true colour images at the highest correction level (Level 2) and a 30 m spatial resolution were used (Table 1). Unfortunatly, accessing the SDC through the WMS connector delivers images that lack full processing, such as visualization corrections like contrast reduction or reflectance enhancement. This omission explains their darker appearance compared to the more refined datasets accessible via the SDC's API.

We chose images from August 2020 for several reasons, first and foremost because they coincide with the swissTLM3D ground truth acquisition and mapping period, so that we can fit in as close as possible. In fact, as indicated in the introduction, the survey period for high-resolution SWISSIMAGE images is spread over a long period, starting at the very beginning of spring in the west of Switzerland and gradually covering the rest of the country, and for mountainous regions it is even necessary to wait until the very end of summer when snow cover is at its lowest. The date of 11 August 2020 has been chosen for training the model because it is the one with the largest footprint to encompass as much ground truth as possible, as well as a minimal cloud cover compared with the other dates available. August 20 has been used for detection and inference, so that the model can be applied to the entire country, as Eastern Switzerland is excluded from August 11.

The different footprints and data information’s can be checked on the SDC explorer.

2.2.3 Brazil Data Cube¶

The products used as input for the object-detector through the BDC are different from the SDC, as we will be using Landsat (30 m) and Sentinel (10 m) mosaics that have been generated over an extended period of time in order to compose cloud-free mosaics (Table 1). This is the advantage of these mosaics, as they allow us to have images of the highest quality, even if they don't have the same temporal resolution as simple updated layers. The BDC uses a Least Cloud Cover first (LCF) algorithm to perform the temporal compositing. It first applies cloud masking to each image, then selects pixels based on their reliability, favouring images with higher percentages of clear pixels⁴³. By sorting images by cloud coverage, LCF selects the clearest pixels for each time step, ensuring high-quality composites with minimal cloud impact. Two of the mosaics are focusing on the Amazon biome and the third represent Brazil as a whole. The false colour composite images, registered in RGB format³¹, make it possible to distinguish the elements of the landscape thanks to their spectral properties, with the vegetation really standing out, as well as the bare ground, and the contrasting water bodies which are much darker⁴⁴⁴⁵⁴⁶⁴⁷.

2.3 MES labels¶

2.3.1 Switzerland ground truth¶

The MES labels originate from the swiss Topographic Landscape Model 3D (swissTLM3D) produced by swisstopo. swissTLM3D is a large-scale topographic landscape model of Switzerland, including manually drawn and georeferenced vectors of objects of interest at a high resolution, including MES features. Domain experts from swisstopo have carried out extensive work to review the labeled MES and to synchronise them with the 2020 SWISSIMAGE mosaic to improve the quality of the labeled dataset. A total of 266 labels are available. The mapped MES reveal the diversity of MES characteristics, such as the presence or absence of buildings/infrastructures, trucks, water pounds, and vegetation¹⁷.
Changing the zoom level affects the resolution by a factor of 2, leading to a loss of information between zoom level 16 (resolution of 1.6 m px^-1) used in the previous study¹⁷ and zoom level 14 (resolution of 6.4 m px^-1) selected in this project. With regards to satellite images, we can see that the information visible in the MES is completely lost and that, without the presence of ground truth it could be difficult to distinguish the MES from the rest of the territory (Fig. 3).

NOTE: In this report, every figure is systematically oriented to the north, allowing us to remove the orientation of the figure layout for better clarity and synthesis of the information.

Images_z14 — *Figure 3: Side-by-side comparison of SWISSIMAGE image (left) and Landsat-8 image (right) at zoom level 14 for the same label polygon.*

In addition, as the footprint of the Landsat raster of August 11, 2020, does not cover the entire Swiss territory (Fig. 4), some labels had to be removed to retain only those that overlapped with the footprint of the raster layer. Thi leaves us with 236 labels for the model using the SDC data, compared to the 266 used for the 2020 SWISSIMAGE mosaic.
These labels are used as the ground truth (GT), i.e. the reference dataset indicating the presence of a MES in an image. The GT is used both as input to train the model to detect MES and to evaluate the model performance.

Landsat-8_footprint — *Figure 4: Footprint of the SDC Landsat-8 raster of August 11-2020, with overlapping labels.*

2.3.2 Brazil ground truth¶

Brazil is an ideal location to test the object detector with another GT dataset, as there are a large number of MES labels available, while also benefiting from access to an ODC.

Two datasets were used:

the first one is presented in Maus et al. (2020)¹. It is a GT dataset which was designed to enable other studies to validate machine learning models, remote sensing analyses, and spatial assessments in the field of mineral extraction. The dataset contains manually delineated polygons representing 21,060 mining areas, derived from satellite imagery within a 10 km radius of known active mining sites.
This dataset features 2,427 labels at a Brazil-wide scale, of which 1,487 are clipped to the Amazon biome and footprint of our BDC rasters. The labels are highly diverse in terms of both surface area and shape. They represent a comprehensive range of highly polygonal shapes with very detailed delineation as shown in Figure 5. Labels were not synchronised for each mosaic in the BDC, as the date of acquisition of the labels did not coincide with the temporal compositing of the mosaics. In addition, this dataset gives a new dimension of quarries compared to previous studied standards since, for example, MES in Figure 5C is approximately 42 km long.

GT_Maus — *Figure 5: Examples of MES from Maus et al. (2020) mapped on Sentinel-2 false colour tiles from the BDC (scale not uniform).*

the second dataset is not designed to be used as GT, since it is the outputs from the Earthrise Media Mining Detector, which is an open source project designed to automatically detect artisanal and industrial gold mining activities using Sentinel-2 satellite imagery focusing on the Amazon basin. The Earth Genome repository provides open source access to its yearly results since 2018, and we use these results, which are already the result of automatic detection, as ground truth. As this project started in 2018 it will not be available for the layer: mosaic-landsat-amazon-3m (Table 1). As this is another type of MES, with artisanal mines which are much smaller overall, we have a dataset that is different than the previous one as the Figure 6 can attest, it has a more cell-like delimitation and is more centred on the objects of study.

GT_EG — *Figure 6: Examples of MES outputs for the 2018 year from Earth Genome.*

When clipped to the corresponding BDC layers extent, it represents 2766 labels for mosaic-landsat-brazil-6m and 3970 labels for mosaic-s2-amazon-3m. Given the very large number of labels, it is impossible for us to check whether they are all quarries, and it is important to consider that these are not 100% accurate labels even though this project follows just like the object detector a heavy post-processing filtering based on confidence score threshold (> 0.6). Setting aside the size of the labels, the first sample is more detailed and polygonal, whereas the second sample takes the form of individual cells and/or aggregates.

2.4 Areas of interest (AoI)¶

As explained in Section 2.2.1, the SWISSIMAGE mosaics are composed over several years, as such acquisition footprints of yearly acquired orthophotos were used as AoI to perform MES detection over Switzerland¹⁷. For Brazil, the inference AoI was defined in order to contain a maximum number of labels for each GT dataset, and within computational limits. The AoI is an area of considerable size, with a total area of approximately 500,000 km² (Fig. 7). It encompasses a wide variety of landscapes and land covers, thereby ensuring that the terrain is varied and not uniform, and that land cover is not dominated by a single use.

AoI-Brazil — *Figure 7: AoI used for BDC models inference, approximately 500,000 km² in the central/eastern part of the Amazone Biome (layer: mosaic-landsat-amazon-3m).*

3. Satellite image fetching¶

Pre-rendered SWISSIMAGE tiles (256 x 256 px) are downloaded using a Web Map Tile Service (WMTS) via an XYZ connector. Similarly, tiles (256 x 256 px) are extracted from the georectified rasters stored on the ODC servers and are accessed directly via a WMS with an URL which acts as an end point. All tiles are served on a cartesian coordinates grid using a Web Mercator Quad projection and a coordinate reference system EPGS 3857. Position of a tile on the grid is defined by x and y coordinates and the pixel resolution of the image is defined by z, its zoom level.
The URL follows the OGC WMS protocol and differs depending on which Data Cube is used. The correct product must be extracted from the server by specifying its layer name in the URL (e.g. Landsat_ot_c2_l2), and the remaining parameters of the URL are defined according to what the server provides and is indicated in the get capabilities of the end point. Depending on the ODC and the layer selected, a temporal component may be chosen, for the landsat_ot_c2_l2 product’s raster in the SDC, in which case the server format must be respected.

The different URL queries used in the object-detector for the layers mentioned in Table 1 are summarized in the Table A1 of Appendix A and can be used as a template. Their structure differs depending on the server.

4. Automatic detection methodology¶

4.1 Deep learning algorithm for object detection¶

Training and inference detection of potential MES were performed with the object-detector framework. This project is based on the open source detectron2 framework⁴⁸, implemented with PyTorch by the Facebook Artificial Intelligence Research group (FAIR). Instance segmentation (delineation of object) was performed with a Mask R-CNN deep learning algorithm⁴⁹. It is based on a Recursive-Convolutional Neural Network (CNN) with a backbone pre-trained model ResNet-50 (50 layers deep residual network). Images were annotated with custom COCO object based on the labels. The model is trained with this dataset to later perform inference detection on images. If the object is detected by the algorithm, a pixel mask is produced with a confidence score (0 to 1) attributed to the detection. The object detector framework permits to convert detection mask to georeferenced polygon that can be used in GIS software’s (more detailed information about this part can be found in Herny et al. (2024)¹⁷).

4.2 Model training¶

Rasters from the different ODC products, for which the GT has been defined, were chosen to proceed the model training. Tiles intersecting labels were selected and split randomly into three datasets: the training dataset (70%), the validation dataset (15%), and the test dataset (15%). The primary objective of this project is to train models using various combinations of GTs and images, and to evaluate and compare their performance. The algorithm hyperparameters (Table 2) were tuned in order to optimize the model’s performances considering each datasets specific characteristics. The training durations were obtained using an NVIDIA L4 GPU machine with 16 GB of RAM.

Model	Products	GT Dataset	Number of labels	image/batch	Learning rate	Learning rate decay	Checkpoint period	Max iteration	Optimal iteration	Training duration
1	SWISSIMAGE	swissTLM3D	266	2	0.005	0.0001	200	3000	1199	11 min
2	SDC landsat_ot_c2_l2	Reduce swissTLM3D	236	2	0.005	0.0001	200	3400	1199	12.6 min
3	BDC mosaic-landsat-amazon-3m	Maus et al. (2020)	1487	12	0.01	0.0001	400	8000	3199	3.6 hr
4	BDC mosaic-landsat-brazil-6m	Maus et al. (2020)	2427	16	0.01	0.0001	400	10000	3199	6.1 hr
5	BDC mosaic-s2-amazon-3m	Maus et al. (2020)	1487	12	0.01	0.0001	400	8000	2799	3.6 hr
6	BDC mosaic-landsat-brazil-6m	Earth Genome	2766	16	0.01	0.0001	400	10000	2799	6.2 hr
7	BDC mosaic-s2-amazon-3m	Earth Genome	3970	24	0.01	0.0001	400	10000	4799	9.3 hr

Table 2: Training parameters for all models at zoom level 14.

For example, for model 4 (Landsat-8), 5,938 tiles were produced for the training process and 160,953 tiles (~20 GiB) were produce for the inference on the Brazil AoI.
In order for the models to learn effectively and avoid significant underfitting, the learning rate and number of images per batch had to be increased significantly, from model 3 to model 7, to compensate the effects of the number of labels used in input. Such parameters favour faster convergence but result in a more deterministic training which introduces noise. Corresponding total loss curves show important oscillation (Fig. B1, Appendix B), and a slower total loss decrease, meaning the model has difficulty to settle⁵⁰⁵¹⁵². Despite nearly identical hyperparameters during training, the total loss curves of models 4 and 6 exhibit noticeable differences oscillation patterns, attributable only to differences in the nature and characteristics of the labels. The complexity of the GT datasets can be the direct cause, but further tuning could improve the model. The optimal detection model is the one minimising the validation loss curve.

1.3 Metrics¶

Each model’s performance and detection reliability were assessed by comparing the results to the GT. The detection performed by the model can be either (1) a True Positive (TP), i.e. the detection is real (spatially intersecting the GT); (2) a False Positive i.e. the detection is not real (not spatially intersecting the GT) or (3) a False Negative (FN) i.e. the labelled object is not detected by the algorithm (Fig. 8).

Metrics presented in Figure C1 (Appendix C) are computed such as: - the recall, translating the amount of TP detections predicted by the model:

\[recall = \frac{\sum_k TP_k}{\sum_k (TP_k + FN_k)}\]

the precision, translating the number of well-predicted TP among all the detections:

\[precision = \frac{\sum_k TP_k}{\sum_k (TP_k + FP_k)}\]

the f1-score, the harmonic average of the precision and the recall:

\[f1 = 2 \times \frac{recall \times precision}{recall + precision}\]

5. Analysis of the automatic detection models¶

5.1 Model performance¶

The performance of each model vary (Table 3) according to all the elements mentioned so far, from the nature of the data itself to the training parameters. When considering the potential impact of GT datasets, it is also important to consider the impact of the enhanced training settings outlined in Section 4.2 for model 3 to 7.
SWISSIMAGE at zoom level 16 achieved a f1-score of 82%¹⁷. In comparison, the performance of model 2 using satellite images on the same labels was lower. With this in mind, the performance of model 2, are reasonably below the other models and is the least performing across the board. Given the GT dataset, a 30 m spatial resolution and dark true colour image doesn’t suit the needs of the mapped MES in the swissTLM3D dataset, demonstrated by the recall score of 40% meaning a significant number of objects are missed.

Model	Precision	Recall	F1
1	61%	70%	65%
2	62%	40%	49%
3	48%	42%	45%
4	64%	48%	55%
5	64%	50%	56%
6	73%	55%	63%
7	72%	61%	67%

Table 3: Metrics value computed for the validation dataset for each model.

Key observations can be made based on the presented results:

It has been suggested that false colour images may have the potential to improve performance in comparison to true colour models. It seems that this difference is highlighted by models 4 and 5 achieving an overall f1-score 10 points higher than model 3, and there appears to be a noticeable difference in precision due to the ability of false colour images to enhance contrast and feature delimitations.
A comparison of models 4 to 7 also highlights the minor impact of spatial resolution on model performance, in the context of the GT datasets used with the BDC. It could be assumed that when using true colour and false colour images, spatial resolution is outweighed by spectral resolution.
Beyond image datasets, the differences between Earth Genome models (models 6 and 7) and Maus et al. (2020)¹ models (model 3, 4 and 5), suggest that it is the characteristics of the GT labels that determines the ability of the models to perform, since the models use practically identical images and hyperparameters. The object detector may respond better to cell shaped labels than polygonised labels, which may be too complex. It is possible that the Maus et al. (2020)¹ dataset is too diverse, and that it would be more efficient to train models by label categories when datasets are to diverse unlike the swissTLM3D dataset.

Experimenting with varying label counts/characteristics during the training process could provide insights into the relationship between label specificities and performance. This approach could directly influence model adaptability to different image resolutions and GT datasets. Determining a possible optimal range of labels to use as inputs in the object detector based on image resolution (spatial and spectral) and label characteristics could be a step closer towards optimized performances.

6. Automatic detection of MES¶

6.1 Detection post-processing for Switzerland¶

Detection by inference was performed over previously mentioned AoIs (SWISSIMAGE footprints) with a minimum threshold detection score of 0.3. The low score filtering was resulting in a large number of detections. Several detections may overlap, potentially segmenting a single object. In addition, a detection might be split into multiple tiles. To improve the pertinence and the aesthetics of the raw detection polygons, a post-processing procedure was applied.
Switching the image type did not change the inherent characteristics of the Swiss landscape. Consequently, a significant number of false positives still appeared in mountainous regions with Landsat-8 images, primarily due to rock outcrops and snow. An elevation filtering was applied using a Switzerland Digital Elevation Model (DEM, about 25 m px^-1) derived from the SRTM instrument (USGS - SRTM). Based on the previous results from different filter combination of the first project, the max altitude threshold value used here was 1200 m, excluding 3 MES with this filter.

Detection aggregation was also applied: first, polygons were clustered (K-means) according to their centroid position. The method involves setting a predefined number k of clusters. The highest detection score was assigned to the clustered detection. This method preserves the final integrity of detection polygons by retaining detection that has potentially a low confidence score but belongs to a cluster with a higher confidence score improving the final segmentation of the detected object. The value of the threshold score must be kept relatively low (i.e. 0.3) when performing the detection to prevent removing too many polygons that could potentially be part of the detected object. Then, spatially close polygons were assumed to belong to the same MES and are merged according to a distance threshold of 10 m. The averaged score of the merged detection polygons was ultimately computed. Finally, score filtering was applied to the clusters keeping only detections with a minimal score of 0.95.
Detections with an area smaller than 5000 m² were filtered out, considering, as in the first project, that 13 MES are below this threshold, but for performance reasons, as previously concluded, the threshold could not be set at the real minimum of 2270 m².

In addition to rock outcrops and areas of snow, a significant number of FP detections were recorded in the extensive riverbeds due to the low water level period and the resultant sediment deposits and turbidity. This is why a slope filter has been designed to try to limit these detections. The goal was to exclude flat detections like riverbeds and improve the altitude filter by removing rock outcrops below 1200 m. The mean of the slope within the detection polygons is calculated. The calculation uses the same Switzerland Digital Elevation Model, with the slope layer processed at the same spatial resolution. A 1° to 48° slope range has been applied to remove all detections outside of this range. The maximum value is determined based on labels characteristics, ensuring a non-excluding filter. The filter is constrained by the resolution of the DEM and the Landsat-8 image. This is because detections do not focus only on the water of rivers but includes the banks themselves, and this with a significant margin (Fig. 9). Consequently, the average slope calculated is necessarily greater than 1°. Using a DEM with higher resolution, such as the swissALTI3D product, could perhaps solve this problem in combination with more precise satellite images like Sentinel products.

6.2 Detection post-processing for Brazil¶

Because of the completely different landscape, MES characteristics, and data availability the post-processing on Brazil models was lighter than the one applied to the SDC model. Based on the characterises of both GT datasets, elevation was not identified as an element useful to differentiate detections in the area studied and where the MES can be found. Due to the unavailability of accessible data, the implementation of slope filtering was not a considered option, but it remains a possible improvement and will be addressed in the conclusion. Detection aggregation was still carried out in the same way, with a 10m distance threshold and a 0.30 score value for the first dataset and a 0.90 score for the second. A lower score threshold allowed more detections to pass through and fitted better the complexity of the second dataset. The complexity and great variety of both the Earthrise Media Mining Detector and Maus et al. (2020)¹ dataset has also led us to take the decision not to set a minimum detection area.

6.3 Inference detections¶

Each trained model was used to perform inference detection respectively to their trained layer apart from model 2 where the August 20, 2020 layer was used for the inference of the 2019 footprint AoI. These initial inferences give a lot of feedback on potential improvements and how could EO data should be used for automatic MES detections. The detection results are presented in Table 4, showing the spatial intersections between GT and model detections. The evaluation of the automatic detection process, as well as the relevance of the results is difficult as the various GT datasets provided do not represent 100% of reality and do not always include every existing MES.

The results in Table 3 are not fully reflected in Table 4, but model 7 is nevertheless the best performing of all the models. Model 5 stands out for its overdetection of labels. This means that the object detector detects multiple times the same GT label, but in smaller split pieces. This aspect could be improve in the future by changing the post-processing parameters.

Model	Detections	Label detection (%)
1	1188	88%
2	1074	66.2%
3	977	45%
4	1031	86%
5	1780	131%
6	1431	68.7%
7	2378	90%

Table 4: Inference results for each model.

Despite the poor training performances of model 2 (Section 5.1), we can see that even with lower image resolution of Landsat-8, the object-detector can still accurately identify and delineate labels (Figure 10A). One of the many introductions of FP detections is linked to the presence of a cloud cover extent in the north of the layer, which is confused by the object-detector (Fig. 10B).

FP_clouds — *Figure 10: (A) Example of object detections for model 2, with labels in yellow and detected polygon in green. (B) Example of cloud cover related FP detections.*

The detections obtained from models 3, 4, and 5 demonstrate strong coherence with the distribution and characteristics of the GT labels from the Maus et al. (2020)¹ dataset. FP detections across all three models are predominantly located in riverbeds/banks and deforested areas in the central part of the AoI, this is apparent when comparing Figure 11A and 11B. It is important to note that these FP observations are not attributed to Earth Genome models. The polygonised shape of the labels may result in the object-detector experiencing confusion. Still, as demonstrated in Figure 11C, the delineation of detections is precise and follows the patterns of the labels, taking into account their shape and diverse scale.

Results_model3 — *Figure 11: (A) Distribution of model 3 detections within the AoI, (B) labels distribution, (C) example of object detections for model 3, labels in yellow and detections in blue.*

For both models, 6 and 7, the distribution of detections closely matches the Earth Genome datasets in the AoI as shown with model 7 (Fig. 12A and 12B). The object detector is able to smooth polygons and more precisely map MES compared to the labels. After manual verification, the inference can detect MES that were not part of the GT datasets which, as we recall, were not 100% accurate. This observation can be found in the inference of each model. MES present a large variety of features (buildings, water pounds, trucks, vegetation) which have been identified as a source of confusion for the algorithm in Herny et al. (2024)¹⁷. As illustrated in Figure 11C, the use of satellite data is unable to detect these anthropic objects, even when employing Sentinel-2 images. Two readings can be distinguished, characterised by either fewer confusing features or a lack of information for the algorithm, it could explain the difference in performance between the 2 projects.

Results_model7 — *Figure 12: (A) Distribution of detections within the AoI, (B) labels distribution, (C) example of object detections for model 7, labels in yellow and detections in blue.*

7. Conclusion and perspectives¶

This new project was designed to apply the object-detector framework to satellite imagery with the goal to implement it on a global scale. It has met the set objectives and demonstrated the object-detector potential to use satellite images and GTs in new areas, while accessing an ODC architecture. The project demonstrated the ability for the object-detector to quickly detect potential MES or over extensive areas, in satellite images of Switzerland and Brazil, with an automatic detection algorithm based on a deep learning approach.
The different trained model achieved a f1-score ranging from 49% to 67% on the validation dataset. The final detection polygons can in many cases accurately delineate potential MES. Although the performance of the trained models could be judged satisfactory, it should be taken with caution. Many FP detections are present in the different datasets, they are mainly due to confusion of the algorithm between MES and rock outcrops, open water bodies, construction sites, deforested areas, and this confusion can be either reduced or enhanced by false colour images. A manual verification of the relevance of the detection by experts in the field is necessary before using and interpreting the data.
Despite the required manual checks, the provided framework and detection results constitute a valuable contribution that can greatly assist the inventory and the observation of MES evolution worldwide. It can easily provide state-wide detection in a matter of hours or even minutes, with unprecedent temporal resolution which could be a considerable timesaving compared with manual mapping and in situ surveys techniques. The ODC infrastructure could allow a potential live time updating of MES mapping before checking by specialists, even if such a system would have to be adapted to the needs of each area. This method also enables MES detection with a standardised method, independent of the data used by countries/regions. Further model improvements should be considered, such as increasing the metrics by improving GT quality and improving model learning strategy. Operationalisation by training a general model that can be used in different contexts with different images seems complex, in particular because of the great diversity of MES typology across the regions and countries of the world.

Here are some technical improvements which could be considered and explored to extend the framework with satellite images:
- Improvement in the WMS queries to ODCs perfromed by the object-detector could be done so that multi-year analyses can be carried out seamlessly in order to perform feature tracking and really take advantage of the potential of the high temporal resolution of EO data. More on the data provider and server side, on multiple occasions, the requests made by the object detector exceeded the capacity of the servers, which can occasionally result in a reduction in processing speed. Nonetheless, processing in general is to be constrained, especially with regard to the potential inference surface. Consequently, in terms of the surface area of a country, this can pose a problem, as it was not possible to infer on the layer’s full extent in the conditions under which this project was tested.
- Regarding GT quality and datasets, the dataset offered by Maus et al. (2020)¹ could have even greater potential if it was not constrained by the availability of strictly national images with ODCs. In South America, there were a large number of MES labels across the region, so greater availability and accessibility to transboundary images through ODCs would be a major asset and could meet the needs of the study of MES. As seen in the case of the code developed by Earth Genome (Section 2.3.2), this tool could solve this issue but remains limited by the temporal resolution and acquisition footprint bands.
- It is not only a matter of resources availability, since the sensor footprints are not clipped by the data producers. The capacity to produce ARD products is inherent to the server that makes them available, as in the case of the images in the SDC, which extend over neighbouring countries and the trained model 2 could be used to inferred in Italy or France. In the similar manner, it would be worthwhile to cross-reference the models and draw inferences from on Landsat-8 images using a model trained on Sentinel-2, to take advantage of the qualities of each sensor.
- As we have seen, the question of image resolution remains central to the performance of the algorithm, and so, in keeping with the logic of automation, time savings and replicability, products at the crossroads of SWISSIMAGE and Landsat/Sentinel could be the right balance in order achieve these goals. swisstopo is also developing satellite imagery with the NPOC using Sentinel-2 imagery and could be a future asset.
- For EO data, improvements could be made either in pre-processing with contrasts reduction/colour enhancement as well as post-processing filtering, which can be easily implemented in the short term compared to above mentioned prospects. As is the case with the SDC, for instance, a significant amount of higher quality data is stored in the STAC interface, yet this data is not accessible via the WMS endpoint. In particular the use of new false colour composition with the object-detector. This technical solution should not be systematic, as it works well in the Brazilian context and landscape, but in the context of MES in Switzerland there is no certainty that it would work, given the environment surrounding the MES.
- Perhaps Sentinel-2 RGB images would be best, to be able to filter-out FP detections using spectral indexes, for example using NDWI/NDVI to map open water⁵³⁵⁴⁵⁵, and reject objects with a ratio of water to total surface area above a certain threshold. For FP detections of deforested and agricultural areas, as in Brazil models (BDC models 3, 4 and 5), the same principle (ratio: spectral detections/total surface area) can be used but with Normalized Burn ratio and different soil spectral properties, which may lead to FP reduction⁵⁶⁵⁷⁴.
- In future, depending on the needs, the Copernicus DEM (Copernicus DEM - Global and European Digital Elevation Model, 2025) could be used for elevation and slope filtering.
- Finally, the SDC case allows us to think about a possible filter to remove FP detections associated with cloud cover and to apply a cloud mask to the images to remove the clouds before or after inference, since relying on products such as those of the BDC with the best possible quality is not a sustainable solution. The Ukis-csmask code⁵⁸ which is based on a machine learning algorithm, could be added in the object-detector to try to mitigate errors generated by the cloud cover and cloud shadows. This would allow detections to be carried out in all seasons, not just summer, and would also reduce image pre-processing and increase the number of usable images.

Code availability¶

The codes are stored and available on the STDL's GitHub page:

proj-dqry: framework for detecting mineral extraction site. The version used to produce the results is v2.2.0 (except for the post-processing for which script from v2.0.0 were used).
object-detector: object detector framework

Acknowledgements¶

This project was made possible thanks to a tight collaboration between the STDL team and UNIGE. This project has been funded by "Stratégie suisse pour la géoinformation".

Appendix¶

A. WMS URLs¶



Swiss Data Cube:
#location: https://ows.swissdatacube.org/?service=WMS&request=GetMap&version=1.3.0&layers=landsat_ot_c2_l2&styles=simple_rgb&crs=EPSG:3857&bbox=5.82094745,45.69217318,10.58912293,47.81708853&width=256&height=256&format=image/png&time=2020-08-11
#layers: landsat_ot_c2_l2

Brazil Data Cube:
#location: https://data.inpe.br/bdc/geoserver/mosaics/ows?SERVICE=WMS&REQUEST=GetMap&VERSION=1.3.0&LAYERS=mosaic-landsat-brazil-6m&STYLES=raster&CRS=EPSG:3857&TIME=2017-07-01T00:00:00.000Z/2018-01-01T00:00:00.000Z&WIDTH=256&HEIGHT=256&BBOX=-8210729.32210553,-3743686.60247694,-3204262.47299497,585552.62310988&FORMAT=image/png
#layers: mosaic-landsat-brazil-6m
#location: https://data.inpe.br/bdc/geoserver/mosaics/ows?SERVICE=WMS&REQUEST=GetMap&VERSION=1.3.0&LAYERS=mosaic-s2-amazon-3m&STYLES=raster&CRS=EPSG:3857&TIME=2022-06-01T00:00:00.000Z/2022-08-01T00:00:00.000Z&WIDTH=256&HEIGHT=256&BBOX=-8210729.32210553,-3743686.60247694,-3204262.47299497,585552.62310988&FORMAT=image/png
#layers: mosaic-s2-amazon-3m
#location: https://data.inpe.br/bdc/geoserver/mosaics/ows?SERVICE=WMS&REQUEST=GetMap&VERSION=1.3.0&LAYERS=mosaic-landsat-amazon-3m&STYLES=raster&CRS=EPSG:3857&TIME=2016-07-01T00:00:00.000Z&WIDTH=256&HEIGHT=256&BBOX=-8210729.32210553,-3743686.60247694,-3204262.47299497,585552.62310988&FORMAT=image/png
#layers: mosaic-landsat-amazon-3m

Table A1: URL queries for Open Data Cube access.

B. Training curves¶

Figure B1: Training curves obtained for the different models at zoom level 14 with the 3 image datasets: SWISSIMAGE (1), SDC (2), BDC (3, 4, 5, 6, 7). The referring model number is detailed in Table 2. The dotted line indicates the iteration minimizing the loss curve.

C. Metrics¶

Figure C1: Evaluation of the trained models performance obtained at zoom level 14 for 7 different models (Table 5). (Left) Number of TP (blue), FN (red), and FP (green) as a function of detection score threshold for the validation dataset. (Right) Metrics value, precision (blue), recall (red), and f1-score (green) as a function of the detection score threshold for the validation dataset.

References¶

Victor Maus, Stefan Giljum, Jakob Gutschlhofer, Dieison M. Da Silva, Michael Probst, Sidnei L. B. Gass, Sebastian Luckeneder, Mirko Lieber, and Ian McCallum. A global-scale data set of mining areas. Scientific Data, 7(1):289, September 2020. URL: https://www.nature.com/articles/s41597-020-00624-w, doi:10.1038/s41597-020-00624-w. ↩↩↩↩↩↩↩↩↩
Nur Nadiatul Hidayah and Sumaiya Zainal Abidin. The evolution of mineral processing in extraction of rare earth elements using liquid-liquid extraction: A review. Minerals Engineering, 121:146–157, June 2018. URL: https://linkinghub.elsevier.com/retrieve/pii/S0892687518301250, doi:10.1016/j.mineng.2018.03.018. ↩
Vicenç Carabassa, Pau Montero, Marc Crespo, Joan-Cristian Padró, Xavier Pons, Jaume Balagué, Lluís Brotons, and Josep Maria Alcañiz. Unmanned aerial system protocol for quarry restoration and mineral extraction monitoring. Journal of Environmental Management, 270:110717, September 2020. URL: https://linkinghub.elsevier.com/retrieve/pii/S0301479720306496, doi:10.1016/j.jenvman.2020.110717. ↩↩↩↩
Chunsheng Wang, Lili Chang, Lingran Zhao, and Ruiqing Niu. Automatic Identification and Dynamic Monitoring of Open-Pit Mines Based on Improved Mask R-CNN and Transfer Learning. Remote Sensing, 12(21):3474, January 2020. URL: https://www.mdpi.com/2072-4292/12/21/3474, doi:10.3390/rs12213474. ↩↩↩
Agata Fugiel, Dorota Burchart-Korol, Krystyna Czaplicka-Kolarz, and Adam Smoliński. Environmental impact and damage categories caused by air pollution emissions from mining and quarrying sectors of European countries. Journal of Cleaner Production, 143:159–168, February 2017. URL: https://linkinghub.elsevier.com/retrieve/pii/S0959652616322004, doi:10.1016/j.jclepro.2016.12.136. ↩
Darryl Reed. Resource Extraction Industries in Developing Countries. Journal of Business Ethics, 39(3):199–226, September 2002. URL: https://link.springer.com/10.1023/A:1016538006160, doi:10.1023/A:1016538006160. ↩
Valentin Tertius Bickel and Andrea Manconi. Decadal Surface Changes and Displacements in Switzerland. Journal of Geovisualization and Spatial Analysis, 6(2):24, December 2022. URL: https://link.springer.com/10.1007/s41651-022-00119-9, doi:10.1007/s41651-022-00119-9. ↩
Haoteng Zhao, Yong Ma, Fu Chen, Jianbo Liu, Liyuan Jiang, Wutao Yao, and Jin Yang. Monitoring Quarry Area with Landsat Long Time-Series for Socioeconomic Study. Remote Sensing, 10(4):517, April 2018. URL: https://www.mdpi.com/2072-4292/10/4/517, doi:10.3390/rs10040517. ↩↩
Mike Faber and Roland Brown. Changing the rules of the game: Political risk, instability and fairplay in mineral concession contracts. Third World Quarterly, 2(1):100–119, January 1980. URL: http://www.tandfonline.com/doi/full/10.1080/01436598008419480, doi:10.1080/01436598008419480. ↩
Dusan Paredes and Nathaly M. Rivera. Mineral taxes and the local public goods provision in mining communities. Resources Policy, 53:328–339, September 2017. URL: https://linkinghub.elsevier.com/retrieve/pii/S030142071730065X, doi:10.1016/j.resourpol.2017.07.007. ↩
Gavin Hilson and Clive Potter. Why Is Illegal Gold Mining Activity so Ubiquitous in Rural Ghana? African Development Review, 15(2-3):237–270, December 2003. URL: https://onlinelibrary.wiley.com/doi/10.1111/j.1467-8268.2003.00073.x, doi:10.1111/j.1467-8268.2003.00073.x. ↩
T. Darwish, C. Khater, I. Jomaa, R. Stehouwer, A. Shaban, and M. Hamzé. Environmental impact of quarries on natural resources in lebanon. Land Degradation & Development, 22(3):345–358, 2011. URL: https://onlinelibrary.wiley.com/doi/10.1002/ldr.1011, doi:10.1002/ldr.1011. ↩
George P. Petropoulos, Panagiotis Partsinevelos, and Zinovia Mitraka. Change detection of surface mining activity and reclamation based on a machine learning approach of multi-temporal Landsat TM imagery. Geocarto International, 28(4):323–342, July 2013. URL: http://www.tandfonline.com/doi/abs/10.1080/10106049.2012.706648, doi:10.1080/10106049.2012.706648. ↩
A. O. Akanwa, F. I. Okeke, V. C. Nnodu, and E. T. Iortyom. Quarrying and its effect on vegetation cover for a sustainable development using high-resolution satellite image and GIS. Environmental Earth Sciences, 76(14):505, July 2017. URL: http://link.springer.com/10.1007/s12665-017-6844-x, doi:10.1007/s12665-017-6844-x. ↩
R. S. Moeletsi and S. G. Tesfamichael. Assessing land cover changes caused by granite quarrying using remote sensing. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, XLII-3/W2:119–124, November 2017. URL: https://isprs-archives.copernicus.org/articles/XLII-3-W2/119/2017/, doi:10.5194/isprs-archives-XLII-3-W2-119-2017. ↩
Huriel Reichel and Nils Hamel. Automatic Detection of Quarries and the Lithology below them in Switzerland. 2022. URL: https://tech.stdl.ch/PROJ-DQRY/. ↩↩
Clémence Herny, Shanci Li, Alessandro Cerioni, and Roxane Pott. Automatic detection and observation of mineral extraction sites in Switzerland. January 2024. URL: https://tech.stdl.ch/PROJ-DQRY-TM/. ↩↩↩↩↩↩↩↩↩
Alessandro Cerioni, Clémence Herny, Adrian Meyer, and Gwenaëlle Salamin. Object detector framework. December 2024. URL: https://tech.stdl.ch/TASK-IDET/. ↩
Vignesh Kumar and Kiran Yarrakula. Environmental impact assessment of limestone quarry using multispectral satellite imagery. Earth Science Informatics, 15(3):1905–1923, September 2022. URL: https://link.springer.com/10.1007/s12145-022-00845-0, doi:10.1007/s12145-022-00845-0. ↩
D.V. Beregovoi, J.A. Younes, and M.G. Mustafin. Monitoring of Quarry Slope Deformations with the Use of Satellite Positioning Technology and Unmanned Aerial Vehicles. Procedia Engineering, 189:737–743, 2017. URL: https://linkinghub.elsevier.com/retrieve/pii/S1877705817322415, doi:10.1016/j.proeng.2017.05.116. ↩
Giuseppe Bonifazi, Laura Cutaia, Paolo Massacci, and Ivan Roselli. Monitoring of abandoned quarries by remote sensing and in situ surveying. Ecological Modelling, 170(2-3):213–218, December 2003. URL: https://linkinghub.elsevier.com/retrieve/pii/S030438000300228X, doi:10.1016/S0304-3800(03)00228-X. ↩
Laura Cutaia, P. Massacci, and Ivan Roselli. Analysis of Landsat 5 TM Images for Monitoring the State of Restoration of Abandoned Quarries. International Journal of Surface Mining, Reclamation and Environment, 18(2):122–134, June 2004. URL: http://www.tandfonline.com/doi/abs/10.1080/13895260412331295385, doi:10.1080/13895260412331295385. ↩
Dakota Aaron McCarty, Hyun Woo Kim, and Hye Kyung Lee. Evaluation of Light Gradient Boosted Machine Learning Technique in Large Scale Land Use and Land Cover Classification. Environments, 7(10):84, October 2020. URL: https://www.mdpi.com/2076-3298/7/10/84, doi:10.3390/environments7100084. ↩
Martin Sudmanns, Hannah Augustin, Brian Killough, Gregory Giuliani, Dirk Tiede, Alex Leith, Fang Yuan, and Adam Lewis. Think global, cube local: an Earth Observation Data Cube’s contribution to the Digital Earth vision. Big Earth Data, 7(3):831–859, July 2023. URL: https://www.tandfonline.com/doi/full/10.1080/20964471.2022.2099236, doi:10.1080/20964471.2022.2099236. ↩↩
Efthimios Tambouris, Evangelos Kalampokis, and Konstantinos Tarabanis. Processing Linked Open Data Cubes. In Efthimios Tambouris, Marijn Janssen, Hans Jochen Scholl, Maria A. Wimmer, Konstantinos Tarabanis, Mila Gascó, Bram Klievink, Ida Lindgren, and Peter Parycek, editors, Electronic Government, volume 9248, pages 130–143. Springer International Publishing, Cham, 2015. URL: http://link.springer.com/10.1007/978-3-319-22479-4_10, doi:10.1007/978-3-319-22479-4_10. ↩
Bruno Chatenoux, Jean-Philippe Richard, David Small, Claudia Roeoesli, Vladimir Wingate, Charlotte Poussin, Denisa Rodila, Pascal Peduzzi, Charlotte Steinmeier, Christian Ginzler, Achileas Psomas, Michael E. Schaepman, and Gregory Giuliani. The Swiss data cube, analysis ready data archive using earth observations of Switzerland. Scientific Data, 8(1):295, November 2021. URL: https://www.nature.com/articles/s41597-021-01076-6, doi:10.1038/s41597-021-01076-6. ↩↩↩
Karine R. Ferreira, Gilberto R. Queiroz, Lubia Vinhas, Rennan F. B. Marujo, Rolf E. O. Simoes, Michelle C. A. Picoli, Gilberto Camara, Ricardo Cartaxo, Vitor C. F. Gomes, Lorena A. Santos, Alber H. Sanchez, Jeferson S. Arcanjo, José Guilherme Fronza, Carlos Alberto Noronha, Raphael W. Costa, Matheus C. Zaglia, Fabiana Zioti, Thales S. Korting, Anderson R. Soares, Michel E. D. Chaves, and Leila M. G. Fonseca. Earth Observation Data Cubes for Brazil: Requirements, Methodology and Products. Remote Sensing, 12(24):4033, December 2020. URL: https://www.mdpi.com/2072-4292/12/24/4033, doi:10.3390/rs12244033. ↩↩
Trevor Dhu, Gregory Giuliani, Jimena Juárez, Argyro Kavvada, Brian Killough, Paloma Merodio, Stuart Minchin, and Steven Ramage. National Open Data Cubes and Their Contribution to Country-Level Development Policies and Practices. Data, 4(4):144, November 2019. URL: https://www.mdpi.com/2306-5729/4/4/144, doi:10.3390/data4040144. ↩↩
M. C. A. Picoli, R. Simoes, M. Chaves, L. A. Santos, A. Sanchez, A. Soares, I. D. Sanches, K. R. Ferreira, and G. R. Queiroz. CBERS Data Cube: a powerful technology for mapping and monitoring brazilian biomes. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, V-3-2020:533–539, August 2020. URL: https://isprs-annals.copernicus.org/articles/V-3-2020/533/2020/, doi:10.5194/isprs-annals-V-3-2020-533-2020. ↩↩
Emanuele Mandanici and Gabriele Bitelli. Preliminary Comparison of Sentinel-2 and Landsat 8 Imagery for a Combined Use. Remote Sensing, 8(12):1014, December 2016. URL: https://www.mdpi.com/2072-4292/8/12/1014, doi:10.3390/rs8121014. ↩↩
Michel E. D. Chaves, Michelle C. A. Picoli, and Ieda D. Sanches. Recent Applications of Landsat 8/OLI and Sentinel-2/MSI for Land Use and Land Cover Mapping: A Systematic Review. Remote Sensing, 12(18):3062, September 2020. URL: https://www.mdpi.com/2072-4292/12/18/3062, doi:10.3390/rs12183062. ↩↩↩
Gerald Forkuor, Kangbeni Dimobe, Idriss Serme, and Jerome Ebagnerin Tondoh. Landsat-8 vs. Sentinel-2: examining the added value of sentinel-2’s red-edge bands to land-use and land-cover mapping in Burkina Faso. GIScience & Remote Sensing, 55(3):331–354, May 2018. URL: https://www.tandfonline.com/doi/full/10.1080/15481603.2017.1370169, doi:10.1080/15481603.2017.1370169. ↩
Andrea Lessio, Vanina Fissore, and Enrico Borgogno-Mondino. Preliminary Tests and Results Concerning Integration of Sentinel-2 and Landsat-8 OLI for Crop Monitoring. Journal of Imaging, 3(4):49, November 2017. URL: https://www.mdpi.com/2313-433X/3/4/49, doi:10.3390/jimaging3040049. ↩
Raziye Hale Topaloğlu, Elif Sertel, and Nebiye Musaoğlu. Assessment of classification accuracies of Sentinel-2 and Landsat-8 data for land cover/use mapping. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, XLI-B8:1055–1059, June 2016. URL: https://isprs-archives.copernicus.org/articles/XLI-B8/1055/2016/, doi:10.5194/isprs-archives-XLI-B8-1055-2016. ↩
Gregory Giuliani, Bruno Chatenoux, Andrea De Bono, Denisa Rodila, Jean-Philippe Richard, Karin Allenbach, Hy Dao, and Pascal Peduzzi. Building an Earth Observations Data Cube: lessons learned from the Swiss Data Cube (SDC) on generating Analysis Ready Data (ARD). Big Earth Data, 1(1-2):100–117, December 2017. URL: https://www.tandfonline.com/doi/full/10.1080/20964471.2017.1398903, doi:10.1080/20964471.2017.1398903. ↩↩
Gregory Giuliani, Bruno Chatenoux, Antonio Benvenuti, Pierre Lacroix, Mattia Santoro, and Paolo Mazzetti. Monitoring land degradation at national level using satellite Earth Observation time-series data to support SDG15 – exploring the potential of data cube. Big Earth Data, 4(1):3–22, January 2020. URL: https://www.tandfonline.com/doi/full/10.1080/20964471.2020.1711633, doi:10.1080/20964471.2020.1711633. ↩
Charlotte Poussin, Pablo Timoner, Bruno Chatenoux, Gregory Giuliani, and Pascal Peduzzi. Improved Landsat-based snow cover mapping accuracy using a spatiotemporal NDSI and generalized linear mixed model. Science of Remote Sensing, 7:100078, June 2023. URL: https://linkinghub.elsevier.com/retrieve/pii/S2666017223000032, doi:10.1016/j.srs.2023.100078. ↩
Gregory Giuliani, Gilberto Camara, Brian Killough, and Stuart Minchin. Earth Observation Open Science: Enhancing Reproducible Science Using Data Cubes. Data, 4(4):147, November 2019. URL: https://www.mdpi.com/2306-5729/4/4/147, doi:10.3390/data4040147. ↩
K. R. Ferreira, G. R. Queiroz, R. F. B. Marujo, and R. W. Costa. Building Earth observation data cubes on AWS. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, XLIII-B3-2022:597–602, May 2022. URL: https://isprs-archives.copernicus.org/articles/XLIII-B3-2022/597/2022/, doi:10.5194/isprs-archives-XLIII-B3-2022-597-2022. ↩
Michel E. D. Chaves, Anderson R. Soares, Ieda D. Sanches, and José G. Fronza. CBERS data cubes for land use and land cover mapping in the Brazilian Cerrado agricultural belt. International Journal of Remote Sensing, 42(21):8398–8432, November 2021. URL: https://www.tandfonline.com/doi/full/10.1080/01431161.2021.1978584, doi:10.1080/01431161.2021.1978584. ↩
Vitor C. F. Gomes, Gilberto R. Queiroz, Karine R. Ferreira, Edzer Pebesma, and Claudio C. F. Barbosa. Brazil Data Cube Workflow Engine: a tool for big Earth observation data processing. International Journal of Digital Earth, 17(1):2313099, December 2024. URL: https://www.tandfonline.com/doi/full/10.1080/17538947.2024.2313099, doi:10.1080/17538947.2024.2313099. ↩
Gregory Giuliani, Bruno Chatenoux, Erica Honeck, and Jean-Philippe Richard. Towards Sentinel-2 Analysis Ready Data: a Swiss Data Cube Perspective. In IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium, 8659–8662. Valencia, July 2018. IEEE. URL: https://ieeexplore.ieee.org/document/8517954/, doi:10.1109/IGARSS.2018.8517954. ↩
K. R. Ferreira, G. R. Queiroz, G. Camara, R. C. M. Souza, L. Vinhas, R. F. B. Marujo, R. E. O. Simoes, C. A. F. Noronha, R. W. Costa, J. S. Arcanjo, V. C. F. Gomes, and M. C. Zaglia. Using Remote Sensing Images and Cloud Services on Aws to Improve Land Use and Cover Monitoring. In 2020 IEEE Latin American GRSS & ISPRS Remote Sensing Conference (LAGIRS), 558–562. Santiago, Chile, March 2020. IEEE. URL: https://ieeexplore.ieee.org/document/9165649/, doi:10.1109/LAGIRS48042.2020.9165649. ↩
E Iman. Remote Sensing and GIS Module: Colour Composite Images and Visual Image Interpretation. University Grand Commission (UGC), MHRD, Govt of India, 2019. ↩
Pasquale Imperatore, Ramin Azar, Fabiana Calo, Daniela Stroppiana, Pietro Alessandro Brivio, Riccardo Lanari, and Antonio Pepe. Effect of the Vegetation Fire on Backscattering: An Investigation Based on Sentinel-1 Observations. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 10(10):4478–4492, October 2017. URL: https://ieeexplore.ieee.org/document/7972961/, doi:10.1109/JSTARS.2017.2717039. ↩
Peng Li, Wenyu Li, Dong Shi, and Arun Jyoti Nath. Normalized Difference Red-NIR-SWIR: A new Sentinel-2 three-band spectral index for mapping freshly-opened swiddens in the tropics. Ecological Informatics, 82:102775, September 2024. URL: https://linkinghub.elsevier.com/retrieve/pii/S1574954124003170, doi:10.1016/j.ecoinf.2024.102775. ↩
Ian Olthof and Robert H. Fraser. Mapping surface water dynamics (1985–2021) in the Hudson Bay Lowlands, Canada using sub-pixel Landsat analysis. Remote Sensing of Environment, 300:113895, January 2024. URL: https://linkinghub.elsevier.com/retrieve/pii/S0034425723004467, doi:10.1016/j.rse.2023.113895. ↩
Yuxin Wu, Alexander Kirillov, Francisco Massa, Wan-Yen Lo, and Ross Girshick. Detectron2. 2019. URL: https://github.com/facebookresearch/detectron2. ↩
Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. Mask R-CNN. January 2018. arXiv:1703.06870 [cs]. URL: http://arxiv.org/abs/1703.06870, doi:10.48550/arXiv.1703.06870. ↩
Samuel L. Smith, Pieter-Jan Kindermans, Chris Ying, and Quoc V. Le. Don't Decay the Learning Rate, Increase the Batch Size. arXiv e-prints, pages arXiv:1711.00489, November 2017. URL: https://ui.adsabs.harvard.edu/abs/2017arXiv171100489S/abstract, doi:10.48550/arXiv.1711.00489. ↩
Leslie N. Smith and Nicholay Topin. Super-Convergence: Very Fast Training of Neural Networks Using Large Learning Rates. 2017. URL: https://arxiv.org/abs/1708.07120, doi:10.48550/ARXIV.1708.07120. ↩
Leslie N. Smith. A disciplined approach to neural network hyper-parameters: Part 1 – learning rate, batch size, momentum, and weight decay. arXiv e-prints, pages arXiv:1803.09820, March 2018. URL: https://ui.adsabs.harvard.edu/abs/2018arXiv180309820S/abstract, doi:10.48550/arXiv.1803.09820. ↩
Bo-cai Gao. NDWI—A normalized difference water index for remote sensing of vegetation liquid water from space. Remote Sensing of Environment, 58(3):257–266, December 1996. URL: https://linkinghub.elsevier.com/retrieve/pii/S0034425796000673, doi:10.1016/S0034-4257(96)00067-3. ↩
Akhona Madasa, Israel R. Orimoloye, and Olusola O. Ololade. Application of geospatial indices for mapping land cover/use change detection in a mining area. Journal of African Earth Sciences, 175:104108, March 2021. URL: https://linkinghub.elsevier.com/retrieve/pii/S1464343X21000091, doi:10.1016/j.jafrearsci.2021.104108. ↩
S. K. McFeeters. The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features. International Journal of Remote Sensing, 17(7):1425–1432, May 1996. URL: https://www.tandfonline.com/doi/full/10.1080/01431169608948714, doi:10.1080/01431169608948714. ↩
Fabio Castaldi, Sabine Chabrillat, Axel Don, and Bas Van Wesemael. Soil Organic Carbon Mapping Using LUCAS Topsoil Database and Sentinel-2 Data: An Approach to Reduce Soil Moisture and Crop Residue Effects. Remote Sensing, 11(18):2121, September 2019. URL: https://www.mdpi.com/2072-4292/11/18/2121, doi:10.3390/rs11182121. ↩
Klara Dvorakova, Pu Shi, Quentin Limbourg, and Bas Van Wesemael. Soil Organic Carbon Mapping from Remote Sensing: The Effect of Crop Residues. Remote Sensing, 12(12):1913, June 2020. URL: https://www.mdpi.com/2072-4292/12/12/1913, doi:10.3390/rs12121913. ↩
Marc Wieland, Yu Li, and Sandro Martinis. Multi-sensor cloud and cloud shadow segmentation with a convolutional neural network. Remote Sensing of Environment, 230:111203, September 2019. URL: https://linkinghub.elsevier.com/retrieve/pii/S0034425719302159, doi:10.1016/j.rse.2019.05.022. ↩

Model	Precision	Recall	F1
1	61%	70%	65%
2	62%	40%	49%
3	48%	42%	45%
4	64%	48%	55%
5	64%	50%	56%
6	73%	55%	63%
7	72%	61%	67%

Model	Precision	Recall	F1
1	61%	70%	65%
2	62%	40%	49%
3	48%	42%	45%
4	64%	48%	55%
5	64%	50%	56%
6	73%	55%	63%
7	72%	61%	67%

Model	Precision	Recall	F1
1	61%	70%	65%
2	62%	40%	49%
3	48%	42%	45%
4	64%	48%	55%
5	64%	50%	56%
6	73%	55%	63%
7	72%	61%	67%