Skip to content

Classification of road surfaces

Gwenaëlle Salamin (swisstopo), Clémence Herny (Exolabs), Roxane Pott (swisstopo), Alessandro Cerioni (État de Genève)

Proposed by the Federal Office of Topography swisstopo - PROJ-ROADSURF
August 2022 to March 2023 - Published on August 28, 2023
All scripts are available on GitHub: https://github.com/swiss-territorial-data-lab/proj-roadsurf

Abstract: The Swiss road network extends over 83’274 km. Information about the type of road surface is useful not only for the Swiss Federal Roads Office and engineering companies, but also for cyclists and hikers. Currently, the data creation and update is entirely done manually at the Swiss Federal Office of Topography. This is a time-consuming and methodical task, potentially suitable to automation by data science methods. The goal of this project is classifying Swiss roads according to their surface type, natural or artificial.
We first searched for statistical differences between these two classes, in order to then perform supervised classification based on machine-learning methods. As we could not find any discriminant feature, we used deep learning methods.
In terms of balanced F1 score, we obtained a global score of 0.74 over the training, validation and test area, 0.56 over the inference-only area.

1. Introduction

The Swiss road network extends over 83'274 km 1. Not only cyclists and hikers can be interested in knowing whether a given road section is covered by a natural or an artificial surface, but also the Swiss Federal Roads Office, which is in charge of road maintenance, and engineering companies. This information is found within the swissTLM3D 2 dataset, the large-scale topographic model of Switzerland produced by the Federal Office of Topography (swisstopo). Keeping the swissTLM3D dataset up to date is a time-consuming work that has to be done methodically. Operators draw and georeference new elements and fill in their attributes based on the visual interpretation of stereoscopic aerial images. The update of existing elements also follows this manual approach. Data science can help by autmatizing this time-consuming and systematic tasks.

So far, the majority of data science studies on the identification of the road surface type, in particular those based on artificial intelligence, have been conducted in the context of improving the driving and security of autonomous vehicles 3456. These works rely on images shot by cameras mounted at the front of the moving vehicle itself. To our knowledge, only one study, carried out by Mansourmoghaddam et al. (2022) 7, proposed a method based on object-based classification from aerial imagery, which could successfully tell artificial roads from natural ones. Another possible approach is to use spectral indices, as done by Zhao & Zhu (2022) 8 working on the distinction between artificial surfaces and bare land. However, their method is not specifically designed for road surfaces.

The goal of this project was to determine whether the road cover is artificial or natural with the development of data science tools. For this first test, only the roads of the class "3m Strasse" are considered.

Simplified workflow
Figure 1: Overview of the workflow for this project.

As the location of roads was known, we faced a problem of supervised classification. Two approaches were tested to address it: machine learning (ML) and deep learning (DL). Both approaches used the same input data, aerial images and vector road location.

2. Data

As input data, this project used two datasets produced by the Federal Office of Topography: swissTLM3D and SWISSIMAGE RS. We worked with data for the year 2018, for which the images and ground truth, i.e. the manually vectorized and classified roads, are available for the area of interest (AOI). Coordinates are expressed in the EPSG:2056 reference system.

2.1. Area of interest

Area of interest
Figure 2: Delimitation of the area of interest with the tile numbers of the 1:25'000 Swiss national map.

The area of interest (AOI) defined for this study was represented by the tiles 1168, 1188, 1208 and 1228 of the Swiss national map at a scale of 1:25'000. This zone covers an area of 840 km2 and was chosen because of its representativeness of the Swiss territory.

2.2. swissTLM3D

The swissTLM3D 2 dataset is a large-scale topographic model of Switzerland. It contains geospatial data necessary o the national map, such as roads, buildings and land cover. Periodical updates rely on the manual work of specialized operators. They interpret stereoscopic images and fill in attributes with the help of some additional information, like cadastral surveys and terrestrial images. The specification of aerial imagery is similar to the SWISSIMAGE RS product.
The road layer contains lines with the identifier, the structure (none, bridge, tunnel, etc.), the object type (highways, 8m roads, 1 m paths, etc.) and the surface type as attributes. The two possible classes of the surface type are defined in the metadata: artificial (German: Hart) and natural (Natur). The artificial class contains surfaces of hard artificial materials like asphalt, concrete or slabs. The natural class contains roads with a surface of natural materials like gravel or dirt, and untreated surfaces.

In this project, it was decided to test the classification for the type "3m Strasse" (3 m roads). This class encompasses roads that are between 2.81 m and 4.20 m wide. Within this subset, 6486 roads have an artificial surface and 289 a natural one. The dataset is heavily unbalanced toward the artificial roads.

In addition, the swissTLM3D dataset was used to identify the forests. Indeed, they prevent us from observing roads from aerial images; hence those roads cannot be used in our study. As no layer in the swissTLM3D is specifically devoted to forested areas, they were deduced from the land cover classes. A filter was applied to only keep forests ("Wald") and open forests ("Wald offen").

Over the AOI, all the roads in quarries have a natural surface. We used our own layer from the project on the detection of mineral extraction sites to know their location. However, it is possible to use the information on the area of use from the swissTLM3D dataset which has a class on gravel quarries and one on stone quarries.

2.3. SWISSIMAGE RS

The product SWISSIMAGE RS 9 contains aerial images of Switzerland composed by four bands: near-infrared (NIR), red (R), green (G) and blue (B). The ground resolution equals 0.10 m over the area of interest, except in some high altitude regions or regions with complex topography, where a resolution of 0.25 m is deemed sufficient. The standard deviation is +/- 0.15 m (1 sigma) for a ground resolution of 0.10 m and +/- 0.25 m (1 sigma) for a ground resolution of 0.25 m, +/- 3-5 m (1 sigma). The dataset is composed of a collection of 16-bit encoded GeoTIFF orthorectified images. The overlap between images varies, but stays always present.

3. Preprocessing

Both the swissTLM3D and SWISSIMAGE RS dataset were processed to be suitable for the algorithms we wanted to develop. This was achieved with two procedures: the generation of the road domain and the creation of a raster mosaic.

3.1. Generation of the Road Domain

The swissTLM3D contains a vector layer representing every road section as a 3D line with some attached attributes. As a first test, the beneficiaries requested us to perform the analysis only on roads of the type "3m Strasse", i.e the roads wider than 2.81 m and thinner than 4.20 m. The engineered structures were excluded based on the attribute "KUNSTBAUTE". Only bridges and road sections without structures were kept.
Data preparation differs slightly between the two performed analyses, machine and deep learning. Results for both approaches are shown here below.

Road labels for the machine learning.
Figure 3: Resulting labels (left) from the initial TLM lines (right) in the case of the machine learning.
Road labels for the deep learning.
Figure 4: Resulting labels (left) from the initial TLM lines (right) in the case of deep learning.

For the machine learning analysis, only the 3m roads were kept (figure 3). For the deep learning analysis, we judged safer to keep all the visible roads (figure 4). Therefore, the neighboring roads were also considered. We made the hypothesis that we would obtain better results by training the model on all the visible roads, rather than on the 3m ones only. Still, the focus on "3m Strasse" class was enforced through the selection of raster tiles: only those tiles containing the specific class were used as input data.
Road geometries, originally linear, were transformed into polygons by adding a buffer with a flat cap style. This procedure generated unwanted overlapping areas in the neighborhood of the intersection points between contiguous road sections. Such artifacts were handled differently depending on the road types:

  • in the case of roads with different widths, the zone of the overlap inherited the class of the widest road section among the intersecting ones.
  • in the case of roads with the same width,
    • in the case of the machine learning analysis, overlapping parts were discarded;
    • in the case of the deep learning analysis, it was supposed that leaving the overlapping parts would produce more understandable labels for the algorithm than suppressing the overlap.

Once that the polygons were generated, sections hidden by a forest canopy were excluded. A buffer of 2 m was also added around forests as the canopy was often seen to be extending beyond the forest delimitation as recorded in the swissTLM3D dataset.

We considered adding some information about the altitude of the length of the roads to the labels. Natural and artificial roads share pretty much the same distribution in terms of altitude. For the length, the longest roads all had an artificial surface. However, the experts could not tell us if it was the case for all Switzerland or a coincidence on our AOI.
For the deep learning analysis, we tried to improve the overlap between labels and images by taking cadastral data into account. A larger buffer was used on the lines for the TLM. Then, only the parts of the buffer intersecting the road surfaces from cadastral surveying were kept. As described in the deep learning analysis section, we tested the labels straight out of the TLM and the ones augmented by the cadastral surveying. We also tried to merge the labels by width type or by surface type.

After the pre-processing step described here above,

  • for the machine learning analysis, 2'781 road sections were obtained, including 110 natural ones;
  • for the deep learning analysis, 12'208 road sections were obtained, including 3078 natural ones.

Let us remind that there were many more roads labeled in the second case as we considered all the visible roads. Especially for natural roads, the vast majority did not belong to the class of interest, but rather to the "1m Weg" and "2m Weg" classes.

3.2. Raster Mosaic Generation

As said in the description of SWISSIMAGE RS, a large overlap between images is present in the dataset. To remove this overlap, a mosaic was created. Instead of merging all the images into one, we decided to set up a XYZ raster tile service, allowing us to work at different resolutions.
The first step consists in reprojecting images in the EPSG:3857 projection, compliant with standard tile map services. Then, to save memory and disk space, images were converted from 16 to 8 bits. Besides, normalization was performed to optimize the usage of the available dynamic range. Finally, images were exported to the Cloud-Optimized GeoTIFF (COG) format. COG files can then be loaded by the TiTiler application, an Open Source dynamic tile server application 10. The MosaicJSON specification was used to store image metadata 11. Zoom levels were bound between 17 and 20, corresponding to resolutions between 1.20 m and 0.15 m.

4. Machine Learning Analysis

4.1. Methodology

Before delving into machine learning, we performed some exploratory data analysis, aiming at checking whether already existing features were discriminant enough to tell natural roads from artificial ones. Additional predictive features were also generated, based on

  • ratios between pixel values in different bands (G/R, B/R, NIR/R, G/B, G/NIR, B/NIR);
  • one spectral index;
  • per-band statistics (mean, median, minimum, maximum, standard deviation).

The machine learning analysis was performed only on the two middle tiles of the AOI.

The most promising spectral index we found in the literature is the Artificial Surface Index (ASI) defined by Zhao & Zhu (2022) 12. Unfortunately, the computation of the ASI requires the shortwave infrared (SWIR) band which is not available in the SWISSIMAGE RS data. The SWIR band can be available in satellite imagery (e.g.: Landsat 8, Sentinel 2), yet spatial resolution (20-30 m/px) is not enough for the problem at hand.

Instead, the VgNIR-BI index 13 could be computed in our case, since it combines the green and NIR bands:

\[\begin{align} \ \mbox{VgNIR-BI} = {\rho_{green} - \rho_{NIR} \over \rho_{green} + \rho_{NIR}} \ \end{align}\]

where ρ stands for the atmospherically corrected surface reflectance values of the band. In our case, no atmospheric correction was applied, because we dealt with aerial imagery instead of satellite imagery.

Boxplots were generated to visualize the distribution of the aforementioned predictive features. Principal component analysis (PCA) were performed, too. The group of values passed to the PCA were the following: - pixel values: Each pixel displays 11 attributes corresponding to (1) its values on the different bands (R, G, B, NIR), (2) the ratio between bands (G/R, B/R, NIR/R, G/B, G/NIR, B/NIR), and (3) the VgNIR-BI spectral index 13. - summary statistics: Each road has 5 attributes for each band: the mean, the median, the minimum (min), the maximum (max), and the standard deviation (std).

Let us note that:

  1. these two analyses were carried out on a curated selection of input data, which was ought to maximize the separation between natural and artificial roads. The road sections were filtered by pixel number and margin of error on their mean to avoid small or crowded road sections.
    Were the results promising, the analysis would have been carried out on the entire dataset.
  2. We confirmed that results were not biased by performing each analysis on the entire dataset and a balanced subset.
  3. We also confirmed that results were stable with respect to the chosen zoom level.

4.2. Results

In order not to make the presentation too cumbersome, here we only show results produced at zoom level 18, on the entire dataset, and considering road sections corresponding to the following criteria:

  1. covering at least 10 pixels;
  2. enjoying a margin of error (MOE) on the mean below 12.5, computed with a coefficient level of 0.95.
Boxplots of the distribution of pixel values on each band for each type of surface
Figure 5: Boxplots of the statistical distribution of pixel values on each band for each surface type. Each graph represents a band and each boxplot a surface type.

We can see on the figure 5 that both the median and the upper quartile are systematically higher for natural than for artificial roads across all the bands, meaning the natural roads have brighter parts. Unfortunately, we have that pixel value statistics do not allow a sharp distinction between the two classes, as the lower quartile are very close.

Boxplots of the ratios and index for each type of surface
Figure 6: Boxplots of the pixel distribution on the VgNIR-BI index and the ratios between bands. Each graph represents a ratio or the index and each boxplot a surface type.
Boxplots of the ratios and index for each type of surface
Figure 6bis: Boxplots of the pixel distribution on the ratios between bands. Each graph represents a ratio and each boxplot a surface type.

The ratios between bands and the VgNIR-BI present similar values for the artificial and natural roads, allowing no distinction between the classes.

Boxplots of road statistics on the blue band
Figure 7: Boxplots of the distribution for the road summary statistics on the blue band. Each graph represents a statistic and each boxplot a type of road surface.

Boxplots produced with the summary statistics computed per band and per road section lead to similar conclusions. Natural roads tend to be lighter than artificial ones. However, the difference is not strong enough to affect the lower quartiles and allow a sharp distinction between classes.

PCA of the pixels based on their value on each band
Figure 8: PCA of the pixels based on their value on each band.
PCA of the pixels based on their value on each band
Figure 9: PCA of the roads based on their statistics on the blue band.

The figures 8 and 9 present respectively the results of the PCA on the pixel values and on the statistics over road sections. Once more, we have to acknowledge that, unfortunately, artificial and natural roads cannot be separated.

4.3. Discussion

Although boxplots reveal that some natural roads can be brighter than artificial roads, statistical indicators overlap in such a way that no sharp distinction between the two classes can be drawn. The PCA confirms such an unfortunate finding.

Those results are not surprising. As a matter of fact, natural roads which are found in the "3m Strasse" type are mainly made by gravel or similar materials which, color-wise, make them very similar to artificial roads.

5. Deep Learning Analysis

5.1. Methodology

To perform the detection and classification of roads, the object detector (OD) framework developed by the STDL 14 was used. It is described in details in the dedicated page.

The two central parts of the AOI constitute the training zone, i.e. the zone for the training, validation and test datasets. The two exterior parts constitute the inference-only zone, i.e. for the "other" dataset, to test the trained model on an entirely new zone.

To assess the predictions, a script was written, final_metrics.py instead of using the one directly from the STDL's OD. We decided to take advantage that: 1. Predictions are not exclusive between classes. Every road section was detected several times with predictions of different class overlapping. 2. The delimitation of the roads are already known.

Therefore, rather than choosing one correct prediction, we aggregated the predictions in a natural and an artificial index over each label. Those indices were defined as follows:

\[\begin{align} \ \mbox{index}_{class} = \frac{\sum_{i=1}^{n} (A_{\%,i} \cdot \mbox{score}_{class,i})}{\sum_{i=1}^{n} A_{\%,i}} \ \end{align}\]

where n is the number of predictions belonging to the class, \(A_{\%, i}\) is the percentage of overlapping area between the label and the prediction, \(\mbox{score}_{class,i}\) is its confidence score.

\[\begin{align} \ \text{final class} = \begin{cases} \mbox{artificial} \quad \mbox{ if } \quad \mbox{index}_{artificial} \gt \mbox{index}_{natural}\\ \mbox{natural} \quad \mbox{ if } \quad \mbox{index}_{artificial} \lt \mbox{index}_{natural} \\ \mbox{undetected} \quad \text{ if } \quad \mbox{index}_{artificial} = 0 \; \text{ and } \; \mbox{index}_{natural} = 0 \\ \mbox{undetermined} \quad \text{ if } \quad \mbox{index}_{artificial} = \mbox{index}_{natural} \; \text{ and }\; \mbox{index}_{artificial} \neq 0\\ \end{cases} \ \end{align}\]

The largest index indicates the right class as better predictions are supporting it.
Once every road has an attributed class, the result was evaluated in terms of recall, precision and balanced F1 score.

\[\begin{align} \ P_{class} = \frac{TP_{class}}{TP_{class}+FP_{class}} \text{ and } P = \frac{P_{natural} + P_{artificial}}{2} \ \end{align}\]
\[\begin{align} \ R_{class} = \frac{TP_{class}}{TP_{class}+FN_{class}} \text{ and } R = \frac{R_{natural} + R_{artificial}}{2} \ \end{align}\]
\[\begin{align} \ F1\text{ }score = \frac{2PR}{P + R} \ \end{align}\]

where

  • \(FN_{class}\) is the number of roads that are not detected or wrongly assigned to the other class,
  • \(FP_{class}\) is the number of roads from the other class wrongly attributed to the considered class,
  • \(P_{class}\) is the precision, i.e. the percentage of correct roads in the detected class,
  • \(R_{class}\) is the recall, i.e. the percentage of correctly found and classified roads from a class,
  • \(TP_{class}\) is the number of roads correctly found in the class.

The predictions are not necessarily all taken into account. They are filtered based on their confidence score. Thresholds were tested over the balanced F1 score of the validation dataset.

The current dataset exhibits a very strong class imbalance. Therefore, we decided to use balanced metrics, giving the same weight to both classes. The balanced F1 score was chosen as the determining criterion between the different tested models.
As it gives equal weight to both classes, the quality of the classification for the natural road was well taken into consideration. However, we have to keep in mind that we gave great importance to this class compared to its number of individuals.

A great risk exists that the model would be biased toward artificial roads, because of the imbalance between classes. Therefore, we decided on a baseline model (BLM) where all the roads in the training zone are classified as artificial. Its metrics are the following:

Artificial Natural Global
Precision 0.97 0 0.49
Recall 1 0 0.5
F1 score 0.98 0 0.49

Table 1: Metrics for the BLM with all the roads classified as artificial

The trained models should improve the global F1 score of 0.49 to be considered as an improvement.

Finally, we wanted to know if the artificial and natural index could constitute a confidence score for their respective classes. The reliability diagram has been plotted to visualize the accuracy of the classification at different levels of those indices.

Representation of the different tests carried out
Figure 10: Listing of the various tests carried out.

To achieve the best possible results, several input parameters and files for the model training were tested. 1. We tried to improve the quality of the labels by integrating data from cadastral surveying and by merging the roads based on their cover, on their type, or not at all. 2. We trained the model with different zoom level images, from 17 to 20. 3. The influence of different band combinations on the model performance was investigated: true colors (RGB) and false colors (NirRG).

For each test, the best configuration was chosen based on the global balanced F1 score. This method supposes that the best choice for one parameter did not depend on the others.

5.2. Results

When testing different procedures to create the labels, using only the TLM and excluding the data from the cadastral survey gave the best metrics. Besides, cutting the label corresponding to the road sections and not merging them by road type or surface gave better metrics.
Increasing the zoom level improved the balanced F1 score.
Using the bands RGB and RG with NIR gave very similar results and an equal F1 score.
Therefore, the best model is based on labels deduced from the TLM and using the RGB bands at a zoom level 20.

Artificial Natural Global
Precision 0.99 0.74 0.87
Recall 0.97 0.74 0.86
F1 score (1) 0.98 0.74 0.86
F1 score for the BLM (2) 0.98 0 0.49
Improvement: (1)-(2) 0 0.74 0.32

Table 2: metrics for the best model over the training, test and validation area.

The F1 score for the natural roads and the global one outperformed the BLM. The per-class F1 scores has been judged as satisfying by the beneficiaries.

Artificial Natural Global
Precision 0.98 0.22 0.60
Recall 0.95 0.26 0.61
F1 score (1) 0.96 0.24 0.60
F1 score for the BLM (2) 0.98 0 0.49
Improvement: (1)-(2) -0.02 0.24 0.11

Table 3: metrics for the best model over the inference-only area.

Those metrics are worse than the ones obtained over the training area. The global F1 score is still higher than for the BLM. However, the natural F1 score is not high enough.

Absolute and relative repartition of the roads in the inference-only zone
Figure 11: Absolute and relative repartition of the roads in the inference-only zone.

93.2% of the roads are correctly classified, 4.2% are in the wrong class and 2.6% are undetected or undetermined. Nearly half of the natural roads are either undetected or in the wrong class, but as they represent a tiny proportion of the dataset, they impact little the accuracy.
In the training zone, only 2% of the roads are in the wrong class and 1.7% are undetected or undetermined.

Reliability curve for the training zone and the inference-only zone.
Figure 12: Reliability curves for the training and the inference-only zone.

The artificial index can be used as confidence score for the artificial roads. The natural index can be used as confidence score for the natural ones. Indeed, the accuracy of the results for each class increases with their value.

5.3. Discussion

The F1 score obtained is 0.86 over the area to train and validate the model and 0.60 over the rest of the AOI. The difference is essentially due to the decrease in the F1 score of the natural roads, passing from 0.74 to 0.24.
The first intuition is that we were facing a case of overfitting. However, the validation loss was controlled in order to stop the training on time and avoid this problem.
Another possibility would be that the two zones differ significantly and that a model trained on one cannot apply on the other. Hence, we also split the tiles randomly between the training and the inference-only zone. The gap between the balanced F1 score of the training and inference-only zone passed from 0.25 to 0.19 with the same hyper-parameters.

The high recall for artificial roads indicates that the model properly detects them. However, once the artificial recall is high, the high artificial precision is in this case necessarily due. As the roads have a known location, the false positives not due to a class confusion are eliminated from our assessment. Then, only the roads classified in the wrong class can affect precision. As there are not a lot of natural roads, even if they were all wrongly classified as artificial like in the BLM, the precision would still remain well at 0.97. In the current case, the precision of the trained model is 0.01 higher than the one of the BLM.
The drop in the natural F1 score is due to all the roads predicted in the wrong class. As they are only a few natural roads, errors of the model affect them more heavily. The part of the misclassified road increased by 44% between the training and the inference-only zone. Meanwhile, the part of undetermined roads only increased by 1%

The F1 score could maybe be further improved by focusing more strictly on the 3m roads. Indeed, we considered it would be safer to teach the algorithm to differentiate only between surfaces and not between road types, which are defined by width. Therefore, the tiles were selected because they intersected 3m roads, but then all the roads on the tiles were transformed into labels. Because of the rarity of 3m natural roads, most of the natural roads seen by the algorithm are 2m roads and those often have a surface with grass, where the 3m natural roads have a surface made only of gravel or dirt. Over the training zone, 110 natural roads are 3m ones and 1183 ones are 2 m and 1 m paths. Maybe, labelling only the 3m roads would give better results than labelling all the visible roads.
We did not tune the hyperparameter used by the deep learning model once we found a satisfying enough combination. In addition, as the algorithm is based on detectron2, not everything can easily be tuned. Using an entirely new framework and tuning the loss weights would allow better handling the class imbalance. A new framework could also allow integrating an attention mask and take advantage of the known road location like recommended by Epel (2018)15. Using a new framework could also allow to use images with 4 bands and integrating the NIR. However, we decided here to first try the tools we already had in our team.

We can say that there is a bias in the model encouraging it to predict artificial roads. However, it is still better than the BLM. Therefore, this model is adapted for its purpose.

5.3.1. Elements specific to the application on the SwissTLM3D product

All these findings seem negative, which is why it is appropriate to recall the significant imbalance between the classes. If we look at the percentages, 93.2% of the dataset is correctly classified over the inference-only zone. This could represent a significant gain of time compared to an operator who would do the classification manually. Indeed, once the model trained, the procedure documented here only needs 20 minutes to classify the roads of the AOI. Besides, the artificial and natural indices allow us to find most of the misclassified roads and limit the time needed for a visual verification.
In addition, the information of the road surface type is already available for the whole Switzerland. When using the algorithm to update the swissTLM3D dataset, it would be possible to perform change detection between the previous and new surface type. Then, those changes could be visually verified.

6. Conclusion

Keeping the swissTLM3D dataset up to date is a time consuming and methodical task. This project aimed at finding a method to automatize the determination of the road surface type (artificial vs. natural). We focused on roads belonging to the "3m Strasse" class and discovered that statistics stemming from pixel values are not enough discriminating to tell artificial roads from natural ones.
Therefore, we decided not to attempt any supervised classification based on machine learning. Instead, deep learning methods are performed. With 93% of the roads classified correctly, this method gave better results in regard to the global F1 score than a baseline model classifying all the roads as artificial. However, the model classifies 4.2% of the roads in the wrong class and has difficulties performing new zones.
To ensure the quality of the swissTLM3D product, we advise to first perform a classification with the algorithm, then to check roads with a low class index or a change in surface type compared to the previous version years. It could represent a huge time saver for the operators who currently classify and check a second time all the roads.

Despite our investigations, we could not find the cause of the gap between the metrics for the training and the inference-only zone. Further investigation is needed.
The next step for this project would be to extend the algorithm to paths of 1 to 2 m wide. The natural roads of 3 m are mostly made of gravel, which strongly resembles asphalt, while natural paths are mostly made of dirt and can grow grass. Therefore, when mixing the two road width classes in one model, the natural roads of 3 m could be too difficult to distinguish from artificial roads and end up neglected.

7. References


  1. Office fédéral de la statistique. Longueur des routes en 2020 | Office fédéral de la statistique. https://www.bfs.admin.ch/news/fr/2020-0273, November 2020. 

  2. swisstopo. swissTLM3D. https://www.swisstopo.admin.ch/de/geodata/landscape/tlm3d.html. 

  3. Lushan Cheng, Xu Zhang, and Jie Shen. Road surface condition classification using deep learning. Journal of Visual Communication and Image Representation, 64:102638, October 2019. doi:10.1016/j.jvcir.2019.102638

  4. Susi Marianingsih, Fitri Utaminingrum, and Fitra Abdurrachman Bachtiar. Road Surface Types Classification Using Combination of K-Nearest Neighbor and Naïve Bayes Based on GLCM. International Journal of Advanced Software Computer Application, 11(2):15–27, 2019. 

  5. Marcus Nolte, Nikita Kister, and Markus Maurer. Assessment of Deep Convolutional Neural Networks for Road Surface Classification. In 2018 21st International Conference on Intelligent Transportation Systems (ITSC), 381–386. Maui, HI, November 2018. IEEE. doi:10.1109/ITSC.2018.8569396

  6. Viktor Slavkovikj, Steven Verstockt, Wesley De Neve, Sofie Van Hoecke, and Rik Van De Walle. Image-Based Road Type Classification. In 2014 22nd International Conference on Pattern Recognition, 2359–2364. Stockholm, August 2014. IEEE. doi:10.1109/ICPR.2014.409

  7. Mohammad Mansourmoghaddam, Hamid Reza Ghafarian Malamiri, Fahime Arabi Aliabad, Mehdi Fallah Tafti, Mohamadreza Haghani, and Saeed Shojaei. The Separation of the Unpaved Roads and Prioritization of Paving These Roads Using UAV Images. Air, Soil and Water Research, 15:117862212210862, January 2022. doi:10.1177/11786221221086285

  8. Hailing Zhou, Hui Kong, Lei Wei, Douglas Creighton, and Saeid Nahavandi. On Detecting Road Regions in a Single UAV Image. IEEE Transactions on Intelligent Transportation Systems, 18(7):1713–1722, July 2017. doi:10.1109/TITS.2016.2622280

  9. swisstopo. SWISSIMAGE RS. https://www.swisstopo.admin.ch/fr/geodata/images/ortho/swissimage-rs.html. 

  10. TiTiler. https://developmentseed.org/titiler/. 

  11. Vincent Sarago, Sean Harkins, and Drew Bollinger. Developmentseed / mosaicjson-spec. https://github.com/developmentseed/mosaicjson-spec, 2021. 

  12. Yongquan Zhao and Zhe Zhu. ASI: An artificial surface Index for Landsat 8 imagery. International Journal of Applied Earth Observation and Geoinformation, 107:102703, March 2022. doi:10.1016/j.jag.2022.102703

  13. Ronald C. Estoque and Yuji Murayama. Classification and change detection of built-up lands from Landsat-7 ETM+ and Landsat-8 OLI/TIRS imageries: A comparative assessment of various spectral indices. Ecological Indicators, 56:205–217, September 2015. doi:10.1016/j.ecolind.2015.03.037

  14. Swiss Territorial Data Lab. Object detector. February 2023. URL: https://github.com/swiss-territorial-data-lab/object-detector

  15. Sagi Eppel. Classifying a specific image region using convolutional nets with an ROI mask as input. December 2018. arXiv:1812.00291