Erosion-SAM
Semantic segmentation of soil erosion by water
The Erosion-SAM project presents an automated approach for semantic segmentation of soil erosion features caused by water in agricultural landscapes. This work demonstrates how fine-tuning the Segment Anything Model (SAM) enables accurate detection of erosion and deposition features from high-resolution aerial imagery.
Overview
Soil erosion by water poses significant challenges in agricultural landscapes. Traditional detection methods rely on manual field surveys or visual interpretation of aerial imagery—processes that are time-consuming and subjective. This project leverages the Segment Anything Model (SAM), a foundation model pre-trained on over 1 billion masks, and fine-tunes it for the specialized task of identifying soil erosion features.
Study Area and Dataset
Location: Southeastern Bavaria, Germany
Data Acquisition: The project involved acquiring orthophotos following heavy rainfall events in erosion-prone areas between May and September of 2011 and 2012. To determine suitable acquisition times, RADOLAN rainfall data, with a spatial resolution of 1 × 1 km², were analyzed. When rainfall events met specific thresholds (either a total rainfall of at least 10 mm or a maximum 30-minute intensity exceeding 10 mm/h) indicating high erosion potential, aerial surveys were conducted using a small aircraft within 30 days after the identified erosive event.
Dataset: The images were then georeferenced. We manually identified and masked 405 individual images from the aerial imagery that exhibited erosion and deposition features such as ephemeral gullies, rills and sediment fans. The fields were categorized into 3 land cover categories:
- Grassland: 128 images
- Cropland: 277 images (vegetated: 131, bare: 146)
Methodology
We applied fine-tuning to adapt SAM’s pre-trained weights to soil erosion detection. Fine-tuning is a transfer learning approach that modifies a pre-trained model for a specific task, improving performance while reducing the need for large labeled datasets and computational resources.
Data Processing:
- Large orthophotos divided into 256×256 pixel patches
- Empty masks filtered out
- Data split: 70% training, 15% validation, 15% testing
Model Configuration:
- Base: SAM (vit_b variant)
- Optimizer: AdamW
- Loss: Combined Dice loss and focal loss
- Strategy: Fine-tuning the mask decoder
Results
The fine-tuned Erosion-SAM model successfully detected soil erosion features across different land cover types. We evaluated multiple fine-tuning approaches and compared them against the original SAM model.
Performance Metrics
The table below shows the performance of different approaches across land cover types. The numbers in blue and orange represent the increase and decrease in performance compared to the original SAM baseline.
The Resizing approach with prompt demonstrated the best overall performance with substantial improvements across all metrics and land cover types, particularly excelling in grassland detection (IoU: 0.75, +0.38 improvement).
Segmentation Examples
Applications
The generated data sets can be applied to machine learning-based SE modeling, providing accurate and consistent training data across different land cover types, and offering a reliable alternative to traditional SE models. In addition, erosion-SAM can make a valuable contribution to the precise monitoring of SE with high temporal resolution over large areas, and its results could benefit reinsurance and insurance-related risk solutions.
Publication
Citation:
Shokati, H., Engelhardt, A., Seufferheld, K., Taghizadeh, R., Fiener, P., Lensch, H., & Scholten, T. (2025). Erosion-SAM: Semantic segmentation of soil erosion by water. CATENA, 254, 108954.
DOI: 10.1016/j.catena.2025.108954
Status: Published in CATENA (Shokati et al., 2025)