SAM-CFFNet: SAM-Based Cross-Feature Fusion Network for Intelligent Identification of Landslides
Laidian Xi, Junchuan Yu☨, Daqing Ge, Yunxuan Pang, Ping Zhou, Changhong Hou, Yichuan Li, Yangyang Chen, Yuanbiao Dong
☨corresponding author: [email protected]
- [2024.02.23] Paper submission in review process.
- [2023.03.17] Dataset uploaded.
- [2024.03.30] Code test.
- [2024.03.30] Code realease.
- [2024.07.01] Code modification
The landslide datasets used in this study are all open source datasets, and the links to access the datasets and the literature are shown below
- Bijie Landlside Dataset: Link, Paper.
- Landslide4Sense Dataset: Link, Paper.
- GVLM Dataset: Link, Paper.
You can also download our processed data directly: Baidu Disk.
-
SAM-CFFNet backbone from SAM, some code references Attention UNet, HRNet, Deeplabv3+. SAM-CFFNet achieved the highest accuracy across three open-source remote sensing landslide datasets compared to other contrastive models.
-
Click the links below to download the checkpoint for the corresponding model type.
-
ViT-L SAM
: Link
Save the file in your download directory:/pretrained/sam_vit_l_0b3195.pth
-
SAM-CFFNET
: Baidu DiskSave the file in your download directory:
/sam_cffnet_weight/{gvlm_best.pth.tar,BJL_best.pth.tar,L4S_best.pth.tar}
-
-
The experimental code has been uploaded, and the training and validation of the model can be achieved through the ipynb file in it, and the supporting library information of the code is shown below:
Package | Version |
---|---|
GDAL | 3.6.2 |
h5py | 3.9.0 |
matplotlib | 3.7.2 |
numpy | 1.24.1 |
opencv-python | 4.8.0.74 |
scipy | 1.10.1 |
tensorboard | 2.10.1 |
tensorboardX | 2.6.2.2 |
torch | 1.12.1 |
torchsummary | 1.5.1 |
torchvision | 0.13.1 |
tqdm | 4.65.0 |
- SAM. SAM is the Segment Anything Model (SAM) proposed by Meta, which breaks through the boundaries of segmentation and greatly promotes the development of basic computer vision models.
- Attention UNet. Attention mechanism enhances feature representation by focusing on informative regions during image segmentation.
- HRNet. High-Resolution Network maintains high-resolution feature maps throughout the network to capture fine details in images effectively.
- Deeplabv3+. DeepLabv3+ utilizes atrous spatial pyramid pooling and decoder module to achieve accurate semantic segmentation with multi-scale features.
If you're using SAM-CFFNet in your research or applications, please cite using this BibTeX:
@Article{rs16132334,
AUTHOR = {Xi, Laidian and Yu, Junchuan and Ge, Daqing and Pang, Yunxuan and Zhou, Ping and Hou, Changhong and Li, Yichuan and Chen, Yangyang and Dong, Yuanbiao},
TITLE = {SAM-CFFNet: SAM-Based Cross-Feature Fusion Network for Intelligent Identification of Landslides},
JOURNAL = {Remote Sensing},
VOLUME = {16},
YEAR = {2024},
NUMBER = {13},
ARTICLE-NUMBER = {2334},
URL = {https://www.mdpi.com/2072-4292/16/13/2334},
ISSN = {2072-4292},
DOI = {10.3390/rs16132334}
}