GitHub - Chen-Yang-Liu/RSCaMa: RSCaMa: Remote Sensing Image Change Captioning with State Space Model

RSCaMa: Remote Sensing Image Change Captioning with State Space Model

Share us a ⭐ if you're interested in this repo

This repository contains the PyTorch implementation of "RSCaMa: Remote Sensing Image Change Captioning with State Space Model".

Installation and Dependencies

git clone https://github.com/Chen-Yang-Liu/RSCaMa.git
cd RSCaMa
conda create -n RSCaMa_env python=3.9
conda activate RSCaMa_env
pip install -r requirements.txt

Data Preparation

Download the LEVIR_CC dataset: LEVIR-CC .
The data structure of LEVIR-CC is organized as follows:

├─/root/Data/LEVIR_CC/
        ├─LevirCCcaptions.json
        ├─images
             ├─train
             │  ├─A
             │  ├─B
             ├─val
             │  ├─A
             │  ├─B
             ├─test
             │  ├─A
             │  ├─B

where folder A contains images of pre-phase, folder B contains images of post-phase.

Extract text files for the change descriptions of each image pair in LEVIR-CC:

python preprocess_data.py --input_captions_json /DATA_PATH/Levir-CC-dataset/LevirCCcaptions.json

!NOTE: When preparing the text token files, we suggest setting the word count threshold of LEVIR-CC to 5 and Dubai_CC to 0 for fair comparisons.

NOTE

Please modify the source code of CLIP package, please modify CLIP.model.VisionTransformer.forward() as [this].
Mamba is only supported on Linux systems.

Training

python train_CC.py --data_folder /DATA_PATH/Levir-CC-dataset/images

!NOTE: If the program encounters the error: "'Meteor' object has no attribute 'lock'," we recommend installing it with sudo apt install openjdk-11-jdk to resolve this issue.

Evaluate

python test.py --data_folder /DATA_PATH/Levir-CC-dataset/images --checkpoint xxxx.pth

Alternatively, you can download our pretrained model here: [Hugging face].

Experiment:

Citation:

@ARTICLE{liu2024rscama,
  author={Liu, Chenyang and Chen, Keyan and Chen, Bowen and Zhang, Haotian and Zou, Zhengxia and Shi, Zhenwei},
  journal={IEEE Geoscience and Remote Sensing Letters}, 
  title={RSCaMa: Remote Sensing Image Change Captioning With State Space Model}, 
  year={2024},
  volume={21},
  number={},
  pages={1-5},
  keywords={Decoding;Visualization;Transformers;Task analysis;Solid modeling;Remote sensing;Feature extraction;Change captioning;Mamba;spatial difference-guided SSM;state space model (SSM);temporal traveling SSM},
  doi={10.1109/LGRS.2024.3404604}}

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
.idea		.idea
__pycache__		__pycache__
data/LEVIR_CC		data/LEVIR_CC
eval_func		eval_func
model		model
resource		resource
utils_tool		utils_tool
README.md		README.md
preprocess_data.py		preprocess_data.py
requirement.txt		requirement.txt
test.py		test.py
train_CC.py		train_CC.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RSCaMa: Remote Sensing Image Change Captioning with State Space Model

Share us a ⭐ if you're interested in this repo

Installation and Dependencies

Data Preparation

NOTE

Training

Evaluate

Experiment:

Citation:

About

Releases

Packages

Languages

Chen-Yang-Liu/RSCaMa

Folders and files

Latest commit

History

Repository files navigation

RSCaMa: Remote Sensing Image Change Captioning with State Space Model

Share us a ⭐ if you're interested in this repo

Installation and Dependencies

Data Preparation

NOTE

Training

Evaluate

Experiment:

Citation:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages