GitHub

CP2M: Clustered-Patch-Mixed Mosaic Augmentation for Aerial Image Segmentation

With the spirit of reproducible research, this repository contains all the codes required to produce the results in the manuscript:

Yijie Li, Hewei Wang, Jinfeng Xu, Zixiao Ma, Puzhen Wu, Shaofan Wang, and Soumyabrata Dev, CP2M: Clustered-Patch-Mixed Mosaic Augmentation for Aerial Image Segmentation, IEEE International Geoscience and Remote Sensing Symposium (IGARSS), 2025.

Citing CP2M

If you find CP2M useful in your research, please consider citing our paper.

@article{li2025cp2m,
  title={CP2M: Clustered-Patch-Mixed Mosaic Augmentation for Aerial Image Segmentation},
  author={Li, Yijie and Wang, Hewei and Xu, Jinfeng and Ma, Zixiao and Wu, Puzhen and Wang, Shaofan and Dev, Soumyabrata},
  journal={arXiv preprint arXiv:2501.15389},
  year={2025}
}

1. Summery

Remote sensing image segmentation is vital for earth observation but limited annotation data in remote sensing often leads to overfitting in deep learning models, driving research into data augmentation techniques. Many current approaches, however, rely on simple transformations that fail to enhance data diversity or model generalization effectively. Hence, We propose Clustered-Patch-Mixed Mosaic (CP2M), a novel augmentation strategy addressing these challenges. CP2M combines Mosaic augmentation, which merges four random samples, with a clustered patch mix phase leveraging connected component labeling to maintain spatial coherence and avoid irrelevant semantics. Experiments on the ISPRS Potsdam dataset show CP2M significantly reduces overfitting, achieving state-of-the-art segmentation accuracy and robustness for remote sensing tasks.

Overall Pipeline The figure below shows the overall pipeline for CP2M.

Segmentation Model The figure below illustrates our designed MobileNetV2-UNet model, which we used to validate the effectiveness of CP2M. In our approach, the MobileNetV2 encoder extracts multi-scale features through its hierarchical layers (E1–E5), which are then passed through a channel reduction module (RD) before being concatenated with the corresponding decoder outputs (D1–D4). CP2M augments the input samples fed to this architecture, enhancing data diversity while maintaining spatial coherence. By leveraging CP2M's Mosaic and clustered patch mix phases, the segmentation model achieves better generalization, reduced overfitting, and improved performance on tasks like aerial image analysis, as demonstrated in experiments with the ISPRS Potsdam dataset.

2. Dependencies

2.1 PaddlePaddle

For CUDA 12

python -m pip install paddlepaddle-gpu==2.6.2.post120 -i https://www.paddlepaddle.org.cn/packages/stable/cu120/

2.2 Others

pip install numpy pillow pandas matplotlib seaborn wandb tqdm albumentations

2.3 Dataset

The offical website of ISPRS-Potsdam is link. You can download the data from link. Please place the data in ./dataset following the structure:

.
├── Potsdam
│   ├── 2_Ortho_RGB
│   │   │── top_potsdam_4_12_RGB.tif
|   |   └── ...
│   └── 5_Labels_all
│       │── top_potsdam_4_12_label.tif
|       └── ...
└── README.md

Then please run python make_potsdam_dataset.py to generate the data for training and testing.

3. Usage

3.1 Train

usage: train.py [-h] [--img_path IMG_PATH] [--gt_path GT_PATH] [--percentage PERCENTAGE] [--epochs EPOCHS] [--batchsize BATCHSIZE] [--lr LR] [-aug]
                [--p_mosaic P_MOSAIC] [--p_cpm P_CPM] [--name NAME] [--key KEY] [--proj PROJ]

options:
  -h, --help            show this help message and exit
  --img_path IMG_PATH  
  --gt_path GT_PATH
  --percentage PERCENTAGE
  --epochs EPOCHS
  --batchsize BATCHSIZE
  --lr LR
  -aug
  --p_mosaic P_MOSAIC  # probability of applying MOSAIC augmentation
  --p_cpm P_CPM  # probability of applying CP2M augmentation
  --name NAME
  --key KEY
  --proj PROJ

Train Using Default Setting

python train.py --name <EXPPERIMENT NAME> --key <YOUR WANDB KEY> -aug

3.2 Test

python test.py --gt_path <PATH OF THE CHECKPOINT>

4. Results

You can download our pre-trained checkpoints for all experiments from link

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
assets		assets
ckpts		ckpts
dataset		dataset
doc		doc
models		models
results		results
utils		utils
.gitignore		.gitignore
README.md		README.md
make_potsdam_dataset.py		make_potsdam_dataset.py
notebook.ipynb		notebook.ipynb
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CP2M: Clustered-Patch-Mixed Mosaic Augmentation for Aerial Image Segmentation

Citing CP2M

1. Summery

2. Dependencies

2.1 PaddlePaddle

2.2 Others

2.3 Dataset

3. Usage

3.1 Train

3.2 Test

4. Results

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Att100/CP2M

Folders and files

Latest commit

History

Repository files navigation

CP2M: Clustered-Patch-Mixed Mosaic Augmentation for Aerial Image Segmentation

Citing CP2M

1. Summery

2. Dependencies

2.1 PaddlePaddle

2.2 Others

2.3 Dataset

3. Usage

3.1 Train

3.2 Test

4. Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages