Lightweight modal-guided cross-attention fusion network for visible-infrared object detection

This is an official PyTorch implementation for our LCAFNet. Paper can be download in LCAFNet

1. Dependences

Create a conda virtual environment and activate it.

conda create --name MOD python=3.9
conda activate MOD
pip install -r requirements.txt

2. Datasets download

Download these datasets and create a dataset folder to hold them.

FLIR dataset: FLIR
LLVIP dataset: LLVIP
M3FD dataset: M3FD
MFAD dataset: MFAD

3. Pretrained weights

Download our LCAFNet weights and create a weights folder to hold them.

FLIR dataset: LCAFNet_FLIR.pt
LLVIP dataset: LCAFNet_LLVIP.pt
M3FD dataset: LCAFNet_M3FD.pt
MFAD dataset: LCAFNet_MFAD.pt

4. Training our LCAFNet

Dataset path, GPU, batch size, etc., need to be modified according to different situations.

python train.py

5. Test our LCAFNet

python test.py

6. Citation

If you find LCAFNet helpful for your research, please consider citing our work.

@article{Wu2026,
  author       = {Wencong Wu and
                  Hongxi Zhang and
                  Xiuwei Zhang and
                  Hanlin Yin and
                  Yanning Zhang},
  title        = {Lightweight modal-guided cross-attention fusion network for visible-infrared object detection},
  journal      = {Pattern Recognition},
  volume       = {177},
  pages        = {113350},
  year         = {2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
descriptor		descriptor
evaluation_script		evaluation_script
models		models
utils		utils
README.md		README.md
confluence.py		confluence.py
detect_twostream.py		detect_twostream.py
flops.py		flops.py
global_var.py		global_var.py
hubconf.py		hubconf.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lightweight modal-guided cross-attention fusion network for visible-infrared object detection

1. Dependences

2. Datasets download

3. Pretrained weights

4. Training our LCAFNet

5. Test our LCAFNet

6. Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Lightweight modal-guided cross-attention fusion network for visible-infrared object detection

1. Dependences

2. Datasets download

3. Pretrained weights

4. Training our LCAFNet

5. Test our LCAFNet

6. Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages