[ICLR 2025 Spotlight] Diffusion Bridge AutoEncoders for Unsupervised Representation Learning (DBAE)
_{_{Official PyTorch implementation of DBAE}}

Yeongmin Kim, Kwanghyeon Lee, Minsang Park, Byeonghu Na, Il-Chul Moon

Dependencies

The requirements for this code are the same as DDBM.

In our experiment, we utilized CUDA 11.4 and PyTorch 1.12.

Datasets

For FFHQ, CelebA, CelebA-HQ, and LSUN datasets (LMDB format), please follow instructions from DiffAE. The directory structure looks like:

data
├─ffhq
|  ├─data.mdb
|  └lock.mdb
├─celebahq
|    ├─data.mdb
|    └lock.mdb
├─celeba
|    ├─data.mdb
|    └lock.mdb
├─bedroom
|    ├─data.mdb
|    └lock.mdb
├─horse
|    ├─data.mdb
|    └lock.mdb

Set your dataset path at args.sh.

Model training

We provide the training bash file train_dbae.sh with dbae_train.py. Set variables DATASET_NAME and SCHEDULE_TYPE:

DATASET_NAME sets the dataset. We support FFHQ, CelebA, CelebA-HQ, and LSUN.
SCHEDULE_TYPE sets forward diffusion. Set ve or vp.
STO sets encoder type. Set true or false.

We use 4 $\times$ L40S GPUs for FFHQ and LSUN datasets and 4 $\times$ RTX 3090 GPUs for celeba64.

To train, run

bash train_dbae.sh $DATASET_NAME $SCHEDULE_TYPE $STO
bash train_dbae.sh ffhq vp true
bash train_dbae.sh celeba vp true

(Eval) Downstream Inference

We provide the code for downstream inference (regression/classification).

For regression, it needs to download LFW datasets.

To evaluate regression capability (Pearson'r, MSE), run

bash infer_reg_dbae.sh ffhq vp {YOUR_PATH}/DBAE/ckpt/ffhq/sto/ema_0.9999_1020000.pt true
bash infer_reg_dbae.sh ffhq vp {YOUR_PATH}/DBAE/ckpt/ffhq/det/ema_0.9999_1020000.pt false
bash infer_reg_dbae.sh celeba vp {YOUR_PATH}/DBAE/ckpt/celeba/sto/ema_0.9999_980000.pt true
bash infer_reg_dbae.sh celeba vp {YOUR_PATH}/DBAE/ckpt/celeba/det/ema_0.9999_1020000.pt false

To evaluate classification capability (AP), run

bash infer_class_dbae.sh ffhq vp {YOUR_PATH}/DBAE/ckpt/ffhq/sto/ema_0.9999_1020000.pt true
bash infer_class_dbae.sh ffhq vp {YOUR_PATH}/DBAE/ckpt/ffhq/det/ema_0.9999_1020000.pt false
bash infer_class_dbae.sh celeba vp {YOUR_PATH}/DBAE/ckpt/celeba/sto/ema_0.9999_980000.pt true
bash infer_class_dbae.sh celeba vp {YOUR_PATH}/DBAE/ckpt/celeba/det/ema_0.9999_1020000.pt false

You can regenerate the results in Table 1 with DBAE/ckpt.

(Eval) Reconstruction

We provide the reconstruction bash file recon_dbae.sh with dbae_reconstruction.py. Set variables MODEL_PATH, CHURN_STEP_RATIO, RHO, GEN_SAMPLER, N, and STO :

MODEL_PATH sets your checkpoint path
CHURN_STEP_RATIO sets SDE(0<, <1) or ODE (0) sampling. We recommend SDE to measure LPIPS and ODE for the other metrics.
RHO sets time-discretization interval selection.
GEN_SAMPLER sets the order of the sampler.
N sets sampling step number.
STO is set to true if a stochastic encoder is used.

To reconstruction, run

bash recon_dbae.sh $DATASET_NAME $SCHEDULE_TYPE $MODEL_PATH $MODEL_PATH $CHURN_STEP_RATIO 1 train $RHO $GEN_SAMPLER $N $STO
bash recon_dbae.sh celebahq vp {YOUR_PATH}/DBAE/ckpt/ffhq/det/ema_0.9999_1020000.pt 0.0 1 train 7 euler 100 false
bash recon_dbae.sh celebahq vp {YOUR_PATH}/DBAE/ckpt/ffhq/det/ema_0.9999_1020000.pt 0.33 1 train 7 euler 333 false
bash recon_dbae.sh celebahq vp {YOUR_PATH}/DBAE/ckpt/ffhq/sto/ema_0.9999_1020000.pt 0.0 1 train 7 euler 100 false
bash recon_dbae.sh celebahq vp {YOUR_PATH}/DBAE/ckpt/ffhq/sto/ema_0.9999_1020000.pt 0.33 1 train 7 euler 333 false

To evaluate reconstruction, run

python3 eval_reconstruction.py --sample_path=$SAMPLE_PATH
python3 eval_reconstruction.py --sample_path={YOUR_PATH}/DBAE/ckpt/ffhq/det/sample_1020000/euler_7.0_100_w=1.0_churn=0.0
python3 eval_reconstruction.py --sample_path={YOUR_PATH}/DBAE/ckpt/ffhq/det/sample_1020000/heun_7.0_333_w=1.0_churn=0.33
python3 eval_reconstruction.py --sample_path={YOUR_PATH}/DBAE/ckpt/ffhq/sto/sample_1020000/euler_7.0_100_w=1.0_churn=0.0
python3 eval_reconstruction.py --sample_path={YOUR_PATH}/DBAE/ckpt/ffhq/sto/sample_1020000/heun_7.0_333_w=1.0_churn=0.33

You can regenerate the results in Table 2 with DBAE/ckpt/ffhq.

Interpolation

To interpolate two images (assets/img1.png and assets/img2.png), run

bash interpol_dbae.sh celebahq vp {YOUR_PATH}/DBAE/ckpt/ffhq/sto/ema_0.9999_1020000.pt 0.33 1 train 7 heun 167 true

Attribute Manipulation

It needs the trained linear classifier.

You can train your linear classifier with the infer_class_dbae.sh code. (The dataset may need to be changed.)

We provide the trained linear classifier (best.pt) on 40 binary attributes of CelebA-HQ in the latent space of DBAE trained on FFHQ.

To manipulate attributes, run

bash manipul_dbae.sh celebahq vp {YOUR_PATH}/ffhq/sto/ema_0.9999_1020000.pt 0.33 1 train 7 heun 167 true

Reference

If you find the code useful for your research, please consider citing

@inproceedings{
kim2025diffusion,
title={Diffusion Bridge AutoEncoders for Unsupervised Representation Learning},
author={Yeongmin Kim and Kwanghyeon Lee and Minsang Park and Byeonghu Na and Il-chul Moon},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025},
url={https://openreview.net/forum?id=hBGavkf61a}
}

Acknowledgements

This work is heavily built upon the code from

Zhou, L., Lou, A., Khanna, S., & Ermon, S. Denoising Diffusion Bridge Models. In The Twelfth International Conference on Learning Representations.
Preechakul, K., Chatthee, N., Wizadwongsa, S., & Suwajanakorn, S. (2022). Diffusion autoencoders: Toward a meaningful and decodable representation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 10619-10629).
Liu, B., Zhu, Y., Song, K., & Elgammal, A. (2020, October). Towards faster and stabilized gan training for high-fidelity few-shot image synthesis. In International conference on learning representations.
Yue, Z., Wang, J., Sun, Q., Ji, L., Eric, I., Chang, C., & Zhang, H. Exploring Diffusion Time-steps for Unsupervised Representation Learning. In The Twelfth International Conference on Learning Representations.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
assets		assets
datasets		datasets
ddbm		ddbm
diffae		diffae
diti		diti
scripts		scripts
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dbae_infer_class.py		dbae_infer_class.py
dbae_infer_reg.py		dbae_infer_reg.py
dbae_interpolation.py		dbae_interpolation.py
dbae_manipulation.py		dbae_manipulation.py
dbae_reconsctruction.py		dbae_reconsctruction.py
dbae_train.py		dbae_train.py
eval_reconstruction.py		eval_reconstruction.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[ICLR 2025 Spotlight] Diffusion Bridge AutoEncoders for Unsupervised Representation Learning (DBAE)
_{_{Official PyTorch implementation of DBAE}}

Dependencies

Datasets

Model training

(Eval) Downstream Inference

(Eval) Reconstruction

Interpolation

Attribute Manipulation

Reference

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

[ICLR 2025 Spotlight] Diffusion Bridge AutoEncoders for Unsupervised Representation Learning (DBAE) Official PyTorch implementation of DBAE

Dependencies

Datasets

Model training

(Eval) Downstream Inference

(Eval) Reconstruction

Interpolation

Attribute Manipulation

Reference

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

[ICLR 2025 Spotlight] Diffusion Bridge AutoEncoders for Unsupervised Representation Learning (DBAE)
_{_{Official PyTorch implementation of DBAE}}

Packages