GitHub - Ammmob/PixelSmile: PixelSmile: Fine-grained facial expression editing with continuous control, reduced semantic entanglement, and strong identity preservation.

PixelSmile: Toward Fine-Grained Facial Expression Editing

📢 Updates

[04/14/2026] 🔥 Training code is now released.
[03/29/2026] 🔥 ComfyUI support (community) is available.
[03/27/2026] 🔥 arXiv paper is now available.
[03/26/2026] 🔥 Demo is live, give it a try 🎮
[03/25/2026] 🔥 Inference Code and Benchmark Data are released.
[03/24/2026] 🔥 Project Page and Model Weight (Preview) are released.

🚀 Release Plan

🧩 Community Contributions

A community implementation for ComfyUI is available here:

ComfyUI-PixelSmile-Conditioning-Interpolation

Thanks to @judian17 for making this possible.

⚡ Quick Start

Quick start for PixelSmile inference.

Install the environment in Installation.
Download the base model and PixelSmile weights in Model Download.
Run inference in Inference.

🔧 Installation

For Inference

Clone the repository and enter the project directory:

git clone https://github.com/Ammmob/PixelSmile.git
cd PixelSmile

Create and activate a clean conda environment:

conda create -n pixelsmile python=3.10
conda activate pixelsmile

Install the inference dependencies:

pip install -r requirements.txt

⚠️ Important! Patch the current diffusers installation for the Qwen image edit bug:

bash scripts/patch_qwen_diffusers.sh

For Training

If you want to train PixelSmile, install the additional training dependencies on top of the inference environment:

pip install -r requirements-train.txt

🤗 Model Download

We recommend downloading all models to ./weights

For Inference

Base Model

PixelSmile uses Qwen-Image-Edit-2511 as the base model, you can download from Hugging Face.

PixelSmile

Model	Version	Data Type	Download
PixelSmile-preview	Preview	Human	Hugging Face

✨ A more stable version is coming soon, with improved human expression editing performance and support for anime expression editing.

For Training

Training requires additional pretrained weights and auxiliary models.

CLIP Encoder

Model	Data Type	Download
clip-vit-large-patch14	Human	Hugging Face
DanbooruCLIP	Anime	Hugging Face

InsightFace Model

We use ArcFace for identity embedding during training.

Download and unzip antelopev2.zip to your model directory (default: ./weights/antelopev2).
Convert glintr100.onnx to glintr100.pth using onnx2torch.

📦 One-Click Download

# Inference models: Qwen base model + PixelSmile LoRA
bash scripts/download_infer_models.sh

# Training CLIP models: clip-vit-large-patch14 (human) + DanbooruCLIP (anime)
bash scripts/download_train_clip_models.sh

# Training InsightFace models: download antelopev2 and convert glintr100.onnx -> glintr100.pth
bash scripts/download_train_insightface.sh

🎨 Inference

The command below is an example for inference, model paths use our default directory: ./weights.

python pixelsmile/infer.py \
  --image-path /path/to/input.jpg \
  --output-dir /path/to/output \
  --model-path ./weights/Qwen-Image-Edit-2511 \
  --lora-path ./weights/PixelSmile-preview.safetensors \
  --expression happy \
  --data-type human \
  --scales 0 0.5 1.0 1.5 \
  --seed 42

🧠 Training

This repository includes the training entry script at pixelsmile/train.py.

Prepare config

Use pixelsmile/configs/example.yaml as reference and configure your training file at pixelsmile/configs/config.yaml.

Configure model paths.

example.yaml already uses our default model directory layout under ./weights/....
If your models are in the same location, keep these defaults:
model.pretrained_path: ./weights/Qwen-Image-Edit-2511
model.insightface_detector_path: ./weights/antelopev2/scrfd_10g_bnkps.onnx
model.insightface_recognition_path: ./weights/antelopev2/glintr100.pth

Configure CLIP path by data type.

Human data: model.clip_path: ./weights/clip-vit-large-patch14
Anime data: model.clip_path: ./weights/DanbooruCLIP

Configure dataset fields.

dataset.path
dataset.data_type

Run training

Single GPU:

python pixelsmile/train.py --config pixelsmile/configs/config.yaml

Multi-GPU (recommended via accelerate):

accelerate launch pixelsmile/train.py --config pixelsmile/configs/config.yaml

Training outputs are saved under exps/<timestamp>/ (ckpts, logs, configs).

Smoke Test (Recommended)

Before full training, start with a tiny run by temporarily setting:

dataset.max_samples: 8
training.num_epochs: 1
training.batch_size: 1
training.gradient_accumulation_steps: 1

If the smoke test works, switch back to your full training config.

📖 Citation

If you find PixelSmile useful in your research or applications, please consider citing our work.

@article{hua2026pixelsmile,
  title={PixelSmile: Toward Fine-Grained Facial Expression Editing},
  author={Hua, Jiabin and Xu, Hengyuan and Li, Aojie and Cheng, Wei and Yu, Gang and Ma, Xingjun and Jiang, Yu-Gang},
  journal={arXiv preprint arXiv:2603.25728},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
assets		assets
pixelsmile		pixelsmile
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements-train.txt		requirements-train.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PixelSmile: Toward Fine-Grained Facial Expression Editing

📢 Updates

🚀 Release Plan

🧩 Community Contributions

⚡ Quick Start

🔧 Installation

For Inference

For Training

🤗 Model Download

For Inference

Base Model

PixelSmile

For Training

CLIP Encoder

InsightFace Model

📦 One-Click Download

🎨 Inference

🧠 Training

Prepare config

Run training

Smoke Test (Recommended)

📖 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PixelSmile: Toward Fine-Grained Facial Expression Editing

📢 Updates

🚀 Release Plan

🧩 Community Contributions

⚡ Quick Start

🔧 Installation

For Inference

For Training

🤗 Model Download

For Inference

Base Model

PixelSmile

For Training

CLIP Encoder

InsightFace Model

📦 One-Click Download

🎨 Inference

🧠 Training

Prepare config

Run training

Smoke Test (Recommended)

📖 Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages