Hands On Waidrin

Hey Pew, the AI coder and I thought we'd pimp Waidrin a little:

- Main feature is dynamic image generation with Flux 2 Klein via ComfyUI
- Every NPC gets an avatar
- Avatars are embedded in scenes via multi-reference Kontext conditioning
- Each dialogue turn generates a scene illustration
- Genre selection (Fantasy / Sci-Fi) – influences prompts, races, image style
- D&D-lite RPG stats (STR/DEX/CON/INT/WIS/CHA + HP) with genre-specific enforcement
- Language selection at the beginning → prompts are automatically translated
- Random character/world generation with re-roll dice button
- Optional custom appearance text field for the protagonist
- State JSON and the rest remain largely original (minor schema addition for image references)

I think that's pretty cool for immersion. The images are still a bit cringe, but that should be manageable.

Disadvantages:

Currently only runs with multi-GPU (1× llama.cpp / 2× ComfyUI), though single-GPU would be theoretically possible with sequential loading
Image generation lags slightly behind the text
The question is: are you interested in a PR? The changes to Waidrin are not insignificant, and until it's cleanly integrated, it could be a minor undertaking ;-)

The multi-GPU setup in particular is likely to be a deal breaker for most people. So the question of effort/return arises – ultimately, I don't care, I can just make a fork at some point.

<img width="849" height="797" alt="Image" src="https://github.com/user-attachments/assets/4ef9cefd-5b39-4daf-b490-9f45d955581c" />
<img width="847" height="613" alt="Image" src="https://github.com/user-attachments/assets/3ee18b2c-7266-4e38-8ed5-306b28cd2307" />
<img width="784" height="854" alt="Image" src="https://github.com/user-attachments/assets/e384efa6-c4ad-439f-9144-e43b2a8ca5f7" />
<img width="783" height="848" alt="Image" src="https://github.com/user-attachments/assets/e8bf859e-852e-4865-8d24-2def8a198fa7" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hands On Waidrin #52

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Hands On Waidrin #52

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions