Dev Image Chat

A tool that automatically generates character images in real time based on your Claude Code conversations and displays them in the browser.

Each time the Claude Code Assistant responds, it reads the conversation content, creates an image generation prompt via a prompt generator (Gemini or Ollama), generates an image using an image generation backend (Stable Diffusion or Gemini), and delivers it to the browser.

With Web Browser

With iTerm2 background display

Caution

This application can use the Gemini API for prompt generation and/or image generation.

Depending on usage frequency, API costs may become significant. Please monitor your usage regularly.
- Be especially careful when using Gemini for image generation, as costs tend to be high. For continuous use, we recommend setting up Stable Diffusion WebUI.
When using the free tier of the Gemini API, your conversation content may be used to improve Google products. If handling confidential information, we recommend using the paid tier API.

By using Ollama for prompt generation and Stable Diffusion for image generation, everything runs locally with no API costs.

Requirements

Go 1.24 or later
Prompt Generator (one of the following)
- Gemini (default) — Requires a Google Gemini API key, available from Google AI Studio
- Ollama — Requires a locally running Ollama instance
  - For the model, gemma3 1b or larger should work fine
Image Generation Backend (one of the following)
- Gemini — Ready to use with just a Gemini API key (no additional setup required)
- Stable Diffusion WebUI — Such as AUTOMATIC1111's stable-diffusion-webui. Must be launched with the --api option to enable the API

Installation

1. Install Go

If Go is not yet installed, use one of the following methods.

macOS (Homebrew):

brew install go

Other platforms:

Download and install from the official Go website.

After installation, verify the version:

go version
# Should show go1.24.0 or later

2. Clone the Repository

git clone https://github.com/egawata/dev-image-chat.git
cd dev-image-chat

3. Build

go build -o dev-image-chat .

This creates the dev-image-chat executable.

4. Create Configuration File

cp .env.example .env

Open the .env file and configure the settings. At minimum, choose a prompt generator and image generation backend.

If using Gemini for prompt generation (default) or image generation, set your API key:

GEMINI_API_KEY=your-api-key-here

If using Ollama for prompt generation, set the prompt generator: (Change OLLAMA_MODEL as appropriate)

PROMPT_GENERATOR=ollama
OLLAMA_MODEL=gemma3

Other settings work with their default values, but can be changed as needed.

Usage

(Optional) Using Ollama for Prompt Generation

Start Ollama.

(Optional) Using Stable Diffusion for Image Generation

Start Stable Diffusion WebUI with the API enabled.

# In the stable-diffusion-webui directory
./webui.sh --api

By default, it starts at http://localhost:7860.

Start Dev Image Chat

./dev-image-chat

Verifying Startup

If you see the following log output, the startup was successful.

Claude Code Image Chat started
  Web UI: http://localhost:8080
  Watching: /Users/<username>/.claude/projects
  Generate interval: 1m0s

Open the Web UI in Your Browser

Access http://localhost:8080 to open the image display screen.

Then use Claude Code as usual. Each time the Assistant responds, an image matching the conversation content will be automatically generated and displayed. (There is a 60-second interval by default.)

Configuration

Settings can be configured via the .env file or environment variables.

Environment Variable	Default	Description
`PROMPT_GENERATOR`	`gemini`	Prompt generator backend (`gemini` or `ollama`)
`IMAGE_GENERATOR`	`sd`	Image generation backend (`sd` or `gemini`)
`SERVER_PORT`	`8080`	Web UI port number
`CLAUDE_PROJECTS_DIR`	`~/.claude/projects`	Claude Code projects directory
`CHARACTERS_DIR`	`characters`	Directory for character configuration files
`CHARACTER_FILE`	(none)	Path to character configuration file (fallback when `CHARACTERS_DIR` is empty)
`GENERATE_INTERVAL`	`60`	Minimum interval between image generations (seconds)
`DEBUG`	`false`	Enable debug logging (`1` or `true`)
`ITERM2_BG`	`false`	Enable iTerm2 terminal background image display (`1` or `true`)

Gemini Parameters

Environment Variable	Default	Description
`GEMINI_API_KEY`	(none)	Google Gemini API key (required when `PROMPT_GENERATOR=gemini` or `IMAGE_GENERATOR=gemini`)
`GEMINI_MODEL`	`gemini-2.5-flash`	Gemini model used for prompt generation (used when `PROMPT_GENERATOR=gemini`)
`GEMINI_IMAGE_MODEL`	`gemini-2.5-flash-image`	Gemini image generation model (used when `IMAGE_GENERATOR=gemini`)

Ollama Parameters

Environment Variable	Default	Description
`OLLAMA_BASE_URL`	`http://localhost:11434`	Ollama API base URL (used when `PROMPT_GENERATOR=ollama`)
`OLLAMA_MODEL`	`gemma3`	Ollama model name (used when `PROMPT_GENERATOR=ollama`)

Stable Diffusion Image Generation Parameters

Effective when IMAGE_GENERATOR=sd (default).

Environment Variable	Default	Description
`SD_BASE_URL`	`http://localhost:7860`	Stable Diffusion WebUI URL
`IMGCHAT_SD_STEPS`	`28`	Number of generation steps
`IMGCHAT_SD_WIDTH`	`512`	Image width (px)
`IMGCHAT_SD_HEIGHT`	`768`	Image height (px)
`IMGCHAT_SD_CFG_SCALE`	`5.0`	CFG scale
`IMGCHAT_SD_SAMPLER_NAME`	`Euler a`	Sampler name
`IMGCHAT_SD_EXTRA_PROMPT`	(none)	Additional prompt appended to all images

Character Configuration

Place .md files in the characters directory to reflect character appearance and atmosphere in the generated images. Multiple character files can be placed, and one character is automatically selected per session.

Placing Character Files (Recommended)

Create .md files in the characters/ directory.

characters/
├── chara1.md
└── chara2.md

Example configuration file (characters/chara1.md):

- High school girl (2nd year)
- Height: 165cm
- Hair: Long black hair, straight bangs
- Eye color: Deep brown
- Outfit: School uniform, blazer, red ribbon, black checkered pleated skirt, black socks
- Style: Slender, calm and elegant
- Speech: Energetic manner of speaking, uses polite language
- Location: School classroom

We recommend specifying visual characteristics such as hairstyle and clothing in as much detail as possible to maintain a consistent look across images. Specifying the location is also recommended.

The directory can be changed with the CHARACTERS_DIR environment variable (default: characters).

iTerm2 Terminal Background Display (macOS)

Instead of viewing generated images in the browser, you can display them directly as the background of your iTerm2 terminal. Images update automatically as you chat with Claude Code.

Note: Escape sequences are used to switch the background image. When prompted for approval, verify that the image file path starts with /tmp/dev-image-chat-bg before approving.

Requirements

iTerm2 (tested with 3.6.6)

Setup

Enable the feature in your .env file:

ITERM2_BG=true

Start the dev-image-chat server:

./dev-image-chat

Define a shell function in your .zshrc (or .bashrc):

claude-bg() {
    /path/to/dev-image-chat/iterm2-bg-watch.sh &
    local watcher_pid=$!
    claude "$@"
    kill "$watcher_pid" 2>/dev/null
    wait "$watcher_pid" 2>/dev/null
}

Replace /path/to/dev-image-chat/ with the actual path to this repository.

Start Claude Code using claude-bg instead of claude:

cd /path/to/your/project
claude-bg

This launches a background watcher that polls for new images and updates the iTerm2 background via escape sequences.

First-time Setup

The first time an image is displayed, iTerm2 will show a confirmation dialog twice (once for each image slot). Verify that the image file path starts with /tmp/dev-image-chat-bg and has a .png extension, then select "Always" both times. After that, images will update without any confirmation.

Adjusting Background Opacity

Go to iTerm2 Preferences > Profiles > Window > Blend and adjust the slider to control the background image transparency so that terminal text remains readable. For Scaling Mode, Scale to Fit is recommended.

Clearing the Background Image

When the watcher exits normally, the background image is automatically cleared.

If the background image remains after exiting, run the following script:

/path/to/dev-image-chat/iterm2-bg-clear.sh

You can define an alias in your .zshrc (or .bashrc) for convenience:

alias clear-bg='/path/to/dev-image-chat/iterm2-bg-clear.sh'

Notes

The watcher automatically detects the project directory and only shows images for that project.
If you run claude-bg again in the same terminal, the old watcher is automatically replaced.
The browser Web UI continues to work alongside the terminal background display.

Troubleshooting

`GEMINI_API_KEY is required` is displayed

This error appears when PROMPT_GENERATOR=gemini (default) or IMAGE_GENERATOR=gemini, but GEMINI_API_KEY is not set. Either set the API key in the .env file, or switch to Ollama for prompt generation (PROMPT_GENERATOR=ollama).

Images are not being generated

Start with DEBUG=1 to check detailed logs.
For Stable Diffusion: Verify that WebUI is started with the --api option and that SD_BASE_URL is correct.
For Gemini: Verify that IMAGE_GENERATOR=gemini is set and that GEMINI_API_KEY is correct.

Image generation interval is too long

You can set the GENERATE_INTERVAL value in the .env file (in seconds).
The default is 60 seconds, but you may use a shorter value if your environment can generate images quickly.

Images are not displayed in the browser

Check that the Web UI (http://localhost:8080) is accessible.
Check the browser developer tools for WebSocket connection errors.

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
assets		assets
characters		characters
static		static
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_ja.md		README_ja.md
config.go		config.go
gemini_image_generator.go		gemini_image_generator.go
go.mod		go.mod
go.sum		go.sum
image_generator.go		image_generator.go
iterm2-bg-clear.sh		iterm2-bg-clear.sh
iterm2-bg-watch.sh		iterm2-bg-watch.sh
iterm2_bg.go		iterm2_bg.go
logger.go		logger.go
main.go		main.go
ollama_prompt_generator.go		ollama_prompt_generator.go
parser.go		parser.go
prompt_generator.go		prompt_generator.go
server.go		server.go
watcher.go		watcher.go

Folders and files

Latest commit

History

Repository files navigation

Dev Image Chat

Caution

Requirements

Installation

1. Install Go

2. Clone the Repository

3. Build

4. Create Configuration File

Usage

(Optional) Using Ollama for Prompt Generation

(Optional) Using Stable Diffusion for Image Generation

Start Dev Image Chat

Verifying Startup

Open the Web UI in Your Browser

Configuration

Gemini Parameters

Ollama Parameters

Stable Diffusion Image Generation Parameters

Character Configuration

Placing Character Files (Recommended)

iTerm2 Terminal Background Display (macOS)

Requirements

Setup

First-time Setup

Adjusting Background Opacity

Clearing the Background Image

Notes

Troubleshooting

GEMINI_API_KEY is required is displayed

Images are not being generated

Image generation interval is too long

Images are not displayed in the browser

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`GEMINI_API_KEY is required` is displayed

Packages