feat: image ingestion via Vision LLM by RaviTharuma · Pull Request #32 · AVIDS2/memorix

RaviTharuma · 2026-03-29T22:27:02Z

Summary

Adds an memorix_ingest_image MCP tool that describes images using a Vision LLM and stores the description as a memorix observation.

What's included

src/multimodal/image-loader.ts — Vision LLM client with:
- Base64 encoding of local images
- Configurable model and prompt
- Support for PNG, JPEG, GIF, WebP, SVG formats
- Automatic MIME type detection
src/server.ts — MCP tool registration (memorix_ingest_image)
tests/multimodal/image-loader.test.ts — 7 tests covering:
- Image description extraction and observation storage
- Unsupported format rejection
- Missing file handling
- Custom prompt passthrough
- API error propagation
- Correct base64 encoding

Design decisions

Uses existing LLM provider (callLLM from src/llm/provider.ts) — no new API configuration needed.
Stores description as discovery observation type with image metadata in facts.
No new npm dependencies — uses native fs for file reading, existing LLM infrastructure for vision.

Tests

7 pass, 0 fail

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: d2c62ae393

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-29T22:29:27Z

src/multimodal/image-loader.ts

+  let baseUrl = getLLMBaseUrl('https://api.openai.com/v1').replace(/\/+$/, '');
+  if (!baseUrl.endsWith('/v1')) baseUrl += '/v1';
+  const model = getLLMModel('gpt-4o');
+
+  const response = await fetch(`${baseUrl}/chat/completions`, {


Route vision calls by configured LLM provider

isLLMEnabled() allows this path when the runtime is configured for non-OpenAI providers, but callVisionLLM hardcodes OpenAI defaults (https://api.openai.com/v1, gpt-4o) and always posts to /chat/completions. In environments that auto-detect anthropic from ANTHROPIC_API_KEY, image ingestion will consistently fail against the wrong endpoint/model even though LLM mode is reported as enabled. This should use provider-aware config/routing (or the existing provider abstraction) so the request format matches the active provider.

Useful? React with 👍 / 👎.

AVIDS2 · 2026-03-30T13:36:23Z

Useful feature direction, but I see two blockers before merge.

The PR currently fails CI because the test file is written against bun:test, while this repo runs vitest.
The implementation assumes an OpenAI-compatible /chat/completions Vision endpoint for whatever getLLMBaseUrl() / getLLMModel() return. That is not safe for the current Memorix config model: a valid installation can be configured for anthropic, and Anthropic is handled through a different API shape elsewhere in the codebase. So right now this tool can be configured according to Memorix but still call the wrong protocol at runtime.

I think the implementation needs to either:

go through the existing LLM abstraction in a provider-safe way, or
explicitly constrain itself to OpenAI-compatible providers and validate that up front.

feat: image ingestion via Vision LLM (memorix-ybj)

d2c62ae

chatgpt-codex-connector bot reviewed Mar 29, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: image ingestion via Vision LLM#32

feat: image ingestion via Vision LLM#32
RaviTharuma wants to merge 1 commit intoAVIDS2:mainfrom
RaviTharuma:feature/memorix-ybj-image-ingestion

RaviTharuma commented Mar 29, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Mar 29, 2026

Uh oh!

AVIDS2 commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

RaviTharuma commented Mar 29, 2026

Summary

What's included

Design decisions

Tests

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

AVIDS2 commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants