Skip to content

harsh/embeddings/refactor#62

Merged
harshkedia177 merged 10 commits intomainfrom
harsh/embeddings/refactor
Mar 25, 2026
Merged

harsh/embeddings/refactor#62
harshkedia177 merged 10 commits intomainfrom
harsh/embeddings/refactor

Conversation

@harshkedia177
Copy link
Copy Markdown
Owner

  • refactor: enhance writable storage initialization and error messaging in Axon CLI
  • feat(embeddings): switch to nomic-embed-text-v1.5 with passage_embed/query_embed
  • feat(embeddings): add token budget to text generation
  • feat(storage): switch to FLOAT[384] schema and parameterized vector queries
  • feat(embeddings): add model metadata to meta.json and auto-migration on model change
  • test(embeddings): add integration test with real nomic model
  • fix(embeddings): apply _MAX_TEXT_CHARS in embed_nodes, treat missing model key as mismatch
  • feat(embeddings): enhance embedding management and background processing

… in Axon CLI

- Updated the `_initialize_writable_storage` function to include an `auto_index` parameter, allowing for more flexible error handling when no index exists.
- Improved error messages to guide users on running the indexing command.
- Adjusted the `setup` command output to clarify configuration instructions for Claude Code and Cursor.
- Modified tests to reflect changes in expected output for the setup command.
…ueries

- Change Embedding table from DOUBLE[] to FLOAT[384] (fixed-dimension, 32-bit)
- Replace inline vector literal with parameterized query using CAST($vec, 'FLOAT[384]')
- Remove _safe_vec_literal helper and unused math import
- Introduced functions to check for existing index metadata and database, improving the initialization process.
- Added background embedding generation capability, allowing embeddings to be processed asynchronously.
- Updated the embedding model handling to ensure current embeddings are used, with automatic re-embedding on model changes.
- Adjusted default batch size for embeddings from 128 to 32 for better performance.
- Enhanced tests to cover new embedding migration and initialization scenarios.
# Conflicts:
#	src/axon/cli/main.py
…ctor

# Conflicts:
#	src/axon/core/embeddings/embedder.py
#	src/axon/core/ingestion/pipeline.py
#	src/axon/core/storage/base.py
#	tests/core/test_embedder.py
@harshkedia177 harshkedia177 merged commit 99b6409 into main Mar 25, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant