NestJS RAG (Retrieval-Augmented Generation)

Un progetto NestJS che implementa un sistema di Retrieval-Augmented Generation (RAG) per generare risposte intelligenti basate su documenti indicizzati.

Note

Per l'accelerazione hardware su GPU AMD, visita il branch vulkan-gpu. Quella versione utilizza llama.cpp con backend Vulkan invece di Ollama standard.

Installazione

npm install

Variabili d'Ambiente

Configura le seguenti variabili nel file .env:

OLLAMA_BASE_URL=http://localhost:11434
EMBEDDING_MODEL=embeddinggemma:latest
SIMILARITY_THRESHOLD=0.45
LLM_MODEL=gemini-3-flash-preview
GEMINI_API_KEY=your_api_key

Avvio del Progetto

# Sviluppo
npm run start

# Modalità watch
npm run start:dev

# Produzione
npm run start:prod

Architettura RAG

Il progetto è organizzato in servizi specializzati per mantenere il codice pulito e manutenibile:

PdfIngestionService: Parsing e chunking dei file PDF
VectorStoreService: Gestione dell'indice FAISS e embeddings
DocumentRetrievalService: Ricerca semantica e filtraggio dei documenti
RagService: Orchestrazione del flusso RAG completo

Endpoint API

Ingestione PDF

POST /rag/ingest
Content-Type: multipart/form-data
Body: files[] (array di file PDF)

Query (solo retrieval)

POST /rag/query
Content-Type: application/json
Body: { "question": "Tua domanda..." }

Generazione (RAG completo)

POST /rag/generate
Content-Type: application/json
Body: { "question": "Tua domanda..." }

Deployment

Check out the NestJS deployment documentation per più informazioni.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
PDF		PDF
ollama		ollama
src		src
.env.example		.env.example
.gitignore		.gitignore
.prettierrc		.prettierrc
README.md		README.md
eslint.config.mjs		eslint.config.mjs
nest-cli.json		nest-cli.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.build.json		tsconfig.build.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NestJS RAG (Retrieval-Augmented Generation)

Installazione

Variabili d'Ambiente

Avvio del Progetto

Architettura RAG

Endpoint API

Ingestione PDF

Query (solo retrieval)

Generazione (RAG completo)

Deployment

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NestJS RAG (Retrieval-Augmented Generation)

Installazione

Variabili d'Ambiente

Avvio del Progetto

Architettura RAG

Endpoint API

Ingestione PDF

Query (solo retrieval)

Generazione (RAG completo)

Deployment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages