ManuelPalermo
diff --git a/‎llm_app/README.md
Lines changed: 95 additions & 0 deletions b/‎llm_app/README.md
Lines changed: 95 additions & 0 deletions
diff --git a/‎llm_app/resources/interface.png
45.9 KB b/‎llm_app/resources/interface.png
45.9 KB
diff --git a/‎llm_app/resources/interface_text_and_image.png
133 KB b/‎llm_app/resources/interface_text_and_image.png
133 KB
@@ -0,0 +1,95 @@
+
+# Description
+
+Creates a chat based LLM app, similar to what you would expect from ChatGPT or Gemini (but worse :D)
+
+Features:
+
+- Support for Local (HuggingFace) or Remote (OpenRouter API) backends
+- Multiple models available for each Backend
+- Tune generation parameters (temperature, max tokens, etc..)
+- Text + Image input (local backend only)
+- System prompts to tune model output format
+- LLM with chat history
+- Save/Load conversations or Anonymous chat
+- Chat with code highlights
+- Retrieval Augmented Generation (RAG) based on langchain + FAISS embeddings database
+- "nice" frontend interface
+
+---
+
+# Setup
+
+```bash
+cd ~/dev-playground/llm_app/
+# create conda environment
+conda env create -f environment.yml
+# install backend pkgs
+pip install -e .
+
+# install frontend pkgs
+cd ~/dev-playground/llm_app/llm_app/frontend
+npm install
+```
+
+---
+
+# Usage
+
+Open 2 terminals, one to run the llm backend service and another to run the frontend
+
+Run llm backend
+
+```bash
+# export OPENROUTER_API_KEY="<API_KEY>"   # NOTE: if you plan on using the OpenRouterAPI backend, then also export the API key:
+conda activate env_llm_app
+cd ~/dev-playground/llm_app/llm_app/backend
+uvicorn main:app --host 0.0.0.0 --port 8000 --reload
+# your backend should now be running on https://localhost:8000
+# you can check the docs/swagger API on http://localhost:8000/docs
+```
+
+Run the frontend interface
+
+```bash
+cd ~/dev-playground/llm_app/llm_app/frontend
+npm run dev
+# your frontend should now be running on http://localhost:5174
+```
+
+# Results
+
+<details>
+<summary> Interface </summary>
+  ![image](./resources/interface.png)
+</details>
+
+<details>
+<summary> Text and Image inputs </summary>
+  ![image](./resources/interface_text_and_image.png)
+</details>
+
+---
+
+# Ideas / TODOs
+
+- [ ] Display in the frontend the sources used in RAG context
+
+- [ ] Support WEB search
+  - for each query search, first search the web and download the first n pages
+  - store the pages with the RAG embeddings database
+  - pass additional context from downloaded pages to the LLM
+
+- [ ] Support Speech to text to Speech model and support voice input/outputs
+  - Change front-end to enable voice/audio inputs
+  - add voice-to-text intermediate model
+  - pass the text to the LLM to generate an output
+  - optionally convert back the output to audio using text-to-voice model
+
+- [ ] Real-time Streaming Responses using Websockets
+
+- [ ] Create docker compose image to run both frontend and backend and deploy it somewhere
+
+- [ ] Debug leftover issues in the frontend/backend communication
+
+# References