This project demonstrates how to use the Ollama AI provider to generate descriptive text for an image using the llama3.2-vision:11b model. The script integrates AI and image analysis, allowing you to send both textual and visual data to the model for inference.
- Uses Ollama AI provider to interact with the Llama model.
- Processes both text and image input.
- Outputs a descriptive text response from the model.
- Uses Tsup for TypeScript/JavaScript bundling and compilation.
- Ollama server running locally or accessible via a specified URL
- The
aiandollama-ai-providernpm packages installed - The
tsuppackage for building the project - A valid image file (
image1.jpg) in theimagedirectory
Install dependencies: npm install npx tsup src/index.ts --format esm --dts node dist/index.js