MultiModalBot is an advanced AI-powered chatbot that can understand and process multiple types of input, including text, images, and audio. This versatile bot is designed to provide intelligent responses and assist users across various domains.
mmbot_test.2.mp4
- Multi-modal input processing (text, images, audio)
- Natural language understanding and generation
- Image recognition and analysis
- Speech-to-text and text-to-speech capabilities
- Customizable responses and knowledge base
- Easy integration with popular messaging platforms
To install MultiModalBot, follow these steps:
git clone https://github.com/luisesantillan/MultiModalBot.git
cd MultiModalBot
pip install -r requirements.txtSetting up your own MultiModalBot is super easy and fun! Just follow these simple steps:
-
Create a new file called
.envin the MultiModalBot folder. -
Open the
.envfile and add your secret bot token and openai api key like this:
TELEGRAM_BOT_TOKEN=your_bot_token_here GOOGLE_API_KEY=your_gemini_api_key_here OPENAI_API_KEY=your_openai_api_key_here
-
Save the
.envfile and close it. -
Open your computer's terminal or command prompt.
-
Go to the MultiModalBot folder by typing:
cd MultiModalBot- Start your awesome bot by typing:
python start_telegram_bot.pyYour bot is now ready! Open Telegram and start chatting with it.
Cool things you can do with your bot:
- Send text messages, and it will reply accordingly!
- Send pictures, and it will tell you what it sees!
- Send voice messages, and it will understand what you say!
- Type /speak followed by some text, and it will speak that back to you!
- Type /model followed by a model name to change the openai model used for text generation.
Cool things the bot can do:
- Understand and respond to text messages
- Understand and respond to images
- Understand and respond to voice messages
- Speak back to you in text or voice
- Send images based on text prompts
- OpenAI for providing such wonderful AI models!
- Telegram for providing a great platform for chatbots.
- pyTelegramBotAPI for making it easy to interact with Telegram bots.
- PollinationsAI for providing an easy tool for image generation.