Skip to content

vidlytics extracts text from multimedia, generates domain-specific summaries & insights from prompts, enabling quick understanding without direct video processing.

Notifications You must be signed in to change notification settings

chigwell/vidlytics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 

Repository files navigation

Vidlytics

PyPI version License: MIT Downloads LinkedIn

Vidlytics is a Python package designed to extract and process information from multimedia content descriptions, such as videos or other structured data, by utilizing advanced language models. It simplifies understanding complex multimedia materials by generating concise, domain-specific summaries or insights based on the provided textual descriptions, captions, or processed data. This enables users to quickly grasp key themes and ideas without directly analyzing the multimedia content itself.

Features

  • Utilizes language models to interpret multimedia descriptions
  • Flexible with different LLM backends (OpenAI, Anthropic, Google, etc.)
  • Simple interface for extracting structured insights
  • Built-in support for pattern matching and validation

Installation

Install Vidlytics via pip:

pip install vidlytics

Usage

Here's a sample usage demonstrating how to invoke the package:

from vidlytics import vidlytics

# Example user input
user_input = "Describe the main topics in the multimedia content."

# Calling with default LLM (ChatLLM7)
response = vidlytics(user_input)

# Using a custom LLM instance, e.g., OpenAI's ChatOpenAI
from langchain_openai import ChatOpenAI

llm = ChatOpenAI()
response = vidlytics(user_input, llm=llm)

Parameters

  • user_input (str): The textual description or summary of the multimedia content to process.
  • llm (Optional[BaseChatModel]): An instance of a language model. Defaults to ChatLLM7 if not provided.
  • api_key (Optional[str]): API key for LLM7. If not provided, the package checks the environment variable LLM7_API_KEY. A free API key can be obtained at https://token.llm7.io/.

Underlying Technology

  • Uses ChatLLM7 from the langchain_llm7 package by default.
  • Supports integration with different language model providers, including OpenAI, Anthropic, Google Generative AI, and others, by passing custom llm instances.

Additional Notes

  • The package is designed for flexibility; users can provide their own language model instances to match their preferred providers and configurations.
  • Rate limits for the free tier of LLM7 are generally sufficient for typical use cases. Higher limits are available via API keys.

License

This project is maintained by Eugene Evstafev. For issues, feature requests, or contributions, please open an issue on GitHub.

Author

License

MIT License