Transform YouTube Videos into Conversational Avatars with Sieve!
This Sieve pipeline converts a YouTube video into an interactive dialogue between two talking avatars. It consists of the following steps:
- Download YouTube Video with the youtube_to_mp4 Sieve function.
- Summarize the downloaded video into conversational-style text using visual-qa Sieve function (employ suitable prompt).
- Summary text is converted into speech using tts Sieve function.
- Talking avatars are generated with portrait-avatar Sieve function.
A detailed explanation of the pipeline is provided in this tutorial.
youtube_url: url of the youtube videovoice1andvoice2: voice for speakers in the generated dialogue (choose a non-cloning voice compatible with sieve-tts. See sieve/tts readme).image1andimage2: Input images for the talking avatars.
First ensure you have the Sieve Python SDK installed: pip install sievedata and set SIEVE_API_KEY to your Sieve API key.
You can find your API key at https://www.sievedata.com/dashboard/settings.
Then deploy the function to your account:
git clone https://github.com/sieve-community/video2dialogue
cd video2dialogue
sieve deploy pipeline.pyYou can now find the function in your Sieve account and call it via API or SDK.