# Voice-Pro Goes Free, Offers Open-Source AI Dubbing Voice-Pro, a comprehensive AI application for voice cloning, translation, and video dubbing, is now completely free and open-source. The developers at abus-aikorea have ceased active development to focus on a new project, releasing the tool's full code as a powerful, self-hosted alternative to paid SaaS platforms like ElevenLabs. The tool packages a suite of cutting-edge AI models into a single Gradio web interface, allowing creators to build a complete audio production pipeline on their own hardware. It acts as a central hub for taking a YouTube video, separating vocals from music, transcribing the speech, translating it into other languages, and then generating a new audio track using text-to-speech or a cloned voice. Previously, Voice-Pro operated on a subscription model for unlimited use. As of version 3.2, all features have been unlocked and the project is now freely available on GitHub, per the repository's README.
What Technologies Are Included?
Voice-Pro integrates several popular open-source AI models to create an all-in-one workflow, removing the need to run multiple separate tools. Its core functions are built on a foundation of specialized AI engines:Speech-to-Text: Uses various versions of OpenAI's Whisper, including Faster-Whisper and WhisperX for highly accurate transcription in over 100 languages.
Voice Cloning: Features zero-shot cloning with models like F5-TTS and CosyVoice, allowing it to replicate a voice from a short audio sample.
Text-to-Speech (TTS): Integrates Microsoft's Edge-TTS for over 400 voices, along with the high-quality `kokoro` model.
Audio Processing: Includes Demucs for separating vocals from background music and `yt-dlp` for downloading video and audio content directly from YouTube.
How Does It Compare to Paid Services?
By going fully open-source, Voice-Pro presents a direct challenge to the per-minute pricing models of many commercial AI voice platforms. For creators producing long-form content like podcasts or video essays, these costs can accumulate quickly. An analysis included in the project's documentation, based on pricing as of April 15, 2025, highlights the potential savings. Processing a single 60-minute video for subtitles, translation, and dubbing can cost anywhere from $23 to over $48 on popular SaaS platforms like Maestra, HappyScribe, or Descript. With Voice-Pro, the only costs are the initial hardware setup and electricity. This makes it a compelling option for podcasters, YouTubers, and developers who need advanced voice solutions without recurring subscription fees.The trade-off is the technical setup and maintenance. Users must install the environment themselves, which can take over an hour and requires at least 20GB of storage. The developers also note that with their focus shifting to their new WeConnect application, bug fixes and feature updates for Voice-Pro will now rely on the open-source community.








