Open Source Alternatives LogoOpen Source Alternatives
AlternativesBlogAdvertise
Open Source Alternatives LogoOpen Source Alternatives

Stay Updated

Subscribe to our newsletter for the latest news and updates about Alternatives

Open Source Alternatives LogoOpen Source Alternatives

Handpicked Open Source Alternatives to Paid Softwares

Product
  • Search
  • Categories
  • Tag
  • Sign In
Resources
  • Blog
  • Collection
  • Submit
  • Advertise your tool
Company
  • Privacy Policy
  • Terms of Service
  • Refund Policy
  • Sitemap
Copyright © 2026 All Rights Reserved.
Home/Categories/AI & Machine Learning/VoxCPM
VoxCPM icon

VoxCPM

VoxCPM is an open-source, tokenizer-free multilingual TTS model supporting voice cloning, voice design, and 48kHz studio-quality output across 30 languages.

18.7K starsPythonApache-2.0Active this month
Visit website
Branded cover for VoxCPM
Contents
  1. 01Who VoxCPM is for
  2. 02

Repository

Stars
18.7K
Forks
2.2K
License
Apache-2.0
Latest
2.0.3
Last commit
24 days ago
Last verified
May 13, 2026
Repo
OpenBMB/VoxCPM ↗

Additional details

GitHub repo
The problem it solves
  • 03How it solves it
  • 04Strengths and trade-offs
  • 05Tech stack
  • 06FAQ
  • 07Similar open-source tools
  • TL;DR

    VoxCPM is an Apache-2.0 multilingual text-to-speech model for teams building voice generation, voice cloning, and speech design workflows. It replaces per-character cloud TTS APIs when AI teams need to run speech generation on their own hardware and tune the pipeline.Apache-2.0 · Python · 18.7K stars · Active this month

    who it's for

    Who VoxCPM is for#

    AI teams building multilingual voice products

    Use VoxCPM when product quality depends on controlling voice generation across languages and styles.

    Skip if:

    Skip if you need a hosted API with support, billing, and uptime guarantees.

    Researchers testing TTS architectures

    The open model and code are useful for studying multilingual and controllable speech generation.

    Skip if:

    Skip if your needs are simple narration clips and you do not have GPU access.

    the problem

    The problem it solves#

    how VoxCPM solves it

    How it solves it#

    Multilingual speech generation

    Targets multilingual TTS use cases where developers need one project for multiple languages and voice styles.

    Voice cloning and voice design

    Supports workflows around reference voices and controllable voice generation for custom voice applications.

    Self-hosted model path

    The Apache-2.0 project lets AI teams run and adapt the model locally instead of paying per character to a hosted API.

    strengths · trade-offs

    Strengths and trade-offs#

    Strengths

    • Useful for privacy-sensitive voice workRunning the model yourself keeps scripts, reference audio, and generated speech inside your environment.
    • Permissive license for buildersApache-2.0 licensing supports commercial experimentation and integration with fewer restrictions than many model releases.

    Trade-offs

    • -GPU and ML operations requiredVoxCPM is a model project, not a polished SaaS recorder. Teams need hardware, serving, audio evaluation, and model operations skill.
    tech stack · detected from GitHub

    What it's built on#

    Languages
    Python
    frequently asked

    FAQ#

    Is VoxCPM open source?

    Yes. VoxCPM is open source under the Apache-2.0 license.

    What is VoxCPM used for?

    VoxCPM is used for multilingual text-to-speech, voice cloning, and controllable voice generation.

    How does VoxCPM compare to ElevenLabs?
    also worth a look

    Similar open-source tools#

    Voicebox

    Voicebox

    Open source voice synthesis studio for generating audio

    28.8KTypeScriptMIT
    Handle

    Handle

    Edit UI visually in the browser and sync changes to code

    34TypeScriptMIT
    OpenFlowKit

    OpenFlowKit

    Local-first AI diagramming tool for developers and builders

    464TypeScriptMIT
    orca

    orca

    The ultimate IDE for coding agents

    3.3KTypeScriptMIT
    CLI-Anything

    CLI-Anything

    Empower AI agents with agent-native CLIs

    41.7KPythonApache-2.0
    oh-my-pi

    oh-my-pi

    A coding agent with the IDE wired in

    7.2KTypeScriptMIT
    Language
    Python
    Open issues
    95
    Contributors
    26
    First release
    2025

    Categories

    AI & Machine LearningDesign & CreativeDeveloper Tools

    Tags

    AI SDKLLMDeveloper ToolsAI AgentsPrompt EngineeringCoding

    Cloud text-to-speech APIs are convenient, but high-volume voice generation can become expensive and constrained by provider limits. Teams building voice products also face privacy questions when scripts, prompts, reference voices, or generated audio pass through a third-party API.

    Multilingual voice work adds another problem: quality and style controls vary by language, and many tools force developers into a fixed hosted model with limited room for research or product-specific tuning.

    ElevenLabs is a hosted voice generation service. VoxCPM is an open source model path for teams that can operate speech generation themselves.