Skip to content

CosyVoice3

Word count
302 words
Reading time
2 minutes

CosyVoice3 is a leading multilingual speech generation tool that supports multiple languages ​​and dialects, including Mandarin, English, and Japanese, and features speech cloning and cross-language synthesis.

Main Interface

cosyvoice_home

Launch Methods

  • Launch via URL
http://127.0.0.1:8000/launcher?project=cosyvoice
  • Launch via Command
bash
uv run cli.py install -n cosyvoice -p 8013 --start
  • Native Launch
bash
uv run webui.py --port 8013

Output Logs

log
Python 3.10.19
Using extensions/cosyvoice/.venv/scripts/python.exe
Checked 159 packages in 3ms
All installed packages are compatible
Audited 40 packages in 15ms
git 2.51.0
Using D:/Develop Files/Git/cmd/git.EXE

 _   .-')                _ .-') _     ('-.             .-')                              _ (`-.    ('-.
( '.( OO )_             ( (  OO) )  _(  OO)           ( OO ).                           ( (OO  ) _(  OO)
 ,--.   ,--.).-'),-----. \     .'_ (,------.,--.     (_)---\_)   .-----.  .-'),-----.  _.`     \(,------.
 |   `.'   |( OO'  .-.  ',`'--..._) |  .---'|  |.-') /    _ |   '  .--./ ( OO'  .-.  '(__...--'' |  .---'
 |         |/   |  | |  ||  |  \  ' |  |    |  | OO )\  :` `.   |  |('-. /   |  | |  | |  /  | | |  |
 |  |'.'|  |\_) |  |\|  ||  |   ' |(|  '--. |  |`-' | '..`''.) /_) |OO  )\_) |  |\|  | |  |_.' |(|  '--.
 |  |   |  |  \ |  | |  ||  |   / : |  .--'(|  '---.'.-._)   \ ||  |`-'|   \ |  | |  | |  .___.' |  .--'
 |  |   |  |   `'  '-'  '|  '--'  / |  `---.|      | \       /(_'  '--'\    `'  '-'  ' |  |      |  `---.
 `--'   `--'     `-----' `-------'  `------'`------'  `-----'    `-----'      `-----'  `--'      `------'

Downloading Model from https://www.modelscope.cn to directory:
D:\Program Files\CreatorBox\creatorbox\extensions\cosyvoice\pretrained_models\CosyVoice3-0.5B

Successfully downloaded model: FunAudioLLM/Fun-CosyVoice3-0.5B-2512.

extensions/cosyvoice/.venv/scripts/python.exe webui3.py --port 8013
2025-12-23 23:58:44.483 | INFO  cbinstaller.py - ✅ CosyVoice started
failed to import ttsfrd, using wetext instead
* Running on local URL:  http://0.0.0.0:8013

Configuration Parameters

None

Notes

  • By default, use webui3.py For native version, use webui.py

  • In the dubbing module, you can fill in local or remote addresses to enable batch speech synthesis, for example:

    • Local access: http://127.0.0.1:8013
    • Remote access: https://xxx.gradio.livehttps://xxx.ngrok-free.apphttps://xxx.loca.lt
    • Your own public IP or domain name