Seed Voice Conversion V2

Zero-shot voice conversion with in-context learning. For local deployment please check GitHub repository for details and updates.
Note that any reference audio will be forcefully clipped to 25s if beyond this length.
If total duration of source and reference audio exceeds 30s, source audio will be processed in chunks.
无需训练的 zero-shot 语音/歌声转换模型,若需本地部署查看[GitHub页面](https://github.com/Plachtaa/seed-vc]
请注意,参考音频若超过 25 秒,则会被自动裁剪至此长度。
若源音频和参考音频的总时长超过 30 秒,源音频将被分段处理。

1 200
0.5 2
0 1
0 1
0.1 1
0.1 2
1 3
Examples
Source Audio / 源音频 Reference Audio / 参考音频 Diffusion Steps / 扩散步数 Length Adjust / 长度调整 Intelligibility CFG Rate Similarity CFG Rate Top-p Temperature Repetition Penalty convert style anonymization only