You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+13-2Lines changed: 13 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -24,7 +24,8 @@
24
24
25
25
## 📣 News
26
26
-**Fork of the [original, unmaintained repository](https://github.com/coqui-ai/TTS). New PyPI package: [coqui-tts](https://pypi.org/project/coqui-tts)**
27
-
- 0.25.0: [OpenVoice](https://github.com/myshell-ai/OpenVoice) models now available for voice conversion.
27
+
- 0.27.0: [Caching mechanism](https://coqui-tts.readthedocs.io/en/latest/cloning.html) for cloned voices.
28
+
- 0.25.2: [OpenVoice](https://github.com/myshell-ai/OpenVoice) and [kNN-VC](https://github.com/bshall/knn-vc) models now available for voice conversion.
28
29
- 0.24.2: Prebuilt wheels are now also published for macOS and Windows (in addition to Linux as before) for easier installation across platforms.
29
30
- 0.20.0: XTTSv2 is here with 17 languages and better performance across the board. XTTS can stream with <200ms latency.
30
31
- 0.19.0: XTTS fine-tuning code is out. Check the [example recipes](https://github.com/idiap/coqui-ai-TTS/tree/dev/recipes/ljspeech).
@@ -117,7 +118,9 @@ You can also help us implement more models.
117
118
## Installation
118
119
119
120
🐸TTS is tested on Ubuntu 24.04 with **python >= 3.10, < 3.14**, but should also
120
-
work on Mac and Windows.
121
+
work on Mac and Windows. Depending on your platform, you might first want to
122
+
separately install Pytorch, `torchaudio`, and `torchcodec` with their
If you are only interested in [synthesizing speech](https://coqui-tts.readthedocs.io/en/latest/inference.html) with the pretrained 🐸TTS models, installing from PyPI is the easiest option.
123
126
@@ -140,6 +143,7 @@ The following extras allow the installation of optional dependencies:
140
143
| Name | Description |
141
144
|------|-------------|
142
145
|`all`| All optional dependencies |
146
+
|`codec`| Installs torchcodec needed with Pytorch>=2.9 |
143
147
|`notebooks`| Dependencies only used in notebooks |
144
148
|`server`| Dependencies to run the TTS server |
145
149
|`bn`| Bangla G2P |
@@ -227,6 +231,10 @@ From version 0.27.0 you can [cache cloned
227
231
voices](https://coqui-tts.readthedocs.io/en/latest/cloning.html) with a custom
228
232
`speaker` ID, so you only need to pass audio files in `speaker_wav` once.
229
233
234
+
> [!NOTE]
235
+
> For more control or additional outputs, e.g. timestamps, use the lower-level
0 commit comments