Replies: 5 comments 2 replies
-
You'd need to share the error message... |
Beta Was this translation helpful? Give feedback.
-
I try to open the server with the voice, and I get the following error to use checkpoint directory, but that is not an option on the server.py.....
|
Beta Was this translation helpful? Give feedback.
-
When I do that, the server actually shows a bunch of voices to select and they all work great. But none of are the one I expected.. When I launch the server by itself- I get no voice selection.... I then tried to specify the text in the URL
|
Beta Was this translation helpful? Give feedback.
-
Hot diggity dog. That worked. That was a MASSIVE docker pull.
I then put /models/morgan/reference.wav in the box and the server reported...
and it spoke. The voices sort of was a speed up version of Morgan Freeman... Not sure how how to alter it but I am off the the races. I tried a C3PO one that wasn't very good either. I'll try some more. |
Beta Was this translation helpful? Give feedback.
-
Hot diggity dog. That worked. That was a MASSIVE docker pull.
***@***.***:~# ls /models/morgan/
config.json model.pth reference.wav speakers_xtts.pth vocab.json
***@***.***:~#
python TTS/server/server.py --model_path /models/morgan --config_path
/models/morgan/config.json
I then put /models/morgan/reference.wav in the box and the server
reported...
/api/tts?text=There%20is%20a%20car%20coming%20up%20the%20driveway.&speaker_id=&style_wav=&speaker_wav=/models/morgan/reference.wav&language_id=en
and it spoke. The voices sort of was a speed up version of Morgan
Freeman... I am using ebook2audiobook with the same voice and it sounds
much more realistic-- but it also takes 30 seconds to process.
Thanks!
…On Tue, Mar 11, 2025 at 10:47 AM Enno Hermann ***@***.***> wrote:
You need to pass some reference audio from your target speaker. This is
actually possible in the demo server only since coqui-tts version 0.26.0
that I just released yesterday, so you should update to that version, then
you should see a field in the form to specify it.
—
Reply to this email directly, view it on GitHub
<#335 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABGOME7M2ECMT66XWVNS5GL2T3ZORAVCNFSM6AAAAABYTFZIQ2VHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTENBWGI4DSNY>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
I have used Coqui in my announcement setup. (ex. cars coming up driveway and the like)
I get some models off of Hugging face, and then I use Coqui to have those models on a TTS server and then use the /api/tts= to read my announcements throughout the house.
I have downloaded several models... I have a David Attenbough working now and this guys Trump. Works great.
https://huggingface.co/enlyth/baj-tts/tree/main/models
I am using the CPU Docker version right now. Works fine and my announcements are not very long so time is not an issue.
To use my current voice, David Attenbough-- I downloaded the model and have my docker entry point be...
Lots of other pretrained voices are on hugging face and should be compatible but I cannot seem to get the server to use them.
I downloaded this voice to my models directory (under /morgan) and I have tried nearly every combination of the server command.
This is the directory.
https://huggingface.co/drewThomasson/fineTunedTTSModels/tree/main/xtts-v2/eng/MorganFreeman
This guy developed ebook2audiobook and I can use that voice on Docker there.... I believe he uses Coqui so this should work-- I just imagine I have an error on my end.
Beta Was this translation helpful? Give feedback.
All reactions