Hey all,
Almost as impressive as all the LLMs these days is the voice that ChatGPT uses with its emphasis and dramatic pauses and umms, etc.
I would love to integrate that with a self-hosted Llama3 engine.
Is there a project that y’all have heard of?
This is what OP looks for. It exists! Other repos only cover partially (e.g. either ollama or tts)
You mean just the text to speech part? Look into Piper
i use these two all the time for tts:
https://github.com/JarodMica/ai-voice-cloning / https://github.com/gitmylo/audio-webui
epub2tts: https://github.com/aedocw/epub2tts
Looks like a project that utilizes coqui-AI: https://github.com/coqui-ai/TTS
Oh WOW! Thanks to all who commented. Next time I get a chance I’m going to check these all out! 👍🏻 I hope others find this thread helpful too!
New Lemmy Post: ChatGPT’s voice, self-hosted? (https://lemmyverse.link/lemmy.world/post/15336896)
Tagging: #SelfHosted(Replying in the OP of this thread (NOT THIS BOT!) will appear as a comment in the lemmy discussion.)
I am a FOSS bot. Check my README: https://github.com/db0/lemmy-tagginator/blob/main/README.md