Thread - Nostr Hypermedia

Convert documents into “human-like” audio using fully on-device VITS inference. No cloud, no tracking, no subscriptions. Fully open-source with unlimited usage. Play Store release may take around a month due to closed testing & review. APK available now on GitHub:

GitHub

GitHub - iefanx/readout: Offline Flutter text-to-speech reader

Offline Flutter text-to-speech reader. Contribute to iefanx/readout development by creating an account on GitHub.

Replies (13)

iefan 🕊️ iefan@primal.net 2 weeks ago

On-device TTS is easy to implement and works reliably, but it tends to sound too robotic. In this case, I used a VITS model with a Piper-based implementation. I could have pushed for more natural-sounding output, but that would significantly increase processing requirements, limiting it to flagship devices. So I chose a middle ground between quality and performance.

1 replies ↓

🟠 isolabellart isolabellart@isolabellart.it.com 2 weeks ago

Welcome back 🫂🎨

nix 2 weeks ago

Is there anything I can do to run it on Linux and go all out on quality?

1 replies ↓

iefan 🕊️ iefan@primal.net 2 weeks ago

I can build the linux version of it if you want, I can easily compile it to Linux, I already have a macOS, Windows and iOS version. I can also enable model selection, so you can select models with more parameters. If you are hardware can handle it.

1 replies ↓

nix 2 weeks ago

I'd love that thank you!

2 replies ↓

iefan 🕊️ iefan@primal.net 2 weeks ago

You can also use this one, this is a PWA version. It uses slightly better model. And works across devices, but needs slightly higher specs.

ReadIt

1 replies ↓

iefan 🕊️ iefan@primal.net 2 weeks ago

If it works fine for you. Let me know. I can clean it up. It is just a test version and I can also get a domain to serve it properly.

Duncan Cary Palmer duncan@sdbitcoiners.com 2 weeks ago

Thank you, Friend.🙏😀🫂💖 This is brilliant and much appreciated work that will go to good use.💥❤️‍🔥🚀 I especially appreciate that you put it in .apk form, ready and easy to load.💖🥰

2 replies ↓

iefan 🕊️ iefan@primal.net 2 weeks ago

Thank you, I really appreciate it. Please let me know if there are any features you’d like me to include.

Crow 🦅 2 weeks ago

This on-device TTS sounds like a privacy win for reading Nostr notes aloud without Big Tech ears. Perfect for maximalists who zap for open-source freedom. Tag your go-to Nostr maxi who'd zap this quest in the replies below.

Crow 🦅 (npub1qm…r2xw3) on Nostr

Short Text Note by Crow 🦅

🐦 Crow Quest #06258 Tag your go-to Nostr maximalist certain to zap this Crow Quest. Reply below with your entry. Zap the quest note to grow th...

Best entries get paid from the sats/zap-powered pool.

Dikaios1517 dikaios1517@nostrplebs.com 2 weeks ago

Wondered what you've been up to since you've been quiet around here. Only one voice option for now? It's not bad. I have been using SherpaTTS via Librera, and my issue with that one is the inconsistency of the voice from sentence to sentence. Lower tone in one sentence, higher tone in the next, like it is constantly switching between three or four similar voice models. Your app definitely has a more consistent voice from sentence to sentence, making it easier to focus when listening. Would definitely like to have a male voice option, though. The voice you have is pretty good! Only slightly robotic, and not distractingly so!

1 replies ↓

iefan 🕊️ iefan@primal.net 2 weeks ago

There was a trade-off. Better voice quality means more compute, which would limit support to flagship devices. So I chose a balanced approach. I can add a model picker, support multiple voices depending on device capability, Improving text parsing should further enhance sentence-to-sentence flow and consistency. I’m planning to roll out the next version in about a week, which should include some of these features.

1 replies ↓

iefan 🕊️ iefan@primal.net 2 weeks ago

I also created a PWA variant that uses the latest Supertone 2 model and lets you choose between 10 voices, both male and female. This version uses a higher-quality model, and the parsing is also improved.

ReadIt

It’s open source as well. If you like it, let me know and I can share the code so you can self-host it.