this post was submitted on 25 Feb 2026
1 points (100.0% liked)

Home Assistant

262 readers
1 users here now

Home Assistant is open source home automation that puts local control and privacy first. Powered by a worldwide community of tinkerers and DIY...

founded 2 years ago
MODERATORS
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/homeassistant by /u/Forsaken_Shopping481 on 2026-02-25 08:39:28+00:00.


Hey guys,

I wanted to share a small project I've been working on to solve a personal pain point: TinyTTS.

We all love our massive 70B+ LLMs, but when building local voice assistants, running a heavy TTS framework alongside them often eats up way too much precious VRAM and compute. I wanted something absurdly small and fast that "just works" locally.

TL;DR Specs:

  • Size: ~9 Million parameters
  • Disk footprint: ~20 MB checkpoint (G.pth)
  • Speed (CPU): ~0.45s to generate 3.7s of audio (~8x faster than real-time)
  • Speed (GPU - RTX 4060): ~0.056s (~67x faster than real-time)
  • Peak VRAM: ~126 MB
  • License: Apache 2.0 (Open Weights)

Why TinyTTS? It is designed specifically for edge devices, CPU-only setups, or situations where your GPU is entirely occupied by your LLM. It's fully self-contained, meaning you don't need to run a complex pipeline of multiple models just to get audio out.

How to use it? I made sure it’s completely plug-and-play with a simple Python API. Even better, on your first run, it will automatically download the tiny 20MB model from Hugging Face into your cache for you.

pip install git+https://github.com/tronghieuit/tiny-tts.git

Python API:

from tiny_tts import TinyTTS

# Auto-detects device (CPU/CUDA) and downloads the 20MB checkpoint

tts = TinyTTS()

tts.speak("The weather is nice today, and I feel very relaxed.", output_path="output.wav")

CLI:

tiny-tts --text "Local AI is the future" --device cpu

Links:

What's next? I plan to clean up and publish the training code soon so the community can fine-tune it easily. I am also looking into adding ultra-lightweight zero-shot voice cloning.

Would love to hear your feedback or see if anyone manages to run this on a literal potato! Let me know what you think.

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here