| Last week I built a local pipeline where a state machine + LLM watches my security cam and yells at Amazon drivers peeing on my house. State machine is the magic: it flips the system from passive (just watching) to active (video/audio ingest + ~1s TTS out) only when a trigger hits. Keeps things deterministic and way more reliable than letting the LLM run solo. LLM handles the fuzzy stuff (vision + reasoning) while the state machine handles control flow. Together it’s solid. Could just as easily be swapped to spot trespassing, log deliveries, or recognize gestures. TL;DR: gave my camera a brain and a mouth + a state machines to keep it focused. Repo in comments to see how it’s wired up. submitted by /u/Weary-Wing-6806 |
submitted by /u/Jeffu [link] [comments]
You don’t always need a heavy wrapper, a big client class, or dozens of lines…
The proliferation of Internet of Things (IoT) devices has transformed how we interact with our…
Customer service teams at fast-growing companies face a challenging reality: customer inquiries are growing exponentially,…
2025 was supposed to be the year of "AI agents," according to Nvidia CEO Jensen…
Another round of terminations, combined with previous layoffs and departures, has reduced the Centers for…