| | Last week I built a local pipeline where a state machine + LLM watches my security cam and yells at Amazon drivers peeing on my house. State machine is the magic: it flips the system from passive (just watching) to active (video/audio ingest + ~1s TTS out) only when a trigger hits. Keeps things deterministic and way more reliable than letting the LLM run solo. LLM handles the fuzzy stuff (vision + reasoning) while the state machine handles control flow. Together it’s solid. Could just as easily be swapped to spot trespassing, log deliveries, or recognize gestures. TL;DR: gave my camera a brain and a mouth + a state machines to keep it focused. Repo in comments to see how it’s wired up. submitted by /u/Weary-Wing-6806 |
I tried all the top models to find the best 3-in-1 Apple charging stations, pads,…
New studies suggest consciousness can't be judged solely by behavior, whether it's a chatbot discussing…
Introducing Comfy Desktop - official Comfy app for every ComfyUI. Same name, new app; and…
You've probably shipped this bug before, where a user types " affordable laptop " into…
We logged thousands of test miles to bring you the best running shoes for every…
Artificial intelligence (AI)-generated images have become increasingly more sophisticated than early ones that showed humans…