Categories: Image

Made a local AI pipeline that yells at drivers peeing on my house

Last week I built a local pipeline where a state machine + LLM watches my security cam and yells at Amazon drivers peeing on my house.

State machine is the magic: it flips the system from passive (just watching) to active (video/audio ingest + ~1s TTS out) only when a trigger hits. Keeps things deterministic and way more reliable than letting the LLM run solo.

LLM handles the fuzzy stuff (vision + reasoning) while the state machine handles control flow. Together it’s solid. Could just as easily be swapped to spot trespassing, log deliveries, or recognize gestures.

TL;DR: gave my camera a brain and a mouth + a state machines to keep it focused. Repo in comments to see how it’s wired up.

submitted by /u/Weary-Wing-6806
[link] [comments]

AI Generated Robotic Content

Share
Published by
AI Generated Robotic Content
Tags: ai images

Recent Posts

Simple, Effective and Fast Z-Image Headswap for characters V1

People like my img2img workflow so it wasn't much work to adapt it to just…

5 hours ago

Target Darts Omni Auto Scoring System Hits the Mark

Step up to the oche and hit the bull’s-eye with this automatic darts scoring system…

1 day ago

Deni Avdija in Space Jam with LTX-2 I2V + iCloRA. Flow included

made a short video with LTX-2 using an iCloRA Flow to recreate a Space Jam…

2 days ago

How PARTs Assemble into Wholes: Learning the Relative Composition of Images

The composition of objects and their parts, along with object-object positional relationships, provides a rich…

2 days ago

Structured outputs on Amazon Bedrock: Schema-compliant AI responses

Today, we’re announcing structured outputs on Amazon Bedrock—a capability that fundamentally transforms how you can…

2 days ago