Categories: Image

Flux.2-Klein pipeline for real-time webcam stream processing in 30 FPS

I have built a pipeline based on the Flux.2-Klein-4B model that allows processing of a video stream with low latency (about 0.2 seconds) on a single RTX5090 GPU.
It is free and open-source, you can try it locally:
https://github.com/tensorforger/FluxRT

Under the hood, it uses a custom spatial-aware KV-cache, so it only recomputes a small number of image tokens per frame, specifically where something is moving or changing.
It also uses frame interpolation with the RIFE model, which can multiply FPS by a factor of 2, 4, 8, etc. I have found that 4 is the most appropriate for my setup.

Depending on scene dynamics, the output stream achieves up to 50 FPS in mostly static scenes and around 20 FPS when the entire input image is changing rapidly. Benchmark results are in the repo.

There is also a Gradio demo, several minimal cv2 examples, and a simple paint-style app with real-time canvas updates.

submitted by /u/TensorForger
[link] [comments]

AI Generated Robotic Content

Share
Published by
AI Generated Robotic Content
Tags: ai images

Recent Posts

The Essential Calvin & Hobbes – FLUX.2 Klein 9b Base -> 4x upscaler

submitted by /u/AreaFifty1 [link] [comments]

22 hours ago

Building a Context Pruning Pipeline for Long-Running Agents

Modern AI agents built on top of large language models (LLMs) are designed to run…

22 hours ago

Training Azerbaijani language models on Amazon SageMaker AI

This solution builds on open source tools including PyTorch, Hugging Face Transformers, and Liger Kernels.…

22 hours ago

AI in SRE: Where and how Google is deploying agentic AI to improve operations

Since its inception over 20 years ago, Google has used Site Reliability Engineering (SRE) to…

22 hours ago

The GOP’s Attacks on James Talarico Are Straight Out of the Incel Handbook

Claims about low testosterone and false accusations of veganism might play well to the online…

23 hours ago

Filtering out humanity: AI-assisted internet research favors cold logic over ethos and pathos

Is the internet losing its soul? A collaborative study by UC Riverside computer and social…

23 hours ago