Categories: Image

🧠πŸ’₯ My HomeLab GPU Cluster – 12Γ— RTX 5090, AI / K8s / Self-Hosted Everything

After months of planning, wiring, airflow tuning, and too many late nights this is my home lab GPU cluster finally up and running.

This setup is built mainly for:

β€’ AI / LLM inference & training β€’ Image & video generation pipelines β€’ Kubernetes + GPU scheduling β€’ Self-hosted APIs & experiments 

πŸ”§ Hardware Overview

β€’ Total GPUs: 12 Γ— RTX 5090 β€’ Layout: 6 machines Γ— 2 GPUs each β€’ Gpu Machine Memory: 128 GB per Machne β€’ Total VRAM: 1.5 TB+ β€’ CPU: 88 cores / 176 threads per server β€’ System RAM: 256 GB per machine 

πŸ–₯️ Infrastructure

β€’ Dedicated rack with managed switches β€’ Clean airflow-focused cases (no open mining frames) β€’ GPU nodes exposed via Kubernetes β€’ Separate workstation + monitoring setup β€’ Everything self-hosted (no cloud dependency) 

🌑️ Cooling & Power

β€’ Tuned fan curves + optimized case airflow β€’ Stable thermals even under sustained load β€’ Power isolation per node (learned this the hard way πŸ˜…) 

πŸš€ What I’m Running

β€’ Kubernetes with GPU-aware scheduling β€’ Multiple AI workloads (LLMs, diffusion, video) β€’ Custom API layer for routing GPU jobs β€’ NAS-backed storage + backups 

This is 100% a learning + building lab, not a mining rig.

submitted by /u/Murky-Classroom810
[link] [comments]

AI Generated Robotic Content

Share
Published by
AI Generated Robotic Content
Tags: ai images

Recent Posts

Embed the world: Multimodal AI for searchable aerial imagery at scale

Turning a library of aerial imagery into a natural-language-searchable knowledge base is a problem that…

38 mins ago

Introducing Web Search on Amazon Bedrock AgentCore

AI agents are changing how organizations find and act on information, but they share one…

3 days ago

The Most Promising Ebola Vaccine Has Been Sitting on the Shelf for 15 Years

Years after initial tests, researchers are now racing to see if a vaccine developed in…

3 days ago

The Roadmap to Mastering AI Agent Evaluation

Let's not waste any more time.

3 days ago

SpaceX wants to build AI data centers in space. Will it work?

The race to build data centers in space is gaining momentum as AI drives unprecedented…

3 days ago

Monitor and debug generative AI inference with SageMaker detailed metrics and Insights dashboard on CloudWatch

Monitoring and troubleshooting generative AI inference endpoints operating at scale is challenging. When your large…

4 days ago