MM-Ego: Towards Building Egocentric Multimodal LLMs

This research aims to comprehensively explore building a multimodal foundation model for egocentric video understanding. To achieve this goal, we work on three fronts. First, as there is a lack of QA data for egocentric video understanding, we automatically generate 7M high-quality QA samples for egocentric videos ranging from 30 seconds to one hour long …

ML 17713 image 1

Reduce ML training costs with Amazon SageMaker HyperPod

Training a frontier model is highly compute-intensive, requiring a distributed system of hundreds, or thousands, of accelerated instances running for several weeks or months to complete a single job. For example, pre-training the Llama 3 70B model with 15 trillion training tokens took 6.5 million H100 GPU hours. On 256 Amazon EC2 P5 instances (p5.48xlarge, …

04 Graphs Introducing GKE Optimized Infmax 1000x1000 1

New GKE inference capabilities reduce costs, tail latency and increase throughput

When it comes to AI, inference is where today’s generative AI models can solve real-world business problems. Google Kubernetes Engine (GKE) is seeing increasing adoption of gen AI inference. For example, customers like HubX run inference of image-based models to serve over 250k images/day to power gen AI experiences, and Snap runs AI inference on …

Leveraging silicon photonics for scalable and sustainable AI hardware

The emergence of AI has profoundly transformed numerous industries. Driven by deep learning technology and Big Data, AI requires significant processing power for training its models. While the existing AI infrastructure relies on graphical processing units (GPUs), the substantial processing demands and energy expenses associated with its operation remain key challenges. Adopting a more efficient …

GujBJ0EXfo2b7LGBszI6JbiBPtQgbMgk7MifRS2MCI

2000s AnalogCore v3 – Flux LoRA update

Hey everyone! I’ve just rolled out V3 of my 2000s AnalogCore LoRA for Flux, and I’m excited to share the upgrades: https://civitai.com/models/1134895?modelVersionId=1640450 What’s New Expanded Footage References: The dataset now includes VHS, VHS-C, and Hi8 examples, offering a broader range of analog looks. Enhanced Timestamps: More authentic on-screen date/time stamps and overlays. Improved Face Variety: …