Local Mechanisms of Compositional Generalization in Conditional Diffusion

Conditional diffusion models appear capable of compositional generalization, i.e., generating convincing samples for out-of-distribution combinations of conditioners, but the mechanisms underlying this ability remain unclear. To make this concrete, we study length generalization, the ability to generate images with more objects than seen during training. In a controlled CLEVR setting (Johnson et al., 2017), we …

ml 19194 arch diag 1024x570 1

Use Amazon SageMaker HyperPod and Anyscale for next-generation distributed computing

This post was written with Dominic Catalano from Anyscale. Organizations building and deploying large-scale AI models often face critical infrastructure challenges that can directly impact their bottom line: unstable training clusters that fail mid-job, inefficient resource utilization driving up costs, and complex distributed computing frameworks requiring specialized expertise. These factors can lead to unused GPU …

maxresdefault 1

Introducing Gemini Enterprise

AI is presenting a once-in-a-generation opportunity to transform how you work, how you run your business, and what you build for your customers. But the first wave of AI, while promising, has been stuck in silos, unable to orchestrate complex work across an entire organization. True transformation requires a comprehensive platform that connects to your …

Echelon’s AI agents take aim at Accenture and Deloitte consulting models

Echelon, an artificial intelligence startup that automates enterprise software implementations, emerged from stealth mode today with $4.75 million in seed funding led by Bain Capital Ventures, targeting a fundamental shift in how companies deploy and maintain critical business systems. The San Francisco-based company has developed AI agents specifically trained to handle end-to-end ServiceNow implementations — …

Are you tired of waiting for image/video generations? Now you can play Snake directly in ComfyUI while you wait!

Added to my custom nodes, just install from ComfyUI Manager (search “CrasH Utils”) and add the Snake Game node. When focused on the node you can use the arrow keys on your keyboard to control it. https://github.com/chrish-slingshot/CrasHUtils I have no idea what possessed me to do this but I’m so glad I did. submitted by …

Rethinking JEPA: Compute-Efficient Video SSL with Frozen Teachers

Video Joint Embedding Predictive Architectures (V-JEPA) learn generalizable off-the-shelf video representation by predicting masked regions in latent space with an exponential moving average (EMA)-updated teacher. While EMA prevents representation collapse, it complicates scalable model selection and couples teacher and student architectures. We revisit masked-latent prediction and show that a frozen teacher suffices. Concretely, we (i) …

vxceed cpg arch 1 1024x503 1

Vxceed builds the perfect sales pitch for sales teams at scale using Amazon Bedrock

This post was co-written with Cyril Ovely from Vxceed. Consumer packaged goods (CPG) companies face a critical challenge in emerging economies: how to effectively retain revenue and grow customer loyalty at scale. Although these companies invest 15–20% of their revenue in trade promotions and retailer loyalty programs, the uptake of these programs has historically remained …