From accessibility upgrades to a custom cat-food bowl, this mobile 3D printer can autonomously add features to a room

Researchers created MobiPrint, a mobile 3D printer that can automatically measure a room and print objects onto the floor. The team’s graphic interface lets users design objects in a space that the robot has mapped out. The prototype, which the team built on a modified consumer vacuum robot, can add a range of objects to …

OpenAI unveils sCM, a new model that generates video media 50 times faster than current diffusion models

Two experts with the OpenAI team have developed a new kind of continuous-time consistency model (sCM) that they claim can generate video media 50 times faster than models currently in use. Cheng Lu and Yang Song have published a paper describing their new model on the arXiv preprint server. They have also posted an introductory …

CtrlSynth: Controllable Image-Text Synthesis for Data-Efficient Multimodal Learning

Pretraining robust vision or multimodal foundation models (e.g., CLIP) relies on large-scale datasets that may be noisy, potentially misaligned, and have long-tail distributions. Previous works have shown promising results in augmenting datasets by generating synthetic samples. However, they only support domain-specific ad hoc use cases (e.g., either image or text only, but not both), and …

ML 17337 image001

Unlocking generative AI for enterprises: How SnapLogic powers their low-code Agent Creator using Amazon Bedrock

This post is cowritten with Greg Benson, Aaron Kesler and David Dellsperger from SnapLogic. The landscape of enterprise application development is undergoing a seismic shift with the advent of generative AI. SnapLogic, a leader in generative integration and automation, has introduced the industry’s first low-code generative AI development platform, Agent Creator, designed to democratize AI …

1 Choosing the right metric GPU Utilizat.max 1000x1000 1

Save on GPUs: Smarter autoscaling for your GKE inferencing workloads

While LLM models deliver immense value for an increasing number of use cases, running LLM inference workloads can be costly. If you’re taking advantage of the latest open models and infrastructure, autoscaling can help you optimize your costs — ensuring you’re meeting customer demand while only paying for the AI accelerators you need. As a …

Listening skills bring human-like touch to robots

Researchers give robots a sense of touch by ‘listening’ to vibrations, allowing them to identify materials, understand shapes and recognize objects just like human hands. The ability to interpret the world through acoustic vibrations emanating from an object — like shaking a cup to see how much soda is left or tapping on a desk …