Mugen – Modernized Anime SDXL Base, or how to make Bluvoll tiny bit less sane

Your monthly “Anzhc’s Posts” issue have arrived. Today im introducing – Mugen – continuation of the Flux 2 VAE experiment on SDXL. We have renamed it to signify strong divergence from prior Noobai models, and to finally have a normal name, no more NoobAI-Flux2VAE-Rectified-Flow-v-0.3-oc-gaming-x. In this run in particular we have prioritized character knowledge, and …

Entropy-Preserving Reinforcement Learning

Policy gradient algorithms have driven many recent advancements in language model reasoning. An appealing property is their ability to learn from exploration on their own trajectories, a process crucial for fostering diverse and creative solutions. As we show in this paper, many policy gradient algorithms naturally reduce the entropy—and thus the diversity of explored trajectories—as …

ML 19682 image 1

How Ring scales global customer support with Amazon Bedrock Knowledge Bases

This post is cowritten with David Kim, and Premjit Singh from Ring. Scaling self-service support globally presents challenges beyond translation. In this post, we show you how Ring, Amazon’s home security subsidiary, built a production-ready, multi-locale Retrieval-Augmented Generation (RAG)-based support chatbot using Amazon Bedrock Knowledge Bases. By eliminating per-Region infrastructure deployments, Ring reduced the cost …

Robots with different bodies can now share skills: What intention-based learning changes

Robots are increasingly being used in manufacturing, agriculture and health care. But programming a team of robots to carry out individual tasks raises a question: How can robots learn from other robots if they are built differently? A multi-institutional team including Chongjie Zhang, an associate professor of computer science and engineering at WashU McKelvey Engineering, …