Now it’s TikTok parent ByteDance’s turn for a reasoning AI: enter Seed-Thinking-v1.5!
It achieved an 8.0% higher win rate over DeepSeek R1, suggesting that its strengths generalize beyond just logic or math-heavy challenges.Read More
It achieved an 8.0% higher win rate over DeepSeek R1, suggesting that its strengths generalize beyond just logic or math-heavy challenges.Read More
For the past three days, DOGE and a handful of Palantir representatives, along with dozens of career IRS engineers, have been collaborating to build a “mega API,” WIRED has learned.
It’s a game a lot of us played as children — and maybe even later in life: unspooling measuring tape to see how far it would extend before bending. But to engineer, this game was an inspiration, suggesting that measuring tape could become a great material for a robotic gripper. The grippers would be a …
Read more “A new robotic gripper made of measuring tape is sizing up fruit and veggie picking”
Anyone notice that this bill has been reintroduced? submitted by /u/Rough-Copy-5611 [link] [comments]
Be sure to check out the previous articles in this series: •
This research aims to comprehensively explore building a multimodal foundation model for egocentric video understanding. To achieve this goal, we work on three fronts. First, as there is a lack of QA data for egocentric video understanding, we automatically generate 7M high-quality QA samples for egocentric videos ranging from 30 seconds to one hour long …
Read more “MM-Ego: Towards Building Egocentric Multimodal LLMs”
Training a frontier model is highly compute-intensive, requiring a distributed system of hundreds, or thousands, of accelerated instances running for several weeks or months to complete a single job. For example, pre-training the Llama 3 70B model with 15 trillion training tokens took 6.5 million H100 GPU hours. On 256 Amazon EC2 P5 instances (p5.48xlarge, …
Read more “Reduce ML training costs with Amazon SageMaker HyperPod”
When it comes to AI, inference is where today’s generative AI models can solve real-world business problems. Google Kubernetes Engine (GKE) is seeing increasing adoption of gen AI inference. For example, customers like HubX run inference of image-based models to serve over 250k images/day to power gen AI experiences, and Snap runs AI inference on …
Read more “New GKE inference capabilities reduce costs, tail latency and increase throughput”
DeepCoder-14B competes with frontier models like o3 and o1—and the weights, code, and optimization platform are open source.Read More
President Trump’s tariffs are boosting China’s global image even as they threaten to decimate its economy.