12A2F7t91r3ldIiiXgxykXs5w

Data Pipeline Version Control: Tracking Code & Data Together (Palantir RFx Blog Series, #3)

Editor’s note: This is the third post in the Palantir RFx Blog Series, which explores how organizations can better craft RFIs and RFPs to evaluate digital transformation software. Each post focuses on one key capability area within a data ecosystem, with the goal of helping companies ask the right questions to better assess technology. Introduction …

ML 9275 image001

Cost efficient ML inference with multi-framework models on Amazon SageMaker 

Machine learning (ML) has proven to be one of the most successful and widespread applications of technology, affecting a wide range of industries and impacting billions of users every day. With this rapid adoption of ML into every industry, companies are facing challenges in supporting low-latency predictions and with high availability while maximizing resource utilization …

ml 12002 image001

Solve business problems end-to-end through machine learning in Amazon SageMaker JumpStart solutions

Amazon SageMaker JumpStart provides pre-trained, open-source models for a wide range of problem types to help you get started with machine learning (ML). JumpStart also provides solution templates that set up infrastructure for common use cases, and executable example notebooks for ML with Amazon SageMaker. As a business user, you get to do the following …

Emily Webber

Train gigantic models with near-linear scaling using sharded data parallelism on Amazon SageMaker

In the pursuit of superior accuracy, deep learning models in areas such as natural language processing and computer vision have significantly grown in size in the past few years, frequently counted in tens to hundreds of billions of parameters. Training these gigantic models is challenging and requires complex distribution strategies. Data scientists and machine learning …

1 UKG.max 1000x1000 1

UKG Ready, People Insights on Google Cloud

Business Problem UKG Ready primarily operates in the Small and Medium Business (SMB) space, so inherently many customers are forced to operate and make key business decisions with less Workforce Management (WFM) / Human Capital Management (HCM) data. In addition to volume, SMB lacks the variety of data needed to create a dynamic and agile organization. …

1 Document AI.max 1000x1000 1

Document AI adds one-click model training with ML Workbench

Each day, countless documents are created, revised, and shared across organizations. The result is a treasure trove of information, but because the data is primarily unstructured — without rows, columns or some other predefined organizational schema — it is difficult to interpret, analyze or use for business processes. That’s why we introducedDocument AI: so users …

Non-Autoregressive Neural Machine Translation: A Call for Clarity

Non-autoregressive approaches aim to improve the inference speed of translation models by only requiring a single forward pass to generate the output sequence instead of iteratively producing each predicted token. Consequently, their translation quality still tends to be inferior to their autoregressive counterparts due to several issues involving output token interdependence. In this work, we …

Prompting for a Conversation: How to Control a Dialog Model?

Dialog modelling faces a difficult trade-off. Models are trained on a large amount of text, yet their responses need to be limited to a desired scope and style of a dialog agent. Because the datasets used to achieve the former contain language that is not compatible with the latter, pre-trained dialog models are fine-tuned on …

A Treatise On FST Lattice Based MMI Training

Maximum mutual information (MMI) has become one of the two de facto methods for sequence-level training of speech recognition acoustic models. This paper aims to isolate, identify and bring forward the implicit modelling decisions induced by the design implementation of standard finite state transducer (FST) lattice based MMI training framework. The paper particularly investigates the …

12AmxxIwLmv6TiTeHJ8wjcFxA

Vernetzte Versorgungsunternehme: Der Weg zu einer nachhaltigen Wasserwirtschaft

(An English-language version of this post can be read here.) Während sich das Klima verändert, die Bevölkerung zwar langsamer, aber zumindest mittelfristig unaufhaltsam weiter wächst und die globalen Ansprüche an den Lebensstandard steigen, ist sauberes Wasser eine der kostbarsten natürlichen Ressourcen. In den Industrieländern ist sauberes, fließendes Wasser eine Selbstverständlichkeit, und um das auch weiter zu …