Using ControlNet with Stable Diffusion

ControlNet is a neural network that can improve image generation in Stable Diffusion by adding extra conditions. This allows users to have more control over the images generated. Instead of trying out different prompts, the ControlNet models enable users to generate consistent images with just one prompt. In this post, you will learn how to …

Label-Efficient Sleep Staging Using Transformers Pre-trained with Position Prediction

Sleep staging is a clinically important task for diagnosing various sleep disorders but remains challenging to deploy at scale because it requires clinical expertise, among other reasons. Deep learning models can perform the task but at the expense of large labeled datasets, which are unfeasible to procure at scale. While self-supervised learning (SSL) can mitigate …

How fintech innovation is driving digital transformation for communities across the globe  

To meet the demands of today’s consumers, enterprises must be continuously innovating. But innovation doesn’t happen in silos. Fintechs, for example, have been transformational for the financial services industry, from democratizing finance to establishing digital currencies that revolutionized the way that we think of money.   As fintechs race to keep up with the needs of …

ML16556 image001

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

This is a guest post co-authored with Ville Tuulos (Co-founder and CEO) and Eddie Mattia (Data Scientist) of Outerbounds. To build a production-grade AI system today (for example, to do multilingual sentiment analysis of customer support conversations), what are the primary technical challenges? Historically, natural language processing (NLP) would be a primary research and development …

image1 dn60Vq9.max 1000x1000 1

Transforming customer feedback: analyzing audio customer reviews with BigQuery ML’s speech-to-text

BigQuery’s integrated speech-to-text functionality offers a powerful tool for unlocking valuable insights hidden within audio data. This service transcribes audio files, such as customer review calls, into text format, making them ready for analysis within BigQuery’s robust data platform. By combining speech-to-text with BigQuery’s analytics capabilities, you can delve into customer sentiment, identify recurring product …