Categories: FAANG

Learning to Detect Novel and Fine-Grained Acoustic Sequences Using Pretrained Audio Representations

This work investigates pre-trained audio representations for few shot Sound Event Detection. We specifically address the task of few shot detection of novel acoustic sequences, or sound events with semantically meaningful temporal structure, without assuming access to non-target audio. We develop procedures for pre-training suitable representations, and methods which transfer them to our few shot learning scenario. Our experiments evaluate the general purpose utility of our pre-trained representations on AudioSet, and the utility of proposed few shot methods via tasks constructed from…
AI Generated Robotic Content

Recent Posts

How Harmonic Security improved their data-leakage detection system with low-latency fine-tuned models using Amazon SageMaker, Amazon Bedrock, and Amazon Nova Pro

This post was written with Bryan Woolgar-O’Neil, Jamie Cockrill and Adrian Cunliffe from Harmonic Security…

21 hours ago

How we built a multi-agent system for superior business forecasting

In today's dynamic business environment, accurate forecasting is the bedrock of efficient operations. Yet, businesses…

21 hours ago

Scientists reveal a tiny brain chip that streams thoughts in real time

BISC is an ultra-thin neural implant that creates a high-bandwidth wireless link between the brain…

2 days ago

Deepening our partnership with the UK AI Security Institute

Google DeepMind and UK AI Security Institute (AISI) strengthen collaboration on critical AI safety and…

2 days ago

Continuously Augmented Discrete Diffusion model for Categorical Generative Modeling

Standard discrete diffusion models treat all unobserved states identically by mapping them to an absorbing…

2 days ago

Implement automated smoke testing using Amazon Nova Act headless mode

Automated smoke testing using Amazon Nova Act headless mode helps development teams validate core functionality…

2 days ago