WHITEPAPERS

Keyword Transformer: A Self-Attention Model For Keyword Spotting

Ways to adapt the Transformer architecture to keyword spotting and an introduction to the Keyword Transformer (KWT).

September 12th, 2022 - By: Arm

The Transformer architecture has been successful across many domains, including natural language processing, computer vision and speech recognition. In keyword spotting, self-attention has primarily been used on top of convolutional or recurrent encoders. We investigate a range of ways to adapt the Transformer architecture to keyword spotting and introduce the Keyword Transformer (KWT), a fully self-attentional architecture that exceeds state-of-the-art performance across multiple tasks without any pre-training or additional data. Surprisingly, this simple architecture outperforms more complex models that mix convolutional, recurrent and attentive layers. KWT can be used as a drop-in replacement for these models, setting two new benchmark records on the Google Speech Commands dataset with 98.6% and 97.7% accuracy on the 12 and 35-command tasks respectively.

By Axel Berg^{1, 2}, Mark O’Connor¹, Miguel Tairum Cruz¹
¹Arm ML Research Lab, UK
²Lund University, Sweden

Click here to read the paper. Click here to read the Arm Community introduction to the paper.

Keyword Transformer: A Self-Attention Model For Keyword Spotting

Leave a Reply Cancel reply

Technical Papers

Knowledge Centers
Entities, people and technologies explored

Related Articles

Flash Getting Stacked High-Bandwidth Version

Can Edge AI Keep Up?

Chiplets Need A New Workflow

Agentic AI Is Changing Data Center Architectures

Gates Add Functionality, But Wires Create Problems

A New Era For Co-Processing

PCIe Benefits From AI, Despite Scaling Protocols

DRAM’s Whac‑A‑Mole Security Crisis

Sponsors

Recent Comments

About

Navigation

Connect With Us

Keyword Transformer: A Self-Attention Model For Keyword Spotting

Leave a Reply Cancel reply

Technical Papers

Knowledge Centers Entities, people and technologies explored

Related Articles

Flash Getting Stacked High-Bandwidth Version

Can Edge AI Keep Up?

Chiplets Need A New Workflow

Agentic AI Is Changing Data Center Architectures

Gates Add Functionality, But Wires Create Problems

A New Era For Co-Processing

PCIe Benefits From AI, Despite Scaling Protocols

DRAM’s Whac‑A‑Mole Security Crisis

Sponsors

Newsletter Signup

Popular Tags

Recent Comments

About

Navigation

Connect With Us

Knowledge Centers
Entities, people and technologies explored