CNN self-attention voice activity detector

Sofer, Amit; Chazan, Shlomo E.

Computer Science > Sound

arXiv:2203.02944 (cs)

[Submitted on 6 Mar 2022]

Title:CNN self-attention voice activity detector

Authors:Amit Sofer, Shlomo E. Chazan

View PDF

Abstract:In this work we present a novel single-channel Voice Activity Detector (VAD) approach. We utilize a Convolutional Neural Network (CNN) which exploits the spatial information of the noisy input spectrum to extract frame-wise embedding sequence, followed by a Self Attention (SA) Encoder with a goal of finding contextual information from the embedding sequence. Different from previous works which were employed on each frame (with context frames) separately, our method is capable of processing the entire signal at once, and thus enabling long receptive field. We show that the fusion of CNN and SA architectures outperforms methods based solely on CNN and SA. Extensive experimental-study shows that our model outperforms previous models on real-life benchmarks, and provides State Of The Art (SOTA) results with relatively small and lightweight model.

Subjects:	Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2203.02944 [cs.SD]
	(or arXiv:2203.02944v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2203.02944

Submission history

From: Shlomo Chazan [view email]
[v1] Sun, 6 Mar 2022 11:52:00 UTC (1,846 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.SD

< prev | next >

new | recent | 2203

Change to browse by:

cs
eess
eess.AS

References & Citations

export BibTeX citation

Computer Science > Sound

Title:CNN self-attention voice activity detector

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:CNN self-attention voice activity detector

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators