柔軟索状レスキューロボットのための空気噴射音下での単チャネル音声強調

坂東 宜昭; 安部 祐一; 糸山 克寿; 昆陽 雅司; 田所 諭; 中臺 一博; 奥乃 博

doi:10.1299/jsmermd.2019.2A2-D07

セッションID: 2A2-D07

DOI https://doi.org/10.1299/jsmermd.2019.2A2-D07

会議情報

主催: 一般社団法人日本機械学会

会議名: ロボティクス・メカトロニクス　講演会2019

開催日: 2019/06/05 - 2019/06/08

柔軟索状レスキューロボットのための空気噴射音下での単チャネル音声強調

*坂東宜昭, 安部祐一, 糸山克寿, 昆陽雅司, 田所諭, 中臺一博, 奥乃博

著者情報

キーワード: Hose-Shaped Rescue Robot, Speech Enhancement, Robot Audition

会議録・要旨集認証あり

詳細

抄録

This paper presents a monaural speech enhancement method for a hose-shaped rescue robot based on a deep speech prior. Speech enhancement is crucial to make a robot operator succeed in detecting human voices because audio signals captured by a microphone on the robot are contaminated by ego-noise. We have been developed three enhancement methods: 1) a blind speech enhancement called robust nonnegative matrix factorization (RNMF), 2) an extension of RNMF with a pre-trained noise model, and 3) another extension of RNMF with a deep speech prior, i.e., a pre-trained speech model based on deep learning. In this paper, we develop a new extension of RNMF by combining the pre-trained noise and speech models as a unified model and evaluated these methods on a hose-shaped rescue robot whose ego-noise consists of vibration-motor and air-jet noise. Experimental results show that the new method outperforms the three RNMF methods when the signal-to-noise ratio is equal to or less than +5 dB.

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）