Research data supporting [Ultrasensitive Textile Strain Sensors Redefine Wearable Silent Speech Interfaces with High Machine Learning Efficiency]

Name: Research data supporting [Ultrasensitive Textile Strain Sensors Redefine Wearable Silent Speech Interfaces with High Machine Learning Efficiency]
Published: 2024-05-07T14:50:54Z
Keywords: Machine Learning, Silent Speech Recognition, Textile Sensor

Tang, Chenyu; Xu, Muzi; Yi, Wentian; Occhipinti, Luigi

Research data supporting [Ultrasensitive Textile Strain Sensors Redefine Wearable Silent Speech Interfaces with High Machine Learning Efficiency]

Repository URI

https://www.repository.cam.ac.uk/handle/1810/368051

Repository DOI

https://doi.org/10.17863/CAM.104307

Files

Dataset1_20 frequently used words.xlsx (17.98 MB)

Dataset2_Confusing words.xlsx (8.68 MB)

Dataset3_Different reading speeds.xlsx (4.42 MB)

New User Generalization Test.xlsx (2.15 MB)

Noise Injection Data.csv (9.4 MB)

Type

Dataset

Authors

Tang, Chenyu

https://orcid.org/0000-0002-6368-5639

Xu, Muzi

Yi, Wentian

Occhipinti, Luigi

Description

This work encompasses five related datasets, accessible via an open-source link provided at the end of the manuscript:

Dataset1_20 Frequently Used Words: This dataset contains signals of the 20 most frequently used words (10 nouns and 10 verbs) collected from participants, with 100 samples per class. Each sample of a word is represented in a row, with the last number in each row indicating the class label for that word (the same applies to the following datasets).
Dataset2_Confusing Words: This dataset includes 5 pairs of 10 words with similar pronunciations that are easily confused, with 100 samples per class.
Dataset3_Different Reading Speeds: This dataset comprises signals of 5 long words read at three different speeds: fast, medium, and slow, with approximately 33 samples for each word at each reading speed.
New User Generalization Test: This dataset contains signals of 5 commonly used words (included in Dataset1) collected from three new users, with 50 samples per class.
Noise Injection Data: This dataset includes around five minutes of silent noise signals (containing physiological noises such as breathing and swallowing) recorded in the absence of speech.

Software / Usage instructions

The processing of the data and the training of the network were conducted in an environment based on Python 3.8.13, Miniconda 3, and PyTorch 2.0.1, with training acceleration provided by Apple’s Metal Performance Shaders (MPS). During the noise injection phase, each original sample was augmented with real-world noise from four different random noise windows, creating four new samples.

Keywords

Machine Learning, Silent Speech Recognition, Textile Sensor

Rights

Attribution 4.0 International (CC BY 4.0)

Sponsorship

EPSRC (EP/W024284/1)

Relationships

Supplements:

https://www.repository.cam.ac.uk/handle/1810/367649

https://doi.org/10.48550/arXiv.2311.15683

Collections

Research Data - Engineering

Research data supporting [Ultrasensitive Textile Strain Sensors Redefine Wearable Silent Speech Interfaces with High Machine Learning Efficiency]

Repository URI

Repository DOI

Files

Type

Change log

Authors

Description

Version

Software / Usage instructions

Keywords

Publisher

Rights

Sponsorship

Relationships

Collections