Speech Recognition Tutorial

Enhancing Speech Emotion Recognition With Conditional Emotion Feature Diffusion and Progressive Interleaved Learning Strategy

Abstract: Speech emotion recognition (SER) aims to identify the speaker's emotional states in specific utterances accurately. However, existing methods still face feature confusion when attempting to ...

GitHub

ESP32 Speech-to-Text (No API Key Required)

An ESP32 client that captures audio over I2S and posts WAV to a server. A lightweight Flask/Gunicorn server that returns JSON transcriptions via speech_recognition. Designed for deterministic embedded ...

IEEE

Keyword Guided Target Speech Recognition

Abstract: This letter presents a new target speech recognition problem, where the target speech is defined by a keyword. For instance, when a person speaks “Hey Google” or “Help Me”, we hope the model ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Enhancing Speech Emotion Recognition With Conditional Emotion Feature Diffusion and Progressive Interleaved Learning Strategy

ESP32 Speech-to-Text (No API Key Required)

Keyword Guided Target Speech Recognition

Trending now