Skip to content

CNN that detects "umm" filler word in real time and triggers buzzer to give speech feedback

Notifications You must be signed in to change notification settings

honeyimholm/AI-Speech-Coach

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

AI Speech Coach

This project uses a convolutional neural network to perform real time detection of the filler word "Uhm" and triggers an arduino piezo buzzer to give real time feedback. In some limited tests I was able to reduce the amount of "uhm"s per minute of speaking when given this buzzer feedback. Demo video here

diagram

Setup

You'll need an arduino attached via serial and a piezo buzzer. In buzzer.ino change your digital pin to one your piezo is hooked up to. In the driver.py code the port is set to COM8 which will need to change based on your configuration.

If you don't have an arduino attached you can just set the variable "ARDUINO_ENABLED" in driver.py to False

Optionally you can also configure the following variables to in driver.py

# only output detection if prediction exceeds confidence threshold
CONFIDENCE_THRESHOLD = .95
# min time between detections
TIME_THRESHOLD = 1

Usage

Start the arduino code in buzzer.ino

run python driver.py

If you say "uhm" you should recieve a terminal output with the confidence which will trigger the piezo buzzer:

Uhm detected! Confidence: 0.9995760321617126

Results

In some very limited tests I did on myself it reduced the amount of "umms" I said per minute of speaking from 5.4 (yikes!) to 3.4 (still yikes!).

trial

Next Steps

I'd like to make this a full wearable using the nano ble sense 33. This would allow for speech coaching in more natural environments i.e. casually talking to friends

Augmentation of the "uhm" dataset needs to be performed - there are a bit of false positives around words like "the"

I'll be uploading more details as an end-to-end tutorial soon!

About

CNN that detects "umm" filler word in real time and triggers buzzer to give speech feedback

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published