speech-classification

Here are 20 public repositories matching this topic...

YuanGongND / ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

audio deep-learning pytorch representation-learning audio-classification keyword-spotting speech-commands speech-classification

Updated May 21, 2023
Jupyter Notebook

YuanGongND / ssast

Star

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

audio audio-classification audio-processing self-supervised-learning speech-classification

Updated Aug 14, 2022
Python

m3hrdadfi / soxan

Star

Wav2Vec for speech recognition, classification, and audio classification

speech-recognition automatic-speech-recognition emotion-recognition speech-emotion-recognition speech-classification

Updated Apr 2, 2022
Jupyter Notebook

kaistmm / Audio-Mamba-AuM

Star

Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"

audio deep-learning pytorch representation-learning audio-classification mamba state-space-model speaker-identification speech-classification audio-mamba

Updated Nov 24, 2024
Python

felixchenfy / Speech-Commands-Classification-by-LSTM-PyTorch

Star

Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augmentation.

machine-learning lstm speech-recognition audio-processing speech-classification

Updated Dec 14, 2022
Jupyter Notebook

HoseinAzad / Transformer-based-SER

Star

Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch

emotion-recognition speech-emotion-recognition speech-classification transformer-pytorch speech-python speech-emotion-classification speech-classification-python

Updated Apr 12, 2024
Python

anik8gupta / Toxic_Speech_Classification

Star

It is a full-fetched web application.Based on sentiment classification, by using nltk library it predicts that a speech is how much toxic, sever toxic, insult, obscene, threat.

machine-learning sentiment-analysis nltk machine-learning-projects speech-classification

Updated Jul 31, 2019
Python

Sreyan88 / Toxicity-Detection-in-Spoken-Utterances

Star

This repository contains the code for the paper: "DeToxy: A Large-Scale Multimodal Dataset for Toxicity Classification in Spoken Utterances"

speech speech-classification toxicity-classification wav2vec2

Updated Oct 13, 2022
Jupyter Notebook

Jason-Oleana / speech-classification

Star

In this challenge, the goal is to learn to recognize which of several English words is pronounced in an audio recording. This is a multiclass classification task.

convolutional-neural-network speech-classification mfcc-features

Updated Mar 25, 2023
Jupyter Notebook

deep-spin / speech-continuous-attention

Star

Speech Classification using Continuous Attention Mechanisms

speech-classification continuous-attention continuous-sparsemax continuous-softmax

Updated Jul 22, 2022
Python

EmanuelAlogna / Gender-Classification-using-ML

Star

Gender Classification with different Machine Learning models, using the LibriSpeech ASR dataset.

machine-learning deep-learning svm naive-bayes machine-learning-algorithms speech-recognition logistic-regression convolutional-neural-networks mlp perceptron librispeech k-nearest-neighbors librispeech-dataset speech-classification

Updated May 15, 2020
Jupyter Notebook

Mubarekethio / Voice-Recognition-Qafaraf-and-Amharic

Sponsor

Star

Qafar-af and Amharic voice Command Recognition project to control the movement of wheelchair

voice-commands voice-recognition speech-recognition amharic voice-control audio-classification keyword-spotting kws amharic-words speech-classification afar-language qafaraf-voice qafaraf afaraf

Updated Jan 24, 2024
Jupyter Notebook

Chris-Winnard / Speech-Gender-Classifier

Star

A convolutional neural network for gender classification, which achieved an F1-score of 94.3% when tested on the RAVDESS dataset. Created as postgraduate coursework, the report is included. The report also discusses Sodiq Adebiy's CNN, which I'd recommend looking at to anyone interested in emotion classification.

machine-learning deep-neural-networks deep-learning audio-analysis gender-recognition convolutional-neural-networks gender-classification speech-classification

Updated Jun 22, 2022
Jupyter Notebook

Amir-Hofo / Speech_commands_Classification

Star

In this notebook, we aim to recognize speech commands using classification. For this purpose, we used the SPEECHCOMMANDS dataset and the deep convolutional model M5. The code is written in Python and designed for the PyTorch platform.

machine-learning ai deep-learning cnn pytorch artificial-intelligence speech-recognition convolutional-neural-networks speech-to-text audio-classification torchaudio speech-classification

Updated Jun 18, 2024
Jupyter Notebook

manashpratim / Frame-Level-Classification-of-Speech

Star

python deep-learning jupyter-notebook pytorch mlp-classifier google-colab google-colaboratory speech-classification

Updated May 28, 2020
Jupyter Notebook

sarthak268 / Multimedia-Computing-and-Applications

Sponsor

Star

This repository contains code for all assignments in the Multimedia Computing and Applications (CSE563) course.

multimedia text-retrieval text-representation speech-classification multimedia-computing

Updated May 16, 2020
Python

KrajShuffle / Classifying_SpeechAudio_CNN

Star

CNN Based Approach for Audio File Classification. Contains Notebooks Illustrating Data Preprocessing, Feature Extraction, Model Training, & Model Inference Workflows & Overall Pipeline

feature-extraction convolutional-neural-networks data-preprocessing feature-engineering metrics-visualization speech-classification model-inference model-training-and-evaluation