Skip to content

Commit

Permalink
Merge pull request #92 from Vishisht16/main
Browse files Browse the repository at this point in the history
Added Speech Emotion Recognition
  • Loading branch information
UppuluriKalyani authored Oct 5, 2024
2 parents 8616f82 + edc20f5 commit 967d5a2
Show file tree
Hide file tree
Showing 2 changed files with 1,239 additions and 0 deletions.
18 changes: 18 additions & 0 deletions Natural Language Processing/Speech Emotion Recognition/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
# Speech Emotion Recognition (SER) using Deep Learning

This project implements a **Speech Emotion Recognition (SER)** system using deep learning. It analyzes speech data and classifies it into different emotional states such as happy, sad, angry, neutral, etc. The model uses **CNNs (Convolutional Neural Networks)** to extract features from audio signals and predict emotions.

## Features
- **Data Augmentation**: Utilizes stretching, pitch shifting, and noise injection for robust training.
- **Preprocessing**: MFCC extraction for audio features.
- **CNN Model**: Built using Keras to classify speech emotions.
- **Training & Evaluation**: Model training on augmented data, with accuracy and loss metrics visualization.

## Datasets
The project uses four open-source datasets: **RAVDESS**, **CREMA-D**, **SAVEE**, and **TESS**, which are downloaded on the go.

## Model Architecture
- **Conv1D layers** to capture temporal features.
- **MaxPooling1D** for downsampling.
- **Dense layers** for final classification.
- **Dropout** to prevent overfitting.
Loading

0 comments on commit 967d5a2

Please sign in to comment.