Merge pull request #92 from Vishisht16/main

Added Speech Emotion Recognition
UppuluriKalyani · Oct 5, 2024 · 967d5a2 · 967d5a2
2 parents 8616f82 + edc20f5
commit 967d5a2
Show file tree

Hide file tree

Showing 2 changed files with 1,239 additions and 0 deletions.
diff --git a/Natural Language Processing/Speech Emotion Recognition/README.md b/Natural Language Processing/Speech Emotion Recognition/README.md
@@ -0,0 +1,18 @@
+# Speech Emotion Recognition (SER) using Deep Learning
+
+This project implements a **Speech Emotion Recognition (SER)** system using deep learning. It analyzes speech data and classifies it into different emotional states such as happy, sad, angry, neutral, etc. The model uses **CNNs (Convolutional Neural Networks)** to extract features from audio signals and predict emotions.
+
+## Features
+- **Data Augmentation**: Utilizes stretching, pitch shifting, and noise injection for robust training.
+- **Preprocessing**: MFCC extraction for audio features.
+- **CNN Model**: Built using Keras to classify speech emotions.
+- **Training & Evaluation**: Model training on augmented data, with accuracy and loss metrics visualization.
+
+## Datasets
+The project uses four open-source datasets: **RAVDESS**, **CREMA-D**, **SAVEE**, and **TESS**, which are downloaded on the go.
+
+## Model Architecture
+- **Conv1D layers** to capture temporal features.
+- **MaxPooling1D** for downsampling.
+- **Dense layers** for final classification.
+- **Dropout** to prevent overfitting.