Audio processing, Video processing and Computer vision - UC3M

Description

Audio processing, Video processing and Computer Vision Laboratories (UC3M - C2.350.16508).

Installation

Create a Python 3.6 virtual environment and run the following command:

pip install -r requirements.txt

Or specify the name of the project to install specific requirements.

pip install -r <PROJECT NAME>/requirements.txt

Installation PyTorch for CUDA 11.3

PIP ENVIRONMENT

pip3 install torch==1.10.0+cu113 torchvision==0.11.1+cu113 -f https://download.pytorch.org/whl/cu113/torch_stable.html

CONDA ENVIRONMENT

conda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch

PROJECTS

1. Scale-Space Blob Detector

Scale-space blob detector based on the Laplacian of Gaussian (LoG) filter. Full guideline here.

2. Melanoma Segmentation

Pre-processing, segmentation and post-processing for melanoma images using thresholding and clustering techniques. Full guideline here.

3. Melanoma Classification with CNNs

Testing of several CNN architectures for melanoma classification (no melanoma, melanoma, keratosis) Full lab here.

4. Object Detection with Faster-RCNN

Faster-RCNN implementation for object detection and classification using a subset of the PASCAL VOC 2012 database. Full lab here.

5. Feature Selection for Audio Classification

Feature extraction and selection for classifying dogs and cats audios using SVM. Full guideline here.

6. Audio Speech Recognition with DeepSpeech2

Comparison of 3 speech recognition architectures based on DeepSpeech2 altering the GRU layer implementation. Full lab here.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
1-Scale-Space-Blob-Detector		1-Scale-Space-Blob-Detector
2-Melanoma-Segmentation		2-Melanoma-Segmentation
3-Image-Classification-with-CNNs		3-Image-Classification-with-CNNs
4-Object-Detection-with-F-RCNN		4-Object-Detection-with-F-RCNN
5-Audio-Features-Selection		5-Audio-Features-Selection
6-Deep-Learning-for-ASR		6-Deep-Learning-for-ASR
.gitattributes		.gitattributes
.gitignore		.gitignore
.jupyter		.jupyter
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio processing, Video processing and Computer vision - UC3M

Table of contents

Description

Installation

Installation PyTorch for CUDA 11.3

PROJECTS

1. Scale-Space Blob Detector

2. Melanoma Segmentation

3. Melanoma Classification with CNNs

4. Object Detection with Faster-RCNN

5. Feature Selection for Audio Classification

6. Audio Speech Recognition with DeepSpeech2

About

Languages

saizk/computer-vision-uc3m

Folders and files

Latest commit

History

Repository files navigation

Audio processing, Video processing and Computer vision - UC3M

Table of contents

Description

Installation

Installation PyTorch for CUDA 11.3

PROJECTS

1. Scale-Space Blob Detector

2. Melanoma Segmentation

3. Melanoma Classification with CNNs

4. Object Detection with Faster-RCNN

5. Feature Selection for Audio Classification

6. Audio Speech Recognition with DeepSpeech2

About

Topics

Resources

Stars

Watchers

Forks

Languages