Implementation-of-CNN-for-action-recognition-in-videos

Deep Learning in Computer Vision to build a convolutional neural network with the Keras framework in Python aimed at recognizing actions in videos. The UCF101 dataset, which is formed by YouTube videos gathered in 101 categories is used to train and test the proposed approach. A siamese convolutional network has ben developed formed by two branches that uses the same weights Each branch has the same architecture while working in tandem on two different input vectors to compute comparable output vectors. One of the branches will correspond to the actual frame of the video (RGB frame), while the other one will correspond to its optical flow. In this way, by checking both characteristics at the same time and merging them, we will obtain the probabilities of the video belonging to each of the 101 classes.

CNN-for-action-recognition-in-videos

Author: Awet Haileslassie Gebrehiwot([email protected])

If you have any questions on the code or README, please feel free to contact me.

Requirments

python Tensorflow Numpy Matplotlib

Running the code

first download the UCF101 dataset and start the training

Open terminal and navigate to the directory where the code exist then simply type:

python skeleten.py

Datasets

The Datasets used by CNN model "UCF101 Datasets", some examples are shown below:

The network is trained and evaluated on UCF101 dataset with classification accuracy.

Part 3: CNN model

The siamse CNN model contains two Baranches One of the branches will correspond to the actual frame of the video (RGB frame), while the other one will correspond to its optical flow. then both will be conacatnated to result in better performance for action recognition of the 101 classes.

1: Extract RGB The 1st thing to do is to obtain RGB content for each video sequence. For doing so, we use from the provided code the functions extract RGB, where we can extract RGB information from the video content. 2: Calculating optical flow The 2nd thing to do is to compute flow for each video sequence. For doing so, we use from the provided code the functions compute flow, where we can calculate and compute the optical flow from the video.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
DLCV5_Graphs		DLCV5_Graphs
Visualization		Visualization
src		src
README.md		README.md
results		results
skeleton.py		skeleton.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Implementation-of-CNN-for-action-recognition-in-videos

CNN-for-action-recognition-in-videos

Requirments

Running the code

Datasets

Part 3: CNN model

Architecture

Model Loss during Training for 30 epoches

Model Accuracy during Training for 30 epoches

Model Loss during Testing for 30 epoches

Model Accuracy during Testing for 30 epoches

About

Releases

Packages

Languages

gebawe/Video-Action-Recognition-CNN

Folders and files

Latest commit

History

Repository files navigation

Implementation-of-CNN-for-action-recognition-in-videos

CNN-for-action-recognition-in-videos

Requirments

Running the code

Datasets

Part 3: CNN model

Architecture

Model Loss during Training for 30 epoches

Model Accuracy during Training for 30 epoches

Model Loss during Testing for 30 epoches

Model Accuracy during Testing for 30 epoches

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages