Computer Vision Final Project

Computer Vision (CS1430) Final Project by Jakobi Haskell, Anh Duong, Ayman Benjelloun Touimi & Adam Mroueh. Full Colab notebook here: https://colab.research.google.com/drive/1tCy18ThUYPCqvCGPS7Sx6-C2hfNiaveT?authuser=1#scrollTo=yYxtkRxM_Hdn

Description

The project is a mini version of Google Translate by images.

We wrote our own scripts to generate and prepare data (generating masks) to comply to COCO format. The data is a list of thousands of images with randomly sized, colored and fonted alphabetical lowercase characters. Example:

We then trained Mask-RCNN on character detection & classification:

Finally, we wrote our own parsing algorithm that parses the character into words, and words into string. These strings are then translated using Google Translate API, and finally overlaid on top of the original image, also using another algorithm we wrote.

Poster

Text Recognition, Translation, and Transformation with Mask-RCNN.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
Mask_RCNN		Mask_RCNN
data_generated_mask/1024px-jean_cocteau_b_meurisse_1923		data_generated_mask/1024px-jean_cocteau_b_meurisse_1923
.DS_Store		.DS_Store
.gitignore		.gitignore
Miniconda3-latest-MacOSX-x86_64.sh		Miniconda3-latest-MacOSX-x86_64.sh
README.md		README.md
Screen Shot 2022-11-30 at 10.28.50 PM.png		Screen Shot 2022-11-30 at 10.28.50 PM.png
Screen Shot 2022-11-30 at 10.32.58 PM.png		Screen Shot 2022-11-30 at 10.32.58 PM.png
billboard.jpg		billboard.jpg
billboard2.jpeg		billboard2.jpeg
character_recognition.py		character_recognition.py
data_generation.py		data_generation.py
french_image.jpeg		french_image.jpeg
get-pip.py		get-pip.py
keras_ocr_synthetic_training_public.py		keras_ocr_synthetic_training_public.py
main.py		main.py
screenshot.png		screenshot.png
simple_text_example.jpeg		simple_text_example.jpeg
test.png		test.png
test.py		test.py
test2.png		test2.png
utf8.txt		utf8.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computer Vision Final Project

Description

Poster

About

Releases

Packages

Contributors 3

Languages

jahaskell53/cv-finalproject

Folders and files

Latest commit

History

Repository files navigation

Computer Vision Final Project

Description

Poster

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages