GitHub - mohummedalee/ic-vendor-classification: Identifying the manufacturer of a semiconductor chip with convolutional nets

Identifying semiconductor manufacturer through images

This respository contains code for multiple approaches to use computer vision to identify the vendor who made a semiconductor. Semiconductor chips look fairly similar at times, and telling them apart can be challenging. This project approaches this problem with convolutional neural networks, a tried and tested CV method to solve classification problems. Code is in a mix of Keras and PyTorch.

Approaches

I largely rely on VGG16 as a backbone, re-using the ImageNet weights. This model was initially trained to classify 1000 object classes in images. I re-use its feature extraction capabilities to identify 27 manufacturer classes.

Post-VGG: Use VGG16 purely as a feature extractor to obtain [2 x 2 x 512] image representations; flatten these and pass through a classification neural net of 2 layers (aka classification head).
Post-VGG-Aug: Use VGG16 as a feature extractor, but to counter the small dataset size and overfitting tendencies, add data augmentation (random horizontal flipping and random rotation) before the input layer.
FT-VGG-AUG: Continue using VGG16 as feature extractor and an augmentation step, but instead of using backprop only on the classification head, backprop the loss on the last 4–6 layers of VGG as well, fine-tuning them for our application.

Evaluation

Model	Accuracy	Top-3 Accuracy	Macro F1
Post-VGG	47.999	70.431	38.854
Post-VGG-Aug	54.274	75.137	44.843
VGG-FT-Aug	75.843	88.941	68.355

Fine-tuning (VGG-FT-Aug) performs the best. Some other things learnt through experimentation:

Initialization: Xavier initialization leads to lower initial loss and smoother training
Data augmentation: Significantly improves generalization, helps with validation loss immediately
Preprocessing: Scaling RGB values from [0,255] to [0,1] improves training stability and performance
Learning rate: a low learning rate for more epochs does much better than high LR with weight decay methods for fewer epochs

Public training logs for VGG-FT-Aug are available at this WandB dashboard.

Repo Structure

train.ipynb contains training code for all three models. If you re-run, models will be exported to the models/ directory.
evaluate.ipynb has handy functions to re-run evaluations and build a comparison table
visualize.ipynb has utilities to visualize training and test data
The data/ directory contains train, val and test data downloaded from IC-ChipNet.
data.py and config.py contains classes for dataset and training configuration for quick experimentation

Credits: Many thanks to Reza and Crandall (2020) for putting together the IC-ChipNet data used here. Image credits to Wikipedia, François Chollet. General idea and many code snippets of the three approaches also from Deep Learning with Python's computer vision chapter.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
models		models
LICENSE		LICENSE
config.py		config.py
data.py		data.py
environment.yml		environment.yml
evaluate.ipynb		evaluate.ipynb
readme.md		readme.md
splash.jpg		splash.jpg
test-examples.png		test-examples.png
train.ipynb		train.ipynb
visualize.ipynb		visualize.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Identifying semiconductor manufacturer through images

Approaches

Evaluation

Repo Structure

About

Releases

Packages

Languages

License

mohummedalee/ic-vendor-classification

Folders and files

Latest commit

History

Repository files navigation

Identifying semiconductor manufacturer through images

Approaches

Evaluation

Repo Structure

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages