Hello! My name is Prithvi Nuthanakalva, and I'm currently a Machine Learning Engineer at a startup. In my previous career, I helped build and ship GPS Positioning and Navigation software, and have extensive experience in a technical customer facing role.
Feel free to connect with me on LinkedIn
As a personal project, I recently built an end to end ML application which uses a ML model to provide fantasy recommendations for cricket games based on the players involved, also provides odds of a team winning. For every game, predictions are available 30 minutes prior to game start time (100s of simualtions are run in the background once the teams are announced). The website is live at betmancric.com
Sharing some personal ML projects I've worked on
-
Language Models
- Simple Seq2Seq using Transformers (2023)
- To get a better understanding of how Transformers work, I experimented with a simple example to sort strings using Transformers. Trained a network using Pytorch
- Patient Clinical Notes (2022)
- Finetuned a Huggingface BERT model in two stages (a) Masked Language Modeling task using the unlabeled patient notes and (b) Named Entity Recognition task using the labeled Description -> Symptom dataset
- Simple Seq2Seq using Transformers (2023)
-
Neural Network from first principles using only Numpy (2018)
- Implemented a Fully connected dense layer architecture using just simple Numpy with functions to backpropagate and update gradients
- Trained on MNIST dataset using Fully connected layers with reasonable accuracy (~98%)
- Implemented a Fully connected dense layer architecture using just simple Numpy with functions to backpropagate and update gradients
-
Recurrent Neural Networks
- Pricing Optimization using LSTM Encoder-Decoder Architecture (2023)
- Using synthetic data from a Google blog, built a simple demand forecasting model to drive price optimization decisions
- Simple Sequence2Sequence Network using RNNs (2018)
- Built using Keras using the Encoder <-> Decoder architecture and trained on a simple task of sorting characters. For eg. 'adgbf' would output the sorted sequence 'abdfg'. This architecture can be easily expanded to do language translation for eg.
- Built using Keras using the Encoder <-> Decoder architecture and trained on a simple task of sorting characters. For eg. 'adgbf' would output the sorted sequence 'abdfg'. This architecture can be easily expanded to do language translation for eg.
- CharacterLevelRNN (2018)
- Trained a simple character level RNN on a random C code base
- Trained a simple character level RNN on a random C code base
- Pricing Optimization using LSTM Encoder-Decoder Architecture (2023)
-
Convolutional Neural Networks
- Carvana Image Segmentation (2018)
- Kaggle contest to segment the outline of a car in an image. Used the UNet architecture and built in Keras
- Kaggle contest to segment the outline of a car in an image. Used the UNet architecture and built in Keras
- Yelp Restaurant Photo Classification (2018)
- 16th placed solution to classify crowd sourced images of restaurants
- 16th placed solution to classify crowd sourced images of restaurants
- Kuzushiji Character Recognition (2020)
- 22nd place in a contest to read characters from 1000 year old Kuzushiji documents. Source code to be uploaded soon
- 22nd place in a contest to read characters from 1000 year old Kuzushiji documents. Source code to be uploaded soon
- Carvana Image Segmentation (2018)
-
Generative Adverserial Networks
- Speech Sample Generation (2018)
- Using Small Speech samples, built a GAN that can generate speech. Built using Keras on a dataset provided by Google
- Using Small Speech samples, built a GAN that can generate speech. Built using Keras on a dataset provided by Google
- Speech Sample Generation (2018)