Data Scientist | Automation Specialist | LLMs, Cloud Platforms
- 🌱 Currently working on instituting model pipelines on Cloud Platforms.
- 👨💻 Find my projects here
- 📫 Reach me out at [email protected]
- 📄 Refer my Resume for more information.
- 💼 I can also be reached out at - Portfolio
While working at Mayo Clinic, I operationalized an ETL pipeline integrating VertexAI, BigQuery, and Cloud Storage to process lab reports of patients with Lupus Anticoagulant and generate interpretations which is further streamed to a Dash Application through BigQuery enabling easier analysis for Hematopathologists. Furthermore, I acquired further experience with Data Cleaning and Pre-Processing by migrating 5000+ unstructured Word files into a GeoDatabase for Panelboards Circuit Reports.
As a Research Assistant at Northeastern University, I worked on formulating a dialogue-based voice assistant titled Auxel that enable blind and low vision individuals in performing data analysis efficiently through interactions with a GPT-3.5 turbo model. This groomed my skills in Natural Language Processing and understanding of Large Language Models. Later during Spring 2024, I took a course titled DS 5983: Large Language Models where I gained in-depth understanding of LLMs and became aware of the current trends in this field.
Previously a Data Engineer at FiftyFive Technologies, I worked on preparing a SQL-based ETL pipeline for business insights and enabled data-driven decision-making. I'm passionate about transforming data into actionable insights to address real-world challenges. I completed my undergraduate degree in Computer Science from Medi-Caps University. My professional journey in Machine Learning (ML) and Data Science began during my sophomore year when I joined a technical club titled Students' Technical and Innovation Club (STIC). Since then, I have undertaken numerous projects and internships, honing my skills and contributing to impactful solutions. Driven by a commitment to continuous learning, I pursued a master's degree to deepen my understanding of ML and Data Science and explore how these technologies drive business innovation. I am particularly excited about leveraging my expertise to improve real-world outcomes through data-driven approaches.
As a Machine Learning Intern at the prestigious Indian Institute of Technology Kharagpur, I spearheaded a comprehensive urban development analysis project across 10 major Indian metropolitan areas. I developed sophisticated Land Use classification models using Quantum Geographic Information System (QGIS) to analyze decade-long urbanization patterns from 2009 to 2019. Additionally, I engineered robust web scraping solutions using Selenium and Beautiful Soup to extract and correlate tourism data, enabling data-driven insights into the relationship between urban development and tourism trends. This analysis provided valuable insights for urban planning and tourism development strategies.
Fall 2024
- DS6983: Trustworthy GenAI
Spring 2024
- DS5983: Large Language Models
- DS5500: Capstone
Summer 2023
- DS5230: Unsupervised Machine Learning and Data Mining
Spring 2023
- DS6120: Natural Language Processing
- DS5220: Supervised Machine Learning
Fall 2022
- CS5800: Algorithms
- DS5110: Data Management and Processing
I'm always open to discussing data science, ML, healthcare innovations, or potential collaborations. Feel free to connect with me on LinkedIn or check out my latest work on Kaggle.