Skip to content

sarthakforwet/sarthakforwet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Hi 👋, I'm Sarthak Khandelwal

Sarthak Khandelwal | Instagram Sarthak Khandelwal | LinkedIn Sarthak Khandelwal | GoogleScholar Sarthak Khandelwal | Kaggle

Data Scientist | Automation Specialist | LLMs, Cloud Platforms

  • 🌱 Currently working on instituting model pipelines on Cloud Platforms.
  • 👨‍💻 Find my projects here
  • 📫 Reach me out at [email protected]
  • 📄 Refer my Resume for more information.
  • 💼 I can also be reached out at - Portfolio

While working at Mayo Clinic, I operationalized an ETL pipeline integrating VertexAI, BigQuery, and Cloud Storage to process lab reports of patients with Lupus Anticoagulant and generate interpretations which is further streamed to a Dash Application through BigQuery enabling easier analysis for Hematopathologists. Furthermore, I acquired further experience with Data Cleaning and Pre-Processing by migrating 5000+ unstructured Word files into a GeoDatabase for Panelboards Circuit Reports.

As a Research Assistant at Northeastern University, I worked on formulating a dialogue-based voice assistant titled Auxel that enable blind and low vision individuals in performing data analysis efficiently through interactions with a GPT-3.5 turbo model. This groomed my skills in Natural Language Processing and understanding of Large Language Models. Later during Spring 2024, I took a course titled DS 5983: Large Language Models where I gained in-depth understanding of LLMs and became aware of the current trends in this field.

Previously a Data Engineer at FiftyFive Technologies, I worked on preparing a SQL-based ETL pipeline for business insights and enabled data-driven decision-making. I'm passionate about transforming data into actionable insights to address real-world challenges. I completed my undergraduate degree in Computer Science from Medi-Caps University. My professional journey in Machine Learning (ML) and Data Science began during my sophomore year when I joined a technical club titled Students' Technical and Innovation Club (STIC). Since then, I have undertaken numerous projects and internships, honing my skills and contributing to impactful solutions. Driven by a commitment to continuous learning, I pursued a master's degree to deepen my understanding of ML and Data Science and explore how these technologies drive business innovation. I am particularly excited about leveraging my expertise to improve real-world outcomes through data-driven approaches.

As a Machine Learning Intern at the prestigious Indian Institute of Technology Kharagpur, I spearheaded a comprehensive urban development analysis project across 10 major Indian metropolitan areas. I developed sophisticated Land Use classification models using Quantum Geographic Information System (QGIS) to analyze decade-long urbanization patterns from 2009 to 2019. Additionally, I engineered robust web scraping solutions using Selenium and Beautiful Soup to extract and correlate tourism data, enabling data-driven insights into the relationship between urban development and tourism trends. This analysis provided valuable insights for urban planning and tourism development strategies.



Sarthak Khandelwal's GitHub Stats

Kaggle Badges

Kaggle-badges

Programming Workbench:

Python SQL rlang


Data Science Workbench:

Database

MySQL MongoDB BigQuery

Data Science Frameworks

Numpy   Pandas   Tensorflow   Scikit-Learn   PyTorch   OpenCV   Matplotlib   Seaborn   Xgboost   Hugging Face   Langchain   Plotly   Scipy

Cloud Platforms

Relevant Coursework

Fall 2024

  • DS6983: Trustworthy GenAI

Spring 2024

  • DS5983: Large Language Models
  • DS5500: Capstone

Summer 2023

  • DS5230: Unsupervised Machine Learning and Data Mining

Spring 2023

  • DS6120: Natural Language Processing
  • DS5220: Supervised Machine Learning

Fall 2022

  • CS5800: Algorithms
  • DS5110: Data Management and Processing

Let's Connect!

I'm always open to discussing data science, ML, healthcare innovations, or potential collaborations. Feel free to connect with me on LinkedIn or check out my latest work on Kaggle.

About

Config files for my GitHub profile.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published