Skip to content
View sarthakforwet's full-sized avatar
:shipit:
@ Epoch #24
:shipit:
@ Epoch #24

Highlights

  • Pro

Block or report sarthakforwet

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sarthakforwet/Readme.md

Hi 👋, I'm Sarthak Khandelwal

Sarthak Khandelwal | Instagram Sarthak Khandelwal | LinkedIn Sarthak Khandelwal | GoogleScholar Sarthak Khandelwal | Kaggle

Data Scientist | Automation Specialist | LLMs, Cloud Platforms

  • 🌱 Currently working on instituting model pipelines on Cloud Platforms.
  • 👨‍💻 Find my projects here
  • 📫 Reach me out at [email protected]
  • 📄 Refer my Resume for more information.
  • 💼 I can also be reached out at - Portfolio

While working at Mayo Clinic, I operationalized an ETL pipeline integrating VertexAI, BigQuery, and Cloud Storage to process lab reports of patients with Lupus Anticoagulant and generate interpretations which is further streamed to a Dash Application through BigQuery enabling easier analysis for Hematopathologists. Furthermore, I acquired further experience with Data Cleaning and Pre-Processing by migrating 5000+ unstructured Word files into a GeoDatabase for Panelboards Circuit Reports.

As a Research Assistant at Northeastern University, I worked on formulating a dialogue-based voice assistant titled Auxel that enable blind and low vision individuals in performing data analysis efficiently through interactions with a GPT-3.5 turbo model. This groomed my skills in Natural Language Processing and understanding of Large Language Models. Later during Spring 2024, I took a course titled DS 5983: Large Language Models where I gained in-depth understanding of LLMs and became aware of the current trends in this field.

Previously a Data Engineer at FiftyFive Technologies, I worked on preparing a SQL-based ETL pipeline for business insights and enabled data-driven decision-making. I'm passionate about transforming data into actionable insights to address real-world challenges. I completed my undergraduate degree in Computer Science from Medi-Caps University. My professional journey in Machine Learning (ML) and Data Science began during my sophomore year when I joined a technical club titled Students' Technical and Innovation Club (STIC). Since then, I have undertaken numerous projects and internships, honing my skills and contributing to impactful solutions. Driven by a commitment to continuous learning, I pursued a master's degree to deepen my understanding of ML and Data Science and explore how these technologies drive business innovation. I am particularly excited about leveraging my expertise to improve real-world outcomes through data-driven approaches.

As a Machine Learning Intern at the prestigious Indian Institute of Technology Kharagpur, I spearheaded a comprehensive urban development analysis project across 10 major Indian metropolitan areas. I developed sophisticated Land Use classification models using Quantum Geographic Information System (QGIS) to analyze decade-long urbanization patterns from 2009 to 2019. Additionally, I engineered robust web scraping solutions using Selenium and Beautiful Soup to extract and correlate tourism data, enabling data-driven insights into the relationship between urban development and tourism trends. This analysis provided valuable insights for urban planning and tourism development strategies.



Sarthak Khandelwal's GitHub Stats

Kaggle Badges

Kaggle-badges

Programming Workbench:

Python SQL rlang


Data Science Workbench:

Database

MySQL MongoDB BigQuery

Data Science Frameworks

Numpy   Pandas   Tensorflow   Scikit-Learn   PyTorch   OpenCV   Matplotlib   Seaborn   Xgboost   Hugging Face   Langchain   Plotly   Scipy

Cloud Platforms

Relevant Coursework

Fall 2024

  • DS6983: Trustworthy GenAI

Spring 2024

  • DS5983: Large Language Models
  • DS5500: Capstone

Summer 2023

  • DS5230: Unsupervised Machine Learning and Data Mining

Spring 2023

  • DS6120: Natural Language Processing
  • DS5220: Supervised Machine Learning

Fall 2022

  • CS5800: Algorithms
  • DS5110: Data Management and Processing

Let's Connect!

I'm always open to discussing data science, ML, healthcare innovations, or potential collaborations. Feel free to connect with me on LinkedIn or check out my latest work on Kaggle.

Popular repositories Loading

  1. YoloV3_Object_Detector YoloV3_Object_Detector Public

    Repo for Custom Object Detection

    Python 3

  2. Foodie_Partners Foodie_Partners Public

    An application which helps people find their foodie partner!

    HTML 3

  3. CyclicGAN_API CyclicGAN_API Public

    This repository contains the implementation of CyclicGAN from the official Research Paper.

    Python 3 1

  4. FakeNews FakeNews Public

    [ WIP ] Model to detect Fake News

    Jupyter Notebook 1 1

  5. DetectGPT DetectGPT Public

    A repository implementing the original detect GPT paper using python and pytorch.

    Python 1 1

  6. OncoScanAI-Image-Segmentation-for-Gastrointestinal-Tract-Cancer OncoScanAI-Image-Segmentation-for-Gastrointestinal-Tract-Cancer Public

    This repository contains codebase for the image segmentation task as part of the capstone project.

    Jupyter Notebook 1 1