Welcome to my GitHub! I'm Sparsh, a Data Engineer with over three years of experience, specializing in Azure, Big Data technologies, and data-driven software solutions. Here, youβll find projects that showcase my skills in data engineering, ETL processes, machine learning, and video analytics.
I'm a recent MSc graduate in Big Data Science from Queen Mary University of London and currently working as a Data Science Intern at Assentian Limited. I have a background in Azure Big Data Engineering, SQL Server, and Data Vault modeling. My work focuses on data pipelines, cloud environments, and scalable solutions for data processing.
- Description: Developed a video analytics-based solution for construction site productivity.
- Technologies: Computer Vision, YOLOv9, LSTM, Python, Azure
- Highlights: Real-time personnel and PPE detection; deployed across several UK construction sites.
- Description: Led a team in designing databases using Data Vault principles and implemented a multi-layer architecture for data migration.
- Technologies: SQL Server, SSIS, Azure Synapse, Snowflake
- Highlights: Created robust ETL processes; migrated data seamlessly across different architectures.
- Description: Built a data pipeline to analyze NYC rideshare data, focusing on traffic patterns and customer demographics.
- Technologies: Hadoop, Spark, Python, Power BI
- Highlights: Leveraged big data to derive insights; visualized findings in Power BI.
- Programming: Python, SQL, T-SQL
- Big Data: Hadoop, Spark, MapReduce
- Databases: SQL Server, Azure Synapse, Snowflake
- ETL: SSIS, Azure Data Factory
- Cloud Platforms: Microsoft Azure, AWS
- Data Modeling: Data Vault, Dimensional Modeling
- Generative AI Fundamentals β Databricks
- Advanced SQL Certification β HackerRank
- Big Data Engineer Certification β Trendy Tech
- Global Agile Certification β Infosys