Data Engineer | Data Analyst | Cloud Architect | AWS Certified
π Portland, OR 97229
π 503.567.9964
π§ [email protected]
π LinkedIn Profile
I am a Data Engineer with 3+ years of experience in building scalable data solutions, optimizing ETL workflows, and ensuring data availability for business decision-making. I excel at using AWS, Snowflake, Python, and SQL to transform complex datasets into actionable insights. My expertise spans real-time data integration, cloud data platforms, and data warehouse solutions.
Data Engineer
July 2023 β Present
- Designed and implemented 10+ REST APIs for customer data, financial management, and payment systems.
- Optimized ETL workflows using AWS Glue and Python, reducing ingestion time by 30%.
- Built Snowflake data warehouses supporting analytics and business intelligence.
- Enhanced data integrity and query performance by 20% with dbt.
Data Analyst
January 2021 β July 2022
- Migrated workloads to Databricks, reducing costs by 35% and improving efficiency by 30%.
- Developed Python data quality packages to clean and ensure integrity of datasets.
- Created Power BI dashboards, enhancing executive decision-making by 20%.
Arizona State University - Tempe, AZ
Master of Science in Information Technology
August 2022 β May 2024
Vellore Institute of Technology - Vellore, India
Bachelor of Technology in Electronics and Communication Engineering (Specialization in IoT & Sensors)
- Programming: Python, SQL, R, Java, Scala
- Data Visualization: Power BI, Tableau
- Databases: Redshift, RDS, DynamoDB, PostgreSQL, MongoDB
- Libraries: NumPy, Pandas, Matplotlib, Scikit-learn, TensorFlow, PySpark
- ETL & Data Integration: AWS Glue, Databricks, dbt, Snowflake
- Big Data: Hadoop, Spark, Kafka
- Cloud Platforms: AWS (S3, Glue, Redshift), Azure (Data Factory, Synapse Analytics)
- CI/CD & Tools: Jenkins, Git, Docker, Kubernetes, Apache Airflow
- Certifications: AWS Certified Solution Architect, AWS Certified Developer, Snowflake Core Pro
- Implemented a PySpark pipeline on Azure Databricks to transform racing data from the Ergast API.
- Created Power BI dashboards to visualize driver and team performance metrics.
- Led the migration of raw data from Azure Data Lake into Snowflake using a Bronze-Silver-Gold architecture.
- Implemented Snowflake Streams for change data capture, optimizing reporting capabilities.