This repository contains a collection of data analysis projects showcasing various techniques, methods, and tools used to analyze real-world datasets. The aim is to demonstrate proficiency in data analysis, data visualization, statistical modeling, and machine learning using different programming languages like R and Python.
- Description: A comprehensive business analysis using R programming, focusing on data-driven decision-making through statistical analysis and visualization.
- Key Features:
- Data preprocessing and cleaning
- Exploratory Data Analysis (EDA)
- Regression analysis and predictive modeling
- Technologies:
R
,dplyr
,ggplot2
,tidyverse
,forecast
- More Details
- Description: This project aims to analyze flight delay data for American Airlines departing from New York John F. Kennedy International Airport (JFK). The analysis will explore patterns in flight delays, identify peak delay times, and provide insights into the factors contributing to delays.
- Selected Parameters:
- Carrier: American Airlines
- Airport: New York John F.Kennedy International Airport (JFK)
- Data Range: [2019-07-01 -> 2024-07-01]
- Technologies used: Python, Pandas, Matplotlib, Jupyter Notebook.
The dataset used for this analysis is obtained from the Bureau of Transportation Statistics (BTS), which provides comprehensive data on U.S. flights, including delay statistics. The dataset includes flight records for American Airlines, specifically focusing on departures from JFK over the selected years
- Clone the repository:
git clone https://github.com/yourusername/data-portfolio.git