This project aims to to predict the quality of wine using linear regression.
- The dataset used for this project can be found on Kaggle under the name "Red Wine Quality".
- The main aim of the red wine quality dataset is to predict which of the physiochemical features make good wine. With 11 variables and 1 output variable (quality) given.
- Python 3.6 or above
- NumPy
- Pandas
- Matplotlib
- Scikit-learn
- Define the Problem
- Data Gathering
- Data Cleaning
- Data Exploration and Visualization
- Train the algorithm
- Download the datset from kaggle and save it in the project directory.
- Run the redwine_quality_prediction.ipynb file, which contains the code for data preprocessing, model training, and evaluation.
- The script will load the dataset, preprocess the data, split it into training and testing sets, train a linear regression model, and evaluate its performance.
- After the model training is completed, it will predict quality of the wine.
The dataset used in this project is subject to the licensing terms provided by Kaggle. Please refer to the dataset's license for more details.
Contributions to this project are welcome. If you find any issues or have suggestions for improvements, please feel free to open an issue or submit a pull request.
For any questions or inquiries, please contact [[email protected]].