This project contains an analysis of Stack Overflow datasets, both old and new, using various data science and machine learning techniques.
This Jupyter notebook contains the analysis performed on the old Stack Overflow dataset. It includes data loading, exploration, and possibly some preprocessing steps.
This Jupyter notebook contains the analysis performed on the new Stack Overflow dataset. It includes data loading, exploration, preprocessing, and model-building steps.
- pandas
- seaborn
- wordcloud
- nltk
- re
- string
- sklearn
- tensorflow
To run the notebooks, you need to have Jupyter installed in your environment. If you don't have Jupyter installed, you can install it by running:
pip install jupyter
After installing Jupyter, you can start it by running:
jupyter notebook
This will start the Jupyter Notebook server in your browser. From there, you can navigate to the location of the notebooks and open them.
- Atharva Werulkar - ([email protected])
- Parshav Sharma - ([email protected])
- Sanket Verma - ([email protected])
- Arihant paroha - ([email protected])
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.
Please adhere to this project's code of conduct
.
Distributed under the MIT License. See License for more information.