Skip to content

This program was built over a 9 week period to leveragee machine learning and regression to predict the rankings of March Madness basketball tournaments. We used scikit-learn for the machine learning, Pandas for the pre-processing of data. During this nine week period we explored the theory behind different machine learning algorithms to help de…

Notifications You must be signed in to change notification settings

hirish99/sports-tournament-prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

60 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Predicting March Madness 2018

We plan on creating a machine learning program in order to simulate college basketball March Madness results for 2018. It can be used for bets on different games or even for building your own bracket in a chance to win the jackpot.

Abstract

Use a Kaggle dataset containing tournament and regular season data alongside seedings and other variables to obtain potential standings in the 2018 March Madness tournament. Experiment with different classifiers and different types of data to produce the best machine learning algorithm to best reach our goal.

Timeline

Week 3

Figure out how to import data via pandas or other related data packages found in python like numpy. Mabye graph our data to see trends and isolate good labels (input) data for our program. Also perhaps try to weed out labels that might be too correlated.

Week 4

Learn how to use scikit-learn for simpler classifiers, also think about its applications in our project

Week 5

Continue to get adept with scikit-learn by using more advanced classifiers and start thinking about which one to use.

Week 6

Begin creating a python program using our chosen classifier and input data.

Week 7

Continue coding until finished, debug errors, finalize product.

Week 8

" "

Week 9

" "

Week 10

Hopefully finish. If we finish earlier, we can also pick up another sport related data set do more with the predictions, such as comparing tournaments. WIP

Data Sources

Kaggle

Technologies

Python (pandas, scikit-learn, matplotlib) for algorithm and loading/manipulating data

Club Resources needed

Free Food? :D and maybe more members who want to learn or know machine learning.

About

This program was built over a 9 week period to leveragee machine learning and regression to predict the rankings of March Madness basketball tournaments. We used scikit-learn for the machine learning, Pandas for the pre-processing of data. During this nine week period we explored the theory behind different machine learning algorithms to help de…

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages