Skip to content

Latest commit

 

History

History
41 lines (27 loc) · 1.66 KB

README.md

File metadata and controls

41 lines (27 loc) · 1.66 KB

Predicting March Madness 2018

We plan on creating a machine learning program in order to simulate college basketball March Madness results for 2018. It can be used for bets on different games or even for building your own bracket in a chance to win the jackpot.

Abstract

Use a Kaggle dataset containing tournament and regular season data alongside seedings and other variables to obtain potential standings in the 2018 March Madness tournament. Experiment with different classifiers and different types of data to produce the best machine learning algorithm to best reach our goal.

Timeline

Week 3

Figure out how to import data via pandas or other related data packages found in python like numpy. Mabye graph our data to see trends and isolate good labels (input) data for our program. Also perhaps try to weed out labels that might be too correlated.

Week 4

Learn how to use scikit-learn for simpler classifiers, also think about its applications in our project

Week 5

Continue to get adept with scikit-learn by using more advanced classifiers and start thinking about which one to use.

Week 6

Begin creating a python program using our chosen classifier and input data.

Week 7

Continue coding until finished, debug errors, finalize product.

Week 8

" "

Week 9

" "

Week 10

Hopefully finish. If we finish earlier, we can also pick up another sport related data set do more with the predictions, such as comparing tournaments. WIP

Data Sources

Kaggle

Technologies

Python (pandas, scikit-learn, matplotlib) for algorithm and loading/manipulating data

Club Resources needed

Free Food? :D and maybe more members who want to learn or know machine learning.