March Madness Predictions for NCAA Tournament 2017

In this deep learning project, we are going to predict which team will win the NCAA basketball tournament of coming 2017 based on past historical data.


Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

What will you learn

  • Understanding the problem statement

  • Importing the dataset and initializing the libraries

  • Applying basic EDA and checking or null values

  • Visualizing relationship between different variables using barplot

  • Plotting scatter plot for visualization

  • Plotting and visualizing a time series plot

  • Merging different dataset for evaluation

  • Understanding seasonality and trend

  • Spearman Correlation and calculation

  • Visualizing correlation using histogram

  • Basics of Neural Networks

  • Installing and initializing Keras framework

  • Creating a model from scratch , embedding and adding bias

  • Training the Artificial Neural Networks on Dataset

  • Plotting graphs as for loss versus the number of epochs

Project Description

Another year, another chance to predict the upsets, call the probabilities and put your bracketology skills to the leaderboard test. In this keras deep learning project, we will once again attempt to predict the outcomes of this year's US men's college basketball tournament. But unlike most deep learning projects, we will pick the winners and losers using a combination of rich historical data and computing power, while the ground truth unfolds on national television.

Data Introduction:

If you are unfamiliar with the format and intricacies of the NCAA tournament, we encourage reading the wikipedia page before diving into the data. The data description and schema may seem daunting at first but is not as complicated as it appears.

As a reminder, you are encouraged to incorporate your own sources of data. We have provided team-level historical data to jump-start the modeling process, but there is also player-level and game-level data that may be useful.

We extend our gratitude to Kenneth Massey for providing much of the historical data.

What to predict:

  1. We will predict the probabilities for every possible matchup in the past 4 NCAA tournaments (2013-2016).
  2. We will predict the probabilities for every possible matchup before the 2017 tournament begins.

Similar Projects

Big Data Project Sequence Classification with LSTM RNN in Python with Keras
In this project, we are going to work on Sequence to Sequence Prediction using IMDB Movie Review Dataset​ using Keras in Python.
Big Data Project Deep Learning with Keras in R to Predict Customer Churn
In this deep learning project, we will predict customer churn using Artificial Neural Networks and learn how to model an ANN in R with the keras deep learning package.
Big Data Project Human Activity Recognition Using Smartphones Data Set
In this deep learning project, you will build a classification system where to precisely identify human fitness activities.

Curriculum For This Mini Project

02h 10m
02h 34m