Data Science Project-TalkingData AdTracking Fraud Detection

Data Science Project-TalkingData AdTracking Fraud Detection

Machine Learning Project in R-Detect fraudulent click traffic for mobile app ads using R data science programming language.

Videos

Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

What will you learn

Understanding the problem statement
Importing a training dataset and testing
Installing necessary libraries and understanding its use
Performing basic EDA and checking for null values
Timestamping the necessary columns
Checking for unique values and data types
Checking for the relationship between different variables
Visualizing the distribution through ggplot
Understanding Decision tree, random forest, logistic regression,SVM , boosting ,bagging models ,cart ,and neural network
Defining the evaluation metrics
Prediction using all the features by applying Logistic Regression
Dividing the dataset into train and test dataset
Data Balancing using Smote
Applying Cross-Validation to avoid overfitting
Using "varImp" function in R to get the best features fo the model
Applying ensemble method Random Forest model
Applying bagging model Decision Tree
Applying linear model Logistic Regression
Using the confusion matrix to visualize the predictions
Selecting the final model and making predictions on the test dataset

Project Description

Fraud risk is everywhere, but for companies that advertise online, click fraud can happen at an overwhelming volume, resulting in misleading click data and wasted money. Ad channels can drive up costs by simply clicking on the ad at a large scale. With over 1 billion smart mobile devices in active use every month, China is the largest mobile market in the world and therefore suffers from huge volumes of fraudulent traffic. 

In this machine learning project, you will build a machine learning model to determine whether a click is fraud or not.

Similar Projects

Machine Learning Project - Work with KKBOX's Music Recommendation System dataset to build the best music recommendation engine.

In this machine learning project, you will uncover the predictive value in an uncertain world by using various artificial intelligence, machine learning, advanced regression and feature transformation techniques.

There are different time series forecasting methods to forecast stock price, demand etc. In this machine learning project, you will learn to determine which forecasting method to be used when and how to apply with time series forecasting example.

Curriculum For This Mini Project

Problem Statement
03m
Data Set
02m
Install Libraries
02m
Import Data Set
04m
Data Set Overview
03m
Next Steps
02m
Recap in Rstudio
10m
Missing Data
04m
Analyse Click Time variable
04m
Analyse Features
01m
Convert variables to correct Data types
03m
Exploratory Data Analysis
26m
Model Creation using all Features
08m
Recap
03m
Selecting Important Features
02m
Tune length Parameter
11m
Model Creation using selected Features
01m
Split Data into Training and Testing
05m
Data Balancing using SMOTE
11m
Model Creation - Decision Tree
15m
Model Creation - Random Forest
20m