Analysis of Community Interactions using Spark GraphX

Analysis of Community Interactions using Spark GraphX

The goal of this spark project is to analyse the level and strength of interactions across areas of coverage of a telecom provider between different areas in the city of Milan.


Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

Customer Love

Read All Reviews

Prasanna Lakshmi T

Advisory System Analyst at IBM

Initially, I was unaware of how this would cater to my career needs. But when I stumbled through the reviews given on the website. I went through many of them and found them all positive. I would... Read More

Camille St. Omer

Artificial Intelligence Researcher, Quora 'Most Viewed Writer in 'Data Mining'

I came to the platform with no experience and now I am knowledgeable in Machine Learning with Python. No easy thing I must say, the sessions are challenging and go to the depths. I looked at graduate... Read More

What will you learn

Introduction to Graphs and why are they essential
Simple Graphs, BIpirate Graphs, Directed, Undirected graphs, Multi Graphs and Property Graphs
Understanding Apache Spark
Understanding Hadoop MapReduce framework
Introduction to Spark GraphX
Visualizations restrictions related to Spark GraphX
The use case for Graph storage and Graph processing
Building a graph structure from our dataset
Downloading the dataset and performing basic EDA
Visualizing edges and nodes of the graph by writing queries in Scala
Use of Spark GraphX graph operators
Analysis of graph using GraphX
Exploring functions and features of Spark GraphX
Performing parallel algorithmic computing like page rank on Graph using GraphX

Project Description

For all telecommunication providers, using the call details record (CDR), it is quite easy to measure the level and strength of interaction across areas of coverage. Such measurement enables the providers to make further decisions regarding sales, promotions and engineering details to improve the quality of service amongst other things.

In this spark graphx project, we will be doing an analysis of the level and strength of interactions between different areas in the city of Milan. Our dataset for this big data project include datasets from Dandelion Open big data website which contains aggregated anonymized call records based on the calls exchanged between Telecom Italia Mobile users.

Similar Projects

In this big data project, we will look at how to mine and make sense of connections in a simple way by building a Spark GraphX Algorithm and a Network Crawler.

In this Neo4j project, you will do network analysis using a graph database to find patterns on how a social network affects business reviews and ratings.

Curriculum For This Mini Project

03h 33m
01h 49m