In this machine learning project, you will use the video clip of an IPL match played between CSK and RCB to forecast key performance indicators like the number of appearances of a brand logo, the frames, and the shortest and longest area percentage in the video.
In this PySpark project, you will simulate a complex real-world data pipeline based on messaging. This project is deployed using the following tech stack - NiFi, PySpark, Hive, HDFS, Kafka, Airflow, Tableau and AWS QuickSight.
This project will cover the understanding of Apache Spark with main focus on one of its components, Spark SQL. We will understand how Spark and Spark SQL works, its internal functioning, its capabilities and advantages over other data processing tools. We are going to take up one business problem in the area of Supply Chain. Our tech stack will be Databricks and the latest Spark 3.0 for this project. We will use Spark SQL to understand the business data and generate insights from it which must help us frame a solution for our business problem.
In this Databricks Azure project, you will use Spark & Parquet file formats to analyse the Yelp reviews dataset. As part of this you will deploy Azure data factory, data pipelines and visualise the analysis.
Estimating churners before they discontinue using a product or service is extremely important. In this ML project, you will develop a churn prediction model in telecom to predict customers who are most likely subject to churn.