DeZyre - live hands on training
  • Home
  • Mini Projects
  • Blog
  • Sign In
  • FREE PROJECT RECIPES

Tutorials

Learn how you can build Big Data Projects


Apache Zookepeer Tutorial: Centralized Configuration Management

Establishing effective configuration management is an important step for building a distributed system. It is a complex process which helps in planning, identifying, tracking and verifying changes in the software. It is important to maintain configuration integrity throughout the life cycle of the system, this can be done by having good configuration management system.

Hadoop Zookeeper Tutorial

Zookeeper Apache is a distributed coordination service for distributed applications. It is designed to help users focus more on the functionality of the distributed application rather than worrying about the architecture. The centralized infrastructure and services provide synchronization across a Hadoop cluster.

Hadoop Sqoop Tutorial

Sqoop is a combination of SQL and Hadoop. Sqoop is a data transfer command line utility designed for efficiently importing and exporting data between RDBMS and HDFS. The data can be imported from any RDBMS like Oracle, MySQL into HDFS.

Hadoop PIG Tutorial

Apache Pig is designed to handle any kind of data. Apache Pig is a high level extensible language designed to reduce the complexities of coding MapReduce applications. Pig was developed at Yahoo to help people use Hadoop to emphasize on analysing large unstructured data sets by minimizing the time spent on writing Mapper and Reducer functions.

Hadoop Oozie Tutorial

Oozie is a server based job coordination system and workflow engine that runs in Java servlet-container. It is designed for executing workflow jobs with actions that trigger Pig jobs or MapReduce jobs. Oozie helps you string together a workflow of various coordinated jobs like Pig job, MapReduce job and a Hive Query.

Hadoop NoSQL Database Tutorial

A database that can be modelled through any other means apart from the traditional tabular relations is generally referred to as a NoSQL database. A NoSQL database organizes large distributed data sets into tuples - key value pairs and objects.

Hadoop Hive Tutorial

Apache Hive is a Hadoop run time component developed at Facebook. The data warehouse infrastructure is built on top of Hadoop stack to help users with querying, analysis and summarization. Apache Hive is a subset of SQL-92 plus Hive specific extensions. 

Hadoop HDFS Tutorial

Hadoop HDFS is a java based distributed file system for storing large unstructured data sets. Hadoop HDFS is designed to provide high performance access to data across large Hadoop clusters of commodity servers. It is referred to as the “Secret Sauce” of Apache Hadoop components as the data can be stored in blocks on the file system until the organization’s wants to leverage it for big data analytics.

Hadoop hBase Tutorial

Hadoop HBase is a real time, open source, column oriented, distributed database written in Java. HBase is modelled after Google’s BigTable and represents a key value column family store. It is built on top of Apache Hadoop and Zookeeper.

Hadoop Flume Tutorial

Apache Flume is an agent for data collection. It is generally used for log data. Flume takes data from several sources like Avro, Syslog’s, and files and delivers to various destinations like Hadoop HDFS or HBase.

1
2
3
4
5
6
7

Big Data and Hadoop Training Courses in Popular Cities

  • Hadoop Training in Texas
  • Hadoop Training in California
  • Hadoop Training in Dallas
  • Hadoop Training in Chicago
  • Hadoop Training in Charlotte
  • Hadoop Training in Dubai
  • Hadoop Training in Edison
  • Hadoop Training in Fremont
  • Hadoop Training in San Jose
  • Hadoop Training in Washington
  • Hadoop Training in New Jersey
  • Hadoop Training in New York
  • Hadoop Training in Atlanta
  • Hadoop Training in Canada
  • Hadoop Training in Abu Dhabi
  • Hadoop Training in Detroit
  • Hadoop Trainging in Germany
  • Hadoop Training in Houston
  • Hadoop Training in Virginia
  • Promotional Price
  • Microsoft Track
    Microsoft Professional Hadoop Certification Program
  • Hackerday

Online courses

  • Hadoop Training
  • Spark Certification Training
  • Data Science in Python
  • Data Science in R
  • Data Science Training
  • Contact Us
  • Mini Projects
  • Free Recipes
  • Blog
  • Tutorials
  • Privacy Policy
  • Disclaimer
Copyright 2019 Iconiq Inc. All rights reserved. All trademarks are property of their respective owners.