How does cloudera environment look in real project?

DeZyre Support

1. In real project following set of machines can be a part of hadoop setup:
a. Cloudera cluster: comprises of several machines behaving as datanodes/tastracker and one or machine behaving as namenode/jobtracker.
b. Client machines: from where cluster can be accessed for firing hadoop commands [client machine itself can be a part of hadoop cluster]
c. User machine: using which user can remotely connect to client machines and perform hadoop specific operations.

2. To deploy map-reduce program on cloudera environment,
a. Prepare jar containing the MapReduce code and related dependency jar files
b. Add the jar in HADOOP_CLASSPATH eg. export HADOOP_CLASSPATH=HADOOP_CLASSPATH:
c. Fire command: hadoop jar

3. To start on a POC, a single [windows/linux] machine would suffice with Cloudera VM installed on it.
Link : http://www.cloudera.com/content/support/en/downloads/download-components/download-products.html?productID=F6mO278Rvo
This VM comes with pre-installed psuedo-distributed hadoop components and eclipse.
Eclipse can be used to write down java based map-reduce code and pre-installed cloudera hadoop can be used to run the program.
Once done with the complete functionality, deploy the program on 5-node hadoop cluster with some good amount of big-data to showcase the time differences.
To deploy 5-node cluster, you can take help from dezyre administrator course material.

May 07 2014 03:47 PM

Preethi

Hi Neha,
Thanks for your inputs

Regards
Preethi

May 07 2014 07:42 PM

DeZyre Support

You are welcome Preethi.
If you need any further help, post here.
Requesting to vote-up, if found useful.

May 15 2014 12:34 AM

How does cloudera environment look in real project?

3 Answer(s)

Relevant Projects

You might also like

Related Questions

Related Blogs