Why do we need to upload Job to HDFS and again submit Job to Jobtracker?

3 Answer(s)

DeZyre Support

hi Sree,
Once JobTracker submits jobs to various TaskTrackers, all TTs need to have access to the codebase to execute the algorithm to execute the MAp/Reduce steps. This is why Step 5 and 6 are required, they are not redundant.

Jun 07 2015 11:17 PM

Sree

So Tasktrackers go to the Job location in HDFS to get the algorithm code?

Jun 07 2015 11:20 PM

Lakshman

I'm not sure about this block diagram-I remember in class Shoban flipped it quickly.
One other question. The apache API doc says "The Hadoop Map-Reduce framework spawns one map task for each InputSplit generated by the InputFormat for the job". Block digram shows file splits are created by Client. Is framework and client the same?

Jun 12 2015 07:46 AM

Why do we need to upload Job to HDFS and again submit Job to Jobtracker?

3 Answer(s)

Relevant Projects

You might also like

Related Questions

Related Blogs