1-844-696-6465 (US)        +91 77600 44484        help@dezyre.com

**Assignment Module 3 Nasdaq Data Set Output **

Hadoop has one standard output only?
and the most important file is part-00000 on every hadoop output which has the results?

Where can I find the physical location of the output file?

cloudera@localhost nasdaq]$ hadoop fs -ls /nasdaq/output/
Found 3 items
-rw-r--r-- 3 cloudera supergroup 0 2014-08-04 15:58 /nasdaq/output/_SUCCESS
drwxr-xr-x - cloudera supergroup 0 2014-08-04 15:54 /nasdaq/output/_logs
-rw-r--r-- 3 cloudera supergroup 4546 2014-08-04 15:57 /nasdaq/output/part-00000
[cloudera@localhost nasdaq]$ hadoop fs -cat /nasdaq/output/part-00000

2 Answer(s)



part-00000 is the output. It is not a directory. _logs is a folder which has the execution log details.On the successful completion of a job, the MapReduce runtime creates a _SUCCESS file which may be useful for applications that need to see if a result set is complete just by inspecting HDFS.


-00001 will be the next no based on the no of reducers.

Your Answer

Click on this code-snippet-icon icon to add code snippet.

Upload Files (Maximum image file size - 1.5 MB, other file size - 10 MB, total size - not more than 50 MB)