**Assignment Module 3 Nasdaq Data Set Output **

Hadoop has one standard output only?
and the most important file is part-00000 on every hadoop output which has the results?

Where can I find the physical location of the output file?

cloudera@localhost nasdaq]$ hadoop fs -ls /nasdaq/output/
Found 3 items
-rw-r--r-- 3 cloudera supergroup 0 2014-08-04 15:58 /nasdaq/output/_SUCCESS
drwxr-xr-x - cloudera supergroup 0 2014-08-04 15:54 /nasdaq/output/_logs
-rw-r--r-- 3 cloudera supergroup 4546 2014-08-04 15:57 /nasdaq/output/part-00000
[cloudera@localhost nasdaq]$ hadoop fs -cat /nasdaq/output/part-00000

2 Answer(s)



part-00000 is the output. It is not a directory. _logs is a folder which has the execution log details.On the successful completion of a job, the MapReduce runtime creates a _SUCCESS file which may be useful for applications that need to see if a result set is complete just by inspecting HDFS.


-00001 will be the next no based on the no of reducers.