This Case study contains examples of Apache Pig commands to query and perform analysis on web server report. The log reports used in this example is generated by various web servers. The log reports contains time-stamped details of requested links, IP address, request type, server response and other data. The same data set is used for analysis in the MapReduce case studies and this case studies illustrated the simplicity of processing and analytics for Apache Pig over Hadoop MapReduce.
The analysis done in this case study reveals visits of specific user, visits per unit time, failed request. This analysis will be carried out by executing queries on the web server log report database.