Storing Internet generate traffic data and processing to get useful insights could help us in understanding the customer behavior and to serve the users better to make lives better in the advanced  world. Data that has been generating over the network is increasing exponentially. But the existing data warehouse systems does not provide much scalability at less cost with higher performance. Instead of using costly warehouse systems, with the help of commodity hardware and distribution process we can serve the customers at any scale. Even if the Data generated is exponential to 10, it could be scalable simply by using Hadoop. In this case we just need to add few more nodes to increase the size of the cluster. Because, storage is cheaper than processor.