Apache LogAnalysis using Pig

http://developer.amazonwebservices.com/connect/entry.jspa?externalID=2728

Analyze your Apache logs using Pig and Amazon Elastic MapReduce.

* Total bytes transferred per hour
* A list of the top 50 IP addresses by traffic per hour
* A list of the top 50 external referrers
* The top 50 search terms in referrals from Bing and Google

You can modify the Pig script to generate additional information.

Leave a Reply