... more stuff
at
php-app-engine.com

Building Hadoop Clusters On Linux In EC2

http://www.higherpass.com/linux/Tutorials/Building-Hadoop-Clusters-On-Linux-In-Ec2/

Learn to build and use multi-node Hadoop clusters running in Amazon EC2. A few bits of knowledge are assumed in this article, first a basic knowledge of Hadoop. If you haven’t used hadoop before you probably want to read Intro To Hadoop Article first.

http://www.higherpass.com/java/Tutorials/Building-Hadoop-Mapreduce-Jobs-In-Java/

Hadoop is a parallel job processing framework from the Apache foundation. The hadoop framework is written in java and supports jar files for job execution. This tutorial is going to cover building a MapReduce job in java. The dataset being used will be the 2000 US Census available as an EBS volume snapshot on Amazon EC2. The census dataset is extemely large, and only a small part of the overall dataset will be explained.

One Response to “Building Hadoop Clusters On Linux In EC2”

  1. Tweets that mention Building Hadoop Clusters On Linux In EC2 › ec2base -- Topsy.com Says:

    [...] This post was mentioned on Twitter by AlbertOFF FabianON, php-app-engine. php-app-engine said: #ec2: Building Hadoop Clusters On Linux In EC2 http://php-app-engine.com/2009/how-to/building-hadoop-clusters-on-linux-in-ec2/ [...]

Leave a Reply