Building Hadoop Clusters On Linux In EC2
http://www.higherpass.com/linux/Tutorials/Building-Hadoop-Clusters-On-Linux-In-Ec2/
Learn to build and use multi-node Hadoop clusters running in Amazon EC2. A few bits of knowledge are assumed in this article, first a basic knowledge of Hadoop. If you haven’t used hadoop before you probably want to read Intro To Hadoop Article first.
http://www.higherpass.com/java/Tutorials/Building-Hadoop-Mapreduce-Jobs-In-Java/
Hadoop is a parallel job processing framework from the Apache foundation. The hadoop framework is written in java and supports jar files for job execution. This tutorial is going to cover building a MapReduce job in java. The dataset being used will be the 2000 US Census available as an EBS volume snapshot on Amazon EC2. The census dataset is extemely large, and only a small part of the overall dataset will be explained.

December 22nd, 2009 at 11:56 am
[...] This post was mentioned on Twitter by AlbertOFF FabianON, php-app-engine. php-app-engine said: #ec2: Building Hadoop Clusters On Linux In EC2 http://php-app-engine.com/2009/how-to/building-hadoop-clusters-on-linux-in-ec2/ [...]