The Hadoop Tutorial Series

hadoop+elephant_rgb

[Note: this was written in 2009 (!) when hadoop was just starting to get popular, so it is pretty old but hopefully some of the concepts are still usuful]

A progressive set of tutorials written along the way around the Hadoop Apache Project:

Issue #1: Setting Up Your MapReduce Learning Playground

Issue #2: Getting Started With (Customized) Partitioning

Issue #3: Counters In Action

Issue #4: To Use Or Not To Use A Combiner

Your comments/critics/remarks are more than welcomed.