The Hadoop Tutorial Series

 

[Note: this was written in 2009 (!) when Hadoop was just starting to get popular, so it is pretty old but hopefully some of the concepts are still useful]

In that series, you’ll find a progressive set of tutorials written along the way around the Hadoop Apache Project:

Hadoop Tutorial Series, Issue #1: Setting Up Your MapReduce Learning Playground

hadoop-logo
Update: Instructions updated for hadoop 0.20.2. This is the first post of a series of small hadoop tutorials introducing progressively ...
Read More

Hadoop Tutorial Series, Issue #2: Getting Started With (Customized) Partitioning

partialSortOn2Reducers
In the Issue #1 of this series, we set up the "learning playground" (based on the Cloudera Virtual Machine) in ...
Read More

Hadoop Tutorial Series, Issue #3: Counters In Action

counters
Note: This post has been updated with a code working for hadoop 0.20.1. In this 3rd issue of the hadoop ...
Read More

Hadoop Tutorial Series, Issue #4: To Use Or Not To Use A Combiner

combiner
Welcome to the fourth issue of the Hadoop Tutorial Series. Combiners are another important Hadoop's feature that every hadoop developer ...
Read More

Your comments/critics/remarks are more than welcomed.