The Hadoop Tutorial Series


[Note: this was written in 2009 (!) when Hadoop was just starting to get popular, so it is pretty old but hopefully some of the concepts are still useful]

In that series, you’ll find a progressive set of tutorials written along the way around the Hadoop Apache Project:

Hadoop Tutorial Series, Issue #1: Setting Up Your MapReduce Learning Playground

Update: Instructions updated for hadoop 0.20.2. This is the first post of a series of small hadoop tutorials introducing progressively ...
Read More

Hadoop Tutorial Series, Issue #2: Getting Started With (Customized) Partitioning

In the Issue #1 of this series, we set up the "learning playground" (based on the Cloudera Virtual Machine) in ...
Read More

Hadoop Tutorial Series, Issue #3: Counters In Action

Note: This post has been updated with a code working for hadoop 0.20.1. In this 3rd issue of the hadoop ...
Read More

Hadoop Tutorial Series, Issue #4: To Use Or Not To Use A Combiner

Welcome to the fourth issue of the Hadoop Tutorial Series. Combiners are another important Hadoop's feature that every hadoop developer ...
Read More

Your comments/critics/remarks are more than welcomed.