mrjob

mrjob lets you write MapReduce jobs in Python 2.5+ and run them on several platforms. You can:

  • Write multi-step MapReduce jobs in pure Python
  • Test on your local machine
  • Run on a Hadoop cluster
  • Run in the cloud using Amazon Elastic MapReduce (EMR)

To get started, install with pip:

pip install mrjob

Indices and tables

Table Of Contents

Next topic

What’s New

This Page