mrjob lets you write MapReduce jobs in Python 2.6+ and run them on several platforms. You can:

  • Write multi-step MapReduce jobs in pure Python
  • Test on your local machine
  • Run on a Hadoop cluster
  • Run in the cloud using Amazon Elastic MapReduce (EMR)

mrjob is licensed under the Apache License, Version 2.0.

To get started, install with pip:

pip install mrjob

and begin reading the tutorial below.


This documentation is for 0.5.0-dev, which is currently in development. Documentation for the stable version of mrjob is hosted at



Module Index

Search Page