A complete and detailed guide for setting up a Hadoop cluster using MongoDB by Arten Yankov. It uses the MongoDB Hadoop adapter mongo-hadoop , which provides input and output adapters, support for InputSplits, and write-only Pig.
What is covered in the tutorial:
- Creating an AMI with the custom settings (installed hadoop and mongo-hadoop)
- Launching a hadoop cluster on EC2
- Adding more nodes to the cluster
- Running some sample jobs
Original title and link: NoSQL Tutorial: Setting Up a Hadoop Cluster with MongoDB Support on EC2 ( ©myNoSQL)