We’ve already seen the analytics MongoDB case study before when looking how Eventbrite is tracking page views with MongoDB, but also in a MongoDB-based real time web traffic visualization tool called Hummingbird.
But Jared Rosoff’s presentation contains a series of slides which are identifying possible issues in each scaling approach:
- single database
- master-slave database
- sharded database
- key-value stores
- key-value store with Hadoop for reporting
The only part I don’t really understand is how is using Hadoop
more complex than scaling MongoDB:
Maybe someone could explain?
Meanwhile, Jared Rosoff’s complete slidedeck below.