NoSQL Benchmarks NoSQL use cases NoSQL Videos NoSQL Hybrid Solutions NoSQL Presentations Big Data Hadoop MapReduce Pig Hive Flume Oozie Sqoop HDFS ZooKeeper Cascading Cascalog BigTable Cassandra HBase Hypertable Couchbase CouchDB MongoDB OrientDB RavenDB Jackrabbit Terrastore Amazon DynamoDB Redis Riak Project Voldemort Tokyo Cabinet Kyoto Cabinet memcached Amazon SimpleDB Datomic MemcacheDB M/DB GT.M Amazon Dynamo Dynomite Mnesia Yahoo! PNUTS/Sherpa Neo4j InfoGrid Sones GraphDB InfiniteGraph AllegroGraph MarkLogic Clustrix CouchDB Case Studies MongoDB Case Studies NoSQL at Adobe NoSQL at Facebook NoSQL at Twitter



AOL: All content tagged as AOL in NoSQL databases and polyglot persistence

AOL AdLead Powered by Riak Components

A story of using Riak for its Dynamo-like building blocks:

I made use of riak-kv, riak-search, and riak-core to help distribute my application’s data, state, and processing, respectively. These technologies provided me with a framework I could use instead of trying to build the functionality myself. This allowed me to put the entire application behind a dumb splitter and call it a day. A horizontal scale out is a simple matter of standing up a new node, joining it to the cluster, and adding an entry to the splitter.

The usage of riak-search as a distributed file index is interesting, given there’s also Luwak — but that Riak library is meant for large files.

As the author mentions this is an internal application of AOL Advertising. The public facing AOL Advertising is using Membase and Hadoop.

Original title and link: AOL AdLead Powered by Riak Components (NoSQL databases © myNoSQL)


Hadoop and Membase Case Study: AOL Advertising Architecture

Combining Hadoop and Membase to solve these challenges:

  1. How to analyze billions of user-related events, presented as a mix of structured and unstructured data, to infer demographic, psychographic and behavioral characteristics that are encapsulated into hundreds of millions of “cookie profiles”
  2. How to make hundreds of millions of cookie profiles available to their ad targeting platform with sub-millisecond, random read latency
  3. How to keep the user profiles fresh and current

AOL Advertising Hadoop Membase Case Study

In a much simplified form:

  • crunch (nb: read it as pre-process and prepare) tons of data with Hadoop
  • feed the results in a low latency, high throughput key-value store for serving them online

Original title and link: Hadoop and Membase Case Study: AOL Advertising Architecture (NoSQL databases © myNoSQL)


Videos from Hadoop World

There was one NoSQL conference that I’ve missed and I was really pissed off: Hadoop World. Even if I’ve followed and curated the Twitter feed, resulting in Hadoop World in tweets, the feeling of not being there made me really sad. But now, thanks to Cloudera I’ll be able to watch most of the presentations. Many of them have already been published and the complete list can be found ☞ here.

Based on the twitter activity on that day, I’ve selected below the ones that seemed to have generated most buzz. The list contains names like Facebook, Twitter, eBay, Yahoo!, StumbleUpon, comScore, Mozilla, AOL. And there are quite a few more …