AOL: All content tagged as AOL in NoSQL databases and polyglot persistence
Wednesday, 23 March 2011
AOL AdLead Powered by Riak Components
A story of using Riak for its Dynamo-like building blocks:
I made use of riak-kv, riak-search, and riak-core to help distribute my application’s data, state, and processing, respectively. These technologies provided me with a framework I could use instead of trying to build the functionality myself. This allowed me to put the entire application behind a dumb splitter and call it a day. A horizontal scale out is a simple matter of standing up a new node, joining it to the cluster, and adding an entry to the splitter.
The usage of riak-search as a distributed file index is interesting, given there’s also Luwak — but that Riak library is meant for large files.
As the author mentions this is an internal application of AOL Advertising. The public facing AOL Advertising is using Membase and Hadoop.
Original title and link: AOL AdLead Powered by Riak Components (NoSQL databases © myNoSQL)
Thursday, 17 February 2011
Hadoop and Membase Case Study: AOL Advertising Architecture
Combining Hadoop and Membase to solve these challenges:
- How to analyze billions of user-related events, presented as a mix of structured and unstructured data, to infer demographic, psychographic and behavioral characteristics that are encapsulated into hundreds of millions of “cookie profiles”
- How to make hundreds of millions of cookie profiles available to their ad targeting platform with sub-millisecond, random read latency
- How to keep the user profiles fresh and current
In a much simplified form:
- crunch (nb: read it as pre-process and prepare) tons of data with Hadoop
- feed the results in a low latency, high throughput key-value store for serving them online
Original title and link: Hadoop and Membase Case Study: AOL Advertising Architecture (NoSQL databases © myNoSQL)
Saturday, 13 November 2010
Videos from Hadoop World
There was one NoSQL conference that I’ve missed and I was really pissed off: Hadoop World. Even if I’ve followed and curated the Twitter feed, resulting in Hadoop World in tweets, the feeling of not being there made me really sad. But now, thanks to Cloudera I’ll be able to watch most of the presentations. Many of them have already been published and the complete list can be found ☞ here.
Based on the twitter activity on that day, I’ve selected below the ones that seemed to have generated most buzz. The list contains names like Facebook, Twitter, eBay, Yahoo!, StumbleUpon, comScore, Mozilla, AOL. And there are quite a few more …
