NoSQL Benchmarks NoSQL use cases NoSQL Videos NoSQL Hybrid Solutions NoSQL Presentations Big Data Hadoop MapReduce Pig Hive Flume Oozie Sqoop HDFS ZooKeeper Cascading Cascalog BigTable Cassandra HBase Hypertable Couchbase CouchDB MongoDB OrientDB RavenDB Jackrabbit Terrastore Amazon DynamoDB Redis Riak Project Voldemort Tokyo Cabinet Kyoto Cabinet memcached Amazon SimpleDB Datomic MemcacheDB M/DB GT.M Amazon Dynamo Dynomite Mnesia Yahoo! PNUTS/Sherpa Neo4j InfoGrid Sones GraphDB InfiniteGraph AllegroGraph MarkLogic Clustrix CouchDB Case Studies MongoDB Case Studies NoSQL at Adobe NoSQL at Facebook NoSQL at Twitter



membase: All content tagged as membase in NoSQL databases and polyglot persistence

Why the Cloudera - Membase partnership?

For those scenarios that require both scalable low latency data access and batch analytics to complete the application’s mission. This kind of hybrid, bidirectional data integration is the topological requirement of new applications – AOL Advertising and ShareThis are joint customers with these requirements. A Flume interface provides a streaming interface from Membase to Hadoop;  a Sqoop utility can be used for batch transfers between the two. Both of these utilities will be familiar to Hadoop watchers.

Basically OLTP (Membase) and OLAP (Cloudera/Hadoop). And I told you everybody Flumes.

Original title and link: Why the Cloudera - Membase partnership? (NoSQL databases © myNoSQL)


Membase and Redis

Attila Kisko, author and maintainer of a Membase .NET client:

Membase is awesome technology, but in many cases I have to be able to interact with my data more directly in the database. I use hash and counter data structures in Redis to implement a number of features in I’d probably switch to Membase if it had that sort of functionality.

Or in other words, scalable key-value stores are cool, but sometimes smart data structures fit better. Plus when such advice comes from a person close to your project, it’s usually a good idea to listen to it.

Original title and link: Membase and Redis (NoSQL databases © myNoSQL)


Microsoft coaches NoSQL options for Azure cloud

The Register writing about Microsoft initiative to bring NoSQL databases to the Azure cloud, Membase and MongoDB being mentioned in the article[1]:

The addition of NoSQL suits Microsoft - by bringing more people to Azure - and it suits the NoSQLers, because they get more Windows devs to support.

You can run NoSQL options like Mongo and Memcached on Azure after some fiddling and configuring. The goal now is to deliver a development, deployment, and management experience already familiar to those on Windows, SQL Server, and Visual Basic.

Is VMWare/Spring making the same bet for the Java world? Judging by the Spring Data initiative, plus Grails support for Redis, Grails support for MongoDB, I’d say they are.

A question that I’d like to clarify to myself is how popular is memcached in the Java world? My impression is that Java people have stayed away from memcached so far, using Java based solutions like EHCache or Terracotta, but I might be completely wrong.

Original title and link: Microsoft coaches NoSQL options for Azure cloud (NoSQL databases © myNoSQL)


Membase: Quick Getting Started Videos

Two short videos on setting up ☞ a single Membase node (aprox.2 min) and ☞ a Membase cluster (aprox.3 min). As you can imagine this is just the very very basics. But on the other hand it is kind of cool to know that’s so easy to get it up and running.

Original title and link: Membase: Quick Getting Started Videos (NoSQL databases © myNoSQL)

Membase Intro

An intro slidedeck to Membase:

Actually I’d say that the second part is much more than an intro, providing details on:

  • Membase data flow
  • data buckets
  • vBuckets (nb see more about virtual buckets)
  • persistence behavior (memory for hot data, disk for cold data)

Original title and link: Membase Intro (NoSQL databases © myNoSQL)

Memcached/Membase: Writing Your Own Storage Engine

Not sure how many will need to implement their own storage engine, but knowing there’re a couple of projects that support pluggable engines (Project Voldemort, Riak) it might be that for special scenarios special engines could perform better. Now you can learn how to do it for Membase:

Right now we’ve got an engine capable of running get and set load, but it is doing synchronous filesystem IO. We can’t serve our client faster than we can read the item from disk, but we might serve other connections while we’re reading the item off disk.

Original title and link: Memcached/Membase: Writing Your Own Storage Engine (NoSQL databases © myNoSQL)


FameTown - A New AWS-Powered Facebook Game Using Membase

Any particular reason for Membase appearing mostly in game usecases? Or is it more like: “if Zynga is using Membase, then we will too”?

Under the hood, this Facebook game makes good use of AWS and a number of other technologies. Here’s a summary:

  • The game is written in Sinatra, a DSL (domain specific language) used to create Ruby apps with minimal effort. The code runs on Amazon EC2.
  • Traffic to the EC2 instances is load balanced using the Nginx load balancer.
  • Membase is used for data storage, hosted on a number of Elastic Block Storage (EBS) volumes.
  • Scalr is used to scale and manage the application.

Original title and link: FameTown - A New AWS-Powered Facebook Game Using Membase (NoSQL databases © myNoSQL)


Why Membase Uses Erlang

On what makes Erlang one of the best environments for building distributed systems:

At the end of the day, the real question isn’t whether it would have been possible for us to implement our cluster management in another language; it’s really a question of effort and maintainability of the result. With any other environment, we would have had to reimplement at least part of what Erlang/OTP provides, while we haven’t really found ourselves reimplementing features provided by any other environment.

A NoSQL database per language ranking would look like:

  • C: 2 Redis, Tokyo Cabinet
  • C++: 3 Hypertable, MongoDB, Kyoto Cabinet
  • C#: 2 RavenDB, sones GraphDB
  • Erlang: 3 4 CouchDB, Membase, Riak, Hibari
  • Java: 8 Cassandra, Hadoop, HBase, OrientDB, Terrastore, Project Voldemort, Neo4j, Hypergraph

Original title and link: Why Membase Uses Erlang (NoSQL databases © myNoSQL)


Membase and Cloudera with Flume and Sqoop

James Phillips (Membase):

On the technology integration front, we have built and are making available to customers two mechanisms for integrating Membase and Cloudera Distribution for Hadoop (CDH). The first is a Membase NodeCode module that can stream data from Membase to CDH in real-time. As new operational data enters Membase, it can be massaged in real time and pumped into a CDH cluster for processing. The second is a Sqoop-derived batch loader utility that enables loading of data from Membase to CDH, and vice versa.

Real-time integration using Flume. Batch integration using Sqoop. Sounds like Cloudera’s tools are delivering.

Original title and link: Membase and Cloudera with Flume and Sqoop (NoSQL databases © myNoSQL)


Hadoop Spreading through Cloudera Parternships

Cloudera in its attempt to Hadoopize the world goes on partnership spree:

Many of you may have read about some of the recent announcements of partnerships between Cloudera and some of the leading data management software companies like Teradata, Netezza, Greenplum (EMC), Quest and Aster Data. We established these partnerships because Hadoop is increasingly serving as an open platform that many different applications and complimentary technologies work with. Our goal is to to make this as easy and as standardized as possible.

Checking the ☞ press release section turns out the following parnerships:

  • Membase
  • Talend
  • Quest
  • Pentaho
  • NTT Data
  • Aster Data
  • EMC Greenplum
  • Teradata
  • Netezza

Quite a few companies from the non-relational market.

Original title and link: Hadoop Spreading through Cloudera Parternships (NoSQL databases © myNoSQL)