NoSQL Benchmarks NoSQL use cases NoSQL Videos NoSQL Hybrid Solutions NoSQL Presentations Big Data Hadoop MapReduce Pig Hive Flume Oozie Sqoop HDFS ZooKeeper Cascading Cascalog BigTable Cassandra HBase Hypertable Couchbase CouchDB MongoDB OrientDB RavenDB Jackrabbit Terrastore Amazon DynamoDB Redis Riak Project Voldemort Tokyo Cabinet Kyoto Cabinet memcached Amazon SimpleDB Datomic MemcacheDB M/DB GT.M Amazon Dynamo Dynomite Mnesia Yahoo! PNUTS/Sherpa Neo4j InfoGrid Sones GraphDB InfiniteGraph AllegroGraph MarkLogic Clustrix CouchDB Case Studies MongoDB Case Studies NoSQL at Adobe NoSQL at Facebook NoSQL at Twitter



nosql tutorial: All content tagged as nosql tutorial in NoSQL databases and polyglot persistence

MongoDB for PHP Developers

A two-part MongoDB tutorial for PHP developers: first part covering installation and the second giving a brief intro to MongoDB PHP API for queries and indexes. While not the tutorial that will get you up and running, it’s still a good resource for beginners.

Original title and link: MongoDB for PHP Developers (NoSQL database©myNoSQL)

Short Intro to Graph Databases, Manipulating and Traversing With Gremlin

A slide deck by Pierre De Wilde with a short theoretical intro to property graphs and graph databases and an extensive set of examples of manipulating and traversing graph data with Gremlin. Good reference material.

NoSQL Books: Riak Handbook and the Little Redis Book

A couple of recent books that I’ll be adding to the list of NoSQL books:

  1. Mathias Meyer’s Riak Handbook. You can get an idea of the book by checking Consistent Hashing Explained: The What and the Why, the free sample chapter, and the table of contents.

  2. Karl Seguin’s The Little Redis Book. Karl is at the second free NoSQL book after the The Little MongoDB Book.

Original title and link: NoSQL Books: Riak Handbook and the Little Redis Book (NoSQL database©myNoSQL)

Measuring User Retention With Hadoop and Hive

A very practical example of how Hive and Hadoop could deliver value when applied to clickstreams, the most common data for each web property:

Hadoop, Hive, and related tech­nologies are formi­dable tools for unlocking value from data. […] Retention measure­ments are partic­u­larly signif­icant because they paint a detailed picture about the overall stick­iness of a product across the entire userbase.

The same clickstream data can be used to calculate visitors’ conversion with the Bayesian discriminant using Hadoop.

Original title and link: Measuring User Retention With Hadoop and Hive (NoSQL database©myNoSQL)


How to Run a MapReduce Job Against Common Crawl Data Using Amazon Elastic MapReduce

Steve Salevan’s 7 step guide to setting up, compiling, deploying, and running a basic MapReduce job.

When Google unveiled its MapReduce algorithm to the world in an academic paper in 2004, it shook the very foundations of data analysis. By establishing a basic pattern for writing data analysis code that can run in parallel against huge datasets, speedy analysis of data at massive scale finally became a reality, turning many orthodox notions of data analysis on their head.

Google published the paper. Yahoo open sourced this. And Amazon is offering (unlimited) resources.

Update: The Hacker News thread where the main question answered is what other corporations are using MapReduce (besides the Internet companies). The answer is unfortunately extremely short: too many to be able to enumerate them all.

Original title and link: How to Run a MapReduce Job Against Common Crawl Data Using Amazon Elastic MapReduce (NoSQL database©myNoSQL)


MarkLogic Querying for SQL People

Inspired by the MongoDB MapReduce translated to SQL and Neo4j Cypher Querying for SQL People, MarkLogic’s Jason Hunter and Eric Bloch put together a page mapping SQL terms and queries to MarkLogix terms and XQuery queries respectively.

Here is how SQL statements translate to MarkLogic XQuery expressions:

NoSQL Screencast: Android Couchbase Tutorial

A 12 minutes screencast introducing the basics of Couchbase Mobile for Android applications:

Neo4j Tutorial: The Power of the Daleks

[…] we’ve built up a Doctor Who data model that shows how Neo4j can be used to address several different data and domain concerns. For example, part of the dataset includes timeline data, comprising seasons, stories and episodes; elsewhere we’ve social network-like data, with characters connected to one another through being companions, allies or enemies of the Doctor. It’s a messy and densely-connected dataset – much like the data you might find in a real-world enterprise. Some of it is of high quality, some of it is lacking in detail. And for every seeming invariant in the domain, there’s an awkward exception – again, much like the real world.

Ian Robinson’s post will walk you through the details of structuring and querying the data with Neo4j. The complete source code is available on GitHub.

Original title and link: Neo4j Tutorial: The Power of the Daleks (NoSQL database©myNoSQL)


Optimizing MongoDB: Lessons Learned at Localytics

These slides have generated quite a reaction on Twitter. I’ll let you decide for yourself the reasons:

While there have been lots of retweets, here’s just a glimpse of what type of reactions I’m referring to:

The Little MongoDB Book

Karl Seguin has put together a 32 page ebook answering some common questions related to MongoDB:

  • getting up and running basic CRUD operations
  • data modeling for MongoDB
  • when to use MongoDB
  • MongoDB MapReduce
  • performance and tools

I’ve noticed that a lot of people still have some fundamental questions about MongoDB. Questions like, where does it fit and how do you model your data? I initially thought about writing a couple blog posts, but I felt that a short ebook might be a better format.

Original title and link: The Little MongoDB Book (NoSQL databases © myNoSQL)


NoSQL tutorial: Redis with Spring Data

This is first milestone release, so there are minor bugs, documentation isn’t perfect as we used to, and current version need no stable Redis server, but for sure it is great library which allow us to use all this cool NoSQL stuff in “standard” Spring Data Access manner.

A short, but nice explanation on setting up Redis with Spring Data and the main classes you’ll interact with. Everything seems to have the Spring flavor, so if you ever used Spring for accessing your database, starting to use Redis should not be too difficult. Indeed you’ll not have SQL queries and you’ll have to be aware of the different data types and operations Redis handles.

Original title and link: NoSQL tutorial: Redis with Spring Data (NoSQL databases © myNoSQL)