cassandra: All content tagged as cassandra in NoSQL databases and polyglot persistence
Sunday, 5 February 2012
Cassandra at SocialFlow with Drew Robb - Powered by NoSQL
To alternate a bit after yesterday’s educational CQL: SQL for Cassandra in the Cassandra NYC 2011 video series from DataStax, today’s video is Drew Robb covering Cassandra usage at SocialFlow for capturing real-time data from Twitter and Bit.ly.
Saturday, 4 February 2012
CQL: SQL for Cassandra with Eric Evans - NoSQL videos
The fine folks from DataStax have made available the presentations from their Cassandra NYC 2011 event.
The first video to post here is Eric Evans’s presentation on Cassandra Query Language.
Friday, 3 February 2012
Modeling A/B Tests With Cassandra
A Cassandra data modeling session around a real-life scenario: tracking data for A/B tests:
With most things in life data modeling in Cassandra can be compared to learning to ride a bike. It can be scary, you might fall off, but in the end once you learn a few fundamental concepts everything will be easier to do. The goal of this article is to get you comfortable with a basic data modeling scenario that you will likely see in the real world.
Original title and link: Modeling A/B Tests With Cassandra (©myNoSQL)
via: http://www.datastax.com/dev/blog/ab-testing-with-apache-cassandra
What does DynamoDB mean for the enteprise world?
Tim Moreton1:
I expect not much in the short term, until some key issues are solved. First, the sources and consumers of data are still on-site. These guys are tackling a specific technical limitation, not necessarily looking to re-architect their wider systems, which are often complex and inter-dependent. Second, security and regulatory concerns may need addressing. Third, the TCO needs to stack up. A quick and dirty back of the envelope calculation suggests that although it’s free to get started with DynamoDB, for the sort of deployment sizes we’re seeing, DynamoDB works out considerably more expensive than alternatives like Acunu deployed on hardware (even after accounting for typical full costing for outsourced data centers).
Keeping my eyes on all things NoSQL for more than 2 years, I’d say that NoSQL databases in general do not mean much for the enterprise world2. Yet.
-
Tim Moreton: CEO Acunu ↩
Original title and link: What does DynamoDB mean for the enteprise world? (©myNoSQL)
via: http://www.acunu.com/blogs/tim-moreton/welcome-party-dynamodb/
DataStax's CEO thoughts on the NoSQL Market and Competition
Billy Bosworth1:
Personally, I have never believed that other post-relational (aka NoSQL/Hadoop) database companies were our primary competition. The brute fact of the matter is that if you put us all together, we are still not statistically relevant compared to the overall DBMS market.
I had only one real personal fear coming into this market: That I would sink a big portion of my life into something that would never take hold in the mainstream. I suspect that would be a truly awful ending for all of us in this space. But thanks to companies like Amazon and Oracle, that feels highly unlikely now, and that is a great thing.
Just to play the devil advocate for a second: Oracle won’t lose much in the NoSQL market if things don’t work out well and Amazon’s DynamoDB is part of a larger plan. But for all the NoSQL database companies it is an all-or-nothing game2.
-
Billy Bosworth: CEO DataStax ↩
-
An all-or-nothing game is not the same with a winner-takes-all game ↩
Original title and link: DataStax’s CEO thoughts on the NoSQL Market and Competition (©myNoSQL)
via: http://www.datastax.com/2012/01/my-thoughts-on-amazons-dynamodb
Thoughts on SimpleDB, DynamoDB and Cassandra
Adrian Cockcroft:
So the lesson here is that for a first step into NoSQL, we went with a hosted solution so that we didn’t have to build a team of experts to run it, and we didn’t have to decide in advance how much scale we needed. Starting again from scratch today, I would probably go with DynamoDB. It’s a low “friction” and developer friendly solution.
You can look at this in two ways: 1) a biased opinion of someone that has already betted on Amazon with the infrastructure of a multi-billion business; 2) the opinion of someone that has accumulated a ton of experience in the NoSQL space and that is successfully1 running the infrastructure of a multi-billion business on NoSQL solutions. I’d strongly suggest you to think of it as the latter.
-
Netflix was one of the few companies that continued to operate during Amazon’s EBS major failure. ↩
Original title and link: Thoughts on SimpleDB, DynamoDB and Cassandra (©myNoSQL)
via: http://perfcap.blogspot.com/2012/01/thoughts-on-simpledb-dynamodb-and.html
Thursday, 2 February 2012
NoSQL Market from Couchbase Perspective
James Philips (Couchbase) for Curt Monash:
- MongoDB is the big competition. He believes Couchbase has an excellent win rate vs. 10gen for actual paying accounts.
- DataStax/Cassandra wins over Couchbase only when multi-data-center capability is important. Naturally, multi-data-center capability is planned for Couchbase. (Indeed, that’s one of the benefits of swapping in CouchDB at the back end.)
- Redis has “dropped off the radar”, presumably because there’s no particular persistence strategy for it.
- Riak doesn’t show up much.
I assume this is sort of a pre-sales/sales department 100k feet overview.
Original title and link: NoSQL Market from Couchbase Perspective (©myNoSQL)
Tuesday, 24 January 2012
Latest NoSQL Releases: HBase 0.92, DataStax Community Server, Hortonworks Data Platform, SolrCloud
Just a quick roundup of the latest releases and announcements.
Hortonworks Data Platform (HDP) version 2
HDP v2 will include:
- NextGen MapReduce architecture
- HDFS NameNode HA
- HDFS Federation
- up-to-date HCatalog, HBase, Hive, Pig
According to the announcement:
In order to avoid confusion, let me explain the two versions of HDP:
- HDP v1 is based upon Apache Hadoop 1.0 (which comes from the 0.20.205 branch). It the most stable, production-ready version of Hadoop that is currently found in many large enterprise deployments. HDP v1 is currently available as a private technology preview. A public technology preview will be made available later this quarter.
- HDP v2 is based upon Apache Hadoop 0.23, which includes the next generation advancements mentioned above. It’s an important step forward in terms of scalability, performance, high availability and data integrity. A technology preview will also be made publicly available later in Q1.
SolrCloud Completes Phase 2
Mark Miller about the completion of phase 2:
The second phase of SolrCloud has been in full swing for a couple of months now and it looks like we are going to be able to commit this work to trunk very soon! In Phase1 we built on top of Solr’s distributed search capabilities and added cluster state, central config, and built-in read side fault tolerance. Phase 2 is even more ambitious and focuses on the write side. We are talking full-blown fault tolerance for reads and writes, near real-time support, real-time GET, true single node durability, optimistic locking, cluster elasticity, improvements to the Phase 1 features, and more.
Not there yet, but it’s coming.
DataStax Community Server 1.0.7
A new release of DataStax’s distribution of Cassandra incorporating Cassandra 1.0.7
HBase 0.92
Don’t let the version number trick you. This is an important release for HBase featuring:
- coprocessors
- security
- new (self-migrating) file format
- AWS improvements: EBS support, building a HA cluster
The list of new features, improvements, and bug fixes in HBase 0.92 is impressive. But the highlight of this release is in my opinion HBase coprocessors (Jira entry HBASE-200).
I’m leaving you with Andrew Purtell’s slides about HBase Coprocessors:
Monday, 23 January 2012
NoSQL Tutorials: Getting Started With Cassandra
All the basic steps of installing Cassandra, creating a keyspace and column families, inserting and manipulating data, querying data with CQL on a single page. Even if a bit old, you could continue next with my tutorial on getting started with Cassandra.
Original title and link: NoSQL Tutorials: Getting Started With Cassandra (©myNoSQL)
via: http://www.datastax.com/resources/articles/getting-started-with-apache-cassandra
Most Popular Articles
- Translate SQL to MongoDB MapReduce
- Tutorial: Getting Started With Cassandra
- CouchDB vs MongoDB: An attempt for a More Informed Comparison
- Cassandra @ Twitter: An Interview with Ryan King
- A Couple of Nice GUI Tools for MongoDB
- NoSQL benchmarks and performance evaluations
- Ehcache: Distributed Cache or NoSQL Store?
- Document Databases Compared: CouchDB, MongoDB, RavenDB
- Quick Review of Existing Graph Databases
- NoSQL Data Modeling