NoSQL: All content tagged as NoSQL in NoSQL databases and polyglot persistence
Infochimps put together a comprehensive Venn diagram of the database world in the TechCrunch article Big Data Right Now: Five Trendy Open Source Technologies
Original title and link: The Database World in a Venn Diagram ( ©myNoSQL)
Nice screenshot by TechCrunch people of the slide talking about the data lifecycle at Facebook:
Based on this you’ll now have a better picture of how Facebook data ingestion numbers correlate to their architecture.
Original title and link: Life of Data at Facebook ( ©myNoSQL)
To navigate in an index, a database user had to know the physical structure of the index.
Obvious deficiencies of the approach led to introduction of separation of data model and data representation. Relational model is one and still the most popular way to do it.
One of the most well known deficiencies of a relational model is the so-called object-relational impedance mismatch: there is more than one way to map objects to relations, and none of them fits all access patterns well.
It has as well a number of advantages: simplicity, ease of analytical processing, and, let’s not forget, performance: by normalizing data, a user is forced to tell the DBMS more about data constraints, distribution, future access patterns.
This makes building efficient and to-the-point data representation structures easier.
Unfortunately, the past generations of database management systems did not address one of the main architecture drawbacks, which plagues the relational model: rigidity of schema change. Very few mainstream DBMS allow to change the structure of a relational database quickly, without downtime or significant performance penalty. This is not a drawback of the relational model, but of one which relates to the implementation.
It should also be kept in mind that in many cases a relational model is an overkill, and a simple key to value mapping is sufficient.
And of course no single model can fit all needs (e.g. graph databases build around the notion of nodes & edges, yet, good luck trying to quickly calculate CUBE on a bunch of nodes in a graph database).
Unfortunately, the world of NoSQL, when it comes to the data model, often simply takes us back to the 60s: there is minimal abstraction of data access from data representation, and once a certain representation has been chosen, there is no way to change it without rewriting your application (e.g. to fit the new performance profile).
Scalability is an answer, but a silly one: throwing more hardware at a problem is not always economical.
Original title and link: A Short History of NoSQL, SQL, NoSQL ( ©myNoSQL)
There’s been a lot of speculation about the announcements coming from Oracle’s OpenWorld event. A first part was revealed during the keynote in the form of an in-memory analytics appliance called Exalytics . But there’s talk about a Big Data Appliance and an Oracle NoSQL database.
Here’re my predictions
Oracle became very aggressive in selling products based on hardware, software, and services. So they’ll announce a Hadoop appliance integrated with an existing Oracle product. It could be either the Oracle Exadata or even the newly announced Exalytics.
This appliance will place Oracle in competition with all other Hadoop appliance sellers: EMC, NetApp, IBM. Also these days most of the analytics databases try to integrate with Hadoop.
Oracle already has a couple of non-relational solutions in their portfolio: BerkleyDB, TimesTen, Coherence. And they’ve already started to test the NoSQL market by announcing the MySQL and MySQL Cluster NoSQL hybrid systems.
I don’t expect Oracle NoSQL database to be a new product. Just a rebranding or repackaging of one of the above mentioned ones. Probably the TimesTen.
Oracle will invest more into integrating its line of products with Hadoop. Having both a Hadoop and an in-memory analytics appliance will make them very competitive in this space.
Oracle will extend the support for NoSQLish interfaces (memcached) to its other database products.
What are your predictions?
Original title and link: The Oracle NoSQL Database and Big Data Appliance ( ©myNoSQL)
Last evening I was trying to catch up with the news in the NoSQL and Big Data space—it looks like nobody wants to pick up the job I’m doing here, except maybe GigaOm’s Infrastructure Curator Derick Harris.
After skimming for a while through the links I’ve bookmarked, I’ve started to realize that this month, September 2011, is looking like the most exciting month in the data space, including but not limited to NoSQL and NewSQL, Big Data, data analytics etc. Partnerships, fundings, acquisitions, major releases. Every couple of days I had a news about a very interesting announcement.
You’ve probably read about some of these, but I thought I should group them together so you could get the same feeling I got:
BigData Market: IBM Acquires Two Analytics Companies: data analysts would call this consolidation. I’d say it’s the beginning of the acquisition spree.
MongoDB Selected as Core Content Management Component of SAP’s Platform-As-A-Service: it’s not very often to hear about a giant chosing a newcomer’s solution as a core component of its services.
R and Hadoop: Revolution Analytics and Cloudera Partnership Announced: R developers can access Hadoop data stores and program MapReduce jobs in R
For a while I’ll keep updating this post to point to the most interesting news this month.
Original title and link: The Best Month for NoSQL, Big Data, and the Data Space? ( ©myNoSQL)
scaling data systems in real life has humbled me. I would not dare to criticize an architecture that holds the social graphs of 750M and works
So if you feel like watching an action movie featuring A-class actors, Todd Hoff has summarized the whole conversation paraphrazing a comment about Lady Gaga:
You know, there’s a difference between not liking someone’s music and not recognizing their talent. If€ you can’t recognize the fact that Lady GaGa is, in fact, extremely talented in many ways, then you may want to try to look at her with less of a bias. There’s plenty of artists I can’t stand, but still respect their talent.
Even if you don’t like Lada Gaga’s schtick, that is a great performance. I get the feeling a lot SQL people don’t recognize the talent of NoSQL, whereas NoSQL people are generally use the best tool for the job types who have no problem with you using SQL if that works for you.
Original title and link: Is Nosql a Premature Optimization That’s Worse Than Death? Or the Lady Gaga of the Database World? ( ©myNoSQL)