NoSQL Benchmarks NoSQL use cases NoSQL Videos NoSQL Hybrid Solutions NoSQL Presentations Big Data Hadoop MapReduce Pig Hive Flume Oozie Sqoop HDFS ZooKeeper Cascading Cascalog BigTable Cassandra HBase Hypertable Couchbase CouchDB MongoDB OrientDB RavenDB Jackrabbit Terrastore Amazon DynamoDB Redis Riak Project Voldemort Tokyo Cabinet Kyoto Cabinet memcached Amazon SimpleDB Datomic MemcacheDB M/DB GT.M Amazon Dynamo Dynomite Mnesia Yahoo! PNUTS/Sherpa Neo4j InfoGrid Sones GraphDB InfiniteGraph AllegroGraph MarkLogic Clustrix CouchDB Case Studies MongoDB Case Studies NoSQL at Adobe NoSQL at Facebook NoSQL at Twitter



HP: All content tagged as HP in NoSQL databases and polyglot persistence

Cloudera and HP Partnership to Simplify Hadoop Deployments

As I was expecting after the series of announcements coming from MapR, Cloudera is announcing its partnership with HP:

Under the terms of the joint development and licensing agreement, the two companies will deliver open standards-based reference architectures that simplify management and accelerate deployment of Hadoop Cluster environments. Clients can purchase the Cloudera Enterprise platform and future Cloudera products either directly from HP or bundled in HP AppSystem for Apache Hadoop.

The new HP reference architecture for Apache Hadoop for Cloudera and HP AppSystem for Apache Hadoop—Cloudera are based on HP Converged Infrastructure. They include the Cloudera Enterprise platform and HP Insight Cluster Manager Utility (CMU) software.

Original title and link: Cloudera and HP Partnership to Simplify Hadoop Deployments (NoSQL database©myNoSQL)

Hadoop: Answering the Basic Questions: Why, What, How, Where

A post on the HP blog answering 5 questions about Hadoop:

  1. Why Hadoop?
  2. What is Hadoop?
  3. What does it do?
  4. What is it good for?
  5. What’s the future?

With the current momentum behind Hadoop, there’s no question that it’s here to stay. But it’s best to think of Hadoop as a starting point for interesting developments to come.

As you probably know, HP owns Vertica, a column-oriented SQL database for data warehousing and BI, but when talking about Big Data HP includes both Vertica and Hadoop.

Original title and link: Hadoop: Answering the Basic Questions: Why, What, How, Where (NoSQL database©myNoSQL)


Data Is the New Currency. But Who’s Leading the Way?

In 2005, Tim O’Reilly said: “data is the next Intel Inside“. Today IDC Mario Morales (VP of semiconductor research) says data is the new currency. All’s good until you read the continuation:

And the companies that understand this are the ones already developing the analytics and infrastructure to extract that value—companies like IBM, HP, Intel, Microsoft, TI, Freescale and Oracle.

The article (nb: may require registration) continues by looking at what each of these companies are doing in the Big Data space, but focuses a large part on IBM Watson.

Going back to the question “who’s leading the Big Data way“, let’s take a quick look at the technology behind Watson. According to Jeopardy Goes to Hadoop and About Watson, Watson technology is based on Apache Hadoop, using an IBM language technology built on the Apache UIMA platform[1] and running Linux on IBM boxes.

To me it looks like open source is leading the advances in Big Data and these large organizations are just connecting the dots (as in packaging these technologies for enterprise environments and contributing missing pieces here and there)[2]. When did this happen before?

  1. Dmitriy Ryaboy taught me that UIMA came out of IBM in the first place and they’ve been critical in its development.  

  2. Or they are very secretive about their internal initiatives and research.  

Original title and link: Data Is the New Currency. But Who’s Leading the Way? (NoSQL database©myNoSQL)

BI Pentaho Integrates Hadoop, NoSQL Databases, and Analytic Databases


  • The ability to orchestrate execution of Hadoop related tasks (i.e., executing a Hive Query, Pig Script, or M/R job) as part of a broader IT workflow.
  • The ability to setup dependencies, so if a step fails the job can branch down a recovery path or send a notification, or if it’s a success it goes on to subsequent dependent tasks. Likewise it supports initiating several tasks in parallel.
  • New integration for Pig — so that developers have the ability to execute a Pig job from a PDI Job flow, integrate the execution of Pig jobs in broader IT workflows through PDI Jobs, take advantage of our out of the box scheduler, and so on.

The list of tools Pentaho 4 integrates with is quite long:

  • a long list of traditional RDBMS
  • analytics databases (Greenplum, Vertica, Netezza, Teradata, etc.)
  • NoSQL databases (MongoDB, HBase, etc.)
  • Hadoop variants
  • LexisNexis HPCC

This is the world of polyglot persistence and hybrid data storage.

Original title and link: BI Pentaho Integrates Hadoop, NoSQL Databases, and Analytic Databases (NoSQL database©myNoSQL)

RainStor Database Technology Embedded Within HP Investigation Solution

Markus Perdrizat about RainStor and HP announcement:

I don’t know how many people have a need for investigation solutions, but there are certainly manz who have some requirements that point into the same direction, namely providing on-line (SQL) access to large amounts of relatively structured information (think logs or messages) for a long time (up to ten years or more). The announcement is also interested given that HP is probably still looking to grow its DB portfolio, so maybe there’s a new acquisition ahead if this partnership works out?

HP is looking for acquisitions in the data storage and processing space.

Original title and link: RainStor Database Technology Embedded Within HP Investigation Solution (NoSQL databases © myNoSQL)


HP Mergers & Acquisitions: MongoDB Next?

M&A speculations are on:

HP would do well to position itself for the post-relational world by acquiring expertise for a next-generation database. 10gen has experience in developing, installing, and supporting the open source database MongoDB, which would position HP well as both a database software provider and as a cloud-based database services provider. HP desperately needs an enterprise database solution. Expect a move in 2011.

Original title and link: HP Mergers & Acquisitions: MongoDB Next? (NoSQL databases © myNoSQL)


HP CEO about Relational Databases

James Governor reporting from the HP CEO Leo Apotheker keynote at the HP Analyst Summit:

“traditional relational databases are becoming less and less relevant to the future stack”

Even if HP acquired the real-time analytics platform Vertica I haven’t heard of HP in the NoSQL space, so my first thought was this is just the usual attack on competitors.

But it could also express HP’s interest in getting into the NoSQL market. The games of speculations about HP’s acquisitions are open.

  1. James Governor: Co-founder of RedMonk, @monkchips  

Original title and link: HP CEO about Relational Databases (NoSQL databases © myNoSQL)

Cloudera: A Business Inteligence Leader

The Informatica accord is Cloudera’s second partnership this year with a leading DI player. Back in August, Cloudera cemented a deal with open source software (OSS) data integration (DI) specialist Talend. It also has partnerships with Teradata Corp., the former Netezza Inc., the former Greenplum Software Corp., Aster Data Systems Inc., Vertica Inc., and Pentaho.

One thing’s for sure: Cloudera is certainly attracting attention.

The strategy is surprisingly simple: make it easy to put data in and get it out.

Original title and link: Cloudera: A Business Inteligence Leader (NoSQL databases © myNoSQL)


HP Acquires Vertica

The official announcement of HP’s acquisition of Vertica is here. And I think this is also explaining the disappearing of the last article Michael Stonebraker published on Vertica blog about 6 criteria for real column stores.

Original title and link: HP Acquires Vertica (NoSQL databases © myNoSQL)