Pivotal HD: All content tagged as Pivotal HD in NoSQL databases and polyglot persistence
Tuesday, 12 March 2013
Proprietary Hadoop Is a Losing Strategy
Matt Asay (10gen) for ReadWrite adding to the long discussion around EMC’s Pivotal HD announcement:
EMC has seemingly bottomless resources to throw at Hadoop, and every incentive to do so. It’s a smart, highly successful company and no doubt will prove successful with Pivotal HD. However, I can’t see it ever dominating an open-source infrastructure market with a proprietary distribution. Open source is the foundation for today’s most interesting markets, from Big Data to mobile to cloud computing. It’s unlikely that EMC will somehow stem this tide with a proprietary product, no matter its short- term performance or functionality advantages.
While I’ve linked to different perspectives about this topic, I’m not sure anyone outside our bubble actually came to a conclusion.
- Dan Woods (Forbes): Why SQL Matters, the Limits of Open Source, and Other Lessons of EMC Greenplum’s Pivotal HD
- Matthew Aslett (the451group): What It Means to Be “all In” on Hadoop
- Michael Hausenblas (MapR): Hadoop: What Matters Are Open and Standardized Interfaces
- Merv Adrian (Gartner): Open Source “Purity”, Hadoop, and Market Realities
- Steve Loughran: Hadoop Distributions: If There Is a Problem in the Hadoop JARs, How Are You Going to Fix It?
- Shaun Connolly (Hortonworks): Did EMC Just Say Fork You To The Hadoop Community?
What I know, though, is that EMC is benefiting from this. A lot. Three weeks ago, I wasn’t reading anything about EMC and Hadoop. Today all major websites have at least a couple of articles about it.
Original title and link: Proprietary Hadoop Is a Losing Strategy (©myNoSQL)
via: http://readwrite.com/2013/03/12/proprietary-hadoop-is-a-losing-strategy
Monday, 11 March 2013
What It Means to Be “all In” on Hadoop
Another post about the Pivotal HD and the accompanying statements, this time from Matthew Aslett:
Pivotal HD is not Hadoop
Neither is Cloudera’s Distribution, including Apache Hadoop.
Nor the Hortonworks Data Platform.
Nor the MapR Distribution.
Nor IBM’s InfoSphere BigInsights.
Nor the WANdisco Distro.
Nor Intel’s Distribution for Apache Hadoop.
Original title and link: What It Means to Be “all In” on Hadoop (©myNoSQL)
via: http://blogs.the451group.com/information_management/2013/03/11/all-in-on-hadoop/
Monday, 4 March 2013
How Many Hadoops?
The short answer is there is only one Apache Hadoop distribution.
The long answer is that there are many distributions that include Apache Hadoop or are claiming compatibility with Apache Hadoop.
The oldest and probably most popular: Cloudera’s Distribution of Hadoop (CDH)
The 100% open source: Hortonworks Data Platform.
The prioprietary: MapR.
The blue one: IBM InfoSphere BigInsights.
The latest: WANdisco Hadoop WDD, Intel Distribution of Hadoop and Pivotal HD from EMC Greenplum.
There’s also the version Facebook’s running on their cluster which includes Facebook Corona: a different approach to job scheduling and resource management.
But this list is not complete as it doesn’t include appliances featuring Hadoop. In this category we have:
- Oracle’s Big Data appliance featuring Cloudera’s Distribution of Hadoop
- Netapp’s Hadooplers
- EMC Greenplum DCA
- Teradata Aster Discovery Platform featuring Hortonworks’s Hadoop Data Platform
- Data Direct Networks (DDN)
I hope I didn’t miss any important ones1. As a conclusion for this list, my question is: who is actually benefiting from all these distributions?
-
I left aside for now Hadoop-as-a-Service. ↩
Original title and link: How Many Hadoops? (©myNoSQL)