ALL COVERED TOPICS

NoSQL Benchmarks NoSQL use cases NoSQL Videos NoSQL Hybrid Solutions NoSQL Presentations Big Data Hadoop MapReduce Pig Hive Flume Oozie Sqoop HDFS ZooKeeper Cascading Cascalog BigTable Cassandra HBase Hypertable Couchbase CouchDB MongoDB OrientDB RavenDB Jackrabbit Terrastore Amazon DynamoDB Redis Riak Project Voldemort Tokyo Cabinet Kyoto Cabinet memcached Amazon SimpleDB Datomic MemcacheDB M/DB GT.M Amazon Dynamo Dynomite Mnesia Yahoo! PNUTS/Sherpa Neo4j InfoGrid Sones GraphDB InfiniteGraph AllegroGraph MarkLogic Clustrix CouchDB Case Studies MongoDB Case Studies NoSQL at Adobe NoSQL at Facebook NoSQL at Twitter

NAVIGATE MAIN CATEGORIES

Close

The Rise of Hadoop: How many Hadoop-related solutions exist?

The CMSWire commented list of Hadoop-related solutions:

  1. Apache Hadoop
  2. Appistry CloudIQ Storage Hadoop Edition: a HDFS replacement improving on the single NameNode ( here).

    Shipping.

  3. IBM Distribution of Apache Hadoop: Apache Hadoop, a 32-bit Linux version of the IBM SDK for Java 6 SR 8, and an easy-to-use installer that will install and configure both Hadoop (including SSH setup) and Java (here).

    Shipping, but in alphaWorks

  4. IBM Global Parallel File System (GPFS): a high-performance shared-disk clustered file system developed by IBM (here).

    Shipping.

  5. Cloudera’s Distribution including Apache Hadoop: Cloudera’s packaging for Hadoop and Hadoop toolkit (here).

    Shipping.

  6. DataStax Brisk: using Apache Cassandra for Hadoop (and Hive) core services (here).

    Announced, but not released yet

  7. Amazon Elastic MapReduce: Amazon hosted Hadoop framework running on the infrastructure of Amazon EC2 and Amazon S3 (here).

    Shipping.

  8. Mapr: proprietary replacement for HDFS.

    Talked about

  9. CloudStore: the former Kosmos open-source distributed filesystem (here).

    Shipping[1]

  10. Pervasive DataRush: parallel data processing optimization for Hadoop jobs (here).

    Shipping.

  11. Cascading: query API and query Planner.

    Shipping.

  12. Apache Hive: data warehouse on top of Hadoop.

    Shipping

  13. Yahoo Pig: high-level data-flow language and execution framework for parallel computation.

    Shipping.

  14. Hadapt: hybrid architecture combining relational databases and Hadoop (here).

    Announced.

Some others are in the Hadoop toolkit.

Instead of “shipping” another criteria that can be used is number of users and amount of processed data.


  1. Kosmos current release is 0.5 dating June, 2010  

Original title and link: The Rise of Hadoop: How many Hadoop-related solutions exist? (NoSQL databases © myNoSQL)

via: http://www.cmswire.com/cms/information-management/open-source-hadoop-framework-faces-challenges-as-big-data-interest-rises-010983.php