StreamReduce: All content tagged as StreamReduce in NoSQL databases and polyglot persistence
Monday, 29 October 2012
Overview of Dremel-Like Solutions: Moving Beyond Hadoop for Big Data Needs
Until I learn more about the recently announced Cloudera Impala and Druid from Metamarkets, this article by Jaikumar Vijayan should offer—with some inherent mistakes1—a good overview of the solutions aiming to offer alternatives to the batch-processing nature of Hadoop:
- Google Dremel (BigQuery)
- Cloudera Impala
- Metamarkets Druid
- Nodeable StreamReduce
- SAP HANA integrated with Hadoop, etc.
-
Just an example: “If you can stand latencies of a few seconds, Hadoop is fine. But Hadoop MapReduce is never going to be useful for sub-second latencies”. Then “The technology [nb Google Dremel] can run queries over trillion-row data tables in seconds…”
Maybe just one more: consider the title “Moving beyond Hadoop” and then the quote from Google’s Ju-kay Kwek: “Google uses Dremel in conjuction with MapReduce. […] Hadoop and Dremel are distributed computing technologies, but each was built to address very different problems.” ↩
Original title and link: Overview of Dremel-Like Solutions: Moving Beyond Hadoop for Big Data Needs (©myNoSQL)