An overview of the 3 SQL-on-Hadoop execution models — batch (10s of minutes and up), interactive (up to minutes), operational (sub-second), their applicability in the field of applications, and the main characteristics of the tools/frameworks in each of these categories:
Within the big data landscape there are multiple approaches to accessing,
analyzing, and manipulating data in Hadoop. Each depends on key
considerations such as latency, ANSI SQL completeness (and the ability to
tolerate machine-generated SQL), developer and analyst skillsets, and
The usual suspects are included: Hive, Impala, Preso, Spark/Shark, Drill.
Original title and link: SQL on Hadoop: An overview of frameworks and their applicability