A very long tutorial by Istvan Szegedi on how to integrate R with Cloudera Impala, through the ODBC driver:
Cloudera Impala is an exciting new technology to provide real-time,
interactive queries in Hadoop environment. It supports ODBC connectors and
this makes it possible to integrate it with many popular BI tools and
statistical software such as R. Together R and Impala provide an excellent
combination for data analyst to process massive data sets efficiently and
they can also support graphical representation of the result sets.
This blog is called myNoSQL and it is written by me, Alex Popescu, a software architect with a passion for open source and communities.
It records my readings, learnings, and opinions on NoSQL databases, polyglot persistence, and distributed systems -- subjects that I'm passionate about.
The opinions expressed here are my own, and no other party necessarily agrees with them.
If you feel I'm biased, I probably am.