Microsoft PolyBase: Unifying Relational and Non-Relational Data

Announced today, Microsoft PolyBase is:

a fundamental breakthrough in data processing used in SQL Server 2012 Parallel Data Warehouse to enable truly integrated query across Hadoop and relational data.

This is a direction almost all analytic databases are taking. Independently or using a well established distribution of Hadoop. Separately there are companies like Hadapt that are offering similar products.

Getting back to PolyBase, according to the docs:

  • it supports standard T-SQL queries that can join data from tables stored in a relational engine and Hadoop
  • integrates with Microsoft BI tools

