Announced 2 hours ago, by Twitter’s analytics infrastructure engineer Dmitriy Ryaboy, here comes Parquet:
We created Parquet to make the advantages of compressed, efficient columnar
data representation available to any project in the Hadoop ecosystem,
regardless of the choice of data processing framework, data model, or
The Parquet format page describes the details of the Apache Thrift metadata encoding, supported types, Thrift definitions, etc.
Original title and link: Parquet - Columnar Storage Format for Hadooop by Twitter and Cloudera ( ©myNoSQL)