Parquet: All content tagged as Parquet in NoSQL databases and polyglot persistence
Tuesday, 12 March 2013
Parquet - Columnar Storage Format for Hadooop by Twitter and Cloudera
Announced 2 hours ago, by Twitter’s analytics infrastructure engineer Dmitriy Ryaboy, here comes Parquet:
We created Parquet to make the advantages of compressed, efficient columnar data representation available to any project in the Hadoop ecosystem, regardless of the choice of data processing framework, data model, or programming language.
The Parquet format page describes the details of the Apache Thrift metadata encoding, supported types, Thrift definitions, etc.
Original title and link: Parquet - Columnar Storage Format for Hadooop by Twitter and Cloudera (©myNoSQL)