NoSQL Benchmarks NoSQL use cases NoSQL Videos NoSQL Hybrid Solutions NoSQL Presentations Big Data Hadoop MapReduce Pig Hive Flume Oozie Sqoop HDFS ZooKeeper Cascading Cascalog BigTable Cassandra HBase Hypertable Couchbase CouchDB MongoDB OrientDB RavenDB Jackrabbit Terrastore Amazon DynamoDB Redis Riak Project Voldemort Tokyo Cabinet Kyoto Cabinet memcached Amazon SimpleDB Datomic MemcacheDB M/DB GT.M Amazon Dynamo Dynomite Mnesia Yahoo! PNUTS/Sherpa Neo4j InfoGrid Sones GraphDB InfiniteGraph AllegroGraph MarkLogic Clustrix CouchDB Case Studies MongoDB Case Studies NoSQL at Adobe NoSQL at Facebook NoSQL at Twitter



Storage Internal Data Formats

(Overheard) Kevin Weil[1]:

one petabyte for a trillion Tweets might become 10 petabytes for a trillion Tweets

So far I’ve ignored the topic of internal data formats. Now I’m wondering how important is it for midscale (nb non Google, non-Facebook, non-Twitter scale) applications?

Update: Make sure you are checking the comments below for more details on why data format is important.

  1. Kevin Weil: Analytics Lead at Twitter, @kevinweil  ()

Original title and link: Storage Internal Data Formats (NoSQL databases © myNoSQL)