NoSQL Benchmarks NoSQL use cases NoSQL Videos NoSQL Hybrid Solutions NoSQL Presentations Big Data Hadoop MapReduce Pig Hive Flume Oozie Sqoop HDFS ZooKeeper Cascading Cascalog BigTable Cassandra HBase Hypertable Couchbase CouchDB MongoDB OrientDB RavenDB Jackrabbit Terrastore Amazon DynamoDB Redis Riak Project Voldemort Tokyo Cabinet Kyoto Cabinet memcached Amazon SimpleDB Datomic MemcacheDB M/DB GT.M Amazon Dynamo Dynomite Mnesia Yahoo! PNUTS/Sherpa Neo4j InfoGrid Sones GraphDB InfiniteGraph AllegroGraph MarkLogic Clustrix CouchDB Case Studies MongoDB Case Studies NoSQL at Adobe NoSQL at Facebook NoSQL at Twitter



HPC: All content tagged as HPC in NoSQL databases and polyglot persistence

Stranger in a strange land: HPC and Big Data

Paul Mineiro sharing his notes and thoughts after attending an HPC event:

My plan was to observe the HPC community, try to get a feel how their worldview differs from my internet-centric “Big Data” mindset, and broaden my horizons. Intriguingly, the HPC guys are actually busy doing the opposite. They’re aware of what we’re up to, but they talk about Hadoop like it’s some giant livin’ in the hillside, comin down to visit the townspeople. Listening to them mapping what we’re up to into their conceptual landscape was very enlightening, and helped me understand them better.

No more ivory towers.

Original title and link: Stranger in a Strange Land: HPC and Big Data (NoSQL database©myNoSQL)


Big Data vs HPC

Found this great slide showing the fundamental differences between Big Data platforms and HPC in Steve Loughran‘s “My other computer is a datacenter” presentation (embedded at the end):

Big Data vs HPC

If there’re just two things to remember about this:

  1. Big Data is focusing on data, while HPC’s focus is on computing (petabytes vs petaflops)
  2. Big Data platforms assume failure is inevitable and are architected for it.

Garth Gibson About HPC and Big Data Applications

Garth Gibson[1] discusses the similaritis and differences between HPC and big data applications in terms of scale of data and challenges.

Most of the time I take filesystems for granted. But that’s exactly what NoSQL database are not doing. Just as a quick example, Basho has designed a special backend for Riak whose main focus is minimizing disk access in order to provide consistent performance.