NoSQL Benchmarks NoSQL use cases NoSQL Videos NoSQL Hybrid Solutions NoSQL Presentations Big Data Hadoop MapReduce Pig Hive Flume Oozie Sqoop HDFS ZooKeeper Cascading Cascalog BigTable Cassandra HBase Hypertable Couchbase CouchDB MongoDB OrientDB RavenDB Jackrabbit Terrastore Amazon DynamoDB Redis Riak Project Voldemort Tokyo Cabinet Kyoto Cabinet memcached Amazon SimpleDB Datomic MemcacheDB M/DB GT.M Amazon Dynamo Dynomite Mnesia Yahoo! PNUTS/Sherpa Neo4j InfoGrid Sones GraphDB InfiniteGraph AllegroGraph MarkLogic Clustrix CouchDB Case Studies MongoDB Case Studies NoSQL at Adobe NoSQL at Facebook NoSQL at Twitter



Six Rules That Should Govern Your Big Data Existence

I cannot say I particularly enjoyed Avinash Kaushik’s presentation style—you can see his recorded presentation after the break—, but his post summarizing and expanding on his Strata talk A Big Data Imperative: Driving Big Action contains this very thought-provoking list of rules to be applied to Big Data strategies:

  1. Don’t buy the hype of big data and throw millions of dollars away. But don’t stand still.

    Take 15% of your decision making budget and give it to one really, really smart person (Ninja! OK, Data Scientist) and give that person the freedom to experiment in the cloud with big data possibilities for your companies.

  2. Big thinking about what big data should be solving for is supremely important.

    When you have access to all this data, the answers you find will be surprising, the insights you deliver will be brilliant, and your impact on the business will be huge. But that can only happen if there is a model that defines the purpose of your sweet big data adventures.

  3. The 10/90 rule for magnificent data success still holds true.

    For every $100 you have available to invest in making smart decisions, invest $10 in tools and vendor services, and invest $90 in big brains (aka people, aka analysis ninjas, aka you!).

  4. Shoot for right time data, not real time data.

    Understand when is the right time for data in your organization. Shoot for systems and processes that match delivery of data (better still, insights ) to that time frame. You’ll have less stress. You’ll focus on big, important, strategic things (real time data is really good at driving the best companies to do tactical silly things). And you’ll save a lot of money, because real time everything is really expensive!

  5. “Data quality sucks, just get over it.”

  6. Eliminating noise is even more important than finding a signal.

    With big data, it is so much more important to be magnificent at knowing what to ignore. You must know how to separate out all the noise in the disparate huge datasets to even have a fighting chance to start to look for the signal.

Original title and link: Six Rules That Should Govern Your Big Data Existence (NoSQL database©myNoSQL)