<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0"><channel><atom:link rel="hub" href="http://tumblr.superfeedr.com/" xmlns:atom="http://www.w3.org/2005/Atom"/><description>NoSQL Databases and Polyglot Persistence: A Curated Guide</description><title>myNoSQL</title><generator>Tumblr (3.0; @nosql)</generator><link>http://nosql.mypopescu.com/</link><item><title>Cassandra at SocialFlow with Drew Robb - Powered by NoSQL</title><description>&lt;p&gt;To alternate a bit after yesterday’s educational &lt;a href="http://" target="_blank"&gt;CQL: SQL for Cassandra&lt;/a&gt; in the &lt;a href="http://nosql.mypopescu.com/tagged/Cassandra-NYC-2011" target="_blank"&gt;Cassandra NYC 2011 video series&lt;/a&gt;  from DataStax, today’s video is Drew Robb covering Cassandra usage at SocialFlow for capturing real-time data from Twitter and Bit.ly.&lt;/p&gt;
&lt;!-- more --&gt;
&lt;div class="embedded smartembed youtube"&gt;
&lt;iframe class="youtube-player" type="text/html" width="520" height="390" src="http://www.youtube.com/embed/KlCMkBGeI8Y" frameborder="0"&gt;&lt;/iframe&gt;
&lt;/div&gt;
&lt;p&gt;For watching more videos from this event follow the &lt;a href="http://nosql.mypopescu.com/tagged/Cassandra-NYC-2011" target="_blank"&gt;Cassandra NYC 2011 tag&lt;/a&gt;. &lt;/p&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/" rel="permalink" style="color:red" target="_blank"&gt;Cassandra at SocialFlow with Drew Robb - Powered by NoSQL&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;

&lt;!--quid:595348292657290cd0a43c1268b37bd2e0029064--&gt;</description><link>http://nosql.mypopescu.com/post/17085273933</link><guid>http://nosql.mypopescu.com/post/17085273933</guid><pubDate>Sun, 05 Feb 2012 12:14:05 +0200</pubDate><category>Cassandra</category><category>Powered by NoSQL</category><category>NoSQL presentations</category><category>NoSQL videos</category><category>DataStax</category><category>NoSQL Conference</category><category>Cassandra NYC 2011</category></item><item><title>XFS: the filesystem of the future?</title><description>&lt;p&gt;Jonathan Corbet &lt;a href="http://lwn.net/Articles/476263/" rel="external nofollow" target="_blank"&gt;summarizing&lt;/a&gt; a presentation about the present and future of XFS by Dave Chinner:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;XFS is often seen as the filesystem for people with massive amounts of data. It serves that role well, Dave said, and it has traditionally performed well for a lot of workloads. Where things have tended to fall down is in the  writing of metadata; support for workloads that generate a lot of metadata writes has been a longstanding weak point for the filesystem. In short, metadata writes were slow, and did not really scale past even a single CPU.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;After the break the video of Dave Chinner’s presentation, “XFS: Recent and Future Adventures in Filesystem scalability”.&lt;/p&gt;

&lt;!-- more --&gt;

&lt;div class="embedded smartembed youtube"&gt;
&lt;iframe class="youtube-player" type="text/html" width="520" height="390" src="http://www.youtube.com/embed/FegjLbCnoBw" frameborder="0"&gt;&lt;/iframe&gt;
&lt;/div&gt;
&lt;p&gt;Even if it’s very long, make sure you check the &lt;a href="http://lwn.net/Articles/476263/" rel="external nofollow" target="_blank"&gt;comment thread&lt;/a&gt;.&lt;/p&gt;
&lt;p class="x"&gt;&lt;a href="http://twitter.com/spyced" rel="external nofollow" target="_blank"&gt;Jonathan Ellis&lt;/a&gt; &lt;/p&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/post/16926987171" rel="permalink" style="color:red" target="_blank"&gt;XFS: the filesystem of the future?&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;</description><link>http://nosql.mypopescu.com/post/17036994009</link><guid>http://nosql.mypopescu.com/post/17036994009</guid><pubDate>Sat, 04 Feb 2012 19:12:05 +0200</pubDate><category>XFS</category><category>file system</category></item><item><title>CQL: SQL for Cassandra with Eric Evans - NoSQL videos</title><description>&lt;p&gt;The fine folks from DataStax have made available the presentations from their &lt;a href="http://www.datastax.com/events/cassandranyc2011" rel="external nofollow" target="_blank"&gt;Cassandra NYC 2011 event&lt;/a&gt;. &lt;/p&gt;
&lt;p&gt;The first video to post here is Eric Evans’s presentation on Cassandra Query Language. &lt;/p&gt;

&lt;!-- more --&gt;

&lt;div class="embedded smartembed youtube"&gt;
&lt;iframe class="youtube-player" type="text/html" width="520" height="390" src="http://www.youtube.com/embed/_l_ymGJ3oPk" frameborder="0"&gt;&lt;/iframe&gt;
&lt;/div&gt;
&lt;p&gt;&lt;/p&gt;&lt;div class="embedded smartembed slideshare" style="width:520px" id="__ss_10489017"&gt;
&lt;iframe class="smartembed slideshare" src="http://www.slideshare.net/slideshow/embed_code/10489017?rel=0" width="520" height="435" frameborder="0" marginwidth="0" marginheight="0" scrolling="no"&gt;&lt;/iframe&gt;
&lt;div class="smartembed-ref-slideshare"&gt;&lt;a href="http://www.slideshare.net/jericevans/cql-sql-in-cassandra" rel="nofollow external" target="_blank"&gt;CQL: SQL for Cassandra&lt;/a&gt;&lt;/div&gt;
&lt;/div&gt;
&lt;p&gt;For watching more videos from this event follow the &lt;a href="http://nosql.mypopescu.com/tagged/Cassandra-NYC-2011" target="_blank"&gt;Cassandra NYC 2011 tag&lt;/a&gt;. &lt;/p&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/" rel="permalink" style="color:red" target="_blank"&gt;CQL: SQL for Cassandra with Eric Evans - NoSQL videos&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;

&lt;!--quid:78bb1dc8118feae6456dc65b5658d6b237f4e228--&gt;</description><link>http://nosql.mypopescu.com/post/17024051739</link><guid>http://nosql.mypopescu.com/post/17024051739</guid><pubDate>Sat, 04 Feb 2012 12:14:05 +0200</pubDate><category>Cassandra</category><category>CQL</category><category>NoSQL presentations</category><category>NoSQL videos</category><category>DataStax</category><category>NoSQL Conference</category><category>Cassandra NYC 2011</category></item><item><title>A bit of history around Hadoop Companies</title><description>&lt;a href="http://www.businessinsider.com/this-former-yahoo-ers-startup-is-so-hot-even-the-cia-invested-in-it-2012-1"&gt;A bit of history around Hadoop Companies&lt;/a&gt;: &lt;p&gt;Imagine this story 5 years from now… with all the scars from the &lt;del&gt;battle&lt;/del&gt; competition with the other companies trying to monetize Hadoop and … the millions in the bank.&lt;/p&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/" rel="permalink" style="color:red" target="_blank"&gt;A bit of history around Hadoop Companies&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;

&lt;!--quid:912c8eceed6213262823899af93481558a9eccba--&gt;</description><link>http://nosql.mypopescu.com/post/16972932787</link><guid>http://nosql.mypopescu.com/post/16972932787</guid><pubDate>Fri, 03 Feb 2012 15:10:05 +0200</pubDate><category>Hadoop</category></item><item><title>Designing HBase Schema to Best Support Specific Queries</title><description>&lt;a href="http://stackoverflow.com/questions/8983503/designing-hbase-schema-to-best-support-specific-queries"&gt;Designing HBase Schema to Best Support Specific Queries&lt;/a&gt;: &lt;p&gt;Real scenario, very good analysis of different data access requirements, and three possible solutions. What’s your pick?&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;The problem is fairly simple - I am storing “notifications” in hbase, each of which has a status (“new”, “seen”, and “read”). Here are the API’s I need to provide:&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;Get all notifications for a user&lt;/li&gt;
&lt;li&gt;Get all “new” notifications for a user&lt;/li&gt;
&lt;li&gt;Get the count of all “new” notifications for a user&lt;/li&gt;
&lt;li&gt;Update status for a notification&lt;/li&gt;
&lt;li&gt;Update status for all of a user’s notifications&lt;/li&gt;
&lt;li&gt;Get all “new” notifications accross the database&lt;/li&gt;
&lt;li&gt;Notifications should be scannable in reverse chronological order and allow pagination.&lt;/li&gt;
&lt;/ul&gt;&lt;/blockquote&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/" rel="permalink" style="color:red" target="_blank"&gt;Designing HBase Schema to Best Support Specific Queries&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;

&lt;!--quid:69de95d4ddaae99618573e6b6fa2aefd3290c964--&gt;</description><link>http://nosql.mypopescu.com/post/16969934679</link><guid>http://nosql.mypopescu.com/post/16969934679</guid><pubDate>Fri, 03 Feb 2012 12:30:05 +0200</pubDate><category>HBase</category><category>data modeling</category></item><item><title>Modeling A/B Tests With Cassandra</title><description>&lt;a href="http://www.datastax.com/dev/blog/ab-testing-with-apache-cassandra"&gt;Modeling A/B Tests With Cassandra&lt;/a&gt;: &lt;p&gt;A Cassandra data modeling session around a real-life scenario: tracking data for A/B tests: &lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;With most things in life data modeling in Cassandra can be compared to learning to
ride a bike. It can be scary, you might fall off, but in the end once you learn a
few fundamental concepts everything will be easier to do. The goal of this article
is to get you comfortable with a basic data modeling scenario that you will likely
see in the real world.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/" rel="permalink" style="color:red" target="_blank"&gt;Modeling A/B Tests With Cassandra&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;

&lt;!--quid:8ba5b08d048548afb37fa14ecdf02c3125d55268--&gt;</description><link>http://nosql.mypopescu.com/post/16969720380</link><guid>http://nosql.mypopescu.com/post/16969720380</guid><pubDate>Fri, 03 Feb 2012 12:15:05 +0200</pubDate><category>Cassandra</category><category>data modeling</category></item><item><title>What's the big deal about Big Data?</title><description>&lt;a href="http://informationarbitrage.com/post/16121669634/whats-the-big-deal-about-big-data"&gt;What's the big deal about Big Data?&lt;/a&gt;: &lt;p&gt;Roger Ehrenberg (Founder and Managing Partner of IA Ventures):&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Every so often a term becomes so beloved by media that it moves from “instructive” to “hackneyed” to “worthless,” and Big Data is one of those terms. […] But since this time the term Big Data has become diluted. Very diluted. So much so that it is almost totally meaningless. Does Big Data mean new kinds of databases? Sure. Does it mean innovative ways to visualize data to create actionable intelligence? Absolutely. Can it be applied to the health care sector? Without question. Has it contributed to the rise of the Data Scientist? Mos def.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;And I thought &lt;a href="http://nosql.mypopescu.com/post/16576613628/analysts-predictions-for-hadoop-market" target="_blank"&gt;I was off&lt;/a&gt;.&lt;/p&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/" rel="permalink" style="color:red" target="_blank"&gt;What’s the big deal about Big Data?&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;

&lt;!--quid:ab15b1edec271ee1d8fb5c606313cb8380b9a93f--&gt;</description><link>http://nosql.mypopescu.com/post/16967616680</link><guid>http://nosql.mypopescu.com/post/16967616680</guid><pubDate>Fri, 03 Feb 2012 10:00:05 +0200</pubDate><category>BigData</category></item><item><title>The time for NoSQL is now</title><description>&lt;a href="http://osintegrators.com/node/76"&gt;The time for NoSQL is now&lt;/a&gt;: &lt;p&gt;Andrew C. Oliver:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;The transition to NoSQL databases will take time. We still don’t have TOAD, Crystal Reports, query language standardization and other essential tools needed for mass adoption. There will be missteps (i.e. I may need a different type of database for reporting than for my operational system), but I truly think this is one technology that isn’t just marketing.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;This coming from someone that was happy to discover back in 1998 all the knobs in Oracle.&lt;/p&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/" rel="permalink" style="color:red" target="_blank"&gt;The time for NoSQL is now&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;

&lt;!--quid:6aa983232188630b18986e363798088d94375413--&gt;</description><link>http://nosql.mypopescu.com/post/16967451179</link><guid>http://nosql.mypopescu.com/post/16967451179</guid><pubDate>Fri, 03 Feb 2012 09:51:06 +0200</pubDate><category>NoSQL databases</category><category>RDBMS</category><category>Oracle</category><category>NoSQL future</category></item><item><title>What does DynamoDB mean for the enteprise world?</title><description>&lt;a href="http://www.acunu.com/blogs/tim-moreton/welcome-party-dynamodb/"&gt;What does DynamoDB mean for the enteprise world?&lt;/a&gt;: &lt;p&gt;Tim Moreton&lt;sup id="fnref:2-fn-Moreton"&gt;&lt;a href="#fn:2-fn-Moreton" rel="footnote" target="_blank"&gt;1&lt;/a&gt;&lt;/sup&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;I expect not much in the short term, until some key issues are solved. First, the sources and consumers of data are still on-site. These guys are tackling a specific technical limitation, not necessarily looking to re-architect their wider systems, which are often complex and inter-dependent. Second, security and regulatory concerns may need addressing. Third, the TCO needs to stack up. A quick and dirty back of the envelope calculation suggests that although it’s free to get started with DynamoDB, for the sort of deployment sizes we’re seeing, DynamoDB works out considerably more expensive than alternatives like Acunu deployed on hardware (even after accounting for typical full costing for outsourced data centers).&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Keeping my eyes on all things NoSQL for more than 2 years, I’d say that NoSQL databases in general do not mean much for the enterprise world&lt;sup id="fnref:2-fn-world"&gt;&lt;a href="#fn:2-fn-world" rel="footnote" target="_blank"&gt;2&lt;/a&gt;&lt;/sup&gt;. &lt;em&gt;Yet&lt;/em&gt;.&lt;/p&gt;
&lt;div class="footnote"&gt;
&lt;hr&gt;&lt;ol&gt;&lt;li id="fn:2-fn-Moreton"&gt;
&lt;p&gt;Tim Moreton: CEO Acunu &lt;a href="#fnref:2-fn-Moreton" rev="footnote" title="Jump back to footnote 1 in the text" target="_blank"&gt;↩&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li id="fn:2-fn-world"&gt;
&lt;p&gt;&lt;a href="http://nosql.mypopescu.com/post/16943400029/datastaxs-ceo-thoughts-on-the-nosql-market-and" target="_blank"&gt;DataStax’s CEO, Billy Bosworth seems to agree&lt;/a&gt;. &lt;a href="#fnref:2-fn-world" rev="footnote" title="Jump back to footnote 2 in the text" target="_blank"&gt;↩&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;&lt;/div&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/post/16944216575" rel="permalink" style="color:red" target="_blank"&gt;What does DynamoDB mean for the enteprise world?&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;</description><link>http://nosql.mypopescu.com/post/16944216575</link><guid>http://nosql.mypopescu.com/post/16944216575</guid><pubDate>Fri, 03 Feb 2012 01:48:17 +0200</pubDate><category>DynamoDB</category><category>Acunu</category><category>Cassandra</category><category>NoSQL databases</category><category>NoSQL market</category></item><item><title>DataStax's CEO thoughts on the NoSQL Market and Competition</title><description>&lt;a href="http://www.datastax.com/2012/01/my-thoughts-on-amazons-dynamodb"&gt;DataStax's CEO thoughts on the NoSQL Market and Competition&lt;/a&gt;: &lt;p&gt;Billy Bosworth&lt;sup id="fnref:2-fn-Bosworth"&gt;&lt;a href="#fn:2-fn-Bosworth" rel="footnote" target="_blank"&gt;1&lt;/a&gt;&lt;/sup&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Personally, I have never believed that other post-relational (aka NoSQL/Hadoop) database companies were our primary competition.  The brute fact of the matter is that if you put us all together, we are still not statistically relevant compared to the overall DBMS market.&lt;/p&gt;
&lt;p&gt;I had only one real personal fear coming into this market: That I would sink a big portion of my life into something that would never take hold in the mainstream.  I suspect that would be a truly awful ending for all of us in this space. But thanks to companies like Amazon and Oracle, that feels highly unlikely now, and that is a great thing.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Just to play the devil advocate for a second: Oracle won’t lose much in the NoSQL market if things don’t work out well and &lt;a href="http://nosql.mypopescu.com/post/16942975288/get-them-by-the-data" target="_blank"&gt;Amazon’s DynamoDB is part of a larger plan&lt;/a&gt;. But for all the NoSQL database companies it is an all-or-nothing game&lt;sup id="fnref:2-fn-game"&gt;&lt;a href="#fn:2-fn-game" rel="footnote" target="_blank"&gt;2&lt;/a&gt;&lt;/sup&gt;.&lt;/p&gt;
&lt;div class="footnote"&gt;
&lt;hr&gt;&lt;ol&gt;&lt;li id="fn:2-fn-Bosworth"&gt;
&lt;p&gt;Billy Bosworth: CEO DataStax &lt;a href="#fnref:2-fn-Bosworth" rev="footnote" title="Jump back to footnote 1 in the text" target="_blank"&gt;↩&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li id="fn:2-fn-game"&gt;
&lt;p&gt;An all-or-nothing game is not the same with &lt;a href="http://nosql.mypopescu.com/post/16164435190/the-state-of-nosql-in-2012" target="_blank"&gt;a winner-takes-all game&lt;/a&gt;  &lt;a href="#fnref:2-fn-game" rev="footnote" title="Jump back to footnote 2 in the text" target="_blank"&gt;↩&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;&lt;/div&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/post/16943400029" rel="permalink" style="color:red" target="_blank"&gt;DataStax’s CEO thoughts on the NoSQL Market and Competition&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;</description><link>http://nosql.mypopescu.com/post/16943400029</link><guid>http://nosql.mypopescu.com/post/16943400029</guid><pubDate>Fri, 03 Feb 2012 01:36:44 +0200</pubDate><category>NoSQL market</category><category>Cassandra</category><category>DynamoDB</category><category>DataStax</category><category>Amazon</category></item><item><title>Get them by the data</title><description>&lt;a href="http://www.theregister.co.uk/2012/01/25/amazon_cloud_enterprise_storage/"&gt;Get them by the data&lt;/a&gt;: &lt;p&gt;Gavin Clarke and Chris Mellor about AWS Storage Gateway:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Once you’ve got them by the data, of course, their hearts and minds will follow, and Amazon’s using the AWS Storage Gateway beta as a sampler for the rest of its compute cloud.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;The Storage Gateway is another piece, &lt;a href="http://nosql.mypopescu.com/post/16517845270/using-amazon-elastic-mapreduce-with-dynamodb-nosql" target="_blank"&gt;together with S3, DynamoDB, SimpleDB, Elastic MapReduce&lt;/a&gt;, in Amazon’s great strategical puzzle of a complete polyglot platform. &lt;/p&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/post/16942975288" rel="permalink" style="color:red" target="_blank"&gt;Get them by the data&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;</description><link>http://nosql.mypopescu.com/post/16942975288</link><guid>http://nosql.mypopescu.com/post/16942975288</guid><pubDate>Fri, 03 Feb 2012 01:27:00 +0200</pubDate><category>Amazon</category><category>DynamoDB</category><category>SimpleDB</category><category>Elastic MapReduce</category><category>RDS</category><category>Storage Gateway</category></item><item><title>MongoDB Tips and Tricks: You Only Wish MongoDB Wasn't Relational</title><description>&lt;a href="http://seanhess.github.com/2012/02/01/mongodb_relational.html"&gt;MongoDB Tips and Tricks: You Only Wish MongoDB Wasn't Relational&lt;/a&gt;: &lt;blockquote&gt;
&lt;p&gt;So when you read that MongoDB is a document store, you might get the wonderful idea to store your relationships in a big document. Since mongo lets you reach into objects, you can query against them, right?&lt;/p&gt;
&lt;p&gt;Several times, we’ve excitedly begun a schema this way, only to be forced to pull the nested documents out into their own collection. I’ll show you why, and why it’s not a big deal.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I’m no MongoDB expert, but the suggested solution requires: 1) two network roundtrips; 2) an additional index. &lt;/p&gt;
&lt;p&gt;The way I’d look at this problem is:&lt;/p&gt;
&lt;ol&gt;&lt;li&gt;what is the most frequent operation: reading a blog post and all its comments or displaying all the comments of a specific user?&lt;/li&gt;
&lt;li&gt;assuming the answer is reading a blog post and all its comments, I’d ask myself how frequent is the other operation. &lt;ol&gt;&lt;li&gt;if it’s something that I need to perform just once in a while, I’d  consider using MongoDB MapReduce, even if that would be suboptimal. &lt;/li&gt;
&lt;li&gt;if it’s a frequent operation, then I’d consider adding a separate collection for comments. Even better, I’d add a per user collection for comments.&lt;/li&gt;
&lt;/ol&gt;&lt;/li&gt;
&lt;/ol&gt;&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/" rel="permalink" style="color:red" target="_blank"&gt;MongoDB Tips and Tricks: You Only Wish MongoDB Wasn’t Relational&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;

&lt;!--quid:fed4b4047537b6552fefc0983b15a114e5cc71aa--&gt;</description><link>http://nosql.mypopescu.com/post/16942611058</link><guid>http://nosql.mypopescu.com/post/16942611058</guid><pubDate>Fri, 03 Feb 2012 01:20:06 +0200</pubDate><category>MongoDB</category></item><item><title>Thoughts on SimpleDB, DynamoDB and Cassandra</title><description>&lt;a href="http://perfcap.blogspot.com/2012/01/thoughts-on-simpledb-dynamodb-and.html"&gt;Thoughts on SimpleDB, DynamoDB and Cassandra&lt;/a&gt;: &lt;p&gt;Adrian Cockcroft:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;So the lesson here is that for a first step into NoSQL, we went with a hosted solution so that we didn’t have to build a team of experts to run it, and we didn’t have to decide in advance how much scale we needed. Starting again from scratch today, I would probably go with DynamoDB. It’s a low “friction” and developer friendly solution.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;You can look at this in two ways: 1) a biased opinion of someone that has already betted on Amazon with the infrastructure of a multi-billion business; 2)  the opinion of someone that has accumulated a ton of experience in the NoSQL space and that is successfully&lt;sup id="fnref:2-fn-successfully"&gt;&lt;a href="#fn:2-fn-successfully" rel="footnote" target="_blank"&gt;1&lt;/a&gt;&lt;/sup&gt; running the infrastructure of a multi-billion business on NoSQL solutions. I’d  strongly suggest you to think of it as the latter.&lt;/p&gt;
&lt;div class="footnote"&gt;
&lt;hr&gt;&lt;ol&gt;&lt;li id="fn:2-fn-successfully"&gt;
&lt;p&gt;Netflix was one of the few companies that continued to operate during Amazon’s EBS major failure. &lt;a href="#fnref:2-fn-successfully" rev="footnote" title="Jump back to footnote 1 in the text" target="_blank"&gt;↩&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;&lt;/div&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/post/16941671515" rel="permalink" style="color:red" target="_blank"&gt;Thoughts on SimpleDB, DynamoDB and Cassandra&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;</description><link>http://nosql.mypopescu.com/post/16941671515</link><guid>http://nosql.mypopescu.com/post/16941671515</guid><pubDate>Fri, 03 Feb 2012 01:05:28 +0200</pubDate><category>DynamoDB</category><category>SimpleDB</category><category>Cassandra</category><category>AWS</category></item><item><title>Neo4j 1.6 GA Release: Heroku, Cypher, Lucene 3.5</title><description>&lt;p&gt;&lt;a href="http://java.dzone.com/news/neo4j-cloud-neo4j-16-ga" rel="external nofollow" target="_blank"&gt;Announced last week&lt;/a&gt;, Jörn Kniv aka Neo4j 1.6 features:&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;Improved Cypher (the query language)&lt;/li&gt;
&lt;li&gt;Web admin - Full Neo4j Shell commands, including versioned Cypher syntax.&lt;/li&gt;
&lt;li&gt;Kernel improvements&lt;/li&gt;
&lt;li&gt;Upgraded Lucene version to 3.5.&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;Also the Neo guys have been &lt;a href="http://neo4j-challenge.herokuapp.com/" rel="external nofollow" target="_blank"&gt;pushing quite a bit&lt;/a&gt; their public beta Heroku add-on.&lt;/p&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/post/16933010342" rel="permalink" style="color:red" target="_blank"&gt;Neo4j 1.6 GA Release: Heroku, Cypher, Lucene 3.5&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;</description><link>http://nosql.mypopescu.com/post/16933010342</link><guid>http://nosql.mypopescu.com/post/16933010342</guid><pubDate>Thu, 02 Feb 2012 22:40:04 +0200</pubDate><category>Neo4j</category><category>NoSQL releases</category></item><item><title>NoSQL Market from Couchbase Perspective</title><description>&lt;a href="http://www.dbms2.com/2012/02/01/couchbase-update/"&gt;NoSQL Market from Couchbase Perspective&lt;/a&gt;: &lt;p&gt;James Philips (Couchbase) for Curt Monash:&lt;/p&gt;
&lt;blockquote&gt;
&lt;ul&gt;&lt;li&gt;MongoDB is the big competition. He believes Couchbase has an excellent win rate vs. 10gen for actual paying accounts.&lt;/li&gt;
&lt;li&gt;DataStax/Cassandra wins over Couchbase only when multi-data-center capability is important. Naturally, multi-data-center capability is planned for Couchbase. (Indeed, that’s one of the benefits of swapping in CouchDB at the back end.)&lt;/li&gt;
&lt;li&gt;Redis has “dropped off the radar”, presumably because there’s no particular persistence strategy for it.&lt;/li&gt;
&lt;li&gt;Riak doesn’t show up much.&lt;/li&gt;
&lt;/ul&gt;&lt;/blockquote&gt;
&lt;p&gt;I assume this is sort of a pre-sales/sales department 100k feet overview.&lt;/p&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/post/16922331195" rel="permalink" style="color:red" target="_blank"&gt;NoSQL Market from Couchbase Perspective&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;</description><link>http://nosql.mypopescu.com/post/16922331195</link><guid>http://nosql.mypopescu.com/post/16922331195</guid><pubDate>Thu, 02 Feb 2012 17:53:11 +0200</pubDate><category>Couchbase</category><category>MongoDB</category><category>Cassandra</category><category>Redis</category><category>Riak</category></item><item><title>MongoDB Tips &amp; Tricks: Using MongoDB ObjectIds as created-on timestamps</title><description>&lt;a href="http://blog.fiesta.cc/post/16470048697/using-and-abusing-mongodb-objectids-as-created-on"&gt;MongoDB Tips &amp; Tricks: Using MongoDB ObjectIds as created-on timestamps&lt;/a&gt;: &lt;blockquote&gt;
&lt;p&gt;One of my favorite MongoDB tricks is the ability to use an ObjectId (the default type for MongoDB’s _id primary key) as a timestamp for when a document was created. Here’s how it works:&lt;/p&gt;
&lt;pre&gt;&lt;code&gt;  $ import pymongo
  $ db = pymongo.Connection().test
  $ db.test.insert({'hello': 'world'})
  ObjectId('4f202e64e6fb1b56ff000000')
  $ doc = db.test.find_one()
  $ doc['_id'].generation_time
  datetime.datetime(2012, 1, 25, 16, 31, 32, tzinfo
&lt;/code&gt;&lt;/pre&gt;
&lt;/blockquote&gt;
&lt;p&gt;Mike Dirolf used to work for 10gen so he probably knows quite a few such MongoDB tips &amp; tricks.&lt;/p&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/post/16921636810" rel="permalink" style="color:red" target="_blank"&gt;MongoDB Tips &amp; Tricks: Using MongoDB ObjectIds as created-on timestamps&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;</description><link>http://nosql.mypopescu.com/post/16921636810</link><guid>http://nosql.mypopescu.com/post/16921636810</guid><pubDate>Thu, 02 Feb 2012 17:32:42 +0200</pubDate><category>MongoDB</category></item><item><title>Hadoop and Seismic Data Processing</title><description>&lt;a href="http://www.cloudera.com/blog/2012/01/seismic-data-science-hadoop-use-case/"&gt;Hadoop and Seismic Data Processing&lt;/a&gt;: &lt;blockquote&gt;
&lt;p&gt;Geophysicists have been pushing the limits of high-performance computing for more than three decades; they were early adopters of the first Cray supercomputers as well as the massively parallel Connection Machine. Today, the most challenging seismic data processing tasks are performed on custom compute clusters that take advantage of multiple GPUs per node, high-performance networking and storage systems for fast data access.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;How many fields we’ve never heard of have handcrafted over years their own solutions to deal with big data that would fit so nicely in Hadoop today?&lt;/p&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/post/16921084348" rel="permalink" style="color:red" target="_blank"&gt;Hadoop and Seismic Data Processing&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;</description><link>http://nosql.mypopescu.com/post/16921084348</link><guid>http://nosql.mypopescu.com/post/16921084348</guid><pubDate>Thu, 02 Feb 2012 17:13:05 +0200</pubDate><category>Hadoop</category><category>Powered by Hadoop</category></item><item><title>NoSQL tutorials: Storing User Preference in Amazon DynamoDB using the Mobile SDKs</title><description>&lt;a href="http://aws.amazon.com/articles/7439603059327617"&gt;NoSQL tutorials: Storing User Preference in Amazon DynamoDB using the Mobile SDKs&lt;/a&gt;: &lt;p&gt;Just a CRUD tutorial for DynamoDB but based on a scenario that makes sense and demoing the API with two languages (Objective-C and Java): &lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;The sample mobile application described here demonstrates how to store user preferences in Amazon DynamoDB. Because more and more people are using multiple mobile devices, connecting these devices to the cloud and storing user preferences in the cloud enables developers to provide a more uniform cross-device experience for their users.&lt;/p&gt;
&lt;p&gt;This article shows sample code for both the iOS and Android platforms.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/" rel="permalink" style="color:red" target="_blank"&gt;NoSQL tutorials: Storing User Preference in Amazon DynamoDB using the Mobile SDKs&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;

&lt;!--quid:c58a677f07fee95c7486f84b3028377d8d547082--&gt;</description><link>http://nosql.mypopescu.com/post/16916864311</link><guid>http://nosql.mypopescu.com/post/16916864311</guid><pubDate>Thu, 02 Feb 2012 14:03:06 +0200</pubDate><category>DynamoDB</category><category>iOS</category><category>Android</category><category>AWS</category><category>Objective C</category><category>Java</category></item><item><title>Doug Cutting About Hadoop's Adoption</title><description>&lt;a href="http://radar.oreilly.com/2012/02/hadoop-doug-cutting-apache-data-processing.html"&gt;Doug Cutting About Hadoop's Adoption&lt;/a&gt;: &lt;p&gt;Doug Cutting expressing his suprise with Hadoop’s growth in an interview with Audrey Watters over O’Reillly Radar:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Yes. I didn’t expect Hadoop to become such a central component of data processing. I recognized that Google’s techniques would be useful to other search engines and that open source was the best way to spread these techniques. But I did not realize how many other folks had big data problems nor how many of these Hadoop applied to.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Hadoop is not Doug Cutting’s first widely successful open source project, so I’m tempted to think this is just pure modesty. &lt;/p&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/post/16916520425" rel="permalink" style="color:red" target="_blank"&gt;Doug Cutting About Hadoop’s Adoption&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;</description><link>http://nosql.mypopescu.com/post/16916520425</link><guid>http://nosql.mypopescu.com/post/16916520425</guid><pubDate>Thu, 02 Feb 2012 13:46:11 +0200</pubDate><category>Hadoop</category><category>NoSQL future</category></item><item><title>Getting off the CouchDB... or Lessons Learned while Experimenting in Production</title><description>&lt;a href="http://blog.signalhq.com/2012/01/24/getting-off-the-couchdb/"&gt;Getting off the CouchDB... or Lessons Learned while Experimenting in Production&lt;/a&gt;: &lt;blockquote&gt;
&lt;p&gt;The move to CouchDB went well. Pages in our web application that would occasionally time out were now loading in a couple of seconds. And, our MySQL database was much, much happier. We liked CouchDB so much that we started planning a feature that would make heavy use of CouchDB’s schema-less nature.&lt;/p&gt;
&lt;p&gt;And that’s when the wheels came off.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Word of caution: this is not the “CouchDB sucks so we went with MongoDB” type of post. It’s   more of “we thought CouchDB can solve one of our problems, but then got confused and thought it can solve world hunger. So we decided to throw a bunch of data to it to see if it sticks. Surprise! It didn’t.” &lt;/p&gt;
&lt;p&gt;Just to be clear, I’m not defending CouchDB and everything John Wood writes about it is correct. It’s just that experimenting with CouchDB in a non-production environment or at least reading myNoSQL would have already offered all those answers.&lt;/p&gt;

&lt;p class="cc" style="font-style: italic; font-size: 0.9em;"&gt;Original title and link: &lt;a href="http://nosql.mypopescu.com/" rel="permalink" style="color:red" target="_blank"&gt;Getting off the CouchDB… or Lessons Learned while Experimenting in Production&lt;/a&gt; (&lt;a href="http://nosql.mypopescu.com" style="display:none;visibility:hidden;" target="_blank"&gt;NoSQL database&lt;/a&gt;©myNoSQL)&lt;/p&gt;

&lt;!--quid:ee2b968f1ee6ee19a123de27dd09d04c11e05c0f--&gt;</description><link>http://nosql.mypopescu.com/post/16912580137</link><guid>http://nosql.mypopescu.com/post/16912580137</guid><pubDate>Thu, 02 Feb 2012 09:36:05 +0200</pubDate><category>CouchDB</category><category>Lucene</category><category>Solr</category><category>ElasticSearch</category><category>MongoDB</category></item></channel></rss>

