At the time I’m reading this Ask HN: To everybody who uses MapReduce: what problems do you solve?, there aren’t many interesting answers.
✚ Compare it with AskReddit: What is an invention that the human race is fully capable of making, but hasn’t been made yet?
Original title and link: To everybody who uses MapReduce: what problems do you solve? ( ©myNoSQL)
Thanks to Murat Demirbas, I got a link to the SOSP‘13 papers. Maybe is because of my myopic interest, but it seems like quite a few papers this year were focus on the topic of transactions and consistency:
- Speedy Transactions in Multicore In-Memory Databases by Stephen Tu, Wenting Zheng (MIT), Eddie Kohler (Harvard), Barbara Liskov, Samuel Madden (MIT)
- From ARIES to MARS: Transaction Support for Next-Generation, Solid-State Drives by Joel Coburn, Trevor Bunker, Meir Schwarz, Rajesh K. Gupta, Steven Swanson (University of California, San Diego)
- Transaction Chains: Achieving Serializability with Low Latency in Geo-Distributed Storage Systems1 by Yang Zhang, Russell Power, Siyuan Zhou, Yair Sovran (NYU), Marcos K. Aguilera (Microsoft Research), Jinyang Li (NYU)
- Consistency-Based Service Level Agreement for Cloud Storage by Douglas B. Terry, Vijayan Prabhakaran, Ramakrishna Kotla, Mahesh Balakrishnan, Marcos K. Aguilera (Microsoft Research), Hussam Abu-Libdeh (Cornell University)
- Everything You Always Wanted to Know about Synchronization but Were Afraid to Ask by Tudor David, Rachid Guerraoui, Vasileios Trigonakis (EPFL)
This SOSP‘13 page has download links for all papers.
Original title and link: Papers on transactions and consistency from SOSP’13 ( ©myNoSQL)
Even if there’s been almost 3 weeks since the announcement, Apache Hadoop 2 is too big of a news not to mention it here. If you want to read something about it, here are a couple of links:
What started out a few years ago as a scalable batch processing system for Java programmers has now emerged as the kernel of the operating system for big data.
A short interview with Rohit Bakhshi (product manager at Hortonwork) YARN Brings New Capabilities To Hadoop:
By turning Apache Hadoop 2.0 into a multi- application data system, YARN enables the Hadoop community to address a generation of new requirements IN Hadoop. YARN responds to these enterprise challenges by addressing the actual requirements at a foundational level rather than being commercial bolt-ons that complicate the environment for customers.
Mike Miller’s post on GigaOm: Why the world should care about Hadoop 2:
This might be surprising, because Hadoop 2 is not a blow-your-socks-off release. It is not packed with revolutionary new features from a user perspective. Instead, its greatest innovation is a glorious refactoring of some internal plumbing. But that plumbing grants the community of Hadoop developers the pathways they need to address some of Hadoops greatest shortcomings in comparison to both the commercial and the internal Google tools that Hadoop was derived from.
Last but not least, any article you can find about YARN and signed Aarun C. Murthy will be well worth reading (e.g. Apache Hadoop YARN – Background and an Overview, old but very very details series about YARN’s objectives, or Moving Hadoop Beyond Batch with Apache YARN
Original title and link: Apache Hadoop 2 - YARN is GA ( ©myNoSQL)
Aerospike is the only “Visionary” in the Gartner Magic Quadrant for Operational Database Management Systems [sponsor]
Words from this week’s sponsor:
Gartner has published the 2013 Magic Quadrant for Operational Database Management Systems and recognized Aerospike in the Visionaries Quadrant. The report examines the key capabilities of information management for transaction processing.
Access the full report http://www.aerospike.com/only-visionary-in-gartner-mq-2013/.
Original title and link: Aerospike is the only “Visionary” in the Gartner Magic Quadrant for Operational Database Management Systems [sponsor] ( ©myNoSQL)
A short (6‘25”) talk by Dan Kuebrich about the importance of using the right abstractions and visualizations when analyzing performance:
Speaking about performance analysis and visualization, Brendan Gregg’s “Systems Performance” book is available now.
Original title and link: Beyond averages ( ©myNoSQL)