What is the big deal about Sort? Sort is fundamental to the MapReduce
framework, the data is sorted between the Map and Reduce phases (see below).
Syncsort’s contribution allows native Hadoop sort to be replaced by an
alternative sort implementation, for both Map and Reduce sides, i.e. it
makes Sort phase pluggable.
This blog is called myNoSQL and it is written by me, Alex Popescu, a software architect with a passion for open source and communities.
It records my readings, learnings, and opinions on NoSQL databases, polyglot persistence, and distributed systems -- subjects that I'm passionate about.
The opinions expressed here are my own, and no other party necessarily agrees with them.
If you feel I'm biased, I probably am.