Building on Eric Baldeschwieler’s defense of HDFS, Cloudera’s Charles Zedlewski adds a couple of HDFS advantages:
- Choice: Customers get to work with any leading hardware vendor and let the best possible price / performer win the decision, not whatever the vendor decided to bundle in.
- Portability: It is possible for customers running Hadoop distributions based on HDFS to move between those different distributions without having to reformat the cluster or copy massive amounts of data. When you’re talking about petabytes of data, this kind of portability is vital. Without it, your vendor has incredible leverage when it comes time to negotiate the next purchase.
- Shared industry R&D We at Cloudera are proud of our employee’s own contributions to HDFS, and they collaborate with their colleagues at Hortonworks. But today you will find that IBM, Microsoft and VMware are also contributing to HDFS to make it work better with their products. In the future I predict you’ll find hard drive, networking and server manufacturers also add patches to HDFS to ensure their technologies run optimally with it.
Original title and link: Defending Hadoop’s HDFS - Cloudera Version ( ©myNoSQL)