Philip Russom clarifies some myths about Hadoop and MapReduce circulating inside the BI community:
- Hadoop consists of multiple products.
- Hadoop is open source but available from vendors, too.
- Hadoop is an ecosystem, not a single product.
- HDFS is a file system, not a database management system (DBMS).
- Hive resembles SQL but is not standard SQL.
- Hadoop and MapReduce are related but don’t require each other.
- MapReduce provides control for analytics, not analytics per se.
- Hadoop is about data diversity, not just data volume.
- Hadoop complements a DW; it’s rarely a replacement.
- Hadoop enables many types of analytics, not just Web analytics.
I do hope this lack of information and misconceptions are not real as otherwise some BI careers would really be endangered.
Original title and link: Busting 10 Myths About Hadoop ( ©myNoSQL)