Cloudera, Hortonworks, and MapR are the most popular Hadoop distributions available today. However, even with this short list, there are few unbiased comparisons of their cluster performance.
We’ve prepared a 65-page research paper that contains a vendor-independent overview of Cloudera, Hortonworks, and MapR distributions. This document provides 83 diagrams that explore performance under 7 types of workloads. Download your copy to learn:
- detailed performance results for 4-, 8-, 12-, and 16-node clusters
- how the size of a cluster affects data processing speed
- how different clusters behave under CPU and disk-bound workloads (including Bayes, DFSIO, Hive aggregation, PageRank, Sort, TeraSort, and WordCount)
- what issues slow down deployment and how to maximize Hadoop processing speed