Cloudera, the global provider of the fastest, easiest, and most secure data management and analytics platform built on Apache Hadoop and the latest open source technologies, today released benchmark results that validate that Cloudera’s modern analytic database solution, powered by Apache Impala (incubating), not only delivers unprecedented capabilities for cloud-native workloads but does so at better cost performance compared to alternatives.
Impala uniquely offers elastic scalability, better flexibility, and direct Amazon S3 query ability unavailable from traditionally-architected systems such as Redshift. With a modern design, Impala decouples data and compute to provide the same high-performance SQL analytics whether cloud-natively over data in S3 or across a wide range of on-premise and cloud storage options. Furthermore, Impala enables all these capabilities while also delivering up to 275% more cost-efficiency and up to 10x greater performance compared to Amazon’s analytic database, Redshift, equating to more value all within an open platform
Using queries from the TPC-DS industry standard benchmark, Cloudera compared Impala running on the cloud (both cloud-natively over S3 and over local EBS storage) to Amazon Redshift (only able to run over its own storage on dedicated AWS instances). Results from the benchmark show:
- Impala is over 200% less costly and over 10x faster on S3 compared to a general purpose tuned Redshift
- Impala is still 8% less costly and 90% faster on S3 compared to a pre-tuned Redshift for specific fixed reporting queries
- Impala is 28-275% less costly and 42-400% faster on EBS compared to either pre-tuned or general purpose tuned Redshift