Blaze
Welcome to your VuePress site
Key Features
Performance
- Supports most native operators/expressions and fine-grained failback.
- Powered by Rust, 2x faster on TPC-DS benchmark.
- Performs significantly better in production environments.
Production ready
- Verified on production environments with exabytes of data.
- Supports complex production scenarios like JSON parsing, UDF/UDTF, etc.
- Resolved various stability and data consistency issues.
Easy to Use
- Simple to build and install to Spark.
- Easy to configuration.
- Full-featured execution metrics.
Compatibility
- Adapted to Spark mainline versions.
- Supports different storage systems like HDFS, S3, etc.
Ecosystem
- Supports data lake system like Hudi, Paimon.
- Supports Remote Shuffle Service like Apache Celeborn.
Community
- Some cooperators have applied Blaze on production.
- More are researching and evaluating Blaze.
Benchmarks
Blaze has passed all TPC-DS/TPC-H benchmark cases. Comparing to Spark-3.5, Blaze is running ~2x faster and save ~50% cluster resources. See Benchmark Details.
Cooperators
Blaze currently has some users and contributors. You are invited to join the list by emailing blaze@kwai.com.