Blaze

Welcome to your VuePress site

Key Features

Performance

Supports most native operators/expressions and fine-grained failback.
Powered by Rust, 2x faster on TPC-DS benchmark.
Performs significantly better in production environments.

Production ready

Verified on production environments with exabytes of data.
Supports complex production scenarios like JSON parsing, UDF/UDTF, etc.
Resolved various stability and data consistency issues.

Easy to Use

Simple to build and install to Spark.
Easy to configuration.
Full-featured execution metrics.

Compatibility

Adapted to Spark mainline versions.
Supports different storage systems like HDFS, S3, etc.

Ecosystem

Supports data lake system like Hudi, Paimon.
Supports Remote Shuffle Service like Apache Celeborn.

Community

Some cooperators have applied Blaze on production.
More are researching and evaluating Blaze.

Benchmarks

Blaze has passed all TPC-DS/TPC-H benchmark cases. Comparing to Spark-3.5, Blaze is running ~2x faster and save ~50% cluster resources. See Benchmark Details.

Cooperators

Blaze currently has some users and contributors. You are invited to join the list by emailing blaze@kwai.com.