BlazeBlaze
Introduction
  • Getting Started
  • Configuration
  • Benchmarks
  • v5.0.0
  • v4.0.1
  • v4.0.0
  • All Archived Releases
Blogs
GitHub
Introduction
  • Getting Started
  • Configuration
  • Benchmarks
  • v5.0.0
  • v4.0.1
  • v4.0.0
  • All Archived Releases
Blogs
GitHub
  • Introduction
  • Documents

    • Getting-Started
    • Configurations
    • Benchmarks
  • Archives

    • v5.0.0
    • v4.0.1
    • v4.0.0
    • All Archived Releases
  • Blogs

New features

  • supports spark3.0/3.1/3.2/3.3/3.4/3.5.
  • supports integrating with Apache Celeborn.
  • supports native ORC input format.
  • supports bloom filter join introduced in spark 3.5.
  • supports forceShuffledHashJoin for running tpch/tpcds benchmarks.
  • new supported native expression/functions: year, month, day, md5.

Bug fixes

  • add missing UDTF.terminate() invokes.
  • fix NPE while executing some native spark physical plans.

Performance

  • use custom implemented hash table for faster joining, supporting SIMD, bulk searching, memory prefetching, etc.
  • improve shuffle write performance.
  • reuse FSDataInputStream for same input file.

Download

VersionDateSourceBinaryRelease Notes
4.0.0Oct 10 2024v4.0.0.zip
v4.0.0.tar.gz
blaze-engine-spark-3.0-release-4.0.0-SNAPSHOT.jar
blaze-engine-spark-3.1-release-4.0.0-SNAPSHOT.jar
blaze-engine-spark-3.2-release-4.0.0-SNAPSHOT.jar
blaze-engine-spark-3.3-release-4.0.0-SNAPSHOT.jar
blaze-engine-spark-3.4-release-4.0.0-SNAPSHOT.jar
blaze-engine-spark-3.5-release-4.0.0-SNAPSHOT.jar
release notes
Prev
v4.0.1
Next
All Archived Releases