Skip to content

[VL] Add default compress for VeloxColumnarBatchSerializer to reduce memory usage#12373

Open
HaoChen-ch wants to merge 4 commits into
apache:mainfrom
HaoChen-ch:velox-serializer-compress
Open

[VL] Add default compress for VeloxColumnarBatchSerializer to reduce memory usage#12373
HaoChen-ch wants to merge 4 commits into
apache:mainfrom
HaoChen-ch:velox-serializer-compress

Conversation

@HaoChen-ch

@HaoChen-ch HaoChen-ch commented Jun 26, 2026

Copy link
Copy Markdown

This PR adds a new config option, spark.gluten.sql.columnar.backend.velox.columnarBatchSerializerCompression, which enables compression in the VeloxColumnarBatchSerializer. The serializer is used to serialize columnar batches (e.g. for broadcast), and applying compression reduces the serialized size, which lowers broadcast memory usage.
The config defaults to none and supports none, zstd, zlib, snappy, lz4, and gzip.
before
img_v3_02131_39f1f43e-97dd-45f1-a89a-4d1ce6ec15hu
after set spark.gluten.sql.columnar.backend.velox.columnarBatchSerializerCompression=zstd
image
bytesSize decrease from 1160593862 to 29097045

@github-actions github-actions Bot added CORE works for Gluten Core VELOX labels Jun 26, 2026
@github-actions

Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@github-actions

Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@zjuwangg zjuwangg left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

L GTM

@taiyang-li taiyang-li left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@github-actions

Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CORE works for Gluten Core VELOX

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants