-
Notifications
You must be signed in to change notification settings - Fork 169
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Support Binary in shuffle writer #106
Conversation
I'm not sure why this issue doesn't happen before... Seems like binary data are already used in various shuffle suites. |
// TODO: this is not accurate, but should be good enough for now | ||
DataType::Binary => len * 100 + len * 4, | ||
DataType::LargeBinary => len * 100 + len * 8, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh, I missed binary type.
core/src/execution/datafusion/shuffle_writer/shuffle_writer_test.rs
Outdated
Show resolved
Hide resolved
They use columnar shuffle, I think, which has much more coverage. |
I see. Thanks, let me try to add a test case in the spark side that leverages native shuffle. |
Fixed. Please let me know that whether we should put test code in the shuffle_writer.rs or in a different file. |
The TPCDS CI pipeline failure is going to be fixed at #108. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Merged, thanks! |
Force using the provided Comet version by purging previous snapshots from the local repository. This avoids inadvertently picking up the wrong snapshot. This only occurs if `dev/install-comet-spark.sh` is explicitly provided a Comet version to use.
Which issue does this PR close?
Closes #105
Rationale for this change
bug fixes
What changes are included in this PR?
Add binary pattern matching in shuffle_writer.rs
How are these changes tested?
Add new test cases in rust and spark side