Extend support for BYTE_STREAM_SPLIT to FIXED_LEN_BYTE_ARRAY, INT32, and INT64 primitive types #6048
Labels
enhancement
Any new improvement worthy of a entry in the changelog
parquet
Changes to the parquet crate
Please correct me if I'm wrong! It seems arrow-rs has added BYTE_STREAM_SPLIT support for float types, but not for other numerical data types like INT32.
Since then, the Parquet spec has been expanded to extend BYTE_STREAM_SPLIT encoding to other numerical primitive types: apache/parquet-format#229. The C++ PoC is here: apache/arrow#40094.
It would be good for arrow-rs to support BYTE_STREAM_SPLIT encoding additionally for FIXED_LEN_BYTE_ARRAY, INT32, and INT64.
The text was updated successfully, but these errors were encountered: