You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There is the experimental ArrayReader interface, however, this is very hard to use correctly and exposes a lot of what should probably remain implementation details.
Describe the solution you'd like
I would like an interface, perhaps similar in spirit to that added to the write side by #4871, that achieves the following:
Makes it easy to parallelise both:
The decoding of the parquet leaf columns
The re-assembly of the arrow data from the dremel encodings
Facilitates overriding the data source, e.g. by exposing the RowGroups trait
Avoids exposing too many internal implementation details
Describe alternatives you've considered
Additional context
The text was updated successfully, but these errors were encountered:
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Currently we provide
ParquetRecordBatchReader
andParquetRecordBatchStream
as interfaces to read parquet data.These interfaces are relatively straightforward to use, but have limitations:
Arc<dyn Array>
in parquet record batch reader. #4864)There is the experimental
ArrayReader
interface, however, this is very hard to use correctly and exposes a lot of what should probably remain implementation details.Describe the solution you'd like
I would like an interface, perhaps similar in spirit to that added to the write side by #4871, that achieves the following:
Describe alternatives you've considered
Additional context
The text was updated successfully, but these errors were encountered: