Releases: kedro-org/kedro-plugins
Releases · kedro-org/kedro-plugins
kedro-datasets-1.3.0
Major features and improvements
- Added pandas 2.0 support.
- Added SQLAlchemy 2.0 support (and dropped support for versions below 1.4).
- Added a save method to the APIDataSet
- Reduced constructor arguments for
APIDataSet
by replacing most arguments with a single constructor argumentload_args
. This makes it more consistent with other Kedro DataSets and the underlyingrequests
API, and automatically enables the full configuration domain: stream, certificates, proxies, and more. - Relaxed Kedro version pin to
>=0.16
- Added
metadata
attribute to all existing datasets. This is ignored by Kedro, but may be consumed by users or external plugins. - Added
ManagedTableDataSet
for managed delta tables on Databricks.
Bug fixes and other changes
- Relaxed
delta-spark
upper bound to allow compatibility with Spark 3.1.x and 3.2.x. - Upgraded required
polars
version to 0.17. - Renamed
TensorFlowModelDataset
toTensorFlowModelDataSet
to be consistent with all other plugins in kedro-datasets.
Community contributions
Many thanks to the following Kedroids for contributing PRs to this release:
kedro-telemetry v0.2.4
Release 0.2.4
What's Changed
- Fix telemetry bug - add checking for project statistics by @noklam in #156
- Kedro-Telemetry release 0.2.4 by @noklam in #157
Full Changelog: kedro-datasets-1.2.0...kedro-telemetry-0.2.4
kedro-datasets-1.2.0
Major features and improvements:
- Added
fsspec
resolution inSparkDataSet
to support more filesystems. - Added the
_preview
method to the PandasExcelDataSet
andCSVDataSet
classes.
Bug fixes and other changes
- Fixed a docstring in the Pandas
SQLQueryDataSet
as part of the Sphinx revamp on Kedro.
kedro-datasets-1.1.1
Bug fixes and other changes
- Fixed problematic docstrings causing Read the Docs builds on Kedro to fail.
kedro-datasets-1.1.0
Major features and improvements:
- Added the following new datasets:
Type | Description | Location |
---|---|---|
polars.CSVDataSet |
A CSVDataSet backed by polars, a lighting fast dataframe package built entirely using Rust. |
kedro_datasets.polars |
snowflake.SnowparkTableDataSet |
Work with Snowpark DataFrames from tables in Snowflake. | kedro_datasets.snowflake |
Bug fixes and other changes
- Add
mssql
backend to theSQLQueryDataSet
DataSet usingpyodbc
library. - Added a warning when the user tries to use
SparkDataSet
on Databricks without specifying a file path with the/dbfs/
prefix.
kedro-datasets-1.0.2
- Change reference to
kedro.pipeline.Pipeline
object throughout test suite withkedro.modular_pipeline.pipeline
factory. - Relaxed
PyArrow
range in line withPandas
- Fixed outdated links to the dill package documentation
kedro-datasets-1.0.1
- Fixed doc string formatting in
VideoDataSet
causing the documentation builds to fail.
kedro-docker v0.3.1
- Introduced two-stage builds with separated runtime environment and project code.
- Optimized image layers that store project code.
- Redundant Spark and Hadoop libs removed from Spark's template.
kedro-telemetry v0.2.3
- Modified the process for reading a Kedro project's package name and version to avoid a failed run when no
pyproject.toml
can be read. - Report the version of Kedro used to run a project, no longer report the project's name.
kedro-datasets v1.0.0
Release 1.0.0
- Sync datasets with
kedro.extras.datasets
by @noklam in #69 - Change
ParquetDataSet
to load using pandas instead of parquet by @SajidAlamQB in #89 - Release
kedro-datasets 1.0.0
by @ankatiyar in #90
Full Changelog: kedro-airflow-0.5.1...kedro-datasets-1.0.0