Release Release · snowflakedb/snowpark-python

1.24.0 (2024-10-28)

Updated Session class to be thread-safe. This allows concurrent DataFrame transformations, DataFrame actions, UDF and stored procedure registration, and concurrent file uploads when using the same Session object.
- The feature is disabled by default and can be enabled by setting FEATURE_THREAD_SAFE_PYTHON_SESSION to True for account.
- Updating session configurations, like changing database or schema, when multiple threads are using the session may lead to unexpected behavior.
- When enabled, some internally created temporary table names returned from DataFrame.queries API are not deterministic, and may be different when DataFrame actions are executed. This does not affect explicit user-created temporary tables.
Added support for 'Service' domain to session.lineage.trace API.
Added support for copy_grants parameter when registering UDxF and stored procedures.
Added support for the following methods in DataFrameWriter to support daisy-chaining:
- option
- options
- partition_by
Added support for snowflake_cortex_summarize.

Improved the following new capability for function snowflake.snowpark.functions.array_remove it is now possible to use in python.
Disables sql simplification when sort is performed after limit.
- Previously, df.sort().limit() and df.limit().sort() generates the same query with sort in front of limit. Now, df.limit().sort() will generate query that reads df.limit().sort().
- Improve performance of generated query for df.limit().sort(), because limit stops table scanning as soon as the number of records is satisfied.

Fixed a bug where the automatic cleanup of temporary tables could interfere with the results of async query execution.
Fixed a bug in DataFrame.analytics.time_series_agg function to handle multiple data points in same sliding interval.
Fixed a bug that created inconsistent casing in field names of structured objects in iceberg schemas.

Deprecated warnings will be triggered when using snowpark-python with Python 3.8. For more details, please refer to https://docs.snowflake.com/en/developer-guide/python-runtime-support-policy.

Added support for np.subtract, np.multiply, np.divide, and np.true_divide.
Added support for tracking usages of __array_ufunc__.
Added numpy compatibility support for np.float_power, np.mod, np.remainder, np.greater, np.greater_equal, np.less, np.less_equal, np.not_equal, and np.equal.
Added numpy compatibility support for np.log, np.log2, and np.log10
Added support for DataFrameGroupBy.bfill, SeriesGroupBy.bfill, DataFrameGroupBy.ffill, and SeriesGroupBy.ffill.
Added support for on parameter with Resampler.
Added support for timedelta inputs in value_counts().
Added support for applying Snowpark Python function snowflake_cortex_summarize.
Added support for DataFrame.attrs and Series.attrs.
Added support for DataFrame.style.

Improved generated SQL query for head and iloc when the row key is a slice.
Improved error message when passing an unknown timezone to tz_convert and tz_localize in Series, DataFrame, Series.dt, and DatetimeIndex.
Improved documentation for tz_convert and tz_localize in Series, DataFrame, Series.dt, and DatetimeIndex to specify the supported timezone formats.
Added additional kwargs support for df.apply and series.apply ( as well as map and applymap ) when using snowpark functions. This allows for some position independent compatibility between apply and functions where the first argument is not a pandas object.
Improved generated SQL query for iloc and iat when the row key is a scalar.
Removed all joins in iterrows.
Improved documentation for Series.map to reflect the unsupported features.
Added support for np.may_share_memory which is used internally by many scikit-learn functions. This method will always return false when called with a Snowpark pandas object.

Fixed a bug where DataFrame and Series pct_change() would raise TypeError when input contained timedelta columns.
Fixed a bug where replace() would sometimes propagate Timedelta types incorrectly through replace(). Instead raise NotImplementedError for replace() on Timedelta.
Fixed a bug where DataFrame and Series round() would raise AssertionError for Timedelta columns. Instead raise NotImplementedError for round() on Timedelta.
Fixed a bug where reindex fails when the new index is a Series with non-overlapping types from the original index.
Fixed a bug where calling __getitem__ on a DataFrameGroupBy object always returned a DataFrameGroupBy object if as_index=False.
Fixed a bug where inserting timedelta values into an existing column would silently convert the values to integers instead of raising NotImplementedError.
Fixed a bug where DataFrame.shift() on axis=0 and axis=1 would fail to propagate timedelta types.
DataFrame.abs(), DataFrame.__neg__(), DataFrame.stack(), and DataFrame.unstack() now raise NotImplementedError for timedelta inputs instead of failing to propagate timedelta types.

Fixed a bug where DataFrame.alias raises KeyError for input column name.
Fixed a bug where to_csv on Snowflake stage fails when data contains empty strings.