From 76b8fafbe143d8c31c30d953e8bd00b9ce335c6a Mon Sep 17 00:00:00 2001 From: Danny Mccormick Date: Fri, 21 Nov 2025 10:36:39 -0500 Subject: [PATCH] Update CHANGES to mention extras changes --- CHANGES.md | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/CHANGES.md b/CHANGES.md index 222d4b82cb25..4077f3f7d19f 100644 --- a/CHANGES.md +++ b/CHANGES.md @@ -120,7 +120,7 @@ - This change only affects pipelines that explicitly use the `pickle_library=dill` pipeline option. - While `dill==0.3.1.1` is still pre-installed on the official Beam SDK base images, it is no longer a direct dependency of the apache-beam Python package. This means it can be overridden by other dependencies in your environment. - If your pipeline uses `pickle_library=dill`, you must manually ensure `dill==0.3.1.1` is installed in both your submission and runtime environments. - - Submission environment: Install the dill extra in your local environment `pip install apache-beam[gcpdill]`. + - Submission environment: Install the dill extra in your local environment `pip install apache-beam[gcp,dill]`. - Runtime (worker) environment: Your action depends on how you manage your worker's environment. - If using default containers or custom containers with the official Beam base image e.g. `FROM apache/beam_python3.10_sdk:2.69.0` - Add `dill==0.3.1.1` to your worker's requirements file (e.g., requirements.txt) @@ -134,6 +134,9 @@ * (Python) The deterministic fallback coder for complex types like NamedTuple, Enum, and dataclasses now normalizes filepaths for better determinism guarantees. This affects streaming pipelines updating from 2.68 to 2.69 that utilize this fallback coder. If your pipeline is affected, you may see a warning like: "Using fallback deterministic coder for type X...". To update safely sepcify the pipeline option `--update_compatibility_version=2.68.0` ([#36345](https://github.com/apache/beam/pull/36345)). * (Python) Fixed transform naming conflict when executing DataTransform on a dictionary of PColls ([#30445](https://github.com/apache/beam/issues/30445)). This may break update compatibility if you don't provide a `--transform_name_mapping`. +* (Python) Split some extras out from the core Beam package. ([#30445](https://github.com/apache/beam/issues/30445)). + - If you use Enrichment with redis, Hadoop FileSystem, TFRecord, or some other packages, you may need to install some extras. + - To retain identical behavior to before, instead of `pip install apache-beam`, use `pip install apache-beam[hadoop,gcp,interactive,redis,test,tfrecord]`. * Removed deprecated Hadoop versions (2.10.2 and 3.2.4) that are no longer supported for [Iceberg](https://github.com/apache/iceberg/issues/10940) from IcebergIO ([#36282](https://github.com/apache/beam/issues/36282)). * (Go) Coder construction on SDK side is more faithful to the specs from runners without stripping length-prefix. This may break streaming pipeline update as the underlying coder could be changed ([#36387](https://github.com/apache/beam/issues/36387)). * Minimum Go version for Beam Go updated to 1.25.2 ([#36461](https://github.com/apache/beam/issues/36461)).