Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make cloudpickle the default pickle library #21298

Open
damccorm opened this issue Jun 4, 2022 · 3 comments
Open

Make cloudpickle the default pickle library #21298

damccorm opened this issue Jun 4, 2022 · 3 comments

Comments

@damccorm
Copy link
Contributor

damccorm commented Jun 4, 2022

Currently Beam has a dependency on a very old version of dill, because upgrading dill proved challenging, which causes significant inconvenience to existing users, see:

#22893 , #32842, #32617, #24458.

We plan to resolve this inconvenience by making cloudpickle the default pickler of choice and remove the mandatory dependency on dill.

Migration plan: s.apache.org/beam-cloudpickle-next-steps

Prior work (for historical reasons/additional context): 
#15472, https://s.apache.org/beam-picklers, BEAM-13386.

@tvalentyn
Copy link
Contributor

Blocking issue: #26209

We should also investigate whether unpickling the same DoFn twice (e.g. in two different threads) would also result in unwanted shared state.

Until this is addressed, cloudpickle support should be considered experimental.

@tvalentyn
Copy link
Contributor

tvalentyn commented Nov 4, 2024

Blocking issue has been resolved in 2.61.0 release

@tvalentyn
Copy link
Contributor

Updated the description with current plan, linked duplicate issues.

@tvalentyn tvalentyn added the P1 label Mar 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants