-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
remote: remove local fallback for remote execution #7202
Comments
remote_local_fallback has incidentally been broken since 1532df0, where the catches with fallback responses (and the reasonable error handling and reporting) are not catching the now-RuntimeError that were previously RetryException, and caught by IOException handlers. This is incredibly broken even without the fallback, and could be addressed, but will not improve the error handling here, which now dies in the skyframe AbstractParallelEvaluator when, for instance, a DEADLINE_EXCEEDED occurs during download. |
Link to the email announcement: proposal One of the use cases discovered for a local fallback is when cache download/upload reached a Timeout. |
I think local fallback is valuable, if only in situations with poor network connectivity. Instead of removing the flag altogether, we can make it so If set explicitly, we should also allow |
I like the idea! In general, discussion was paused for a while, because we realized that a One of the main concerns is a risk of a cache poisoning, but I don't want to completely disable caching for locally executed actions. Especially considering that there are only a few use cases of the feature known to us and for them caching local actions is a safe thing to do.
The only concern here is how could we identify whether |
Why would you need to do that? |
In order to know whether the flag was set explicitly or not. You've suggested to toggle it to |
Thank you for contributing to the Bazel repository! This issue has been marked as stale since it has not had any activity in the last 2.5 years. It will be closed in the next 14 days unless any other activity occurs or one of the following labels is added: "not stale", "awaiting-bazeler". Please reach out to the triage team ( |
I think this is still valid @coeuvre? |
That's interesting. I believe local and remote should be identical otherwise dynamic execution wouldn't work. cc @tjgq for the work on multiple platform dynamic execution. |
The
--remote_local_fallback
flag falls back to local execution if there was some error (i.e. network) running the action remotely. This is inherently unsafe as the local and remote execution environment might not be identical.The text was updated successfully, but these errors were encountered: