-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Build failing on remote cache problems unexpectedly #22119
Comments
--remote_local_fallback
@guw Could you please provide complete steps to reproduce this issue? |
@iancha1992 I am not sure how. This seems to rely on network issues within Google Cloud. All we do is running We did notice a detail, it seems to be failing only when compiling unit tests (within Example:
But the one that fail are usually compiling a unit test class and they also don't print a stacktrace. |
I think this is a duplicate of #20123. |
"unable to finalize action" and |
Is "unable to finalize action" failing the build when remote cache fails expected at this point? The behavior we see is inconsistent. Sometimes its a warning and sometimes it fails. |
it's inconsistent because an action can access the remote cache at different stages. Before executing the action locally, Bazel first checks the AC in remote cache. If this check failed (e.g. connection reset), it treats it as CacheNotFound (as well as print the warning as in #22119 (comment)) and continue with local execution. Otherwise, Bazel will retrieve the outputs from CAS in remote cache. If the connection is broken at this point, you will see build error in #22119 (comment). It would be nice to have it fallback to local execution, but it requires non-trivial changes. |
Would it be possible to return with an exit code that encourages retries? We implemented a retry logic by parsing the Bazel output for |
Description of the bug:
Our build is unreliable. The culprit seems to be remote cache problems (we use Google Cloud storage).
This is unexpected because we have the following in
.bazelrc
:Remote cache reliability issues should not impact the Bazel build. Especially intermittent network issues should not fail a Bazel build. Those are expensive. If a cache upload or download fails the build should consider the remote cache unreliable and continue without problems.
Which operating system are you running Bazel on?
Linux
What is the output of
bazel info release
?release 7.1.1
The text was updated successfully, but these errors were encountered: