Skip to content

Conversation

@mergify
Copy link
Contributor

@mergify mergify bot commented Oct 8, 2024

Type of change

  • Bug

Proposed commit message

In cases where GCS reports an error, we were logging the error but not
otherwise making use of the information. When GCS was reporting that a
requested object does not exist this was causing unnecessary log-spam and
not updating the set of failed jobs to remove the object, resulting in future
futile re-attempts to collect the object. When any other error was reported,
the set of failed jobs was not being updated, resulting in continued
re-attempts, even past the maximum retry count.

This fixes both cases by differentiating the two situations, logging only at
debug level when the object is reported to be missing and removing the object
from the failed rework set, and logging at error and increasing the failure
count for all other error cases.

Checklist

  • My code follows the style guidelines of this project
  • I have commented my code, particularly in hard-to-understand areas
  • I have made corresponding changes to the documentation
  • I have made corresponding change to the default configuration files
  • I have added tests that prove my fix is effective or that my feature works
  • I have added an entry in CHANGELOG.next.asciidoc or CHANGELOG-developer.next.asciidoc.

Disruptive User Impact

Author's Checklist

  • [ ]

How to test this PR locally

Related issues

Use cases

Screenshots

Logs


This is an automatic backport of pull request #41142 done by [Mergify](https://mergify.com).

…e error logs (#41142)

[filebeat][GCS] fix failed job handling and remove false-positive error logs

In cases where GCS report an error, we were logging the error but not
otherwise making use of the information. When GCS was reporting that a
requested object does note exist this was causing unnecessary log-spam and
not updating the set of failed jobs to remove the object, resulting in future
futile re-attempts to collect the object. When any other error was reported,
the set of failed jobs was not being updated, resulting in continued
re-attempts, even past the maximum retry count.

This fixes both cases by differentiating the two situations, logging only at
debug level when the object is reported to be missing and removing the object
from the failed rework set, and logging at error and increasing the failure
count for all other error cases.

(cherry picked from commit fbff318)
@mergify mergify bot added the backport label Oct 8, 2024
@mergify mergify bot requested a review from a team as a code owner October 8, 2024 05:27
@botelastic botelastic bot added the needs_team Indicates that the issue/PR needs a Team:* label label Oct 8, 2024
@botelastic
Copy link

botelastic bot commented Oct 8, 2024

This pull request doesn't have a Team:<team> label.

@ShourieG ShourieG merged commit ef40cad into 8.x Oct 8, 2024
@ShourieG ShourieG deleted the mergify/bp/8.x/pr-41142 branch October 8, 2024 13:25
@khushijain21 khushijain21 mentioned this pull request Jun 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

backport needs_team Indicates that the issue/PR needs a Team:* label

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants