UI hangs when attempting to view Sensor logs if pod has more than one container #9459

jsvk · 2022-08-28T17:55:13Z

Checklist

Double-checked my configuration.
Tested using the latest version.
Used the Emissary executor.

Summary

What happened/what you expected to happen?

We're using a Kubernetes mutating webhook to inject a fluent-bit container into certain running pods for the purposes of collecting logs and sending them to Splunk. When attempting to view Sensor logs in the UI, the Argo Workflows API hangs and eventually returns a 504 - Gateway Timeout.

This seems to happen because the API encounters this message when attempting to query container logs:

time="2022-08-25T14:30:21.960Z" level=error msg="a container name must be specified for pod adobe-platform--[snip], choose one of: [main fluent-bit]" namespace=ns-team-adobe-platform--[snip] podName=adobe-platform--[snip]

The API does support parsing podLogOptions.container, and so by appending podLogOptions.container=main to the API call, I confirmed the call returns instantly with the logs we expect, so this is a matter of updating this at the UI level.

The container name of main can be seen here: https://github.com/argoproj/argo-events/blob/56196143ecbf8b451d5ecb02a0ca13835e20954c/controllers/sensor/resource.go#L267

A potential fix to this is here: #9438

What version are you running?

Argo Workflows v3.3.8

Diagnostics

Paste the smallest workflow that reproduces the bug. We must be able to run the workflow.

This happens with any workflow so long as the Sensor pod has more than 1 container.

# Logs from the workflow controller:
kubectl logs -n argo deploy/workflow-controller | grep ${workflow} 

# If the workflow's pods have not been created, you can skip the rest of the diagnostics.

# The workflow's pods that are problematic:
kubectl get pod -o yaml -l workflows.argoproj.io/workflow=${workflow},workflow.argoproj.io/phase!=Succeeded

# Logs from in your workflow's wait container, something like:
kubectl logs -c wait -l workflows.argoproj.io/workflow=${workflow},workflow.argoproj.io/phase!=Succeeded

Message from the maintainers:

Impacted by this bug? Give it a 👍. We prioritise the issues with the most 👍.

The text was updated successfully, but these errors were encountered:

…roj#9459

…roj#9459 Signed-off-by: jsvk <[email protected]>

…rgoproj#9459 Signed-off-by: jsvk <[email protected]>

Signed-off-by: jsvk <[email protected]>

…rgoproj#9438) Signed-off-by: jsvk <[email protected]> Signed-off-by: juchao <[email protected]>

jsvk added type/bug triage labels Aug 28, 2022

jsvk mentioned this issue Aug 28, 2022

fix(ui): default to main container in Sensor logs. Fixes #9459 #9438

Merged

jsvk added a commit to jsvk/argo-workflows that referenced this issue Aug 29, 2022

fix: default to 'main' container in Sensor logs API call. Fixes argop…

d1c05cb

…roj#9459

jsvk added a commit to jsvk/argo-workflows that referenced this issue Aug 29, 2022

fix: default to 'main' container in Sensor logs API call. Fixes argop…

897f338

…roj#9459 Signed-off-by: jsvk <[email protected]>

jsvk added a commit to jsvk/argo-workflows that referenced this issue Aug 29, 2022

fix: default to 'main' container name in Sensor logs API call. Fixes a…

a6ae35e

…rgoproj#9459 Signed-off-by: jsvk <[email protected]>

alexec added the area/ui label Sep 5, 2022

alexec closed this as completed in #9438 Sep 5, 2022

alexec pushed a commit that referenced this issue Sep 5, 2022

fix: default to 'main' container in Sensor logs. Fixes #9459 (#9438)

f27475f

Signed-off-by: jsvk <[email protected]>

juchaosong pushed a commit to juchaosong/argo-workflows that referenced this issue Nov 3, 2022

fix: default to 'main' container in Sensor logs. Fixes argoproj#9459 (a…

61db1bd

…rgoproj#9438) Signed-off-by: jsvk <[email protected]> Signed-off-by: juchao <[email protected]>

sarabala1979 mentioned this issue Nov 8, 2022

v3.3 Cherry-picking #10000

Closed

aniro-s mentioned this issue Apr 15, 2024

EventSource logs fetching fails if more than 1 container in pod #12938

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UI hangs when attempting to view Sensor logs if pod has more than one container #9459

UI hangs when attempting to view Sensor logs if pod has more than one container #9459

jsvk commented Aug 28, 2022 •

edited

Loading

UI hangs when attempting to view Sensor logs if pod has more than one container #9459

UI hangs when attempting to view Sensor logs if pod has more than one container #9459

Comments

jsvk commented Aug 28, 2022 • edited Loading

Checklist

Summary

Diagnostics

jsvk commented Aug 28, 2022 •

edited

Loading