Skip to content

amplify testSudo flake#19328

Closed
jelly wants to merge 1 commit intocockpit-project:mainfrom
jelly:TestReauthorize.testSudo-debug
Closed

amplify testSudo flake#19328
jelly wants to merge 1 commit intocockpit-project:mainfrom
jelly:TestReauthorize.testSudo-debug

Conversation

@jelly jelly added the no-test For doc/workflow changes, or experiments which don't need a full CI run, label Sep 14, 2023
@jelly
Copy link
Member Author

jelly commented Sep 14, 2023

@allisonkarlitskaya wew, this reproduces quite a lot. We have debug logs of the bridge now in the journal.

Seeing:

> warning: Peer exited with status 1
> warning: Peer exited with status 1
> warning: Peer exited with status 1
> warning: Peer exited with status 1
> warning: Peer exited with status 1
> log: fail
> warning: ListUnitsFiltered failed:  Your session has been terminated.

It seems something kills the bridge, and that's why we get terminated?

Relevant journal?

Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.protocol-DEBUG: writing to transport <cockpit.transports.StdioTransport object at 0x7fcfcc074450>
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.protocol-DEBUG: sending message 1:4!4 {'reply': [], 'id': '20'}
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.protocol-DEBUG: writing to transport <cockpit.transports.StdioTransport object at 0x7fcfcc074450>
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit._vendor.systemd_ctypes.bus-DEBUG: Doing async method call org.freedesktop.systemd1 /org/freedesktop/systemd1/unit/chronyd_2eservice org.freedesktop.DBus.Introspectable Introspect  ()
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.transports-DEBUG: Read ready on <cockpit.transports.SubprocessTransport object at 0x7fcfcbacc9d0> <cockpit.superuser.SuperuserPeer object at 0x7fcfcbb0e9d0> 19
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.transports-DEBUG:   got EOF
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.peer-DEBUG: Peer SuperuserPeer received unexpected EOF
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.transports-DEBUG: Process exited with status 1
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.peer-DEBUG: Peer SuperuserPeer exited, status 1
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.protocol-DEBUG: connection_lost
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.transports-DEBUG: won't attempt 15 to process 1732.  It exited already.
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.peer-DEBUG: Peer SuperuserPeer connection lost <class 'cockpit.peer.PeerExited'> 1
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.router-DEBUG: shutdown_endpoint(<cockpit.superuser.SuperuserPeer object at 0x7fcfcbb0e9d0>, {'problem': 'terminated', 'message': 'Peer exited with status 1'}) will close {'1:4!9', '1:2!4', '1:4!7', '1:3!2', '1:4!10', '1:4!6'}
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.protocol-DEBUG: sending message  {'command': 'close', 'channel': '1:4!9', 'problem': 'terminated', 'message': 'Peer exited with status 1'}
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.protocol-DEBUG: writing to transport <cockpit.transports.StdioTransport object at 0x7fcfcc074450>
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.router-DEBUG: router dropped channel 1:4!9
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.protocol-DEBUG: sending message  {'command': 'close', 'channel': '1:2!4', 'problem': 'terminated', 'message': 'Peer exited with status 1'}
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.protocol-DEBUG: writing to transport <cockpit.transports.StdioTransport object at 0x7fcfcc074450>
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.router-DEBUG: router dropped channel 1:2!4
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.protocol-DEBUG: sending message  {'command': 'close', 'channel': '1:4!7', 'problem': 'terminated', 'message': 'Peer exited with status 1'}
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.protocol-DEBUG: writing to transport <cockpit.transports.StdioTransport object at 0x7fcfcc074450>
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.router-DEBUG: router dropped channel 1:4!7
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.protocol-DEBUG: sending message  {'command': 'close', 'channel': '1:3!2', 'problem': 'terminated', 'message': 'Peer exited with status 1'}
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.protocol-DEBUG: writing to transport <cockpit.transports.StdioTransport object at 0x7fcfcc074450>
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.router-DEBUG: router dropped channel 1:3!2
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.protocol-DEBUG: sending message  {'command': 'close', 'channel': '1:4!10', 'problem': 'terminated', 'message': 'Peer exited with status 1'}
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.protocol-DEBUG: writing to transport <cockpit.transports.StdioTransport object at 0x7fcfcc074450>
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.router-DEBUG: router dropped channel 1:4!10
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.protocol-DEBUG: sending message  {'command': 'close', 'channel': '1:4!6', 'problem': 'terminated', 'message': 'Peer exited with status 1'}
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.protocol-DEBUG: writing to transport <cockpit.transports.StdioTransport object at 0x7fcfcc074450>
Sep 14 08:34:23 fedora-coreos-127-0-0-2-2301 ws[1656]: cockpit.router-DEBUG: router dropped channel 1:4!6

@martinpitt
Copy link
Member

@martinpitt martinpitt added the blocked Don't land until something else happens first (see task list) label Nov 13, 2023
@martinpitt
Copy link
Member

@allisonkarlitskaya says that this will completely change with her ongoing work to use the current FernyTransport API in cockpit and unify the implementation of our SSH and sudo channels. Let's put this on the shelf and revisit when it still happens afterwards.

@martinpitt
Copy link
Member

For the record: This flake still happens with @allisonkarlitskaya 's FernyTransport rewrite in PR #19668.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

blocked Don't land until something else happens first (see task list) no-test For doc/workflow changes, or experiments which don't need a full CI run,

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants