Display the runner type during CLI operations #2795

izaaklauer · 2021-12-03T14:12:46Z

Waypoint will soon automatically choose if operations will occur locally or remotely. Currently though, users have no visibility into where their operations take place. If there is some kind of error during the operation, users will find it valuable to know where that error occurred. For example, if it's a permissions error, they'll need to either update their local credentials providers, or go modify an ODR profile, depending on where the job ran.

With this change, the runner proto gets a new field, kind, which is either local, odr, or remote. Runners will know their own type on startup, and communicate that type back to the server when they first register.

When the CLI creates a job, it watches the job events to see when job's assigned_runner ref is set. It then uses that runner ID to query the server for the complete runner proto, and displays the runner location.

How to verify

Run a local build. Note the line Performing operation locally

$ waypoint build -local

» Building web...
Performing operation locally
✓ Initializing Docker client...
✓ Building image...
 │ Step 1/3 : FROM nginx:stable
 │  ---> c8d03f6b8b91
 │ Step 2/3 : COPY ./public/ /var/www
 │  ---> Using cache

Turn on a remote runner (but don't set a default or project runner profile), and run the op again with -local-false to force the job to execute on the "static" remote runner.
Note the line Performing this operation on a remote runner with id "01FP0N64TDN3QN4D0M83RGRPD4"

$ waypoint build -local=false

» Building learn-waypoint-lambda...
  Performing this operation on a remote runner with id "01FP0N64TDN3QN4D0M83RGRPD4"

» Cloning data from Git
  URL: https://github.com/hashicorp/waypoint-examples
  Ref: us-east-1

Set up an ODR profile.
Note the line Performing operation on docker with runner profile test

$ ~/dev/waypoint/waypoint build -local=false

» Building learn-waypoint-lambda...
  Performing operation on docker with runner profile test

» Cloning data from Git
  URL: https://github.com/hashicorp/waypoint-examples
  Ref: us-east-1
...

Future considerations

If an ODR operation fails, there's a good chance there will be some useful context in the ODR pod's logs, or it's env vars, or platform-specific setup. Even with these changes, there is no good way to figure out which odr instance (i.e. k8s pod) in particular executed the op. The best thing i've found is to compare launch timestamps, or grep the logs of all odr pods for strings I expect to see.

I don't see any obvious path to get that context.

Hacky idea: We could do something hacky, like have the task launcher plugin set some WAYPOINT_TASK_INSTANCE_NAME env var, and teach the runner to discover that and report it back to the server.

Better (but harder) idea: We could have the task plugin report DeclaredResources via an outparamater (similar to how status works), and save those on a new task protobuf, and then tie that task id back to the runner proto somehow. I think this would probably require an RFC.

If anyone has any better ideas, i'm all ears.

izaaklauer · 2021-12-03T14:20:15Z

internal/server/singleprocess/service_job.go

@@ -337,7 +337,7 @@ func (s *service) onDemandRunnerStartJob(

 	// Arguments for the runner image. Waypoint is ALWAYS assumed to be
 	// the entrypoint for ODR images.
-	args := []string{"runner", "agent", "-vv", "-id", runnerId, "-odr"}
+	args := []string{"runner", "agent", "-vv", "-id", runnerId, "-odr", "-odr-profile-id", od.Id}


I don't love this way of setting metadata on the runner - it feels very indirect. It would feel much cleaner to me if we called a server api here-ish to create a new "pending" runner record of type odr, then just start the runner with the predetermined ID and no other flags. Then, when the runner registered itself to the server, it would just set its state from "pending" to "active", or something.

It also feels a little weird from a security perspective - fresh runners have the opportunity to lie about their profile id and odr-ness if we rely on them to tell the server.

If and when it's time to add the next runner flag like this, I think we should revisit this pattern.

We need to add some sort of runner "adoption" or "activation" step for some future work we have planned for exactly the reason you stated. Ideally: runners can relatively easy ATTEMPT to register with the server, but a human (or machine via more privileged API) has to "adopt" the runner to activate it. I think we can do this using some sort of certificate exchange mechanism (so if the runner restarts or something it can prove its adoption for a period of time). I have various thoughts here.

For now, I think this is okay.

internal/server/proto/server.proto

izaaklauer · 2021-12-03T17:43:11Z

internal/cli/base.go

@@ -471,12 +471,7 @@ func remoteOpPreferred(ctx context.Context, client pb.WaypointClient, project *p
 	}
 	hasRemoteRunner := false
 	for _, runner := range runnersResp.Runners {
-		if !runner.Odr {


This is a great side-effect of this change! We can be more confident now about deciding if a remote op is likely to work.

krantzinator

Code paths look generally good! I can merge this next week when you're out if you'd like to wait for @evanphx's feedback.

internal/cli/runner_agent.go

krantzinator · 2021-12-07T22:14:07Z

internal/client/job.go

+				if err != nil {
+					c.UI.Output("Failed to inspect the runner (id %q) assigned for this operation: %s", assignedRunner.Id, err, terminal.WithErrorStyle())
+				}
+				switch runnerType := runner.Kind.(type) {


I love this because it's like self-documenting of the different runners in our runner system.

mitchellh

One critical thing, the rest are just things to think about. Looks wonderful. Great job!

mitchellh · 2021-12-08T00:25:37Z

internal/cli/base.go

-			// that as a remote runner, and this will return a false positive.
-
-			// Also note that this is designed to run before se start our own CLI runner.
+		if _, ok := runner.Kind.(*pb.Runner_Remote_); ok {


I've always done v, ok := ... ; ok && v != nil. What you did is PROBABLY FINE, but talking out loud I've always wondered if this is totally fine haha. You can keep it how it is, just thinking out loud.

oh ha interesting - i assume v == nil when runner.Kind has never been set - I think it's OK here because nil will still bypass this block.

Well, you can do runner.Kind = (*pb.Runner_Remote_)(nil) to make runner.Kind != nil but once you cast it is == nil. I can’t think of any time this would happen without someone acting weird though. (This is the interface vs typed nil)

internal/client/job.go

internal/server/proto/server.proto

@briancain

Good idea @briancain!

We don't expect users to ever set this when the use the `runner agent` command, even if they build their own custom images. It's only used by us internally.

briancain

Looks good to me! 👍🏻

krantzinator · 2021-12-13T16:38:54Z

Per discussion with Izaak last week, I'm going to merge this in since it has all the necessary approvals and tests are passing! 🎉

github-actions bot added core plugin plugin/docker labels Dec 3, 2021

izaaklauer commented Dec 3, 2021

View reviewed changes

internal/server/proto/server.proto Outdated Show resolved Hide resolved

izaaklauer force-pushed the store-runner-location branch from 304e14b to 52c1d5d Compare December 3, 2021 17:06

vercel bot temporarily deployed to Preview December 3, 2021 17:06 Inactive

github-actions bot removed plugin/docker plugin labels Dec 3, 2021

vercel bot temporarily deployed to Preview December 3, 2021 17:40 Inactive

izaaklauer added the core/cli label Dec 3, 2021

izaaklauer commented Dec 3, 2021

View reviewed changes

izaaklauer force-pushed the store-runner-location branch from 940ad21 to efce813 Compare December 3, 2021 18:16

vercel bot temporarily deployed to Preview December 3, 2021 18:16 Inactive

vercel bot temporarily deployed to Preview December 3, 2021 18:18 Inactive

izaaklauer marked this pull request as ready for review December 3, 2021 18:20

izaaklauer changed the title ~~DRAFT: display the runner type during CLI operations~~ Display the runner type during CLI operations Dec 3, 2021

izaaklauer requested a review from a team December 3, 2021 18:20

vercel bot temporarily deployed to Preview December 3, 2021 18:22 Inactive

vercel bot temporarily deployed to Preview December 3, 2021 18:25 Inactive

vercel bot deployed to Preview December 3, 2021 18:27 View deployment

github-actions bot added the website label Dec 3, 2021

izaaklauer requested a review from evanphx December 7, 2021 16:19

izaaklauer mentioned this pull request Dec 7, 2021

Improved project/remoteness selection UX #2704

Closed

11 tasks

krantzinator approved these changes Dec 7, 2021

View reviewed changes

mitchellh suggested changes Dec 8, 2021

View reviewed changes

vercel bot temporarily deployed to Preview December 8, 2021 02:06 Inactive

izaaklauer requested a review from mitchellh December 8, 2021 02:08

izaaklauer force-pushed the store-runner-location branch from 5c8d341 to a6a53c0 Compare December 8, 2021 02:13

vercel bot temporarily deployed to Preview December 8, 2021 02:13 Inactive

izaaklauer force-pushed the store-runner-location branch from a6a53c0 to 14e2c04 Compare December 8, 2021 02:37

vercel bot temporarily deployed to Preview December 8, 2021 02:37 Inactive

github-actions bot added plugin plugin/aws ui labels Dec 8, 2021

izaaklauer added 3 commits December 7, 2021 21:42

Logging the runner type during operations

5f3261d

Renaming proto runner.type -> runner.kind

e0b3568

Good idea @briancain!

changelog

3fab33b

izaaklauer force-pushed the store-runner-location branch from 14e2c04 to 5948abc Compare December 8, 2021 02:45

vercel bot temporarily deployed to Preview December 8, 2021 02:45 Inactive

github-actions bot removed plugin ui plugin/aws labels Dec 8, 2021

izaaklauer added 7 commits December 7, 2021 21:50

fmt

8e84fb8

Hiding the runner agent -odr flag

5f12d35

We don't expect users to ever set this when the use the `runner agent` command, even if they build their own custom images. It's only used by us internally.

More runner comments

90d788b

Updating tests

72f3b7a

website-mdx (removing the -odr flag from docs)

974af03

Using the ui argument rather than the stored one

24af3be

Reodering protos, adding comments to deprecated field

8b2c504

izaaklauer force-pushed the store-runner-location branch from 5948abc to 8b2c504 Compare December 8, 2021 02:51

vercel bot temporarily deployed to Preview December 8, 2021 02:51 Inactive

briancain approved these changes Dec 8, 2021

View reviewed changes

izaaklauer removed the request for review from evanphx December 10, 2021 13:38

mitchellh approved these changes Dec 10, 2021

View reviewed changes

krantzinator added this to the 0.7.0 milestone Dec 13, 2021

krantzinator merged commit a9db39e into main Dec 13, 2021

krantzinator deleted the store-runner-location branch December 13, 2021 16:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Display the runner type during CLI operations #2795

Display the runner type during CLI operations #2795

izaaklauer commented Dec 3, 2021 •

edited

Loading

izaaklauer Dec 3, 2021 •

edited

Loading

mitchellh Dec 8, 2021

izaaklauer Dec 3, 2021

krantzinator left a comment

krantzinator Dec 7, 2021

mitchellh left a comment

mitchellh Dec 8, 2021

izaaklauer Dec 8, 2021

mitchellh Dec 8, 2021

briancain left a comment

krantzinator commented Dec 13, 2021

Display the runner type during CLI operations #2795

Display the runner type during CLI operations #2795

Conversation

izaaklauer commented Dec 3, 2021 • edited Loading

How to verify

Future considerations

izaaklauer Dec 3, 2021 • edited Loading

Choose a reason for hiding this comment

mitchellh Dec 8, 2021

Choose a reason for hiding this comment

izaaklauer Dec 3, 2021

Choose a reason for hiding this comment

krantzinator left a comment

Choose a reason for hiding this comment

krantzinator Dec 7, 2021

Choose a reason for hiding this comment

mitchellh left a comment

Choose a reason for hiding this comment

mitchellh Dec 8, 2021

Choose a reason for hiding this comment

izaaklauer Dec 8, 2021

Choose a reason for hiding this comment

mitchellh Dec 8, 2021

Choose a reason for hiding this comment

briancain left a comment

Choose a reason for hiding this comment

krantzinator commented Dec 13, 2021

izaaklauer commented Dec 3, 2021 •

edited

Loading

izaaklauer Dec 3, 2021 •

edited

Loading