[POC] RCS 2.0 - fulfilling cluster handles cross cluster requests #92089

n1v0lg · 2022-12-05T11:00:18Z

A proof of concept for fulfilling cluster handling of cross cluster requests under the new security model. This is not meant to merge, but rather a point of departure to validate the high level approach. I will split parts of this PR out into smaller, polished PRs.

This PR handles detecting requests to which the new security model applies (via the transport port/profile), and authentication and authorization for these.

Feedback points

The implementation details are not polished but there are several implementation choices I'm looking for feedback on (other observations & comments are of course also welcome):

Detecting RCS 2.0 requests based on the associated transport profile when we initialize transport filters inside the transport interceptor, we can detect if we're dealing with the new remote cluster profile and pass a boolean flag to ServerTransportFilter to toggle its behavior around authentication. @ywangd we had briefly discussed using the port instead, however, after our conversation I realized that we can instead run this check at transport filter initiation. It feels more natural to rely on the profile name here instead of the port. Open to switching this around still, ofc.

Toggling RCS 2.0 behavior inside ServerTransportFilter this looks like a natural place for toggling authentication behavior -- let me know if you disagree.

RemoteAccessAuthenticator is stand-alone and separate from AuthenticatorChain I don't think there is a use case for remote access authentication to be part of the authenticator chain, and the logic is different enough to completely split it out into its own class.

High level authentication model is not finalized, but I want to make sure we agree on the high level structure: we will have a new authentication type, and store role descriptor bytes (both for the remote access key, and querying-cluster-side role descriptors) in authc metadata, which will in turn be used during role building.

High level approach to role building again, the code is not cleaned up but the high level gist is to build role references from the role descriptor bytes we've stored in the authc metadata.

Things that are missing

As mentioned on Slack, sniff mode doesn't work since requests still go through the regular transport port instead of the remote port in some cases. How we handle this is TBD; I want to keep the focus primarily on proxy mode since that's what we've identified as "MVP".

Scroll, PIT, and async_search probably don't work, but I don't see any big hurdles in making them work. Skipping to keep the scope down.

Clear and correct error messages: currently, the code fails where it should but the error messages are not cleaned up. A bigger, specific example of this is when we have a run-as request on the QC; the error message construction in this case is not correct. This is a somewhat complex and specific scenario so it may be better to leave handling it out of this PR, and deal with it separately.

No custom audit logging. That's out of scope for this PR.

Handling around TLS being optional on the remote cluster port. I did not get to looking into this yet but it seems minor enough, so I didn't want to block an initial round of feedback because of it.

n1v0lg · 2023-01-20T11:37:45Z

Cheers @ywangd!

But instead of a boolean flag, I'd make it an enum. I was even considering a subclass of ServerTransportFilter. But that might be overkill for now.

Yup, the boolean flag was just the simplest way to make it work for the POC. We'll find the best option in the context of a stand-alone PR.

I like your suggestion around re-using AuthenticatorChain. I've pushed a tentative implementation of this. The added functionality is a single method inside AuthenticationService -- in the "real" PR I will likely refactor parts of it into its own class(es). I think the exact structure is something we can iterate on outside of this POC.

What we have in the remote cluster credentials is a generic authorization field which has the potential to support authentication other than API key. The above composition can allow this to happen more easily. This is just a hunch. But PKI authentication could be a potential future enhancement.

I agree that your proposed structure lends itself to supporting other credentials in the future, however, I'm not sure this is on the horizon, or immediately feasible. API keys aren't just authentication credentials, they also allow us to specify privileges, and that is pretty central to the RCS 2.0 design. Off the top of my head, I don't see how this would work for e.g., PKI. Either way, I'm in favor of the structure you proposed since it's cleaner and allows for more code re-use.

Agreed also on your suggestions around the authentication model changes. I've pushed these as well. I think for finer details it's also best to iterate in the context of a separate PR.

...n/core/src/main/java/org/elasticsearch/xpack/core/security/authz/permission/LimitedRole.java

n1v0lg · 2023-01-20T11:54:42Z

...gin/security/src/main/java/org/elasticsearch/xpack/security/authc/AuthenticationService.java

+        final boolean allowAnonymous,
+        final ActionListener<Authentication> authenticationListener
+    ) {
+        if (false == (threadContext.getHeader(AuthenticationField.AUTHENTICATION_KEY) == null)) {


All this is just a rough draft -- will polish in a stand-alone PR if we're happy with the overall approach.

ywangd · 2023-01-24T03:12:25Z

...gin/security/src/main/java/org/elasticsearch/xpack/security/authc/AuthenticationService.java

+        authenticatorChain.authenticateAsync(context, ActionListener.wrap(authentication -> {
+            final RemoteAccessAuthentication remoteAccessAuthentication = RemoteAccessAuthentication.readFromContext(threadContext);
+            final Map<String, String> existingRequestHeaders = threadContext.getRequestHeadersOnly();
+            try (ThreadContext.StoredContext ignored = threadContext.stashContext()) {
+                // drop authentication and remote access authentication headers
+                existingRequestHeaders.forEach((k, v) -> {
+                    if (false == Set.of(
+                        AuthenticationField.AUTHENTICATION_KEY,
+                        SecurityServerTransportInterceptor.REMOTE_ACCESS_CLUSTER_CREDENTIAL_HEADER_KEY,
+                        RemoteAccessAuthentication.REMOTE_ACCESS_AUTHENTICATION_HEADER_KEY
+                    ).contains(k)) {
+                        threadContext.putHeader(k, v);
+                    }
+                });


I think we should use ContextPreservingActionListener here so that original context is restored once authentication finishes with authenticatorChain so that we can write the final authentication object, as oppose to nest another threadContext inside which has the risk that the old authentication may be mis-used if downstream code pops the nested threadContext.

I'll give this a shot; the problem is that ideally we also want to remove both remote access headers from the context, once we've read them -- I think it's a nice invariant to maintain that we either have _remote_access_authentication or _xpack_security_authentication in thread context but not both. https://github.com/elastic/elasticsearch/blob/main/x-pack/plugin/security/src/main/java/org/elasticsearch/xpack/security/transport/SecurityServerTransportInterceptor.java#L172 we for instance have an assertion on this, which will trip if we don't explicitly clear the remote access headers.

This is all doable, just requires a little more fiddling.

ywangd · 2023-01-24T04:07:04Z

API keys aren't just authentication credentials, they also allow us to specify privileges, and that is pretty central to the RCS 2.0 design. Off the top of my head, I don't see how this would work for e.g., PKI.

Any of the authentication mechansims can allow users to specifiy privileges. API keys just allow them to be specified directly on creation. But the same effect can be achieved with things like username/password and named roles. These roles will then be intersected with the remote index privileges bytes. API keys are not any special in this process. I think it is chosen because it is our default machine-to-machine communication mechanism but that's not a technical constraint. If we have decided to support specialized API keys, it may make a difference here. But we are not prioritizing the work.

It is possible that once the feature is released, users would ask for a more "centrally managed credentials" in the place of the remote access API keys. PKI is our earliest support for "centrally managed credentials" and I just used it as an example. But the main idea is about "centrally managed credentials" as opposed to local to a cluster (API keys).

I do realise all these are distant guesses. Just jotting down as thinking exercises.

This PR implements the necessary changes to the Authentication class, to support remote access authentication under the new remote cluster security model. Upon successful authentication, a new authentication instance will be constructed by the fulfilling cluster which combines information from the remote access API key used and the user authentication and role info sent by the querying cluster with a cross cluster request. Remote access authentication is modeled in way that exposes (and assumes) that the underlying authentication method is an API key; for example, it includes the metadata associated with API keys in its metadata directly, re-using existing metadata field keys. I chose this approach instead of trying to generalize away from API keys because there are no medium-term plans to support any other authentication forms for remote access; generalizing would have made the change more complex. This change is stand-alone and not wired up to active code flows yet. A proof of concept in #92089 highlights how the model change in this PR fits into the broader context of the fulfilling cluster processing cross cluster requests.

This PR adds support for building roles for remote_access authentication instances, under the new remote cluster security model. This change is stand-alone and not wired up to active code flows yet. A proof of concept in #92089 highlights how the model change in this PR fits into the broader context of the fulfilling cluster processing cross cluster requests.

This PR adds support for building roles for remote_access authentication instances, under the new remote cluster security model. This change is stand-alone and not wired up to active code flows yet. A proof of concept in elastic#92089 highlights how the model change in this PR fits into the broader context of the fulfilling cluster processing cross cluster requests.

n1v0lg · 2023-02-09T14:48:56Z

Closing. This is all in main now.

n1v0lg added 24 commits November 22, 2022 17:00

Clean up

c2abab4

Checkstyle

27a8679

Add assertion

67df73b

Merge branch 'main' into remote-access-authentication-header

ed6dd51

Clean up and test parse bytes

d4f399d

Visibility

b39c861

Merge branch 'main' into remote-access-authentication-header

0d1bb46

Nit

5b53b3d

Typo

8d0d67e

Merge branch 'main' into remote-access-authentication-header

7c6b127

Merge branch 'main' into remote-access-authentication-header

567d652

List instead of collection

33b9eac

Merge branch 'main' into remote-access-authentication-header

bf17175

WIP send requests with remote access headers

ceb631d

Merge branch 'main' into send-remote-access-headers

36bfadc

Merge branch 'main' into remote-access-authentication-header

461fab2

WIP remote access authenticator

9928890

WIP role building

c88a776

Subject authc type

64d0ff4

WIP role references

0b1c04c

Hacky but it works

97d6dca

Lint

badff5c

Merge branch 'main' into remote-access-authentication-header

cd691b5

Refactor access header

3646d27

n1v0lg added >non-issue :Security/Authentication Logging in, Usernames/passwords, Realms (Native/LDAP/AD/SAML/PKI/etc) :Security/Authorization Roles, Privileges, DLS/FLS, RBAC/ABAC labels Dec 5, 2022

n1v0lg self-assigned this Dec 5, 2022

elasticsearchmachine added the v8.7.0 label Dec 5, 2022

Not a supplier

2491e98

Merge

97b0681

n1v0lg commented Jan 20, 2023

View reviewed changes

...n/core/src/main/java/org/elasticsearch/xpack/core/security/authz/permission/LimitedRole.java Outdated Show resolved Hide resolved

Remove other test

0adb15e

n1v0lg commented Jan 20, 2023

View reviewed changes

n1v0lg added 2 commits January 20, 2023 13:03

Clean up

10252c0

Nit

7bfa646

n1v0lg requested a review from ywangd January 23, 2023 08:40

n1v0lg added 2 commits January 23, 2023 16:05

Merge branch 'main' into poc/remote-access-authorization

5738221

Merge branch 'main' into poc/remote-access-authorization

83e1fda

n1v0lg mentioned this pull request Jan 23, 2023

Authentication model changes for remote access #93151

Merged

ywangd reviewed Jan 24, 2023

View reviewed changes

n1v0lg added 2 commits January 26, 2023 17:17

Merge branch 'main' into poc/remote-access-authorization

a535210

Fixes

3fc93e5

n1v0lg mentioned this pull request Jan 30, 2023

Build role for remote access authentication #93316

Merged

n1v0lg added 4 commits January 31, 2023 11:40

Merge branch 'main' into poc/remote-access-authorization

a358ffa

Merge and clean up and better encapsulate remote access authc

fea0ee0

More clean up

6be3ca9

Dont need api key service as field

33d3353

rjernst added v8.8.0 and removed v8.7.0 labels Feb 8, 2023

n1v0lg closed this Feb 9, 2023

n1v0lg deleted the poc/remote-access-authorization branch February 9, 2023 14:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[POC] RCS 2.0 - fulfilling cluster handles cross cluster requests #92089

[POC] RCS 2.0 - fulfilling cluster handles cross cluster requests #92089

Uh oh!

n1v0lg commented Dec 5, 2022 •

edited

Loading

Uh oh!

n1v0lg commented Jan 20, 2023

Uh oh!

Uh oh!

n1v0lg Jan 20, 2023

Uh oh!

ywangd Jan 24, 2023

Uh oh!

n1v0lg Jan 31, 2023 •

edited

Loading

Uh oh!

ywangd commented Jan 24, 2023

Uh oh!

n1v0lg commented Feb 9, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[POC] RCS 2.0 - fulfilling cluster handles cross cluster requests #92089

[POC] RCS 2.0 - fulfilling cluster handles cross cluster requests #92089

Uh oh!

Conversation

n1v0lg commented Dec 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Feedback points

Things that are missing

Uh oh!

n1v0lg commented Jan 20, 2023

Uh oh!

Uh oh!

n1v0lg Jan 20, 2023

Choose a reason for hiding this comment

Uh oh!

ywangd Jan 24, 2023

Choose a reason for hiding this comment

Uh oh!

n1v0lg Jan 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ywangd commented Jan 24, 2023

Uh oh!

n1v0lg commented Feb 9, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

n1v0lg commented Dec 5, 2022 •

edited

Loading

n1v0lg Jan 31, 2023 •

edited

Loading