[8.0] Support multiple endpoints #966

michalpristas · 2021-12-07T15:51:37Z

What is the problem this PR solves?

What this PR solves is a problem when agent got unenrolled on heavier load when agent managing fleet server cannot checkin to it's own server so it will fallback to unenroll.
Closes #741

How does this PR solve the problem?

Problem is solved by adding internal endpoint which is used for communication on local network (with agent handling fleet server)
It lets FS to spin up 2 set of handlers, one on public 8220 and one on port defined in config.

How to test this PR locally

This needs to be tested with work on elastic-agent Link: elastic/beats#28993

Start stack
Install agent with FS in a policy
Check ports

sh-3.2# lsof -i -P | grep LISTEN | grep fleet
fleet-ser  7056            root   19u  IPv4 0xba7881a9227099a5      0t0    TCP localhost:{random_port} (LISTEN)
fleet-ser  7056            root   21u  IPv6 0xba7881a91284721d      0t0    TCP *:8220 (LISTEN)

run wireshark, set filter to random port, there should be some comm
set filter to 8220 port, there should be no comm
enroll new agent, from another VM
there should be some comm on both ports

Checklist

I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have made corresponding change to the default configuration files
I have added tests that prove my fix is effective or that my feature works
I have added an entry in CHANGELOG.next.asciidoc or CHANGELOG-developer.next.asciidoc.

…ng (elastic#829) Co-authored-by: apmmachine <[email protected]>

…ng (elastic#833) Co-authored-by: apmmachine <[email protected]>

…ng (elastic#842) Co-authored-by: apmmachine <[email protected]>

…ng (elastic#849) Co-authored-by: apmmachine <[email protected]>

…ng (elastic#852) Co-authored-by: apmmachine <[email protected]>

(cherry picked from commit 8a4855b) Co-authored-by: Sean Cunningham <[email protected]>

This was coming out of the debugging session around fleet-server where some of the log messages were not too clear to me on what these mean.

…ticsearch Fleet APIs, remove holes detection and refreshes (elastic#814) (elastic#863) * Switch to the new _fleet/_fleet_search and _fleet/_fleet_msearch Elasticsearch Fleet APIs, remove holes detection and refreshes * Switch to the new _fleet/_fleet_msearch and _fleet/_fleet_search Fleet APIs endpoints for the searches that required refreshes and wait for checkpoints. The new API handles refreshes and checkpoints waits. * Separate queues for _msearch and _fleet_msearch, to avoid delays on searches without checkpoints wait. Use _fleet/_fleet_msearch endpoint if search is requested with wait_for_checkpoints. Use _fleet/_fleet_search for the monitor hits fetch. * Had to copy over the search and msearch wrappers from go-elasticsearch library and customize them for _fleet_search and _fleet_msearch. These could be removed once the library is updated for these new endpoints. * Removed the holes detection and refresh op code as it's not longer used. (cherry picked from commit a2fb073) Co-authored-by: Aleksandr Maus <[email protected]>

…elastic#864) * Do not depend on agent.Id ad that field was not added until 7.15 (cherry picked from commit 6382114) * Migrate agent.id field from 7.14 to 7.15+ (cherry picked from commit aeb4b66) * Handle 404 on .fleet-agent index as a noop during migration. (cherry picked from commit 130056a) Co-authored-by: Sean Cunningham <[email protected]>

…ng (elastic#867) Co-authored-by: apmmachine <[email protected]>

…ng (elastic#870) Co-authored-by: apmmachine <[email protected]>

* Periodic expired actions cleanup * Fix make check * Fix TestConfig unit test * Put back WithRefresh in integration tests actions setup * Switch the actions cleanup to use bulker.MDelete instead of Delete * Improve 404 status handling (cherry picked from commit 6694c08) Co-authored-by: Aleksandr Maus <[email protected]>

…ng (elastic#878) Co-authored-by: apmmachine <[email protected]>

…astic#881) (cherry picked from commit 9d8666e) Co-authored-by: Aleksandr Maus <[email protected]>

* use ecs zerolog lib for logging (cherry picked from commit 6627876) * update checksums (cherry picked from commit 998db6a) * run check on 1.17 (cherry picked from commit 72dccaa) Co-authored-by: bryan <[email protected]>

…ndex (elastic#882) (elastic#889) (cherry picked from commit 3f02142) Co-authored-by: Aleksandr Maus <[email protected]>

…ng (elastic#891) Co-authored-by: apmmachine <[email protected]>

…lastic#895) (cherry picked from commit 7937c63) Co-authored-by: Aleksandr Maus <[email protected]>

* Add default_api_key_history field to the agent schema * Append agent.default_api_key_history on API key change and invalidate the keys on ack (cherry picked from commit dff3595) Co-authored-by: Aleksandr Maus <[email protected]>

…lastic#897) (cherry picked from commit d643e6b) Co-authored-by: Aleksandr Maus <[email protected]>

…ng (elastic#902) Co-authored-by: apmmachine <[email protected]>

Adds support to enable instrumentation via the APM Go agent. New config options have been added to the `Server` input which could be set up in the `fleet-server` integration configuration. The added instrumentation covers the `fleet-server` http server and the Adds support to enable instrumentation via the APM Go agent. New config options have been added to the `Server` input which could be set up in the `fleet-server` integration configuration. The added instrumentation covers the `fleet-server` http server and the `go-elasticsearch` client. A sample of the configuration that's been added (`instrumentation`): ```yaml inputs: - type: fleet-server server: instrumentation: enabled: true hosts: ["localhost:8200"] environment: production secret_token: token api_key: apikey ``` Signed-off-by: Marc Lopez Rubio <[email protected]> (cherry picked from commit ade74c7) Co-authored-by: Marc Lopez Rubio <[email protected]>

…c#906) (elastic#910) * Improve expired actions cleanup, use _delete_by_query instead (cherry picked from commit fae23a3) Co-authored-by: Aleksandr Maus <[email protected]>

…ng (elastic#914) Co-authored-by: apmmachine <[email protected]>

…ng (elastic#918) Co-authored-by: apmmachine <[email protected]>

…ng (elastic#921) Co-authored-by: apmmachine <[email protected]>

…ng (elastic#925) Co-authored-by: apmmachine <[email protected]>

…ng (elastic#933) Co-authored-by: apmmachine <[email protected]>

) * keep trucking on ES availability errors; more tests to come (cherry picked from commit 7fb0138) * don't attempt to distinguish between errors, just keep retrying (cherry picked from commit 2c75552) * move error blackholing up the stack so the monitor will never crash, added additional logging (cherry picked from commit f5fead9) * pr feedback (cherry picked from commit 1886dc5) * upped logging level, properly wrapped errors (cherry picked from commit 97524dc) Co-authored-by: bryan <[email protected]>

…ng (elastic#938) Co-authored-by: apmmachine <[email protected]>

…ng (elastic#945) Co-authored-by: apmmachine <[email protected]>

…ng (elastic#951) Co-authored-by: apmmachine <[email protected]>

…ng (elastic#959) Co-authored-by: apmmachine <[email protected]>

…ic#964) Adds TLS configuration options for the APM instrumentation, using env vars to configure the APM HTTP Tracer since it currently doesn't support setting those values in Golang. We'll follow up on this once the apm tracer has a function to create a new tracer with configurable settings via config struct. Signed-off-by: Marc Lopez Rubio <[email protected]> (cherry picked from commit 155d0e9) Co-authored-by: Marc Lopez Rubio <[email protected]>

Multiple endpoints

mergify · 2021-12-07T15:52:19Z

This pull request is now in conflicts. Could you fix it @michalpristas? 🙏
To fixup this pull request, you can check out it locally. See documentation: https://help.github.com/articles/checking-out-pull-requests-locally/

git fetch upstream
git checkout -b backport_multiple_endpoints-8.0 upstream/backport_multiple_endpoints-8.0
git merge upstream/master
git push upstream backport_multiple_endpoints-8.0

mergify · 2021-12-07T15:52:20Z

This pull request does not have a backport label. Could you fix it @michalpristas? 🙏
To fixup this pull request, you need to add the backport labels for the needed
branches, such as:

backport-v/d./d./d is the label to automatically backport to the 7./d branch. /d is the digit

NOTE: backport-skip has been added to this pull request.

apmmachine and others added 30 commits November 3, 2021 05:28

[Automation] Update elastic stack version to 8.0.0-6b50534b for testi…

2d25807

…ng (elastic#829) Co-authored-by: apmmachine <[email protected]>

[Automation] Update elastic stack version to 8.0.0-decb3f1d for testi…

6da3a8f

…ng (elastic#833) Co-authored-by: apmmachine <[email protected]>

[Automation] Update elastic stack version to 8.0.0-fc3570a9 for testi…

077556e

…ng (elastic#842) Co-authored-by: apmmachine <[email protected]>

[Automation] Update elastic stack version to 8.0.0-be754c25 for testi…

3803706

…ng (elastic#849) Co-authored-by: apmmachine <[email protected]>

[Automation] Update elastic stack version to 8.0.0-995d02ee for testi…

73abc5b

…ng (elastic#852) Co-authored-by: apmmachine <[email protected]>

Normalize logging (elastic#858)

944a907

(cherry picked from commit 8a4855b) Co-authored-by: Sean Cunningham <[email protected]>

Improve some of the log message (elastic#844) (elastic#861)

f118d7a

This was coming out of the debugging session around fleet-server where some of the log messages were not too clear to me on what these mean.

[Automation] Update elastic stack version to 8.0.0-683a0b7d for testi…

f2cfd88

…ng (elastic#867) Co-authored-by: apmmachine <[email protected]>

[Automation] Update elastic stack version to 8.0.0-31084d02 for testi…

d9036cd

…ng (elastic#870) Co-authored-by: apmmachine <[email protected]>

[Automation] Update elastic stack version to 8.0.0-0357b4f0 for testi…

e126a29

…ng (elastic#878) Co-authored-by: apmmachine <[email protected]>

Fix: Fleet Server crashes on expired actions search (elastic#876) (el…

a17a795

…astic#881) (cherry picked from commit 9d8666e) Co-authored-by: Aleksandr Maus <[email protected]>

Add integration test coverage for actions cleanup with non-existing i…

8ff6de9

…ndex (elastic#882) (elastic#889) (cherry picked from commit 3f02142) Co-authored-by: Aleksandr Maus <[email protected]>

[Automation] Update elastic stack version to 8.0.0-83e38099 for testi…

1c252de

…ng (elastic#891) Co-authored-by: apmmachine <[email protected]>

Update go-elasticsearch to the latest 7.15.1 version (elastic#832) (e…

320d1c9

…lastic#895) (cherry picked from commit 7937c63) Co-authored-by: Aleksandr Maus <[email protected]>

API keys cleanup (elastic#884) (elastic#896)

6ea2743

* Add default_api_key_history field to the agent schema * Append agent.default_api_key_history on API key change and invalidate the keys on ack (cherry picked from commit dff3595) Co-authored-by: Aleksandr Maus <[email protected]>

Improve the cleanup code on enrollment handling error (elastic#890) (e…

07e1de9

…lastic#897) (cherry picked from commit d643e6b) Co-authored-by: Aleksandr Maus <[email protected]>

[Automation] Update elastic stack version to 8.0.0-1160a953 for testi…

1900398

…ng (elastic#902) Co-authored-by: apmmachine <[email protected]>

Improve expired actions cleanup, use _delete_by_query instead (elasti…

a5cff0f

…c#906) (elastic#910) * Improve expired actions cleanup, use _delete_by_query instead (cherry picked from commit fae23a3) Co-authored-by: Aleksandr Maus <[email protected]>

[Automation] Update elastic stack version to 8.0.0-28ef013c for testi…

17002b4

…ng (elastic#914) Co-authored-by: apmmachine <[email protected]>

[Automation] Update elastic stack version to 8.0.0-ca68b68a for testi…

24c9cc9

…ng (elastic#918) Co-authored-by: apmmachine <[email protected]>

[Automation] Update elastic stack version to 8.0.0-89da64e1 for testi…

5e26769

…ng (elastic#921) Co-authored-by: apmmachine <[email protected]>

[Automation] Update elastic stack version to 8.0.0-94dc1348 for testi…

fde1cc4

…ng (elastic#925) Co-authored-by: apmmachine <[email protected]>

[Automation] Update elastic stack version to 8.0.0-5dc82a0c for testi…

0e8bd66

…ng (elastic#933) Co-authored-by: apmmachine <[email protected]>

[Automation] Update elastic stack version to 8.0.0-df873dde for testi…

5ae6754

…ng (elastic#938) Co-authored-by: apmmachine <[email protected]>

apmmachine and others added 5 commits December 2, 2021 05:26

[Automation] Update elastic stack version to 8.0.0-aab6301c for testi…

8420ab1

…ng (elastic#945) Co-authored-by: apmmachine <[email protected]>

[Automation] Update elastic stack version to 8.0.0-eda77a0f for testi…

ce7a2a2

…ng (elastic#951) Co-authored-by: apmmachine <[email protected]>

[Automation] Update elastic stack version to 8.0.0-1e314182 for testi…

493e259

…ng (elastic#959) Co-authored-by: apmmachine <[email protected]>

Merge pull request elastic#880 from michalpristas/multiple-endpoints

0379b9d

Multiple endpoints

michalpristas self-assigned this Dec 7, 2021

michalpristas closed this Dec 7, 2021

mergify bot added the backport-skip Skip notification from the automated backport with mergify label Dec 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[8.0] Support multiple endpoints #966

[8.0] Support multiple endpoints #966

Uh oh!

michalpristas commented Dec 7, 2021

Uh oh!

mergify bot commented Dec 7, 2021

Uh oh!

mergify bot commented Dec 7, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[8.0] Support multiple endpoints #966

[8.0] Support multiple endpoints #966

Uh oh!

Conversation

michalpristas commented Dec 7, 2021

What is the problem this PR solves?

How does this PR solve the problem?

How to test this PR locally

Checklist

Uh oh!

mergify bot commented Dec 7, 2021

Uh oh!

mergify bot commented Dec 7, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants