[8.x] [Streams] Introducing the new Streams plugin (#198713) by dgieselaar · Pull Request #201723 · elastic/kibana

dgieselaar · 2024-11-26T08:11:24Z

Backport

This will backport the following commits from main to 8.x:

[Streams] Introducing the new Streams plugin (#198713)

Questions ?

Please refer to the Backport tool documentation

## Summary This PR introduces the new experimental "Streams" plugin into the Kibana project. The Streams project aims to simplify workflows around dealing with messy logs in Elasticsearch. Our current offering is either extremely opinionated with integrations or leaves the user alone with the high flexibility of Elasticsearch concepts like index templates, component templates and so on, which make it challenging to configure everything correctly for good performance and controlling search speed and cost. ### Scope of PR - Provides an API for the user to "enable" the streams framework which creates the "root" entity `logs` with all the backing Elasticsearch assets - Provides an API for the user to "fork" a stream - Provides an API for the user to "read" a stream and all of it's Elasticsearch assets. - Provides an API for the user to upsert a stream (and implicitly child streams that are mentioned) - Part of this API is placing grok and disscect processing steps as well as fields to the mapping - Implements the Stream Naming Schema (SNS) which uses dots to express the index patterns and stream IDs. Example: `logs.nginx.errors` - The APIs will fully manage the `index_template`, `component_template`, and `ingest_pipelines`. ### Out of scope - Integration tests (coming in a follow-up) ### Reviewer Notes - I haven't implemented tests beyond a unit test for converting the filter conditions to Painless. I wanted to get a PR up so we can start iterating on the interface and functionality before we invest in testing. - You might need to add `server.versioned.versionResolution: oldest` to your `config/kibana.dev.yaml` to play with the requests below in the Kibana "Dev console". ### Example API Calls Enable the root stream (and set the mapping for the internal `.streams` index) ``` POST kbn:/api/streams/_enable ``` Read the root entity "logs" ``` GET kbn:/api/streams/logs ``` Fork the "root" entity "logs" and create "logs.nginx" based on a condition ``` POST kbn:/api/streams/logs/_fork { "stream": { "id": "logs.nginx", "children": [], "processing": [], "fields": [], }, "condition": { "field": "log.logger", "operator": "eq", "value": "nginx_proxy" } } ``` Fork the entity "logs.nginx" and create "logs.nginx.errors" based on a condition ``` POST kbn:/api/streams/logs.nginx/_fork { "stream": { "id": "logs.nginx.error", "children": [], "processing": [], "fields": [], }, "condition": { "or": [ { "field": "log.level", "operator": "eq", "value": "error" }, { "field": "log.level", "operator": "eq", "value": "ERROR" } ] } } ``` Set some processing on a stream and map the generated field ``` PUT kbn:/api/streams/logs.nginx { "children": [], "processing": [ { "config": { "type": "grok", "patterns": ["^%{IP:ip} – –"], "field": "message" } } ], "fields": [ { "name": "ip", "type": "ip" } ], } } ``` Field definitions are checked for both descendants and ancestors for incompatibilities to ensure they stay additive. If children are defined in the `PUT /api/streams/<name>` API, sub-streams are created implicitly. If a stream is `PUT`, it's added to the parent as well with a condition that is never true (can be edited subsequently). `POST /api/streams/_resync` can be used to re-sync all streams from their meta data in case the Elasticsearch objects got messed up by some external change - not sure whether we want to keep that. Follow-ups * API integration tests * Check read permissions on data streams to determine whether a user is allowed to read certain streams --------- Co-authored-by: Joe Reuter <johannes.reuter@elastic.co> Co-authored-by: kibanamachine <42973632+kibanamachine@users.noreply.github.com> (cherry picked from commit b86dc81) # Conflicts: # .github/CODEOWNERS

github-actions · 2024-11-26T08:13:09Z

A documentation preview will be available soon.

🔨 Buildkite builds
📚 HTML diff
📙 Preview page

Request a new doc build by commenting

Rebuild this PR: run docs-build
Rebuild this PR and all Elastic docs: run docs-build rebuild

_{run docs-build is much faster than run docs-build rebuild. A rebuild should only be needed in rare situations.}

_{If your PR continues to fail for an unknown reason, the doc build pipeline may be broken. Elastic employees can check the pipeline status here.}

elasticmachine · 2024-11-26T09:53:07Z

💚 Build Succeeded

Buildkite Build
Commit: 7190c9c

Metrics [docs]

Module Count

Fewer modules leads to a faster build time

id	before	after	diff
`streams`	-	4	+4

Public APIs missing comments

Total count of every public API that lacks a comment. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats comments for more detailed information.

id	before	after	diff
`streams`	-	12	+12

Any counts in public APIs

Total count of every any typed public API. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats any for more detailed information.

id	before	after	diff
`streams`	-	7	+7

Public APIs missing exports

Total count of every type that is part of your API that should be exported but is not. This will cause broken links in the API documentation system. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats exports for more detailed information.

id	before	after	diff
`streams`	-	2	+2

Page load bundle

Size of the bundles that are downloaded on every page load. Target size is below 100kb

id	before	after	diff
`streams`	-	1.6KB	+1.6KB

Unknown metric groups

API count

id	before	after	diff
`streams`	-	12	+12

ESLint disabled line counts

id	before	after	diff
`streams`	-	4	+4

Total ESLint disabled count

id	before	after	diff
`streams`	-	4	+4

dgieselaar added the backport This PR is a backport of another PR label Nov 26, 2024

dgieselaar enabled auto-merge (squash) November 26, 2024 08:11

dgieselaar mentioned this pull request Nov 26, 2024

[Streams] Introducing the new Streams plugin #198713

Merged

auto-merge was automatically disabled November 26, 2024 09:22
Pull Request is not mergeable

flash1293 approved these changes Nov 26, 2024

View reviewed changes

dgieselaar merged commit 570afd9 into elastic:8.x Nov 26, 2024

dgieselaar deleted the backport/8.x/pr-198713 branch November 26, 2024 10:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[8.x] [Streams] Introducing the new Streams plugin (#198713)#201723

[8.x] [Streams] Introducing the new Streams plugin (#198713)#201723
dgieselaar merged 1 commit intoelastic:8.xfrom
dgieselaar:backport/8.x/pr-198713

dgieselaar commented Nov 26, 2024

Uh oh!

github-actions bot commented Nov 26, 2024

Uh oh!

elasticmachine commented Nov 26, 2024

API count

ESLint disabled line counts

Total ESLint disabled count

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

dgieselaar commented Nov 26, 2024

Backport

Questions ?

Uh oh!

github-actions bot commented Nov 26, 2024

Uh oh!

elasticmachine commented Nov 26, 2024

💚 Build Succeeded

Metrics [docs]

Module Count

Public APIs missing comments

Any counts in public APIs

Public APIs missing exports

Page load bundle

API count

ESLint disabled line counts

Total ESLint disabled count

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants