🌊 LLM-powered parsing suggestions by flash1293 · Pull Request #208777 · elastic/kibana

flash1293 · 2025-01-29T15:22:08Z

Depends on #209985

Add suggestions for grok processing:

The logic for generating suggestions works like this:

Take the current sample
Split it into patterns based on a simple regex-based grouping replacing runs of numbers with a placeholder, runs of regular numbers with a placeholder, etc.
For the top 5 found groups, pass a couple messages to the LLM in parallel to come up with a grok pattern
Check the grok patterns whether they actually match something and don't break
Report the patterns that have a positive match rate

For the Generate patterns button to show in the UI, make sure a connector is configured and the license level is above basic (trial license is easiest to test with).

I did some light refactoring on the processing routes, moving the simulation bits into a separate file - no changes in this area though.

…rs status

…a into tonyghiani-93-update-ui-processing

…rocessing

flash1293 · 2025-02-18T16:44:40Z

@tonyghiani I think I addressed all points raised except for the advertisement of the AI feature if it's not enabled. I'm going to add that in a follow-up PR

This is how the screen looks when no suggestions could be found:

flash1293 · 2025-02-18T16:45:18Z

You probably want to wait with another round of review until #209985 is merged (I pulled it in here and the diffs are mixed now)

…t' into flash1293/llm-parsing-suggestions

…ng-suggestions

…apping'

…293/kibana into flash1293/llm-parsing-suggestions

flash1293 · 2025-02-19T15:37:36Z

@tonyghiani should be rebased with main - there are two things missing that I will address in a follow-up:

Unit tests for the suggestions handler
CTA if no LLM connector is available, but the user has permissions to configure it

…apping'

tonyghiani

Looks good, there are some client side parts that will probably change with the state management refactor, but I'll handle that once I rebase this work into my WIP changes.

Agree on having some API test for the suggestions, there is a lot of logic going on there and having a test safety guard seems very necessary.

elasticmachine · 2025-02-19T16:53:29Z

💚 Build Succeeded

Buildkite Build
Commit: 6b3ce93
Kibana Serverless Image: docker.elastic.co/kibana-ci/kibana-serverless:pr-208777-6b3ce93ddc78

Metrics [docs]

Module Count

Fewer modules leads to a faster build time

id	before	after	diff
`streamsApp`	308	310	+2

Public APIs missing comments

Total count of every public API that lacks a comment. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats comments for more detailed information.

id	before	after	diff
`@kbn/streams-schema`	266	268	+2

Async chunks

Total size of all lazy-loaded chunks that will be downloaded as the user navigates the app

id	before	after	diff
`streamsApp`	287.4KB	293.1KB	+5.7KB

Unknown metric groups

API count

id	before	after	diff
`@kbn/streams-schema`	269	271	+2

History

💔 Build #277730 failed 44cba3c
💔 Build #277364 failed 288dc3f
💔 Build #277327 failed 1740676
💛 Build #275066 was flaky 6659dfc
💔 Build #275042 failed 996573e

kibanamachine · 2025-02-20T07:44:28Z

Starting backport for target branches: 8.x

https://github.com/elastic/kibana/actions/runs/13430306734

kibanamachine · 2025-02-20T07:49:51Z

💔 All backports failed

Status	Branch	Result
❌	8.x	Backport failed because of merge conflicts You might need to backport the following PRs to 8.x: - [streams] lifecycle - ingestion and total docs metadata (#210301)

Manual backport

To create the backport manually run:

node scripts/backport --pr 208777

Questions ?

Please refer to the Backport tool documentation

# Backport This will backport the following commits from `main` to `8.x`: - 🌊 LLM-powered parsing suggestions (#208777)](#208777) --------- Co-authored-by: kibanamachine <42973632+kibanamachine@users.noreply.github.com>

This PR takes care of two follow-ups related to the LLM integration: * Shows CTA if AI assistant can be configured, but isn't (see #208777 (comment)) <img width="505" alt="Screenshot 2025-02-24 at 11 24 30" src="https://github.com/user-attachments/assets/da01e782-6b02-4ec4-91ab-b46009b41e29" /> * Adds some tests --------- Co-authored-by: kibanamachine <42973632+kibanamachine@users.noreply.github.com>

This PR takes care of two follow-ups related to the LLM integration: * Shows CTA if AI assistant can be configured, but isn't (see elastic#208777 (comment)) <img width="505" alt="Screenshot 2025-02-24 at 11 24 30" src="https://github.com/user-attachments/assets/da01e782-6b02-4ec4-91ab-b46009b41e29" /> * Adds some tests --------- Co-authored-by: kibanamachine <42973632+kibanamachine@users.noreply.github.com>

Depends on elastic#209985 Add suggestions for grok processing: <img width="594" alt="Screenshot 2025-02-05 at 10 31 27" src="https://github.com/user-attachments/assets/4b717681-aa7d-4952-a4e0-9013d9b8aaf8" /> The logic for generating suggestions works like this: * Take the current sample * Split it into patterns based on a simple regex-based grouping replacing runs of numbers with a placeholder, runs of regular numbers with a placeholder, etc. * For the top 5 found groups, pass a couple messages to the LLM in parallel to come up with a grok pattern * Check the grok patterns whether they actually match something and don't break * Report the patterns that have a positive match rate For the `Generate patterns` button to show in the UI, make sure a connector is configured and the license level is above basic (trial license is easiest to test with). I did some light refactoring on the processing routes, moving the simulation bits into a separate file - no changes in this area though. --------- Co-authored-by: Marco Antonio Ghiani <marcoantonio.ghiani01@gmail.com> Co-authored-by: kibanamachine <42973632+kibanamachine@users.noreply.github.com> Co-authored-by: Jean-Louis Leysens <jloleysens@gmail.com>

This PR takes care of two follow-ups related to the LLM integration: * Shows CTA if AI assistant can be configured, but isn't (see elastic#208777 (comment)) <img width="505" alt="Screenshot 2025-02-24 at 11 24 30" src="https://github.com/user-attachments/assets/da01e782-6b02-4ec4-91ab-b46009b41e29" /> * Adds some tests --------- Co-authored-by: kibanamachine <42973632+kibanamachine@users.noreply.github.com>

tonyghiani and others added 24 commits January 23, 2025 12:28

feat(streams): wip enrichment redesign

a79e858

feat(streams): wip redesign

e6650ee

refactor(streams): update copies

6a58780

Merge branch 'main' into 93-update-ui-processing

cb10b5d

refactor(streams): allow text ellipsis

eeb67a7

refactor(streams): reset forms on cancel

e18dd93

refactor(streams): update internal forms structure and typing

4ec0f7d

refactor(streams): update internal state management to track processo…

470a22f

…rs status

refactor(streams): update discard changes modal

8962b15

refactor(streams): update dissect processor typing

bd81230

refactor(streams): minor changes

c260449

refactor(streams): update sampling condition

7c1c4f0

Merge branch '93-update-ui-processing' of github.com:tonyghiani/kiban…

78a09a4

…a into tonyghiani-93-update-ui-processing

Merge branch 'tonyghiani-93-update-ui-processing' into 93-update-ui-p…

731a125

…rocessing

Merge branch 'main' into 93-update-ui-processing

c19f57d

refactor(streams): improvements to simulation

d7fbb25

refactor(streams): update columns rendering for unmatched docs

224c7df

refactor(streams): wip simulation table

39d6a62

refactor(streams): wip simulation table style

10c513f

Merge branch 'main' into 93-update-ui-processing

12e38a1

feat(streams): wip data preview

0040628

start suggestions page

23b80da

refactor(streams): minor cleanup

96549e2

refactor(streams): minor changes

20d849a

flash1293 changed the title ~~wip~~ POC: LLM-powered parsing suggestions Jan 29, 2025

tonyghiani added 4 commits January 30, 2025 11:47

refactor(streams): update live processors udpates

0df5843

refactor(streams): remove import

e48537f

refactor(streams): remove unused props

340d238

fix(streams): disable simulation on existing processors

81e38b9

flash1293 added the Feature:Streams This is the label for the Streams Project label Jan 30, 2025

flash1293 and others added 6 commits February 18, 2025 18:04

reset errors

288dc3f

refactor(streams): wip new stream enrichment hook

0ef51d2

Merge branch 'main' into 102-refactor-state-management

35edb90

refactor(streams): update usage to state machine

8ff8453

Merge branch 'main' into 102-refactor-state-management

705e53a

Merge remote-tracking branch 'tonyghiani/102-refactor-state-managemen…

c1f3895

…t' into flash1293/llm-parsing-suggestions

flash1293 requested a review from a team as a code owner February 19, 2025 14:51

flash1293 and others added 6 commits February 19, 2025 15:52

Merge remote-tracking branch 'upstream/main' into flash1293/llm-parsi…

0cade54

…ng-suggestions

Merge remote-tracking branch 'upstream/main' into flash1293/llm-parsi…

ef7edbc

…ng-suggestions

[CI] Auto-commit changed files from 'node scripts/styled_components_m…

44cba3c

…apping'

revert draft changes

c9d307a

Merge branch 'flash1293/llm-parsing-suggestions' of github.com:flash1…

3a96ea4

…293/kibana into flash1293/llm-parsing-suggestions

fine tuning

b89dad2

flash1293 removed the request for review from a team February 19, 2025 15:36

[CI] Auto-commit changed files from 'node scripts/styled_components_m…

6b3ce93

…apping'

tonyghiani approved these changes Feb 19, 2025

View reviewed changes

flash1293 merged commit 1f35d7a into elastic:main Feb 20, 2025

flash1293 mentioned this pull request Feb 20, 2025

[8.x] 🌊 LLM-powered parsing suggestions #211869

Merged

flash1293 mentioned this pull request Feb 24, 2025

🌊 LLM integration follow-ups #212208

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🌊 LLM-powered parsing suggestions#208777

🌊 LLM-powered parsing suggestions#208777
flash1293 merged 158 commits intoelastic:mainfrom
flash1293:flash1293/llm-parsing-suggestions

flash1293 commented Jan 29, 2025 •

edited by kibanamachine

Loading

Uh oh!

flash1293 commented Feb 18, 2025

Uh oh!

flash1293 commented Feb 18, 2025

Uh oh!

flash1293 commented Feb 19, 2025

Uh oh!

tonyghiani left a comment

Uh oh!

elasticmachine commented Feb 19, 2025 •

edited

Loading

API count

Uh oh!

kibanamachine commented Feb 20, 2025

Uh oh!

kibanamachine commented Feb 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Conversation

flash1293 commented Jan 29, 2025 • edited by kibanamachine Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

flash1293 commented Feb 18, 2025

Uh oh!

flash1293 commented Feb 18, 2025

Uh oh!

flash1293 commented Feb 19, 2025

Uh oh!

tonyghiani left a comment

Choose a reason for hiding this comment

Uh oh!

elasticmachine commented Feb 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💚 Build Succeeded

Metrics [docs]

Module Count

Public APIs missing comments

Async chunks

API count

History

Uh oh!

kibanamachine commented Feb 20, 2025

Uh oh!

kibanamachine commented Feb 20, 2025

💔 All backports failed

Manual backport

Questions ?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

flash1293 commented Jan 29, 2025 •

edited by kibanamachine

Loading

elasticmachine commented Feb 19, 2025 •

edited

Loading