add evaluation #782

VladimirKadlec · 2025-11-11T10:27:01Z

Description

Add Evaluation slides

Type of change

Related Tickets & Documents

Related Issue #LCORE-872

Summary by CodeRabbit

Documentation
- Expanded evaluation guidance with clearer goals (performance, UX, bias/harm, ethics/legal compliance)
- Added framework overview with evaluation capabilities including multi-framework support and metrics
- Included comprehensive configuration examples demonstrating schema details and setup/cleanup workflows

openshift-ci · 2025-11-11T10:27:04Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

coderabbitai · 2025-11-11T10:27:14Z

Walkthrough

The change restructures and expands the Evaluation subsection in the lcore documentation. It replaces outdated evaluation-related bullets with new sections covering evaluation goals, benefits, and the Lightspeed Evaluation Framework. A YAML example is expanded with detailed schema.

Changes

Cohort / File(s)	Summary
Documentation content restructuring and expansion `docs/demos/lcore/lcore.md`	Replaced outdated Evaluation subsection bullets with new sections: "Why Evaluate an LLM System?" covering performance/UX/bias/ethics goals; "Benefits of Evaluation" describing improvements and benchmarking; and "Lightspeed Evaluation Framework" with framework description and capabilities. Expanded YAML example with conversation_group schema including setup/cleanup scripts, metrics, metadata, turns, per-turn metric overrides, and expected responses/intents.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Single file modification with homogeneous, repetitive nature of changes (documentation content only)
No executable logic or runtime behavior modifications
Main effort involves verifying documentation accuracy, YAML schema correctness, and consistency of new framework descriptions

Pre-merge checks and finishing touches

❌ Failed checks (1 inconclusive)

Check name	Status	Explanation	Resolution
Title check	❓ Inconclusive	The title 'add evaluation' is vague and generic, failing to convey the specific nature of the changes beyond the general addition of evaluation-related content.	Use a more descriptive title that specifies the scope, such as 'Expand Evaluation section with framework overview and YAML examples' or 'Add Lightspeed Evaluation Framework documentation'.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 18ca0fb and 166b168.

📒 Files selected for processing (1)

docs/demos/lcore/lcore.md (1 hunks)

🧰 Additional context used

🪛 markdownlint-cli2 (0.18.1)

docs/demos/lcore/lcore.md

401-401: Heading style
Expected: atx; Actual: setext

(MD003, heading-style)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (4)

GitHub Check: Konflux kflux-prd-rh02 / lightspeed-stack-on-pull-request
GitHub Check: build-pr
GitHub Check: e2e_tests (ci)
GitHub Check: e2e_tests (azure)

🔇 Additional comments (2)

docs/demos/lcore/lcore.md (2)

379-415: Well-structured evaluation narrative and framework documentation.

The new sections provide clear motivation, tangible benefits, and a comprehensive overview of the framework capabilities. The progression from "Why Evaluate" → "Benefits" → "Framework Features" is logical and pedagogically sound for a presentation slide deck. The inline links and bullet structure enhance readability.

417-448: Excellent YAML example with comprehensive schema documentation.

The expanded YAML example provides clear documentation of the evaluation configuration structure, including setup/cleanup scripts, conversation-level and per-turn metrics, and expected response/intent fields. Inline comments effectively clarify the purpose of each section. This is valuable reference documentation for users.

coderabbitai · 2025-11-11T10:30:43Z

docs/demos/lcore/lcore.md

+<font size="10">[https://github.com/lightspeed-core/lightspeed-evaluation/](https://github.com/lightspeed-core/lightspeed-evaluation)</font>
+---


⚠️ Potential issue | 🟡 Minor

Resolve markdown linting violation and avoid deprecated HTML.

Line 401 uses the deprecated  tag and the --- on line 402 creates a setext-style heading, which violates the markdownlint rule MD003 (heading-style). Modernize this by using an atx-style heading (##).

Apply this diff:

-[https://github.com/lightspeed-core/lightspeed-evaluation/](https://github.com/lightspeed-core/lightspeed-evaluation) ---- +## [Lightspeed Evaluation Framework](https://github.com/lightspeed-core/lightspeed-evaluation/)

This replaces the deprecated  tag with a proper markdown heading, aligns with markdown linting standards, and keeps the GitHub link accessible.

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

[https://github.com/lightspeed-core/lightspeed-evaluation/](https://github.com/lightspeed-core/lightspeed-evaluation)

---

## [Lightspeed Evaluation Framework](https://github.com/lightspeed-core/lightspeed-evaluation/)

🧰 Tools

🪛 markdownlint-cli2 (0.18.1)

401-401: Heading style
Expected: atx; Actual: setext

(MD003, heading-style)

🤖 Prompt for AI Agents

In docs/demos/lcore/lcore.md around lines 401 to 402, the content uses a deprecated HTML tag and a setext-style heading (the '---') which triggers markdownlint MD003; replace the HTML and setext underline with a proper ATX-style markdown heading that contains the GitHub link (e.g., start the line with "##" followed by the link text or the link itself) so the heading is valid markdown and the link remains accessible.

tisnik

LGTM

add evaluation

166b168

VladimirKadlec marked this pull request as ready for review November 11, 2025 10:28

coderabbitai bot reviewed Nov 11, 2025

View reviewed changes

tisnik approved these changes Nov 11, 2025

View reviewed changes

tisnik merged commit 3bc69e5 into lightspeed-core:main Nov 11, 2025
21 of 23 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add evaluation #782

add evaluation #782

Uh oh!

VladimirKadlec commented Nov 11, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

openshift-ci bot commented Nov 11, 2025

Uh oh!

coderabbitai bot commented Nov 11, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Nov 11, 2025

Uh oh!

tisnik left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		<font size="10">[https://github.com/lightspeed-core/lightspeed-evaluation/](https://github.com/lightspeed-core/lightspeed-evaluation)</font>
		---

	<font size="10">[https://github.com/lightspeed-core/lightspeed-evaluation/](https://github.com/lightspeed-core/lightspeed-evaluation)</font>
	---
	## [Lightspeed Evaluation Framework](https://github.com/lightspeed-core/lightspeed-evaluation/)

add evaluation #782

add evaluation #782

Uh oh!

Conversation

VladimirKadlec commented Nov 11, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Related Tickets & Documents

Summary by CodeRabbit

Uh oh!

openshift-ci bot commented Nov 11, 2025

Uh oh!

coderabbitai bot commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

tisnik left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

VladimirKadlec commented Nov 11, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Nov 11, 2025 •

edited

Loading