Evals docs in Logfire (points to Pydantic AI) #1374

summerscope · 2025-09-05T10:26:53Z

This isn't precious - just trying to get something into Logfire docs to start with, feel free to slash / edit as you feel fit. Mostly just wanted to make sure people go to the Pydantic AI docs to read about Evals properly.

cloudflare-workers-and-pages · 2025-09-05T10:27:36Z

Deploying logfire-docs with Cloudflare Pages

Latest commit:	`6bf3134`
Status:	✅ Deploy successful!
Preview URL:	https://a1fc6e0f.logfire-docs.pages.dev
Branch Preview URL:	https://laura-evals-docs.logfire-docs.pages.dev

View logs

mkdocs.yml

dmontagu · 2025-09-10T17:55:10Z

docs/guides/web-ui/evals.md

+
+!!! note "Code-First Evaluation"
+
+    Evals are created and run using the [Pydantic Evals](https://ai.pydantic.dev/evals/) a sub-package of Pydantic AI. Logfire serves as a read-only observability layer where you can view and compare results.


Suggested change

Evals are created and run using the [Pydantic Evals](https://ai.pydantic.dev/evals/) a sub-package of Pydantic AI. Logfire serves as a read-only observability layer where you can view and compare results.

Evals are created and run using the [Pydantic Evals](https://ai.pydantic.dev/evals/) package, which is developed in tandem with Pydantic AI. Logfire serves as an observability layer where you can view and compare results.

dmontagu · 2025-09-10T17:55:37Z

docs/guides/web-ui/evals.md

@@ -0,0 +1,91 @@
+# Evals (beta)
+
+View and analyze your evaluation results in Pydantic Logfire's web interface. Evals provides observability into how your AI systems perform across different test cases and experiments over time.


Suggested change

View and analyze your evaluation results in Pydantic Logfire's web interface. Evals provides observability into how your AI systems perform across different test cases and experiments over time.

View and analyze your evaluation results in Pydantic Logfire's web interface. Evals provide observability into how your AI systems perform across different test cases and experiments over time.

dmontagu · 2025-09-10T17:56:24Z

docs/guides/web-ui/evals.md

+
+## What are Evals?
+
+Evals help you systematically test and evaluate AI systems by running them against predefined test cases. Each evaluation experiment appears in Logfire automatically when you run the Pydantic Evals package with Logfire integration enabled.


Suggested change

Evals help you systematically test and evaluate AI systems by running them against predefined test cases. Each evaluation experiment appears in Logfire automatically when you run the Pydantic Evals package with Logfire integration enabled.

Evals help you systematically test and evaluate AI systems by running them against predefined test cases. Each evaluation experiment appears in Logfire automatically when you run the `pydantic_evals.Dataset.evaluate` method with Logfire integration enabled.

Initial commit

27fcfe9

summerscope commented Sep 10, 2025

View reviewed changes

mkdocs.yml Outdated Show resolved Hide resolved

summerscope added 5 commits September 10, 2025 11:35

Evals in Beta

0df1566

Copy edits

fe8355c

chore: style admonition

dabd61d

chore: kill LLM crud

4ba3161

Add image assets and use them

995c724

summerscope requested a review from dmontagu September 10, 2025 08:58

summerscope marked this pull request as ready for review September 10, 2025 08:58

summerscope added 3 commits September 10, 2025 12:00

Merge remote-tracking branch 'origin/main' into laura/evals-docs

8e18bbf

chore: fix broken links

efb9986

chore: move image to appropriate spot

e92d102

summerscope changed the title ~~Evals docs~~ Evals docs in Logfire (points to Pydantic AI) Sep 10, 2025

dmontagu reviewed Sep 10, 2025

View reviewed changes

Merge branch 'main' into laura/evals-docs

6bf3134

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Evals docs in Logfire (points to Pydantic AI) #1374

Evals docs in Logfire (points to Pydantic AI) #1374

summerscope commented Sep 5, 2025 •

edited

Loading

Uh oh!

cloudflare-workers-and-pages bot commented Sep 5, 2025 •

edited

Loading

Uh oh!

Uh oh!

dmontagu Sep 10, 2025

Uh oh!

dmontagu Sep 10, 2025

Uh oh!

dmontagu Sep 10, 2025

Uh oh!

Uh oh!


		!!! note "Code-First Evaluation"

		Evals are created and run using the [Pydantic Evals](https://ai.pydantic.dev/evals/) a sub-package of Pydantic AI. Logfire serves as a read-only observability layer where you can view and compare results.

		@@ -0,0 +1,91 @@
		# Evals (beta)

		View and analyze your evaluation results in Pydantic Logfire's web interface. Evals provides observability into how your AI systems perform across different test cases and experiments over time.


		## What are Evals?

		Evals help you systematically test and evaluate AI systems by running them against predefined test cases. Each evaluation experiment appears in Logfire automatically when you run the Pydantic Evals package with Logfire integration enabled.

Evals docs in Logfire (points to Pydantic AI) #1374

Are you sure you want to change the base?

Evals docs in Logfire (points to Pydantic AI) #1374

Conversation

summerscope commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cloudflare-workers-and-pages bot commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying logfire-docs with Cloudflare Pages

Uh oh!

Uh oh!

dmontagu Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

dmontagu Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

dmontagu Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

summerscope commented Sep 5, 2025 •

edited

Loading

cloudflare-workers-and-pages bot commented Sep 5, 2025 •

edited

Loading