Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Agent Observability Blog Post #3209

Merged
merged 25 commits into from
Jul 27, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
25 commits
Select commit Hold shift + click to select a range
198efaa
update markdown hyperlinks to stable urls
areibman Jun 21, 2024
7afe9d9
update notebook images and text
areibman Jun 21, 2024
354474c
re-write observability section
areibman Jun 21, 2024
8485c30
Updated section
areibman Jun 21, 2024
46e625b
update wording
areibman Jun 21, 2024
0e45cfd
added newline
areibman Jun 21, 2024
695e74e
update styling in image tags to be jsx compatible
areibman Jun 21, 2024
8ec73d4
added text
areibman Jun 21, 2024
8d63347
update link
areibman Jun 21, 2024
0551e73
simplified text
bboynton97 Jun 21, 2024
4ddda11
Merge pull request #2 from AgentOps-AI/readme_updates_agentops
areibman Jun 21, 2024
edba9de
Merge branch 'main' into main
areibman Jun 21, 2024
33b91cb
Merge branch 'microsoft:main' into main
areibman Jul 23, 2024
09c4190
created blog
areibman Jul 23, 2024
08b28a5
replace flow images with fewer shadows
areibman Jul 24, 2024
f877d03
reformat line
areibman Jul 24, 2024
b30010a
add authors
areibman Jul 25, 2024
4c989e7
updated discord link and direct paths to image URLS
areibman Jul 25, 2024
164355f
removed images since they are not stored in the AgentOps github
areibman Jul 25, 2024
842e5f0
remove trailing whitespaces
areibman Jul 25, 2024
7d67e89
removed newline
areibman Jul 25, 2024
0dd0a41
removed whitespace
areibman Jul 25, 2024
a6dfb12
Update website/blog/2024-07-25-AgentOps/index.mdx
areibman Jul 27, 2024
3186fed
single quotes with double quotes
areibman Jul 27, 2024
ccb77c8
export OPENAI_API_KEY="sk-proj-RAAcI5xvxgcAdyJZXiIMT3BlbkFJMBxOvxhiqi…
areibman Jul 27, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
70 changes: 70 additions & 0 deletions website/blog/2024-07-25-AgentOps/index.mdx
Original file line number Diff line number Diff line change
@@ -0,0 +1,70 @@
---
title: AgentOps, the Best Tool for AutoGen Agent Observability
authors:
- areibman
- bboynton97
tags: [LLM,Agent,Observability,AutoGen,AgentOps]
---

# AgentOps, the Best Tool for AutoGen Agent Observability
<img src="https://github.com/AgentOps-AI/agentops/blob/main/docs/images/external/autogen/autogen-integration.png?raw=true" alt="AgentOps and AutoGen" style={{ maxWidth: "30%" }} />

## TL;DR
* AutoGen® offers detailed multi-agent observability with AgentOps.
* AgentOps offers the best experience for developers building with AutoGen in just two lines of code.
* Enterprises can now trust AutoGen in production with detailed monitoring and logging from AgentOps.

AutoGen is excited to announce an integration with AgentOps, the industry leader in agent observability and compliance. Back in February, [Bloomberg declared 2024 the year of AI Agents](https://www.bloomberg.com/news/newsletters/2024-02-15/tech-companies-bet-the-world-is-ready-for-ai-agents). And it's true! We've seen AI transform from simplistic chatbots to autonomously making decisions and completing tasks on a user's behalf.

However, as with most new technologies, companies and engineering teams can be slow to develop processes and best practices. One part of the agent workflow we're betting on is the importance of observability. Letting your agents run wild might work for a hobby project, but if you're building enterprise-grade agents for production, it's crucial to understand where your agents are succeeding and failing. Observability isn't just an option; it's a requirement.

As agents evolve into even more powerful and complex tools, you should view them increasingly as tools designed to augment your team's capabilities. Agents will take on more prominent roles and responsibilities, take action, and provide immense value. However, this means you must monitor your agents the same way a good manager maintains visibility over their personnel. AgentOps offers developers observability for debugging and detecting failures. It provides the tools to monitor all the key metrics your agents use in one easy-to-read dashboard. Monitoring is more than just a “nice to have”; it's a critical component for any team looking to build and scale AI agents.

## What is Agent Observability?

Agent observability, in its most basic form, allows you to monitor, troubleshoot, and clarify the actions of your agent during its operation. The ability to observe every detail of your agent's activity, right down to a timestamp, enables you to trace its actions precisely, identify areas for improvement, and understand the reasons behind any failures — a key aspect of effective debugging. Beyond enhancing diagnostic precision, this level of observability is integral for your system's reliability. Think of it as the ability to identify and address issues before they spiral out of control. Observability isn't just about keeping things running smoothly and maximizing uptime; it's about strengthening your agent-based solutions.

<img src="https://github.com/AgentOps-AI/agentops/blob/main/docs/images/external/autogen/flow.png?raw=true" alt="AI agent observability" style={{ maxWidth: "30%" }} />

## Why AgentOps?

AutoGen has simplified the process of building agents, yet we recognized the need for an easy-to-use, native tool for observability. We've previously discussed AgentOps, and now we're excited to partner with AgentOps as our official agent observability tool. Integrating AgentOps with AutoGen simplifies your workflow and boosts your agents' performance through clear observability, ensuring they operate optimally. For more details, check out our [AgentOps documentation](https://microsoft.github.io/autogen/docs/notebooks/agentchat_agentops/).

<img src="https://github.com/AgentOps-AI/agentops/blob/main/docs/images/external/autogen/session-replay.png?raw=true" alt="Agent Session Replay" style={{ maxWidth: "40%" }} />

Enterprises and enthusiasts trust AutoGen as the leader in building agents. With our partnership with AgentOps, developers can now natively debug agents for efficiency and ensure compliance, providing a comprehensive audit trail for all of your agents' activities. AgentOps allows you to monitor LLM calls, costs, latency, agent failures, multi-agent interactions, tool usage, session-wide statistics, and more all from one dashboard.

By combining the agent-building capabilities of AutoGen with the observability tools of AgentOps, we're providing our users with a comprehensive solution that enhances agent performance and reliability. This collaboration establishes that enterprises can confidently deploy AI agents in production environments, knowing they have the best tools to monitor, debug, and optimize their agents.

The best part is that it only takes two lines of code. All you need to do is set an `AGENTOPS_API_KEY` in your environment (Get API key here: https://app.agentops.ai/account) and call `agentops.init()`:
```
import os
import agentops

agentops.init(os.environ["AGENTOPS_API_KEY"])
```

## AgentOps's Features

AgentOps includes all the functionality you need to ensure your agents are suitable for real-world, scalable solutions.

<img src="https://github.com/AgentOps-AI/agentops/blob/main/docs/images/external/autogen/dashboard.png?raw=true" alt="AgentOps overview dashboard" style={{ maxWidth: "40%" }} />

* **Analytics Dashboard:** The AgentOps Analytics Dashboard allows you to configure and assign agents and automatically track what actions each agent is taking simultaneously. When used with AutoGen, AgentOps is automatically configured for multi-agent compatibility, allowing users to track multiple agents across runs easily. Instead of a terminal-level screen, AgentOps provides a superior user experience with its intuitive interface.
* **Tracking LLM Costs:** Cost tracking is natively set up within AgentOps and provides a rolling total. This allows developers to see and track their run costs and accurately predict future costs.
* **Recursive Thought Detection:** One of the most frustrating aspects of agents is when they get trapped and perform the same task repeatedly for hours on end. AgentOps can identify when agents fall into infinite loops, ensuring efficiency and preventing wasteful computation.

AutoGen users also have access to the following features in AgentOps:

* **Replay Analytics:** Watch step-by-step agent execution graphs.
* **Custom Reporting:** Create custom analytics on agent performance.
* **Public Model Testing:** Test your agents against benchmarks and leaderboards.
* **Custom Tests:** Run your agents against domain-specific tests.
* **Compliance and Security:** Create audit logs and detect potential threats, such as profanity and leaks of Personally Identifiable Information.
* **Prompt Injection Detection:** Identify potential code injection and secret leaks.

## Conclusion

By integrating AgentOps into AutoGen, we've given our users everything they need to make production-grade agents, improve them, and track their performance to ensure they're doing exactly what you need them to do. Without it, you're operating blindly, unable to tell where your agents are succeeding or failing. AgentOps provides the required observability tools needed to monitor, debug, and optimize your agents for enterprise-level performance. It offers everything developers need to scale their AI solutions, from cost tracking to recursive thought detection.

Did you find this note helpful? Would you like to share your thoughts, use cases, and findings? Please join our observability channel in the [AutoGen Discord](https://discord.gg/hXJknP54EH).
12 changes: 12 additions & 0 deletions website/blog/authors.yml
Original file line number Diff line number Diff line change
Expand Up @@ -140,3 +140,15 @@ marklysze:
title: AI Freelancer
url: https://github.com/marklysze
image_url: https://github.com/marklysze.png

areibman:
name: Alex Reibman
title: Co-founder/CEO at AgentOps
url: https://github.com/areibman
image_url: https://github.com/areibman.png

bboynton97:
name: Braelyn Boynton
title: AI Engineer at AgentOps
url: https://github.com/bboynton97
image_url: https://github.com/bboynton97.png
Loading