Back to articles

Evaluate LLM and agent quality in Dynatrace AI Observability with dt-evals

Dynatrace news

AI AI Observability AI agents Evaluations LLM-as-judge Observability Product news Smartscape

Evaluate LLM and agent quality in Dynatrace AI Observability with dt-evals

By Kristof Muhi

June 12, 2026

81 views

Summary

`dt-evals` is an open-source CLI tool designed to evaluate the quality and safety of LLMs and agents by analyzing real GenAI traces using a "LLM-as-a-judge" approach. By integrating these evaluation scores directly into Dynatrace AI Observability, the tool enables teams to connect AI performance metrics with operational data for more effective debugging, trend analysis, and automated alerting.

Read the Original Article

This article originally appeared on Dynatrace news.

Read Full Article on Original Site

Related Articles

Beyond LLM-as-a-judge: Establishing LLM evaluations as a foundation for trustworthy agentic AI systems

Beyond LLM-as-a-judge: Establishing LLM evaluations as a foundation for trustworthy agentic AI systems

Kristof Muhi • Jun 26, 2026 • 6 shared categories

AWS publishes Dynatrace-developed blueprint for secure Amazon Bedrock access at scale

AWS publishes Dynatrace-developed blueprint for secure Amazon Bedrock access at scale

Thomas Natschläeger • Nov 19, 2025 • 5 shared categories

Announcing Amazon Bedrock AgentCore Agent Observability

Announcing Amazon Bedrock AgentCore Agent Observability

Kristof Muhi • Nov 18, 2025 • 5 shared categories

Dynatrace Release Radar 06.26

Dynatrace Release Radar 06.26

Michael Winkler • Jul 10, 2026 • 4 shared categories

AI agents are redefining software development—but they’re flying blind without observability

AI agents are redefining software development—but they’re flying blind without observability

Bernd Greifeneder • May 29, 2026 • 4 shared categories

Popular from Dynatrace news

1

dtctl: The Dynatrace observability CLI that’s built for AI agents and humans

dtctl: The Dynatrace observability CLI that’s built for AI agents and humans

Christoph Neumüller • Mar 24, 2026 • 328 views

2

OneAgent release notes version 1.335

OneAgent release notes version 1.335

Malcolm Davidson • Apr 7, 2026 • 192 views

3

What’s new in Dynatrace SaaS version 1.338

What’s new in Dynatrace SaaS version 1.338

Malcolm Davidson • May 5, 2026 • 182 views

4

Bring real-time production insights into Claude Code with the Dynatrace MCP Server

Bring real-time production insights into Claude Code with the Dynatrace MCP Server

Milan Steskal • Mar 31, 2026 • 177 views

5

Dynatrace to acquire Bindplane to bring control to the telemetry lifecycle

Dynatrace to acquire Bindplane to bring control to the telemetry lifecycle

Steve Tack • Apr 9, 2026 • 174 views