Evaluate LLM and agent quality in Dynatrace AI Observability with dt-evals
Dynatrace news

Evaluate LLM and agent quality in Dynatrace AI Observability with dt-evals


Summary

`dt-evals` is an open-source CLI tool designed to evaluate the quality and safety of LLMs and agents by analyzing real GenAI traces using a "LLM-as-a-judge" approach. By integrating these evaluation scores directly into Dynatrace AI Observability, the tool enables teams to connect AI performance metrics with operational data for more effective debugging, trend analysis, and automated alerting.
Read the Original Article

This article originally appeared on Dynatrace news.

Read Full Article on Original Site

Related Articles

AWS publishes Dynatrace-developed blueprint for secure Amazon Bedrock access at scale
AWS publishes Dynatrace-developed blueprint for secure Amazon Bedrock access at scale

Thomas Natschläeger Nov 19, 2025 5 shared categories

Announcing Amazon Bedrock AgentCore Agent Observability
Announcing Amazon Bedrock AgentCore Agent Observability

Kristof Muhi Nov 18, 2025 5 shared categories

Dynatrace Release Radar 01.26
Dynatrace Release Radar 01.26

Michael Winkler Mar 3, 2026 4 shared categories

Popular from Dynatrace news

1
dtctl: The Dynatrace observability CLI that’s built for AI agents and humans
2
OneAgent release notes version 1.335
OneAgent release notes version 1.335

Malcolm Davidson Apr 7, 2026 126 views