• Documentation
    • About ​ValidMind
    • Get Started
    • Guides
    • Support
    • Releases

    • ValidMind Library
    • Python API
    • Public REST API

    • Training Courses
  • Log In
  1. Add agentic metrics from DeepEval as scorers
  • All releases

  • Current releases
  • Feature highlights
  • ValidMind Platform releases
  • ValidMind Library releases
  • Documentation updates

  • Breaking changes and deprecations

  • Older releases
  • April 24, 2025
  • March 7, 2025
  • January 31, 2025
  • 2024 Releases
    • December 24, 2024
    • December 6, 2024
    • October 22, 2024
    • September 25, 2024
    • September 9, 2024
    • August 13, 2024
    • July 22, 2024
    • June 10, 2024
    • May 22, 2024
    • March 27, 2024
    • February 14, 2024
    • January 26, 2024
    • January 18, 2024
  • 2023 Releases
    • December 13, 2023
    • November 9, 2023
    • October 25, 2023
    • September 27, 2023
    • August 15, 2023
    • July 24, 2023
    • June 22, 2023
    • May 30, 2023

Add agentic metrics from DeepEval as scorers

validmind-library
2.11.1
enhancement
Published

January 12, 2026

Four new agentic evaluation metrics from DeepEval are now available to assess your LLM agents:

  • ArgumentCorrectness: Checks if agents generate correct arguments for tool calls.
  • PlanAdherence: Measures whether agents follow their own execution plans.
  • PlanQuality: Evaluates the logical quality, completeness, and efficiency of agent-generated plans.
  • ToolCorrectness: Verifies if agents invoke the right tools with appropriate arguments.

These metrics broaden evaluation coverage across both tool use (actions) and strategic reasoning (plans). They are accessible via validmind.scorers.llm.deepeval and require validmind[llm].

ValidMind Library Python API

  • ValidMind Logo
    ©
    Copyright 2026 ValidMind Inc.
    All Rights Reserved.
    Cookie preferences
    Legal
  • Get started
    • Model development
    • Model validation
    • Setup & admin
  • Guides
    • Access
    • Configuration
    • Model inventory
    • Model documentation
    • Model validation
    • Workflows
    • Reporting
    • Monitoring
    • Attestation
  • Library
    • For developers
    • For validators
    • Code samples
    • Python API
    • Public REST API
  • Training
    • Learning paths
    • Courses
    • Videos
  • Support
    • Troubleshooting
    • FAQ
    • Get help
  • Community
    • GitHub
    • LinkedIn
    • Events
    • Blog
  • Edit this page
  • Report an issue