cortx provides two essential utilities to help you track and optimize your agents post-deployment:
- Agent Activity – For tracking real-time agent usage across workflows.
- Agent Evaluation – For analyzing the quality and reliability of agent outputs over time.
- Where to Find These Tools
- Agent Activity:
- Left Nav > Agents > Agent Catalog > Agent Activity
- Agent Evaluation:
- Left Nav > Agents > Agent Catalog > Select Agent > Performance Tab > Evaluate
- Agent Activity:
Agent Activity
The Agent Activity tab shows a detailed log of each agent’s interactions, including tasks triggered, who triggered them, and their current status.
How to Monitor Agent Activity
- Go to the Agents section from the left nav.
- Click Agent Catalog.

- Navigate to the Agent Activity Tab.
- Review the table that displays:
- Agent ID & Name
- Activity Description
- Specific Action/Command performed
- Timestamp (Modified On)
- Initiator (Modified By)
- Execution Status (Completed, Pending, Failed)

- Use Cases
- Ensure agents are actively being used and triggered correctly.
- Investigate failed or pending tasks.
- Monitor frequency of use by different users.
- Audit behaviour for debugging or reporting.
Agent Evaluation
Use Agent Evaluation to validate the effectiveness and consistency of an agent over time through dynamic performance metrics.
- How to Evaluate an Agent
- Go to Agents > Agent Catalog.
- Click on the Agent you wish to evaluate.

- In the agent detail screen, switch to the Performance tab.
- Expand the Self Evaluation section and click the Evaluate button.

- View the visualized charts for the following metrics:
- Relevance Over Time
- Completeness Over Time
- Approval Rate
- Hallucination Rate
- Tool Accuracy Rate
How to Use This Data
- Analyze spikes or drops in performance.
- Identify root causes for poor ratings (e.g., hallucinations or low completeness).
- Refine prompts, tool configurations, or logic rules based on trend insights.
- Use results to iterate and improve your agents continuously.
Best Practices
- Monitor Agent Activity weekly for task-level diagnostics.
- Run Evaluations after major updates or monthly to maintain output quality.
- Use the evaluation data to inform training loops, refine agent scope, or disable underperforming agents.