/cortex-eval
Evaluates a deployed model or LLM integration for performance issues: checks for accuracy degradation against a reference dataset, data distribution drift that may explain behavior changes, latency regression compared to baseline, and cost increases from token usage changes. Produces a health report with recommended actions.
Install
This skill
Install Cortex Eval
1. Add to marketplace
2. Install this skill
The agent
Install Cortex
1. Add to marketplace
2. Install Cortex
Want all 31 agents across both teams?
Full installation guideInvoke this skill
When to use
When model outputs seem worse after a retraining run or model version upgrade. When users are reporting worse AI feature quality without a clear cause. On a regular evaluation cadence, monthly or quarterly, after models have been in production long enough to drift.
More from Cortex
All Cortex skillsReady to use Cortex Eval?