Skip to main content
C

/cortex-eval

CortexML & AI

Evaluates a deployed model or LLM integration for performance issues: checks for accuracy degradation against a reference dataset, data distribution drift that may explain behavior changes, latency regression compared to baseline, and cost increases from token usage changes. Produces a health report with recommended actions.

Install

This skill

Install Cortex Eval

1. Add to marketplace

$ claude plugin marketplace add tonone-ai/tonone

2. Install this skill

$ claude plugin install cortex-eval@tonone-ai

The agent

Install Cortex

1. Add to marketplace

$ claude plugin marketplace add tonone-ai/tonone

2. Install Cortex

$ claude plugin install cortex@tonone-ai

Want all 31 agents across both teams?

Full installation guide

Invoke this skill

Command|$ /cortex-eval

When to use

When model outputs seem worse after a retraining run or model version upgrade. When users are reporting worse AI feature quality without a clear cause. On a regular evaluation cadence, monthly or quarterly, after models have been in production long enough to drift.

Deep dive

AI Model Evaluation and Drift Check

Models decay silently. /cortex-eval checks accuracy regression against reference data, distribution drift, latency baseline, and token cost shifts.

Read the article

More from Cortex

All Cortex skills

Ready to use Cortex Eval?