VERDICT is an independent evaluation index for AI agents and workflow platforms. No vendor sponsorships. No paid certifications. Scores based on public data, incident history, and behavioral testing.
Layer 0 evaluates public documentation only. E (Effectiveness) requires live behavioral testing (Layer 1) and is excluded from Layer 0 scores.
Layer 0 maximum: 85 points. The shape of each radar chart is the platform's trust fingerprint.
| Code | Dimension | What We Evaluate | Weight |
|---|---|---|---|
| V | Verifiability検証可能性 | Developer identity, OSS code disclosure, version transparency, third-party audits | 20 |
| E | Effectiveness実効性 | Task success rate, cost accuracy, performance degradation — Layer 1 only | 15 |
| R | Resilience耐障害性 | CVE frequency & severity, patch response speed, structural failure patterns, supply chain integrity | 20 |
| D | Data Conductデータ行動規範 | GDPR posture, data minimization, AI training use disclosure, sub-processor transparency | 15 |
| I | Identity & Control主権と制御 | Emergency stop mechanisms, human-in-the-loop availability, permission chain documentation | 10 |
| C | Containment境界遵守 | Sandbox design philosophy (whitelist vs. blocklist), least-privilege defaults, tenant isolation | 10 |
| T | Transparency透明性 | CVE publication posture, incident disclosure speed, AI safety framework adoption | 10 |