The European AI Productivity Index
Assesses whether frontier AI models can perform economically valuable professional tasks in European contexts — EU law, multi-country taxation, industrial standards, and cross-border regulatory analysis.
1,240
Total tasks
87
Expert authors
6
Domains
4
Languages
Leaderboard
Ranked by overall score · last updated March 2026
Task categories
Weighted contribution to the overall score
Interpretation of directives, regulations, and court decisions across member states
VAT, transfer pricing, and multi-jurisdiction compliance tasks
CE marking, EN/ISO compliance, and technical product documentation
MiFID II, DORA, and Basel III application in European contexts
OJEU notices, tender evaluation, and contracting authority obligations
Data subject rights, DPIAs, and cross-border transfer mechanisms
About
KAROKAN-EU is the first benchmark specifically designed to measure AI productivity in European professional settings. Unlike English-centric evaluations, it probes model capabilities on tasks that require deep knowledge of EU institutional frameworks, member-state legal systems, and cross-border regulatory complexity. Tasks were authored by verified domain experts — lawyers, tax advisors, policy analysts, and engineers — and independently validated before inclusion.
Methodology
Each task is evaluated in a closed-book, multi-turn setting. Models receive a realistic professional prompt and are scored on factual accuracy, legal correctness, and contextual completeness by expert human raters. Final scores are aggregated across 8 professional domains using a weighted average reflecting economic activity distribution in the EU.
Cite
@misc{karokan2026eu,
title={KAROKAN-EU: The European AI Productivity Index},
author={Karokan Research Team},
year={2026},
url={https://karokan.com/research/karokan-eu}
}Get involved
Access the dataset, submit a model for evaluation, or collaborate with our research team.
Contact research →Other benchmarks
European Multilingual Evaluation
The first rigorous benchmark evaluating LLM quality beyond English, across all 24 official EU languages in professional and institutional contexts.
AI Act Compliance Benchmark
The first benchmark evaluating whether AI systems satisfy EU AI Act requirements: risk classification, documentation, transparency, and human oversight at model and system level.