KAROKAN-EU
Gemini 2.5 Pro
Google DeepMind · score 47.6% ±3.0%
Assesses whether frontier AI models can perform economically valuable professional tasks in European contexts — EU law, multi-country taxation, industrial standards, and cross-border regulatory analysis.
Benchmark detail: ranked #4 with a normalized performance bar of 75%.