KAROKAN-EU

Gemini 2.5 Pro

Google DeepMind · score 47.6% ±3.0%

Assesses whether frontier AI models can perform economically valuable professional tasks in European contexts — EU law, multi-country taxation, industrial standards, and cross-border regulatory analysis.

Benchmark detail: ranked #4 with a normalized performance bar of 75%.