EUR 150-250
per hour
Review risk scenarios, preference rankings, and regulatory reasoning chains for frontier post-training. Evaluate model outputs on Basel III compliance, AML procedures, and stress testing edge cases.
Role Directory
Public Karokan opportunities that reference French language coverage.
EUR 150-250
per hour
Review risk scenarios, preference rankings, and regulatory reasoning chains for frontier post-training. Evaluate model outputs on Basel III compliance, AML procedures, and stress testing edge cases.
EUR 130-220
per hour
Assess AI Act, procurement, and cross-border legal reasoning in multilingual institutional contexts. Evaluate model responses across complex EU regulatory scenarios.
EUR 100-180
per hour
Design adversarial suites for agentic workflows, tool misuse, multilingual jailbreaks, and instruction hijacking. Work directly with safety researchers on frontier models.
EUR 120-200
per hour
Label preference data and edge-case corrections for clinical note summarization and medical reasoning models deployed in European hospital environments.
EUR 80-140
per hour
Create and validate professional-language benchmark tasks beyond English across public and enterprise workflows. Contribute to the KAROKAN-LANG evaluation suite.
EUR 90-160
per hour
Design verifier-backed reasoning tasks for advanced STEM evaluation and post-training refinement of frontier models.
EUR 110-180
per hour
Support deployment architecture reviews, documentation, and compliance posture for sensitive AI systems under GDPR and the AI Act.
EUR 100-170
per hour
Validate synthetic plant-floor datasets and workflow copilots for industrial deployments in German manufacturing environments.
EUR 140-210
per hour
Drive AI Act readiness, governance design, vendor review, and deployment control frameworks for a Fortune 500 enterprise rollout.
EUR 90-150
per hour
Evaluate outputs on grid resilience, energy forecasting, infrastructure optimization, and scientific reliability for climate AI models.
EUR 80-140
per hour
Design and run end-to-end evaluation pipelines for deployed LLM products, covering instruction tuning quality, regression testing, and performance benchmarking.
EUR 110-190
per hour
Produce high-quality legal synthetic data and contract analysis annotations for NLP models targeting EU cross-border commercial law.