Skip to content

datenlabor-bmz/ai-language-monitor

Repository files navigation

title emoji colorFrom colorTo sdk license short_description datasets models tags
AI Language Monitor
🌍
purple
pink
gradio
cc-by-sa-4.0
Evaluating LLM performance across all human languages.
openlanguagedata/flores_plus
meta-llama/Llama-3.3-70B-Instruct
mistralai/Mistral-Small-24B-Instruct-2501
deepseek-ai/DeepSeek-V3
microsoft/phi-4
leaderboard
submission:manual
test:public
judge:auto
modality:text
modality:artefacts
eval:generation
language:English
language:German

Hugging Face

AI Language Monitor 🌍

Benchmarking all big AI models on all benchmarkable languages.

Sources:

  1. For AI models: OpenRouter
  2. For language benchmarks: FLORES+
  3. For language statistics: Wikidata, Ethnologue

UI sketch