EyesInAI — AI Benchmark Leaderboard

⚡Ping

Latency & availability — single-word reply

1942 ms

🧮Reasoning

Basic math reasoning — show work, give answer

fail

{ }JSON Output

Structured output compliance — valid JSON with required keys

fail

💻Code Gen

Python function generation with docstring

fail

🚀Throughput

Token generation speed — 500-token long-form response

fail

🔍Context Recall

Retrieval from in-context data — 20-item list Q&A

fail

🔧Tool Use

Function/tool calling — get_weather invocation

11859 ms

zai-org/GLM-4.7