EyesInAI — AI Benchmark Leaderboard

Model	Test	Runs	Latency	$ / 1k	Pass
deepseek-v4-pro	🧠 Multi-Step Logic	5	164ms	—	33%
gpt-oss-120b	🚀 Throughput	4	1819ms	$0.132	100%
gpt-oss-120b	🧠 Multi-Step Logic	4	918ms	$0.056	100%
deepseek-v4-pro	🚀 Throughput	4	154ms	—	50%
deepseek-v4-pro	📝 Summarization	4	233ms	—	50%
deepseek-v4-pro	📄 Data Extraction	4	166ms	—	50%
deepseek-v4-pro	📋 Instruction Follow	4	148ms	—	100%
deepseek-v4-pro	✅ Format Compliance	4	194ms	—	50%
deepseek-v4-pro	💻 Code Gen	4	173ms	—	100%
deepseek-v4-pro	{ } JSON Output	4	186ms	—	100%
deepseek-v4-pro	⚡ Ping	4	164ms	—	100%
gpt-oss-120b	🔧 Tool Use	3	415ms	$0.014	100%
gpt-oss-120b	🔍 Context Recall	3	420ms	$0.025	100%
gpt-oss-120b	💻 Code Gen	3	754ms	$0.047	100%
gpt-oss-120b	{ } JSON Output	3	543ms	$0.030	100%
gpt-oss-120b	🧮 Reasoning	3	429ms	$0.020	100%
gpt-oss-120b	⚡ Ping	3	419ms	$0.012	100%
gemma-4-26b-a4b-it:free	🔍 Context Recall	3	1481ms	free	100%
gemma-4-26b-a4b-it:free	🚀 Throughput	3	1264ms	free	100%
gemma-4-26b-a4b-it:free	💻 Code Gen	3	1871ms	free	100%
gemma-4-26b-a4b-it:free	{ } JSON Output	3	1821ms	free	100%
gemma-4-26b-a4b-it:free	🧮 Reasoning	3	1232ms	free	100%
gemma-4-26b-a4b-it:free	⚡ Ping	3	898ms	free	100%
GLM-5.2	🔧 Tool Use	3	19564ms	$0.310	100%
GLM-5.2	🔍 Context Recall	3	12575ms	$1.30	100%
GLM-5.2	🚀 Throughput	3	30756ms	$2.31	100%
GLM-5.2	💻 Code Gen	3	10442ms	$1.08	100%
GLM-5.2	{ } JSON Output	3	4982ms	$0.838	100%
GLM-5.2	🧮 Reasoning	3	9143ms	$1.41	100%
GLM-5.2	⚡ Ping	3	6048ms	$0.595	100%
gpt-oss-20b	🔧 Tool Use	3	1330ms	$0.012	100%
gpt-oss-20b	🔍 Context Recall	3	2551ms	$0.036	100%
gpt-oss-20b	🚀 Throughput	3	7011ms	$0.108	100%
gpt-oss-20b	💻 Code Gen	3	3284ms	$0.050	100%
gpt-oss-20b	{ } JSON Output	3	1874ms	$0.023	100%
gpt-oss-20b	🧮 Reasoning	3	3033ms	$0.035	100%
gpt-oss-20b	⚡ Ping	3	1350ms	$0.011	100%
gpt-oss-120b	✅ Format Compliance	3	1727ms	$0.107	100%
gpt-oss-120b	📋 Instruction Follow	3	1505ms	$0.107	100%
gpt-oss-120b	📄 Data Extraction	3	1072ms	$0.066	100%
gpt-oss-120b	📝 Summarization	3	1077ms	$0.068	100%
deepseek-v4-pro	🪡 Long-Context Needle	3	298ms	—	100%
deepseek-v4-pro	🏷️ Classification	3	154ms	—	100%
deepseek-v4-pro	🔧 Tool Use	3	156ms	—	100%
deepseek-v4-pro	🔍 Context Recall	3	181ms	—	100%
deepseek-v4-pro	code_algo	3	169ms	—	100%
deepseek-v4-pro	code_refactor_preserve	3	155ms	—	100%
deepseek-v4-pro	code_to_spec	3	165ms	—	100%
deepseek-v4-pro	code_fix_bug	3	153ms	—	100%
deepseek-v4-pro	🧮 Reasoning	3	164ms	—	100%
glm-4.6	💻 Code Gen	3	—	—	0%
glm-4.6	{ } JSON Output	3	3379ms	—	50%
glm-4.6	⚡ Ping	3	1252ms	—	100%
palmyra-x5	🔍 Context Recall	2	632ms	$0.476	100%
palmyra-x5	🚀 Throughput	2	632ms	$3.18	100%
palmyra-x5	💻 Code Gen	2	672ms	$0.876	100%
palmyra-x5	{ } JSON Output	2	599ms	$0.437	100%
palmyra-x5	🧮 Reasoning	2	616ms	$0.607	100%
palmyra-x5	⚡ Ping	2	588ms	$0.170	100%
hy3:free	🔍 Context Recall	2	1833ms	free	100%
hy3:free	🚀 Throughput	2	1741ms	free	100%
hy3:free	💻 Code Gen	2	1710ms	free	100%
hy3:free	{ } JSON Output	2	1648ms	free	100%
hy3:free	🧮 Reasoning	2	1821ms	free	100%
hy3:free	⚡ Ping	2	1748ms	free	100%
l3.1-euryale-70b	🔍 Context Recall	2	608ms	$0.201	100%
l3.1-euryale-70b	🚀 Throughput	2	819ms	$0.459	100%
l3.1-euryale-70b	💻 Code Gen	2	—	—	0%
l3.1-euryale-70b	{ } JSON Output	2	350ms	$0.076	100%
l3.1-euryale-70b	🧮 Reasoning	2	422ms	$0.048	100%
l3.1-euryale-70b	⚡ Ping	2	755ms	$0.016	100%
qwen3.7-plus	🔍 Context Recall	2	1102ms	$0.844	100%
qwen3.7-plus	🚀 Throughput	2	735ms	$2.90	100%
qwen3.7-plus	💻 Code Gen	2	713ms	$2.14	100%
qwen3.7-plus	{ } JSON Output	2	1024ms	$0.918	100%
qwen3.7-plus	🧮 Reasoning	2	1282ms	$2.28	100%
qwen3.7-plus	⚡ Ping	2	910ms	$0.311	100%
qwen3.5-plus-20260420	🔍 Context Recall	2	659ms	$0.847	100%
qwen3.5-plus-20260420	🚀 Throughput	2	670ms	$4.60	100%
qwen3.5-plus-20260420	💻 Code Gen	2	712ms	$2.23	100%
qwen3.5-plus-20260420	{ } JSON Output	2	1101ms	$2.03	100%
qwen3.5-plus-20260420	🧮 Reasoning	2	735ms	$1.61	100%
qwen3.5-plus-20260420	⚡ Ping	2	648ms	$0.297	100%
qwen3.5-122b-a10b	🔍 Context Recall	2	382ms	free	100%
qwen3.5-122b-a10b	🚀 Throughput	2	1331ms	free	100%
qwen3.5-122b-a10b	💻 Code Gen	2	767ms	free	100%
qwen3.5-122b-a10b	{ } JSON Output	2	315ms	free	100%
qwen3.5-122b-a10b	🧮 Reasoning	2	387ms	free	100%
qwen3.5-122b-a10b	⚡ Ping	2	1035ms	free	100%
qwen3-vl-235b-a22b-instruct	🔍 Context Recall	2	683ms	$0.109	100%
qwen3-vl-235b-a22b-instruct	🚀 Throughput	2	359ms	$0.958	100%
qwen3-vl-235b-a22b-instruct	💻 Code Gen	2	761ms	$0.210	100%
qwen3-vl-235b-a22b-instruct	{ } JSON Output	2	297ms	$0.072	100%
qwen3-vl-235b-a22b-instruct	🧮 Reasoning	2	343ms	$0.110	100%
qwen3-vl-235b-a22b-instruct	⚡ Ping	2	330ms	<$0.01	100%
qwen3-coder-plus	🔍 Context Recall	2	1128ms	$0.256	100%
qwen3-coder-plus	🚀 Throughput	2	718ms	$1.65	100%
qwen3-coder-plus	💻 Code Gen	2	693ms	$0.355	100%
qwen3-coder-plus	{ } JSON Output	2	729ms	$0.153	100%
qwen3-coder-plus	🧮 Reasoning	2	588ms	$0.177	100%
qwen3-coder-plus	⚡ Ping	2	1264ms	$0.013	100%
qwen3-30b-a3b-thinking-2507	🔍 Context Recall	2	319ms	$0.621	100%
qwen3-30b-a3b-thinking-2507	🚀 Throughput	2	1232ms	$2.21	100%
qwen3-30b-a3b-thinking-2507	💻 Code Gen	2	249ms	$4.23	100%
qwen3-30b-a3b-thinking-2507	{ } JSON Output	2	343ms	$0.587	100%
qwen3-30b-a3b-thinking-2507	🧮 Reasoning	2	266ms	$0.444	100%
qwen3-30b-a3b-thinking-2507	⚡ Ping	2	268ms	$0.161	100%
qwen2.5-vl-72b-instruct	🔍 Context Recall	2	617ms	$0.201	100%
qwen2.5-vl-72b-instruct	🚀 Throughput	2	445ms	$0.539	100%
qwen2.5-vl-72b-instruct	💻 Code Gen	2	730ms	$0.113	100%
qwen2.5-vl-72b-instruct	{ } JSON Output	2	1617ms	$0.078	100%
qwen2.5-vl-72b-instruct	🧮 Reasoning	2	446ms	$0.077	100%
qwen2.5-vl-72b-instruct	⚡ Ping	2	2953ms	$0.023	100%
laguna-xs-2.1:free	🔍 Context Recall	2	405ms	free	100%
laguna-xs-2.1:free	🚀 Throughput	2	453ms	free	100%
laguna-xs-2.1:free	💻 Code Gen	2	465ms	free	100%
laguna-xs-2.1:free	{ } JSON Output	2	539ms	free	100%
laguna-xs-2.1:free	🧮 Reasoning	2	416ms	free	100%
laguna-xs-2.1:free	⚡ Ping	2	375ms	free	100%
sonar-deep-research	🔍 Context Recall	2	—	—	0%
sonar-deep-research	🚀 Throughput	2	10128ms	—	100%
sonar-deep-research	💻 Code Gen	2	—	—	0%
sonar-deep-research	{ } JSON Output	2	—	—	0%
sonar-deep-research	🧮 Reasoning	2	—	—	0%
sonar-deep-research	⚡ Ping	2	10137ms	—	100%
gpt-audio-mini	🔍 Context Recall	2	—	—	0%
gpt-audio-mini	🚀 Throughput	2	—	—	0%
gpt-audio-mini	💻 Code Gen	2	—	—	0%
gpt-audio-mini	{ } JSON Output	2	—	—	0%
gpt-audio-mini	🧮 Reasoning	2	—	—	0%
gpt-audio-mini	⚡ Ping	2	—	—	0%
nemotron-3-super-120b-a12b:free	🔍 Context Recall	2	534ms	free	100%
nemotron-3-super-120b-a12b:free	🚀 Throughput	2	406ms	free	100%
nemotron-3-super-120b-a12b:free	💻 Code Gen	2	458ms	free	100%
nemotron-3-super-120b-a12b:free	{ } JSON Output	2	389ms	free	100%
nemotron-3-super-120b-a12b:free	🧮 Reasoning	2	—	—	0%
nemotron-3-super-120b-a12b:free	⚡ Ping	2	345ms	free	100%
deepseek-r1	🔍 Context Recall	2	1218ms	$1.48	100%
deepseek-r1	🚀 Throughput	2	1324ms	$1.27	100%
deepseek-r1	💻 Code Gen	2	—	—	0%
deepseek-r1	{ } JSON Output	2	748ms	$0.462	100%
deepseek-r1	🧮 Reasoning	2	1170ms	$2.23	100%
deepseek-r1	⚡ Ping	2	755ms	$0.382	100%
command-r-plus-08-2024	🔍 Context Recall	2	855ms	$0.815	100%
command-r-plus-08-2024	🚀 Throughput	2	884ms	$5.07	100%
command-r-plus-08-2024	💻 Code Gen	2	—	—	0%
command-r-plus-08-2024	{ } JSON Output	2	801ms	$0.497	100%
command-r-plus-08-2024	🧮 Reasoning	2	805ms	$0.532	100%
command-r-plus-08-2024	⚡ Ping	2	874ms	$0.037	100%
seed-2.0-lite	🔍 Context Recall	2	911ms	$0.398	100%
seed-2.0-lite	🚀 Throughput	2	403ms	$6.65	100%
seed-2.0-lite	💻 Code Gen	2	1303ms	$1.77	100%
seed-2.0-lite	{ } JSON Output	2	407ms	$0.291	100%
seed-2.0-lite	🧮 Reasoning	2	479ms	$1.18	100%
seed-2.0-lite	⚡ Ping	2	588ms	$0.208	100%
nova-premier-v1	🔍 Context Recall	2	777ms	$1.07	100%
nova-premier-v1	🚀 Throughput	2	766ms	$6.41	100%
nova-premier-v1	💻 Code Gen	2	699ms	$1.91	100%
nova-premier-v1	{ } JSON Output	2	1424ms	$0.620	100%
nova-premier-v1	🧮 Reasoning	2	1506ms	$0.610	100%
nova-premier-v1	⚡ Ping	2	1617ms	$0.128	100%
aion-3.0	🔍 Context Recall	2	2509ms	$2.11	100%
aion-3.0	🚀 Throughput	2	1093ms	$4.22	100%
aion-3.0	💻 Code Gen	2	1083ms	$2.35	100%
aion-3.0	{ } JSON Output	2	762ms	$1.01	100%
aion-3.0	🧮 Reasoning	2	771ms	$1.41	100%
aion-3.0	⚡ Ping	2	913ms	$0.198	100%
gpt-4o-2024-05-13	🔧 Tool Use	2	705ms	$0.338	100%
gpt-4o-2024-05-13	🔍 Context Recall	2	662ms	$0.755	100%
gpt-4o-2024-05-13	🚀 Throughput	2	4812ms	$7.65	100%
gpt-4o-2024-05-13	💻 Code Gen	2	1052ms	$1.64	100%
gpt-4o-2024-05-13	{ } JSON Output	2	725ms	$0.492	100%
gpt-4o-2024-05-13	🧮 Reasoning	2	1296ms	$1.15	100%
gpt-4o-2024-05-13	⚡ Ping	2	735ms	$0.045	100%
gpt-4.1	🔧 Tool Use	2	712ms	$0.270	100%
gpt-4.1	🔍 Context Recall	2	1060ms	$0.604	100%
gpt-4.1	🚀 Throughput	2	7182ms	$6.12	100%
gpt-4.1	💻 Code Gen	2	1619ms	$0.960	100%
gpt-4.1	{ } JSON Output	2	743ms	$0.290	100%
gpt-4.1	🧮 Reasoning	2	911ms	$0.292	100%
gpt-4.1	⚡ Ping	2	936ms	$0.036	100%
gpt-3.5-turbo-0125	🔧 Tool Use	2	1398ms	—	100%
gpt-3.5-turbo-0125	🔍 Context Recall	2	1815ms	—	100%
gpt-3.5-turbo-0125	🚀 Throughput	2	4247ms	—	100%
gpt-3.5-turbo-0125	💻 Code Gen	2	1823ms	—	100%
gpt-3.5-turbo-0125	{ } JSON Output	2	1787ms	—	100%
gpt-3.5-turbo-0125	🧮 Reasoning	2	6264ms	—	100%
gpt-3.5-turbo-0125	⚡ Ping	2	1460ms	—	100%
gpt-oss-20b	🔧 Tool Use	2	282ms	$0.011	100%
gpt-oss-20b	🔍 Context Recall	2	305ms	$0.018	100%
gpt-oss-20b	🚀 Throughput	2	1059ms	$0.109	100%
gpt-oss-20b	💻 Code Gen	2	439ms	$0.034	100%
gpt-oss-20b	{ } JSON Output	2	376ms	$0.024	100%
gpt-oss-20b	🧮 Reasoning	2	381ms	$0.024	100%
gpt-oss-20b	⚡ Ping	2	560ms	<$0.01	100%
Kimi-K2.6	🔧 Tool Use	2	867ms	$0.324	100%
Kimi-K2.6	🔍 Context Recall	2	1232ms	$0.780	100%
Kimi-K2.6	🚀 Throughput	2	5913ms	$2.67	100%
Kimi-K2.6	💻 Code Gen	2	4414ms	$3.04	100%
Kimi-K2.6	{ } JSON Output	2	1351ms	$0.845	100%
Kimi-K2.6	🧮 Reasoning	2	7744ms	$5.01	100%
Kimi-K2.6	⚡ Ping	2	1203ms	$0.323	100%
reka-flash-3	⚡ Ping	2	10119ms	$0.056	100%
qwen3.6-35b-a3b	🔍 Context Recall	2	1123ms	$0.741	100%
qwen3.6-35b-a3b	🚀 Throughput	2	750ms	$0.506	100%
qwen3.6-35b-a3b	💻 Code Gen	2	604ms	$1.54	100%
qwen3.6-35b-a3b	{ } JSON Output	2	1034ms	$0.852	100%
qwen3.6-35b-a3b	🧮 Reasoning	2	310ms	$1.40	100%
qwen3.6-35b-a3b	⚡ Ping	2	532ms	$0.205	100%
qwen3.5-35b-a3b	🔍 Context Recall	2	806ms	$0.808	100%
qwen3.5-35b-a3b	🚀 Throughput	2	471ms	$0.506	100%
qwen3.5-35b-a3b	💻 Code Gen	2	602ms	$0.314	100%
qwen3.5-35b-a3b	{ } JSON Output	2	769ms	$0.717	100%
qwen3.5-35b-a3b	🧮 Reasoning	2	1501ms	$2.03	100%
qwen3.5-35b-a3b	⚡ Ping	2	1153ms	$0.144	100%
qwen3-vl-30b-a3b-instruct	🔍 Context Recall	2	710ms	$0.045	100%
qwen3-vl-30b-a3b-instruct	🚀 Throughput	2	417ms	$0.265	100%
qwen3-vl-30b-a3b-instruct	💻 Code Gen	2	318ms	$0.062	100%
qwen3-vl-30b-a3b-instruct	{ } JSON Output	2	1783ms	$0.023	100%
qwen3-vl-30b-a3b-instruct	🧮 Reasoning	2	355ms	$0.026	100%
qwen3-vl-30b-a3b-instruct	⚡ Ping	2	387ms	<$0.01	100%
qwen3-max	🔍 Context Recall	2	1352ms	$0.206	100%
qwen3-max	🚀 Throughput	2	1170ms	$1.98	100%
qwen3-max	💻 Code Gen	2	1278ms	$0.426	100%
qwen3-max	{ } JSON Output	2	1187ms	$0.160	100%
qwen3-max	🧮 Reasoning	2	1599ms	$0.338	100%
qwen3-max	⚡ Ping	2	1982ms	$0.016	100%
qwen3-8b	🔍 Context Recall	2	527ms	$0.166	100%
qwen3-8b	🚀 Throughput	2	725ms	$0.661	100%
qwen3-8b	💻 Code Gen	2	596ms	$0.511	100%
qwen3-8b	{ } JSON Output	2	697ms	$0.142	100%
qwen3-8b	🧮 Reasoning	2	571ms	$0.368	100%
qwen3-8b	⚡ Ping	2	719ms	$0.058	100%
qwen3-235b-a22b	🔍 Context Recall	2	1049ms	$0.584	100%
qwen3-235b-a22b	🚀 Throughput	2	373ms	$5.04	100%
qwen3-235b-a22b	💻 Code Gen	2	364ms	$4.19	100%
qwen3-235b-a22b	{ } JSON Output	2	470ms	$0.470	100%
qwen3-235b-a22b	🧮 Reasoning	2	419ms	$0.710	100%
qwen3-235b-a22b	⚡ Ping	2	426ms	$0.216	100%
qwen-2.5-7b-instruct	🔍 Context Recall	2	639ms	$0.012	100%
qwen-2.5-7b-instruct	🚀 Throughput	2	955ms	$0.052	100%
qwen-2.5-7b-instruct	💻 Code Gen	2	641ms	$0.013	100%
qwen-2.5-7b-instruct	{ } JSON Output	2	521ms	<$0.01	100%
qwen-2.5-7b-instruct	🧮 Reasoning	2	633ms	<$0.01	100%
qwen-2.5-7b-instruct	⚡ Ping	2	632ms	<$0.01	100%
sonar-pro-search	🔍 Context Recall	2	1120ms	$1.12	100%
sonar-pro-search	🚀 Throughput	2	1073ms	$7.49	100%
sonar-pro-search	💻 Code Gen	2	1014ms	$1.91	100%
sonar-pro-search	{ } JSON Output	2	932ms	$0.510	100%
sonar-pro-search	🧮 Reasoning	2	932ms	$0.789	100%
sonar-pro-search	⚡ Ping	2	1087ms	$0.036	100%
gpt-oss-20b	🔍 Context Recall	2	1306ms	$0.067	100%
gpt-oss-20b	🚀 Throughput	2	1306ms	$0.477	100%
gpt-oss-20b	💻 Code Gen	2	464ms	$0.042	100%
gpt-oss-20b	{ } JSON Output	2	1388ms	$0.026	100%
gpt-oss-20b	🧮 Reasoning	2	962ms	$0.035	100%
gpt-oss-20b	⚡ Ping	2	9844ms	$0.014	100%
nemotron-3-ultra-550b-a55b:free	🔍 Context Recall	2	514ms	free	100%
nemotron-3-ultra-550b-a55b:free	🚀 Throughput	2	508ms	free	100%
nemotron-3-ultra-550b-a55b:free	💻 Code Gen	2	433ms	free	100%
nemotron-3-ultra-550b-a55b:free	{ } JSON Output	2	442ms	free	100%
nemotron-3-ultra-550b-a55b:free	🧮 Reasoning	2	470ms	free	100%
nemotron-3-ultra-550b-a55b:free	⚡ Ping	2	440ms	free	100%
llama-3.3-nemotron-super-49b-v1.5	🔍 Context Recall	2	415ms	free	100%
llama-3.3-nemotron-super-49b-v1.5	🚀 Throughput	2	408ms	free	100%
llama-3.3-nemotron-super-49b-v1.5	💻 Code Gen	2	364ms	free	100%
llama-3.3-nemotron-super-49b-v1.5	{ } JSON Output	2	367ms	free	100%
llama-3.3-nemotron-super-49b-v1.5	🧮 Reasoning	2	379ms	free	100%
llama-3.3-nemotron-super-49b-v1.5	⚡ Ping	2	1523ms	free	100%
nex-n2-mini	🔍 Context Recall	2	774ms	$0.011	100%
nex-n2-mini	🚀 Throughput	2	598ms	$0.051	100%
nex-n2-mini	💻 Code Gen	2	994ms	$0.031	100%
nex-n2-mini	{ } JSON Output	2	974ms	<$0.01	100%
nex-n2-mini	🧮 Reasoning	2	1169ms	<$0.01	100%
nex-n2-mini	⚡ Ping	2	972ms	<$0.01	100%
kimi-k2	🔍 Context Recall	2	1587ms	$0.213	100%
kimi-k2	🚀 Throughput	2	2169ms	$1.17	100%
kimi-k2	💻 Code Gen	2	4310ms	$0.254	100%
kimi-k2	{ } JSON Output	2	2821ms	$0.087	100%
kimi-k2	🧮 Reasoning	2	2468ms	$0.087	100%
kimi-k2	⚡ Ping	2	3326ms	$0.014	100%
mistral-saba	🔍 Context Recall	2	596ms	$0.052	100%
mistral-saba	🚀 Throughput	2	832ms	$0.309	100%
mistral-saba	💻 Code Gen	2	576ms	$0.079	100%
mistral-saba	{ } JSON Output	2	566ms	$0.033	100%
mistral-saba	🧮 Reasoning	2	576ms	$0.055	100%
mistral-saba	⚡ Ping	2	511ms	<$0.01	100%
mistral-large	🔍 Context Recall	2	726ms	$0.644	100%
mistral-large	🚀 Throughput	2	613ms	$3.07	100%
mistral-large	💻 Code Gen	2	600ms	$0.744	100%
mistral-large	{ } JSON Output	2	603ms	$0.360	100%
mistral-large	🧮 Reasoning	2	595ms	$0.338	100%
mistral-large	⚡ Ping	2	590ms	$0.038	100%
minimax-m2.7	🔍 Context Recall	2	659ms	$0.355	100%
minimax-m2.7	🚀 Throughput	2	550ms	$0.517	100%
minimax-m2.7	💻 Code Gen	2	—	—	0%
minimax-m2.7	{ } JSON Output	2	1691ms	$0.337	100%
minimax-m2.7	🧮 Reasoning	2	990ms	$0.515	100%
minimax-m2.7	⚡ Ping	2	2323ms	$0.073	100%
wizardlm-2-8x22b	🔍 Context Recall	2	1120ms	$0.148	100%
wizardlm-2-8x22b	🚀 Throughput	2	2337ms	$0.350	100%
wizardlm-2-8x22b	💻 Code Gen	2	—	—	0%
wizardlm-2-8x22b	{ } JSON Output	2	1397ms	$0.077	100%
wizardlm-2-8x22b	🧮 Reasoning	2	1061ms	$0.099	100%
wizardlm-2-8x22b	⚡ Ping	2	1567ms	$0.027	100%
llama-3.2-3b-instruct:free	🔍 Context Recall	2	—	—	0%
llama-3.2-3b-instruct:free	🚀 Throughput	2	—	—	0%
llama-3.2-3b-instruct:free	💻 Code Gen	2	—	—	0%
llama-3.2-3b-instruct:free	{ } JSON Output	2	—	—	0%
llama-3.2-3b-instruct:free	🧮 Reasoning	2	—	—	0%
llama-3.2-3b-instruct:free	⚡ Ping	2	—	—	0%
kat-coder-pro-v2.5	🔍 Context Recall	2	778ms	$0.570	100%
kat-coder-pro-v2.5	🚀 Throughput	2	525ms	$1.51	100%
kat-coder-pro-v2.5	💻 Code Gen	2	664ms	$1.08	100%
kat-coder-pro-v2.5	{ } JSON Output	2	615ms	$0.703	100%
kat-coder-pro-v2.5	🧮 Reasoning	2	515ms	$0.366	100%
kat-coder-pro-v2.5	⚡ Ping	2	527ms	$0.187	100%
ling-2.6-1t	🔍 Context Recall	2	1616ms	$0.027	100%
ling-2.6-1t	🚀 Throughput	2	1216ms	$0.316	100%
ling-2.6-1t	💻 Code Gen	2	1371ms	$0.079	100%
ling-2.6-1t	{ } JSON Output	2	1401ms	$0.031	100%
ling-2.6-1t	🧮 Reasoning	2	1196ms	$0.034	100%
ling-2.6-1t	⚡ Ping	2	1768ms	<$0.01	100%
deepseek-v3.2	🔍 Context Recall	2	1331ms	$0.057	100%
deepseek-v3.2	🚀 Throughput	2	1005ms	$0.211	100%
deepseek-v3.2	💻 Code Gen	2	1915ms	$0.079	100%
deepseek-v3.2	{ } JSON Output	2	616ms	$0.027	100%
deepseek-v3.2	🧮 Reasoning	2	871ms	$0.032	100%
deepseek-v3.2	⚡ Ping	2	1295ms	<$0.01	100%
deepseek-chat	🔍 Context Recall	2	1059ms	$0.071	100%
deepseek-chat	🚀 Throughput	2	472ms	$0.407	100%
deepseek-chat	💻 Code Gen	2	1231ms	$0.098	100%
deepseek-chat	{ } JSON Output	2	717ms	$0.043	100%
deepseek-chat	🧮 Reasoning	2	878ms	$0.060	100%
deepseek-chat	⚡ Ping	2	893ms	<$0.01	100%
dolphin-mistral-24b-venice-edition:free	🔍 Context Recall	2	—	—	0%
dolphin-mistral-24b-venice-edition:free	🚀 Throughput	2	—	—	0%
dolphin-mistral-24b-venice-edition:free	💻 Code Gen	2	—	—	0%
dolphin-mistral-24b-venice-edition:free	{ } JSON Output	2	—	—	0%
dolphin-mistral-24b-venice-edition:free	🧮 Reasoning	2	—	—	0%
dolphin-mistral-24b-venice-edition:free	⚡ Ping	2	—	—	0%
ernie-4.5-vl-424b-a47b	🔍 Context Recall	2	1600ms	$0.104	100%
ernie-4.5-vl-424b-a47b	🚀 Throughput	2	1651ms	$0.605	100%
ernie-4.5-vl-424b-a47b	💻 Code Gen	2	1486ms	$0.178	100%
ernie-4.5-vl-424b-a47b	{ } JSON Output	2	1545ms	$0.056	100%
ernie-4.5-vl-424b-a47b	🧮 Reasoning	2	1404ms	$0.087	100%
ernie-4.5-vl-424b-a47b	⚡ Ping	2	1578ms	<$0.01	100%
nova-2-lite-v1	🔍 Context Recall	2	664ms	$0.191	100%
nova-2-lite-v1	🚀 Throughput	2	837ms	$1.27	100%
nova-2-lite-v1	💻 Code Gen	2	676ms	$0.338	100%
nova-2-lite-v1	{ } JSON Output	2	672ms	$0.131	100%
nova-2-lite-v1	🧮 Reasoning	2	936ms	$0.151	100%
nova-2-lite-v1	⚡ Ping	2	1004ms	$0.026	100%
gpt-4o-mini-2024-07-18	🔧 Tool Use	2	955ms	$0.020	100%
gpt-4o-mini-2024-07-18	🔍 Context Recall	2	1244ms	$0.048	100%
gpt-4o-mini-2024-07-18	🚀 Throughput	2	12311ms	$0.459	100%
gpt-4o-mini-2024-07-18	💻 Code Gen	2	2163ms	$0.070	100%
gpt-4o-mini-2024-07-18	{ } JSON Output	2	1224ms	$0.029	100%
gpt-4o-mini-2024-07-18	🧮 Reasoning	2	1493ms	$0.030	100%
gpt-4o-mini-2024-07-18	⚡ Ping	2	972ms	<$0.01	100%
gpt-4.1-nano	🔧 Tool Use	2	746ms	$0.013	100%
gpt-4.1-nano	🔍 Context Recall	2	644ms	$0.024	100%
gpt-4.1-nano	🚀 Throughput	2	3748ms	$0.306	100%
gpt-4.1-nano	💻 Code Gen	2	966ms	$0.046	100%
gpt-4.1-nano	{ } JSON Output	2	623ms	$0.018	100%
gpt-4.1-nano	🧮 Reasoning	2	1068ms	$0.017	100%
gpt-4.1-nano	⚡ Ping	2	501ms	<$0.01	100%
gpt-4-0613	🔧 Tool Use	2	1382ms	—	100%
gpt-4-0613	🔍 Context Recall	2	2993ms	—	100%
gpt-4-0613	🚀 Throughput	2	19244ms	—	100%
gpt-4-0613	💻 Code Gen	2	7795ms	—	100%
gpt-4-0613	{ } JSON Output	2	2041ms	—	100%
gpt-4-0613	🧮 Reasoning	2	2445ms	—	100%
gpt-4-0613	⚡ Ping	2	2225ms	—	100%
step-3.5-flash	🔍 Context Recall	2	2770ms	free	100%
step-3.5-flash	🚀 Throughput	2	7195ms	free	100%
step-3.5-flash	💻 Code Gen	2	2132ms	free	100%
step-3.5-flash	{ } JSON Output	2	4948ms	free	100%
step-3.5-flash	🧮 Reasoning	2	18751ms	free	100%
step-3.5-flash	⚡ Ping	2	3809ms	free	100%
nemotron-3-ultra-550b-a55b	🔍 Context Recall	2	1764ms	$0.707	100%
nemotron-3-ultra-550b-a55b	🚀 Throughput	2	15005ms	$2.75	100%
nemotron-3-ultra-550b-a55b	💻 Code Gen	2	1861ms	$0.710	100%
nemotron-3-ultra-550b-a55b	{ } JSON Output	2	3198ms	$0.794	100%
nemotron-3-ultra-550b-a55b	🧮 Reasoning	2	1231ms	$0.259	100%
nemotron-3-ultra-550b-a55b	⚡ Ping	2	15814ms	$0.233	100%
ising-calibration-1-35b-a3b	🔍 Context Recall	2	6667ms	free	100%
ising-calibration-1-35b-a3b	🚀 Throughput	2	—	—	0%
ising-calibration-1-35b-a3b	💻 Code Gen	2	8562ms	free	100%
ising-calibration-1-35b-a3b	{ } JSON Output	2	—	—	0%
ising-calibration-1-35b-a3b	🧮 Reasoning	2	13533ms	free	100%
ising-calibration-1-35b-a3b	⚡ Ping	2	3193ms	free	100%
llama-3.2-3b-instruct	🔍 Context Recall	2	907ms	free	100%
llama-3.2-3b-instruct	🚀 Throughput	2	—	—	0%
llama-3.2-3b-instruct	💻 Code Gen	2	2303ms	free	100%
llama-3.2-3b-instruct	{ } JSON Output	2	9174ms	free	100%
llama-3.2-3b-instruct	🧮 Reasoning	2	804ms	free	100%
llama-3.2-3b-instruct	⚡ Ping	2	3407ms	free	100%
diffusiongemma-26b-a4b-it	🔍 Context Recall	2	1199ms	free	100%
diffusiongemma-26b-a4b-it	🚀 Throughput	2	2469ms	free	100%
diffusiongemma-26b-a4b-it	💻 Code Gen	2	706ms	free	100%
diffusiongemma-26b-a4b-it	{ } JSON Output	2	882ms	free	100%
diffusiongemma-26b-a4b-it	🧮 Reasoning	2	601ms	free	100%
diffusiongemma-26b-a4b-it	⚡ Ping	2	833ms	free	100%
llama-3.3-70b-versatile	🔧 Tool Use	2	311ms	$0.161	100%
llama-3.3-70b-versatile	🔍 Context Recall	2	376ms	$0.166	100%
llama-3.3-70b-versatile	🚀 Throughput	2	2669ms	$0.636	100%
llama-3.3-70b-versatile	💻 Code Gen	2	714ms	$0.192	100%
llama-3.3-70b-versatile	{ } JSON Output	2	282ms	$0.074	100%
llama-3.3-70b-versatile	🧮 Reasoning	2	407ms	$0.061	100%
llama-3.3-70b-versatile	⚡ Ping	2	239ms	$0.026	100%
gemini-pro-latest	🔧 Tool Use	2	2954ms	—	100%
gemini-pro-latest	🔍 Context Recall	2	5028ms	—	100%
gemini-pro-latest	🚀 Throughput	2	—	—	0%
gemini-pro-latest	💻 Code Gen	2	6797ms	—	100%
gemini-pro-latest	{ } JSON Output	2	4865ms	—	100%
gemini-pro-latest	🧮 Reasoning	2	7731ms	—	100%
gemini-pro-latest	⚡ Ping	2	2814ms	—	100%
gemini-3.1-flash-lite-image	🔧 Tool Use	2	695ms	—	100%
gemini-3.1-flash-lite-image	🔍 Context Recall	2	880ms	—	100%
gemini-3.1-flash-lite-image	🚀 Throughput	2	2893ms	—	100%
gemini-3.1-flash-lite-image	💻 Code Gen	2	1009ms	—	100%
gemini-3.1-flash-lite-image	{ } JSON Output	2	783ms	—	100%
gemini-3.1-flash-lite-image	🧮 Reasoning	2	690ms	—	100%
gemini-3.1-flash-lite-image	⚡ Ping	2	780ms	—	100%
gemini-2.5-pro	🔧 Tool Use	2	2920ms	$0.269	100%
gemini-2.5-pro	🔍 Context Recall	2	5027ms	$0.701	100%
gemini-2.5-pro	🚀 Throughput	2	—	—	0%
gemini-2.5-pro	💻 Code Gen	2	13423ms	$2.51	100%
gemini-2.5-pro	{ } JSON Output	2	5189ms	$0.535	100%
gemini-2.5-pro	🧮 Reasoning	2	9979ms	$0.441	100%
gemini-2.5-pro	⚡ Ping	2	2990ms	$0.019	100%
Kimi-K2.6	🔧 Tool Use	2	2921ms	$0.472	100%
Kimi-K2.6	🔍 Context Recall	2	5224ms	$1.30	100%
Kimi-K2.6	🚀 Throughput	2	7202ms	$2.67	100%
Kimi-K2.6	💻 Code Gen	2	8982ms	$3.35	100%
Kimi-K2.6	{ } JSON Output	2	3864ms	$0.821	100%
Kimi-K2.6	🧮 Reasoning	2	10424ms	$4.06	100%
Kimi-K2.6	⚡ Ping	2	1485ms	$0.158	100%
Meta-Llama-3.1-70B-Instruct-Turbo	🔧 Tool Use	2	1344ms	$0.107	100%
Meta-Llama-3.1-70B-Instruct-Turbo	🔍 Context Recall	2	1968ms	$0.096	100%
Meta-Llama-3.1-70B-Instruct-Turbo	🚀 Throughput	2	—	—	0%
Meta-Llama-3.1-70B-Instruct-Turbo	💻 Code Gen	2	5457ms	$0.077	100%
Meta-Llama-3.1-70B-Instruct-Turbo	{ } JSON Output	2	1544ms	$0.036	100%
Meta-Llama-3.1-70B-Instruct-Turbo	🧮 Reasoning	2	1636ms	$0.026	100%
Meta-Llama-3.1-70B-Instruct-Turbo	⚡ Ping	2	1001ms	<$0.01	100%
DeepSeek-V3.1-Terminus	🔧 Tool Use	2	—	—	0%
DeepSeek-V3.1-Terminus	🔍 Context Recall	2	1080ms	$0.070	100%
DeepSeek-V3.1-Terminus	🚀 Throughput	2	7554ms	$0.727	100%
DeepSeek-V3.1-Terminus	💻 Code Gen	2	3626ms	$0.105	100%
DeepSeek-V3.1-Terminus	{ } JSON Output	2	1113ms	$0.046	100%
DeepSeek-V3.1-Terminus	🧮 Reasoning	2	1335ms	$0.062	100%
DeepSeek-V3.1-Terminus	⚡ Ping	2	975ms	<$0.01	100%
MiMo-V2.5	🔧 Tool Use	2	2182ms	$0.278	100%
MiMo-V2.5	🔍 Context Recall	2	1907ms	$0.180	100%
MiMo-V2.5	🚀 Throughput	2	15262ms	$1.53	100%
MiMo-V2.5	💻 Code Gen	2	2881ms	$0.379	100%
MiMo-V2.5	{ } JSON Output	2	1248ms	$0.114	100%
MiMo-V2.5	🧮 Reasoning	2	2443ms	$0.266	100%
MiMo-V2.5	⚡ Ping	2	944ms	$0.048	100%
Qwen3.5-397B-A17B	🔧 Tool Use	2	3833ms	$0.422	100%
Qwen3.5-397B-A17B	🔍 Context Recall	2	14887ms	$1.50	100%
Qwen3.5-397B-A17B	🚀 Throughput	2	—	—	0%
Qwen3.5-397B-A17B	💻 Code Gen	2	—	—	0%
Qwen3.5-397B-A17B	{ } JSON Output	2	20246ms	$2.10	100%
Qwen3.5-397B-A17B	🧮 Reasoning	2	51795ms	$4.76	100%
Qwen3.5-397B-A17B	⚡ Ping	2	7718ms	$0.722	100%
Qwen3-Max-Thinking	🔧 Tool Use	2	2943ms	$0.552	100%
Qwen3-Max-Thinking	🔍 Context Recall	2	2923ms	$0.449	100%
Qwen3-Max-Thinking	🚀 Throughput	2	16870ms	$4.58	100%
Qwen3-Max-Thinking	💻 Code Gen	2	5354ms	$1.14	100%
Qwen3-Max-Thinking	{ } JSON Output	2	2987ms	$0.246	100%
Qwen3-Max-Thinking	🧮 Reasoning	2	5012ms	$0.328	100%
Qwen3-Max-Thinking	⚡ Ping	2	2313ms	$0.024	100%
Qwen3-14B	🔧 Tool Use	2	3491ms	$0.066	100%
Qwen3-14B	🔍 Context Recall	2	4986ms	$0.096	100%
Qwen3-14B	🚀 Throughput	2	11921ms	$0.186	100%
Qwen3-14B	💻 Code Gen	2	—	—	0%
Qwen3-14B	{ } JSON Output	2	—	—	0%
Qwen3-14B	🧮 Reasoning	2	6553ms	$0.089	100%
Qwen3-14B	⚡ Ping	2	2779ms	$0.031	100%
MythoMax-L2-13b	🔧 Tool Use	2	—	—	0%
MythoMax-L2-13b	🔍 Context Recall	2	1495ms	$0.125	100%
MythoMax-L2-13b	🚀 Throughput	2	14182ms	$0.314	100%
MythoMax-L2-13b	💻 Code Gen	2	—	—	0%
MythoMax-L2-13b	{ } JSON Output	2	2074ms	$0.042	100%
MythoMax-L2-13b	🧮 Reasoning	2	—	—	0%
MythoMax-L2-13b	⚡ Ping	2	1009ms	$0.013	100%
NVIDIA-Nemotron-3-Ultra-550B-A55B	🔧 Tool Use	2	559ms	$0.259	100%
NVIDIA-Nemotron-3-Ultra-550B-A55B	🔍 Context Recall	2	545ms	$0.166	100%
NVIDIA-Nemotron-3-Ultra-550B-A55B	🚀 Throughput	2	3036ms	$1.84	100%
NVIDIA-Nemotron-3-Ultra-550B-A55B	💻 Code Gen	2	853ms	$0.409	100%
NVIDIA-Nemotron-3-Ultra-550B-A55B	{ } JSON Output	2	589ms	$0.129	100%
NVIDIA-Nemotron-3-Ultra-550B-A55B	🧮 Reasoning	2	551ms	$0.104	100%
NVIDIA-Nemotron-3-Ultra-550B-A55B	⚡ Ping	2	926ms	$0.019	100%
claude-opus-4-5-20251101	🔧 Tool Use	2	1973ms	$4.34	100%
claude-opus-4-5-20251101	🔍 Context Recall	2	2399ms	$2.66	100%
claude-opus-4-5-20251101	🚀 Throughput	2	14071ms	$19.09	100%
claude-opus-4-5-20251101	💻 Code Gen	2	7028ms	$6.05	100%
claude-opus-4-5-20251101	{ } JSON Output	2	1705ms	$1.09	100%

deepseek-v4-pro

🧠 Multi-Step Logic

5

164ms

—

33%

gpt-oss-120b

🚀 Throughput

4

1819ms

$0.132

100%

gpt-oss-120b

🧠 Multi-Step Logic

4

918ms

$0.056

100%

deepseek-v4-pro

🚀 Throughput

4

154ms

—

50%

deepseek-v4-pro

📝 Summarization

4

233ms

—

50%

deepseek-v4-pro

📄 Data Extraction

4

166ms

—

50%

deepseek-v4-pro

📋 Instruction Follow

4

148ms

—

100%

deepseek-v4-pro

✅ Format Compliance

4

194ms

—

50%

deepseek-v4-pro

💻 Code Gen

4

173ms

—

100%

deepseek-v4-pro

{ } JSON Output

4

186ms

—

100%

deepseek-v4-pro

⚡ Ping

4

164ms

—

100%

gpt-oss-120b

🔧 Tool Use

3

415ms

$0.014

100%

gpt-oss-120b

🔍 Context Recall

3

420ms

$0.025

100%

gpt-oss-120b

💻 Code Gen

3

754ms

$0.047

100%

gpt-oss-120b

{ } JSON Output

3

543ms

$0.030

100%

gpt-oss-120b

🧮 Reasoning

3

429ms

$0.020

100%

gpt-oss-120b

⚡ Ping

3

419ms

$0.012

100%

gemma-4-26b-a4b-it:free

🔍 Context Recall

3

1481ms

free

100%

gemma-4-26b-a4b-it:free

🚀 Throughput

3

1264ms

free

100%

gemma-4-26b-a4b-it:free

💻 Code Gen

3

1871ms

free

100%

gemma-4-26b-a4b-it:free

{ } JSON Output

3

1821ms

free

100%

gemma-4-26b-a4b-it:free

🧮 Reasoning

3

1232ms

free

100%

gemma-4-26b-a4b-it:free

⚡ Ping

3

898ms

free

100%

GLM-5.2

🔧 Tool Use

3

19564ms

$0.310

100%

GLM-5.2

🔍 Context Recall

3

12575ms

$1.30

100%

GLM-5.2

🚀 Throughput

3

30756ms

$2.31

100%

GLM-5.2

💻 Code Gen

3

10442ms

$1.08

100%

GLM-5.2

{ } JSON Output

3

4982ms

$0.838

100%

GLM-5.2

🧮 Reasoning

3

9143ms

$1.41

100%

GLM-5.2

⚡ Ping

3

6048ms

$0.595

100%

gpt-oss-20b

🔧 Tool Use

3

1330ms

$0.012

100%

gpt-oss-20b

🔍 Context Recall

3

2551ms

$0.036

100%

gpt-oss-20b

🚀 Throughput

3

7011ms

$0.108

100%

gpt-oss-20b

💻 Code Gen

3

3284ms

$0.050

100%

gpt-oss-20b

{ } JSON Output

3

1874ms

$0.023

100%

gpt-oss-20b

🧮 Reasoning

3

3033ms

$0.035

100%

gpt-oss-20b

⚡ Ping

3

1350ms

$0.011

100%

gpt-oss-120b

✅ Format Compliance

3

1727ms

$0.107

100%

gpt-oss-120b

📋 Instruction Follow

3

1505ms

$0.107

100%

gpt-oss-120b

📄 Data Extraction

3

1072ms

$0.066

100%

gpt-oss-120b

📝 Summarization

3

1077ms

$0.068

100%

deepseek-v4-pro

🪡 Long-Context Needle

3

298ms

—

100%

deepseek-v4-pro

🏷️ Classification

3

154ms

—

100%

deepseek-v4-pro

🔧 Tool Use

3

156ms

—

100%

deepseek-v4-pro

🔍 Context Recall

3

181ms

—

100%

deepseek-v4-pro

code_algo

3

169ms

—

100%

deepseek-v4-pro

code_refactor_preserve

3

155ms

—

100%

deepseek-v4-pro

code_to_spec

3

165ms

—

100%

deepseek-v4-pro

code_fix_bug

3

153ms

—

100%

deepseek-v4-pro

🧮 Reasoning

3

164ms

—

100%

glm-4.6

💻 Code Gen

3

—

0%

glm-4.6

{ } JSON Output

3

3379ms

—

50%

glm-4.6

⚡ Ping

3

1252ms

—

100%

palmyra-x5

🔍 Context Recall

2

632ms

$0.476

100%

palmyra-x5

🚀 Throughput

2

632ms

$3.18

100%

palmyra-x5

💻 Code Gen

2

672ms

$0.876

100%

palmyra-x5

{ } JSON Output

2

599ms

$0.437

100%

palmyra-x5

🧮 Reasoning

2

616ms

$0.607

100%

palmyra-x5

⚡ Ping

2

588ms

$0.170

100%

hy3:free

🔍 Context Recall

2

1833ms

free

100%

hy3:free

🚀 Throughput

2

1741ms

free

100%

hy3:free

💻 Code Gen

2

1710ms

free

100%

hy3:free

{ } JSON Output

2

1648ms

free

100%

hy3:free

🧮 Reasoning

2

1821ms

free

100%

hy3:free

⚡ Ping

2

1748ms

free

100%

l3.1-euryale-70b

🔍 Context Recall

2

608ms

$0.201

100%

l3.1-euryale-70b

🚀 Throughput

2

819ms

$0.459

100%

l3.1-euryale-70b

💻 Code Gen

2

—

0%

l3.1-euryale-70b

{ } JSON Output

2

350ms

$0.076

100%

l3.1-euryale-70b

🧮 Reasoning

2

422ms

$0.048

100%

l3.1-euryale-70b

⚡ Ping

2

755ms

$0.016

100%

qwen3.7-plus

🔍 Context Recall

2

1102ms

$0.844

100%

qwen3.7-plus

🚀 Throughput

2

735ms

$2.90

100%

qwen3.7-plus

💻 Code Gen

2

713ms

$2.14

100%

qwen3.7-plus

{ } JSON Output

2

1024ms

$0.918

100%

qwen3.7-plus

🧮 Reasoning

2

1282ms

$2.28

100%

qwen3.7-plus

⚡ Ping

2

910ms

$0.311

100%

qwen3.5-plus-20260420

🔍 Context Recall

2

659ms

$0.847

100%

qwen3.5-plus-20260420

🚀 Throughput

2

670ms

$4.60

100%

qwen3.5-plus-20260420

💻 Code Gen

2

712ms

$2.23

100%

qwen3.5-plus-20260420

{ } JSON Output

2

1101ms

$2.03

100%

qwen3.5-plus-20260420

🧮 Reasoning

2

735ms

$1.61

100%

qwen3.5-plus-20260420

⚡ Ping

2

648ms

$0.297

100%

qwen3.5-122b-a10b

🔍 Context Recall

2

382ms

free

100%

qwen3.5-122b-a10b

🚀 Throughput

2

1331ms

free

100%

qwen3.5-122b-a10b

💻 Code Gen

2

767ms

free

100%

qwen3.5-122b-a10b

{ } JSON Output

2

315ms

free

100%

qwen3.5-122b-a10b

🧮 Reasoning

2

387ms

free

100%

qwen3.5-122b-a10b

⚡ Ping

2

1035ms

free

100%

qwen3-vl-235b-a22b-instruct

🔍 Context Recall

2

683ms

$0.109

100%

qwen3-vl-235b-a22b-instruct

🚀 Throughput

2

359ms

$0.958

100%

qwen3-vl-235b-a22b-instruct

💻 Code Gen

2

761ms

$0.210

100%

qwen3-vl-235b-a22b-instruct

{ } JSON Output

2

297ms

$0.072

100%

qwen3-vl-235b-a22b-instruct

🧮 Reasoning

2

343ms

$0.110

100%

qwen3-vl-235b-a22b-instruct

⚡ Ping

2

330ms

<$0.01

100%

qwen3-coder-plus

🔍 Context Recall

2

1128ms

$0.256

100%

qwen3-coder-plus

🚀 Throughput

2

718ms

$1.65

100%

qwen3-coder-plus

💻 Code Gen

2

693ms

$0.355

100%

qwen3-coder-plus

{ } JSON Output

2

729ms

$0.153

100%

qwen3-coder-plus

🧮 Reasoning

2

588ms

$0.177

100%

qwen3-coder-plus

⚡ Ping

2

1264ms

$0.013

100%

qwen3-30b-a3b-thinking-2507

🔍 Context Recall

2

319ms

$0.621

100%

qwen3-30b-a3b-thinking-2507

🚀 Throughput

2

1232ms

$2.21

100%

qwen3-30b-a3b-thinking-2507

💻 Code Gen

2

249ms

$4.23

100%

qwen3-30b-a3b-thinking-2507

{ } JSON Output

2

343ms

$0.587

100%

qwen3-30b-a3b-thinking-2507

🧮 Reasoning

2

266ms

$0.444

100%

qwen3-30b-a3b-thinking-2507

⚡ Ping

2

268ms

$0.161

100%

qwen2.5-vl-72b-instruct

🔍 Context Recall

2

617ms

$0.201

100%

qwen2.5-vl-72b-instruct

🚀 Throughput

2

445ms

$0.539

100%

qwen2.5-vl-72b-instruct

💻 Code Gen

2

730ms

$0.113

100%

qwen2.5-vl-72b-instruct

{ } JSON Output

2

1617ms

$0.078

100%

qwen2.5-vl-72b-instruct

🧮 Reasoning

2

446ms

$0.077

100%

qwen2.5-vl-72b-instruct

⚡ Ping

2

2953ms

$0.023

100%

laguna-xs-2.1:free

🔍 Context Recall

2

405ms

free

100%

laguna-xs-2.1:free

🚀 Throughput

2

453ms

free

100%

laguna-xs-2.1:free

💻 Code Gen

2

465ms

free

100%

laguna-xs-2.1:free

{ } JSON Output

2

539ms

free

100%

laguna-xs-2.1:free

🧮 Reasoning

2

416ms

free

100%

laguna-xs-2.1:free

⚡ Ping

2

375ms

free

100%

sonar-deep-research

🔍 Context Recall

2

—

0%

sonar-deep-research

🚀 Throughput

2

10128ms

—

100%

sonar-deep-research

💻 Code Gen

2

—

0%

sonar-deep-research

{ } JSON Output

2

—

0%

sonar-deep-research

🧮 Reasoning

2

—

0%

sonar-deep-research

⚡ Ping

2

10137ms

—

100%

gpt-audio-mini

🔍 Context Recall

2

—

0%

gpt-audio-mini

🚀 Throughput

2

—

0%

gpt-audio-mini

💻 Code Gen

2

—

0%

gpt-audio-mini

{ } JSON Output

2

—

0%

gpt-audio-mini

🧮 Reasoning

2

—

0%

gpt-audio-mini

⚡ Ping

2

—

0%

nemotron-3-super-120b-a12b:free

🔍 Context Recall

2

534ms

free

100%

nemotron-3-super-120b-a12b:free

🚀 Throughput

2

406ms

free

100%

nemotron-3-super-120b-a12b:free

💻 Code Gen

2

458ms

free

100%

nemotron-3-super-120b-a12b:free

{ } JSON Output

2

389ms

free

100%

nemotron-3-super-120b-a12b:free

🧮 Reasoning

2

—

0%

nemotron-3-super-120b-a12b:free

⚡ Ping

2

345ms

free

100%

deepseek-r1

🔍 Context Recall

2

1218ms

$1.48

100%

deepseek-r1

🚀 Throughput

2

1324ms

$1.27

100%

deepseek-r1

💻 Code Gen

2

—

0%

deepseek-r1

{ } JSON Output

2

748ms

$0.462

100%

deepseek-r1

🧮 Reasoning

2

1170ms

$2.23

100%

deepseek-r1

⚡ Ping

2

755ms

$0.382

100%

command-r-plus-08-2024

🔍 Context Recall

2

855ms

$0.815

100%

command-r-plus-08-2024

🚀 Throughput

2

884ms

$5.07

100%

command-r-plus-08-2024

💻 Code Gen

2

—

0%

command-r-plus-08-2024

{ } JSON Output

2

801ms

$0.497

100%

command-r-plus-08-2024

🧮 Reasoning

2

805ms

$0.532

100%

command-r-plus-08-2024

⚡ Ping

2

874ms

$0.037

100%

seed-2.0-lite

🔍 Context Recall

2

911ms

$0.398

100%

seed-2.0-lite

🚀 Throughput

2

403ms

$6.65

100%

seed-2.0-lite

💻 Code Gen

2

1303ms

$1.77

100%

seed-2.0-lite

{ } JSON Output

2

407ms

$0.291

100%

seed-2.0-lite

🧮 Reasoning

2

479ms

$1.18

100%

seed-2.0-lite

⚡ Ping

2

588ms

$0.208

100%

nova-premier-v1

🔍 Context Recall

2

777ms

$1.07

100%

nova-premier-v1

🚀 Throughput

2

766ms

$6.41

100%

nova-premier-v1

💻 Code Gen

2

699ms

$1.91

100%

nova-premier-v1

{ } JSON Output

2

1424ms

$0.620

100%

nova-premier-v1

🧮 Reasoning

2

1506ms

$0.610

100%

nova-premier-v1

⚡ Ping

2

1617ms

$0.128

100%

aion-3.0

🔍 Context Recall

2

2509ms

$2.11

100%

aion-3.0

🚀 Throughput

2

1093ms

$4.22

100%

aion-3.0

💻 Code Gen

2

1083ms

$2.35

100%

aion-3.0

{ } JSON Output

2

762ms

$1.01

100%

aion-3.0

🧮 Reasoning

2

771ms

$1.41

100%

aion-3.0

⚡ Ping

2

913ms

$0.198

100%

gpt-4o-2024-05-13

🔧 Tool Use

2

705ms

$0.338

100%

gpt-4o-2024-05-13

🔍 Context Recall

2

662ms

$0.755

100%

gpt-4o-2024-05-13

🚀 Throughput

2

4812ms

$7.65

100%

gpt-4o-2024-05-13

💻 Code Gen

2

1052ms

$1.64

100%

gpt-4o-2024-05-13

{ } JSON Output

2

725ms

$0.492

100%

gpt-4o-2024-05-13

🧮 Reasoning

2

1296ms

$1.15

100%

gpt-4o-2024-05-13

⚡ Ping

2

735ms

$0.045

100%

gpt-4.1

🔧 Tool Use

2

712ms

$0.270

100%

gpt-4.1

🔍 Context Recall

2

1060ms

$0.604

100%

gpt-4.1

🚀 Throughput

2

7182ms

$6.12

100%

gpt-4.1

💻 Code Gen

2

1619ms

$0.960

100%

gpt-4.1

{ } JSON Output

2

743ms

$0.290

100%

gpt-4.1

🧮 Reasoning

2

911ms

$0.292

100%

gpt-4.1

⚡ Ping

2

936ms

$0.036

100%

gpt-3.5-turbo-0125

🔧 Tool Use

2

1398ms

—

100%

gpt-3.5-turbo-0125

🔍 Context Recall

2

1815ms

—

100%

gpt-3.5-turbo-0125

🚀 Throughput

2

4247ms

—

100%

gpt-3.5-turbo-0125

💻 Code Gen

2

1823ms

—

100%

gpt-3.5-turbo-0125

{ } JSON Output

2

1787ms

—

100%

gpt-3.5-turbo-0125

🧮 Reasoning

2

6264ms

—

100%

gpt-3.5-turbo-0125

⚡ Ping

2

1460ms

—

100%

gpt-oss-20b

🔧 Tool Use

2

282ms

$0.011

100%

gpt-oss-20b

🔍 Context Recall

2

305ms

$0.018

100%

gpt-oss-20b

🚀 Throughput

2

1059ms

$0.109

100%

gpt-oss-20b

💻 Code Gen

2

439ms

$0.034

100%

gpt-oss-20b

{ } JSON Output

2

376ms

$0.024

100%

gpt-oss-20b

🧮 Reasoning

2

381ms

$0.024

100%

gpt-oss-20b

⚡ Ping

2

560ms

<$0.01

100%

Kimi-K2.6

🔧 Tool Use

2

867ms

$0.324

100%

Kimi-K2.6

🔍 Context Recall

2

1232ms

$0.780

100%

Kimi-K2.6

🚀 Throughput

2

5913ms

$2.67

100%

Kimi-K2.6

💻 Code Gen

2

4414ms

$3.04

100%

Kimi-K2.6

{ } JSON Output

2

1351ms

$0.845

100%

Kimi-K2.6

🧮 Reasoning

2

7744ms

$5.01

100%

Kimi-K2.6

⚡ Ping

2

1203ms

$0.323

100%

reka-flash-3

⚡ Ping

2

10119ms

$0.056

100%

qwen3.6-35b-a3b

🔍 Context Recall

2

1123ms

$0.741

100%

qwen3.6-35b-a3b

🚀 Throughput

2

750ms

$0.506

100%

qwen3.6-35b-a3b

💻 Code Gen

2

604ms

$1.54

100%

qwen3.6-35b-a3b

{ } JSON Output

2

1034ms

$0.852

100%

qwen3.6-35b-a3b

🧮 Reasoning

2

310ms

$1.40

100%

qwen3.6-35b-a3b

⚡ Ping

2

532ms

$0.205

100%

qwen3.5-35b-a3b

🔍 Context Recall

2

806ms

$0.808

100%

qwen3.5-35b-a3b

🚀 Throughput

2

471ms

$0.506

100%

qwen3.5-35b-a3b

💻 Code Gen

2

602ms

$0.314

100%

qwen3.5-35b-a3b

{ } JSON Output

2

769ms

$0.717

100%

qwen3.5-35b-a3b

🧮 Reasoning

2

1501ms

$2.03

100%

qwen3.5-35b-a3b

⚡ Ping

2

1153ms

$0.144

100%

qwen3-vl-30b-a3b-instruct

🔍 Context Recall

2

710ms

$0.045

100%

qwen3-vl-30b-a3b-instruct

🚀 Throughput

2

417ms

$0.265

100%

qwen3-vl-30b-a3b-instruct

💻 Code Gen

2

318ms

$0.062

100%

qwen3-vl-30b-a3b-instruct

{ } JSON Output

2

1783ms

$0.023

100%

qwen3-vl-30b-a3b-instruct

🧮 Reasoning

2

355ms

$0.026

100%

qwen3-vl-30b-a3b-instruct

⚡ Ping

2

387ms

<$0.01

100%

qwen3-max

🔍 Context Recall

2

1352ms

$0.206

100%

qwen3-max

🚀 Throughput

2

1170ms

$1.98

100%

qwen3-max

💻 Code Gen

2

1278ms

$0.426

100%

qwen3-max

{ } JSON Output

2

1187ms

$0.160

100%

qwen3-max

🧮 Reasoning

2

1599ms

$0.338

100%

qwen3-max

⚡ Ping

2

1982ms

$0.016

100%

qwen3-8b

🔍 Context Recall

2

527ms

$0.166

100%

qwen3-8b

🚀 Throughput

2

725ms

$0.661

100%

qwen3-8b

💻 Code Gen

2

596ms

$0.511

100%

qwen3-8b

{ } JSON Output

2

697ms

$0.142

100%

qwen3-8b

🧮 Reasoning

2

571ms

$0.368

100%

qwen3-8b

⚡ Ping

2

719ms

$0.058

100%

qwen3-235b-a22b

🔍 Context Recall

2

1049ms

$0.584

100%

qwen3-235b-a22b

🚀 Throughput

2

373ms

$5.04

100%

qwen3-235b-a22b

💻 Code Gen

2

364ms

$4.19

100%

qwen3-235b-a22b

{ } JSON Output

2

470ms

$0.470

100%

qwen3-235b-a22b

🧮 Reasoning

2

419ms

$0.710

100%

qwen3-235b-a22b

⚡ Ping

2

426ms

$0.216

100%

qwen-2.5-7b-instruct

🔍 Context Recall

2

639ms

$0.012

100%

qwen-2.5-7b-instruct

🚀 Throughput

2

955ms

$0.052

100%

qwen-2.5-7b-instruct

💻 Code Gen

2

641ms

$0.013

100%

qwen-2.5-7b-instruct

{ } JSON Output

2

521ms

<$0.01

100%

qwen-2.5-7b-instruct

🧮 Reasoning

2

633ms

<$0.01

100%

qwen-2.5-7b-instruct

⚡ Ping

2

632ms

<$0.01

100%

sonar-pro-search

🔍 Context Recall

2

1120ms

$1.12

100%

sonar-pro-search

🚀 Throughput

2

1073ms

$7.49

100%

sonar-pro-search

💻 Code Gen

2

1014ms

$1.91

100%

sonar-pro-search

{ } JSON Output

2

932ms

$0.510

100%

sonar-pro-search

🧮 Reasoning

2

932ms

$0.789

100%

sonar-pro-search

⚡ Ping

2

1087ms

$0.036

100%

gpt-oss-20b

🔍 Context Recall

2

1306ms

$0.067

100%

gpt-oss-20b

🚀 Throughput

2

1306ms

$0.477

100%

gpt-oss-20b

💻 Code Gen

2

464ms

$0.042

100%

gpt-oss-20b

{ } JSON Output

2

1388ms

$0.026

100%

gpt-oss-20b

🧮 Reasoning

2

962ms

$0.035

100%

gpt-oss-20b

⚡ Ping

2

9844ms

$0.014

100%

nemotron-3-ultra-550b-a55b:free

🔍 Context Recall

2

514ms

free

100%

nemotron-3-ultra-550b-a55b:free

🚀 Throughput

2

508ms

free

100%

nemotron-3-ultra-550b-a55b:free

💻 Code Gen

2

433ms

free

100%

nemotron-3-ultra-550b-a55b:free

{ } JSON Output

2

442ms

free

100%

nemotron-3-ultra-550b-a55b:free

🧮 Reasoning

2

470ms

free

100%

nemotron-3-ultra-550b-a55b:free

⚡ Ping

2

440ms

free

100%

llama-3.3-nemotron-super-49b-v1.5

🔍 Context Recall

2

415ms

free

100%

llama-3.3-nemotron-super-49b-v1.5

🚀 Throughput

2

408ms

free

100%

llama-3.3-nemotron-super-49b-v1.5

💻 Code Gen

2

364ms

free

100%

llama-3.3-nemotron-super-49b-v1.5

{ } JSON Output

2

367ms

free

100%

llama-3.3-nemotron-super-49b-v1.5

🧮 Reasoning

2

379ms

free

100%

llama-3.3-nemotron-super-49b-v1.5

⚡ Ping

2

1523ms

free

100%

nex-n2-mini

🔍 Context Recall

2

774ms

$0.011

100%

nex-n2-mini

🚀 Throughput

2

598ms

$0.051

100%

nex-n2-mini

💻 Code Gen

2

994ms

$0.031

100%

nex-n2-mini

{ } JSON Output

2

974ms

<$0.01

100%

nex-n2-mini

🧮 Reasoning

2

1169ms

<$0.01

100%

nex-n2-mini

⚡ Ping

2

972ms

<$0.01

100%

kimi-k2

🔍 Context Recall

2

1587ms

$0.213

100%

kimi-k2

🚀 Throughput

2

2169ms

$1.17

100%

kimi-k2

💻 Code Gen

2

4310ms

$0.254

100%

kimi-k2

{ } JSON Output

2

2821ms

$0.087

100%

kimi-k2

🧮 Reasoning

2

2468ms

$0.087

100%

kimi-k2

⚡ Ping

2

3326ms

$0.014

100%

mistral-saba

🔍 Context Recall

2

596ms

$0.052

100%

mistral-saba

🚀 Throughput

2

832ms

$0.309

100%

mistral-saba

💻 Code Gen

2

576ms

$0.079

100%

mistral-saba

{ } JSON Output

2

566ms

$0.033

100%

mistral-saba

🧮 Reasoning

2

576ms

$0.055

100%

mistral-saba

⚡ Ping

2

511ms

<$0.01

100%

mistral-large

🔍 Context Recall

2

726ms

$0.644

100%

mistral-large

🚀 Throughput

2

613ms

$3.07

100%

mistral-large

💻 Code Gen

2

600ms

$0.744

100%

mistral-large

{ } JSON Output

2

603ms

$0.360

100%

mistral-large

🧮 Reasoning

2

595ms

$0.338

100%

mistral-large

⚡ Ping

2

590ms

$0.038

100%

minimax-m2.7

🔍 Context Recall

2

659ms

$0.355

100%

minimax-m2.7

🚀 Throughput

2

550ms

$0.517

100%

minimax-m2.7

💻 Code Gen

2

—

0%

minimax-m2.7

{ } JSON Output

2

1691ms

$0.337

100%

minimax-m2.7

🧮 Reasoning

2

990ms

$0.515

100%

minimax-m2.7

⚡ Ping

2

2323ms

$0.073

100%

wizardlm-2-8x22b

🔍 Context Recall

2

1120ms

$0.148

100%

wizardlm-2-8x22b

🚀 Throughput

2

2337ms

$0.350

100%

wizardlm-2-8x22b

💻 Code Gen

2

—

0%

wizardlm-2-8x22b

{ } JSON Output

2

1397ms

$0.077

100%

wizardlm-2-8x22b

🧮 Reasoning

2

1061ms

$0.099

100%

wizardlm-2-8x22b

⚡ Ping

2

1567ms

$0.027

100%

llama-3.2-3b-instruct:free

🔍 Context Recall

2

—

0%

llama-3.2-3b-instruct:free

🚀 Throughput

2

—

0%

llama-3.2-3b-instruct:free

💻 Code Gen

2

—

0%

llama-3.2-3b-instruct:free

{ } JSON Output

2

—

0%

llama-3.2-3b-instruct:free

🧮 Reasoning

2

—

0%

llama-3.2-3b-instruct:free

⚡ Ping

2

—

0%

kat-coder-pro-v2.5

🔍 Context Recall

2

778ms

$0.570

100%

kat-coder-pro-v2.5

🚀 Throughput

2

525ms

$1.51

100%

kat-coder-pro-v2.5

💻 Code Gen

2

664ms

$1.08

100%

kat-coder-pro-v2.5

{ } JSON Output

2

615ms

$0.703

100%

kat-coder-pro-v2.5

🧮 Reasoning

2

515ms

$0.366

100%

kat-coder-pro-v2.5

⚡ Ping

2

527ms

$0.187

100%

ling-2.6-1t

🔍 Context Recall

2

1616ms

$0.027

100%

ling-2.6-1t

🚀 Throughput

2

1216ms

$0.316

100%

ling-2.6-1t

💻 Code Gen

2

1371ms

$0.079

100%

ling-2.6-1t

{ } JSON Output

2

1401ms

$0.031

100%

ling-2.6-1t

🧮 Reasoning

2

1196ms

$0.034

100%

ling-2.6-1t

⚡ Ping

2

1768ms

<$0.01

100%

deepseek-v3.2

🔍 Context Recall

2

1331ms

$0.057

100%

deepseek-v3.2

🚀 Throughput

2

1005ms

$0.211

100%

deepseek-v3.2

💻 Code Gen

2

1915ms

$0.079

100%

deepseek-v3.2

{ } JSON Output

2

616ms

$0.027

100%

deepseek-v3.2

🧮 Reasoning

2

871ms

$0.032

100%

deepseek-v3.2

⚡ Ping

2

1295ms

<$0.01

100%

deepseek-chat

🔍 Context Recall

2

1059ms

$0.071

100%

deepseek-chat

🚀 Throughput

2

472ms

$0.407

100%

deepseek-chat

💻 Code Gen

2

1231ms

$0.098

100%

deepseek-chat

{ } JSON Output

2

717ms

$0.043

100%

deepseek-chat

🧮 Reasoning

2

878ms

$0.060

100%

deepseek-chat

⚡ Ping

2

893ms

<$0.01

100%

dolphin-mistral-24b-venice-edition:free

🔍 Context Recall

2

—

0%

dolphin-mistral-24b-venice-edition:free

🚀 Throughput

2

—

0%

dolphin-mistral-24b-venice-edition:free

💻 Code Gen

2

—

0%

dolphin-mistral-24b-venice-edition:free

{ } JSON Output

2

—

0%

dolphin-mistral-24b-venice-edition:free

🧮 Reasoning

2

—

0%

dolphin-mistral-24b-venice-edition:free

⚡ Ping

2

—

0%

ernie-4.5-vl-424b-a47b

🔍 Context Recall

2

1600ms

$0.104

100%

ernie-4.5-vl-424b-a47b

🚀 Throughput

2

1651ms

$0.605

100%

ernie-4.5-vl-424b-a47b

💻 Code Gen

2

1486ms

$0.178

100%

ernie-4.5-vl-424b-a47b

{ } JSON Output

2

1545ms

$0.056

100%

ernie-4.5-vl-424b-a47b

🧮 Reasoning

2

1404ms

$0.087

100%

ernie-4.5-vl-424b-a47b

⚡ Ping

2

1578ms

<$0.01

100%

nova-2-lite-v1

🔍 Context Recall

2

664ms

$0.191

100%

nova-2-lite-v1

🚀 Throughput

2

837ms

$1.27

100%

nova-2-lite-v1

💻 Code Gen

2

676ms

$0.338

100%

nova-2-lite-v1

{ } JSON Output

2

672ms

$0.131

100%

nova-2-lite-v1

🧮 Reasoning

2

936ms

$0.151

100%

nova-2-lite-v1

⚡ Ping

2

1004ms

$0.026

100%

gpt-4o-mini-2024-07-18

🔧 Tool Use

2

955ms

$0.020

100%

gpt-4o-mini-2024-07-18

🔍 Context Recall

2

1244ms

$0.048

100%

gpt-4o-mini-2024-07-18

🚀 Throughput

2

12311ms

$0.459

100%

gpt-4o-mini-2024-07-18

💻 Code Gen

2

2163ms

$0.070

100%

gpt-4o-mini-2024-07-18

{ } JSON Output

2

1224ms

$0.029

100%

gpt-4o-mini-2024-07-18

🧮 Reasoning

2

1493ms

$0.030

100%

gpt-4o-mini-2024-07-18

⚡ Ping

2

972ms

<$0.01

100%

gpt-4.1-nano

🔧 Tool Use

2

746ms

$0.013

100%

gpt-4.1-nano

🔍 Context Recall

2

644ms

$0.024

100%

gpt-4.1-nano

🚀 Throughput

2

3748ms

$0.306

100%

gpt-4.1-nano

💻 Code Gen

2

966ms

$0.046

100%

gpt-4.1-nano

{ } JSON Output

2

623ms

$0.018

100%

gpt-4.1-nano

🧮 Reasoning

2

1068ms

$0.017

100%

gpt-4.1-nano

⚡ Ping

2

501ms

<$0.01

100%

gpt-4-0613

🔧 Tool Use

2

1382ms

—

100%

gpt-4-0613

🔍 Context Recall

2

2993ms

—

100%

gpt-4-0613

🚀 Throughput

2

19244ms

—

100%

gpt-4-0613

💻 Code Gen

2

7795ms

—

100%

gpt-4-0613

{ } JSON Output

2

2041ms

—

100%

gpt-4-0613

🧮 Reasoning

2

2445ms

—

100%

gpt-4-0613

⚡ Ping

2

2225ms

—

100%

step-3.5-flash

🔍 Context Recall

2

2770ms

free

100%

step-3.5-flash

🚀 Throughput

2

7195ms

free

100%

step-3.5-flash

💻 Code Gen

2

2132ms

free

100%

step-3.5-flash

{ } JSON Output

2

4948ms

free

100%

step-3.5-flash

🧮 Reasoning

2

18751ms

free

100%

step-3.5-flash

⚡ Ping

2

3809ms

free

100%

nemotron-3-ultra-550b-a55b

🔍 Context Recall

2

1764ms

$0.707

100%

nemotron-3-ultra-550b-a55b

🚀 Throughput

2

15005ms

$2.75

100%

nemotron-3-ultra-550b-a55b

💻 Code Gen

2

1861ms

$0.710

100%

nemotron-3-ultra-550b-a55b

{ } JSON Output

2

3198ms

$0.794

100%

nemotron-3-ultra-550b-a55b

🧮 Reasoning

2

1231ms

$0.259

100%

nemotron-3-ultra-550b-a55b

⚡ Ping

2

15814ms

$0.233

100%

ising-calibration-1-35b-a3b

🔍 Context Recall

2

6667ms

free

100%

ising-calibration-1-35b-a3b

🚀 Throughput

2

—

0%

ising-calibration-1-35b-a3b

💻 Code Gen

2

8562ms

free

100%

ising-calibration-1-35b-a3b

{ } JSON Output

2

—

0%

ising-calibration-1-35b-a3b

🧮 Reasoning

2

13533ms

free

100%

ising-calibration-1-35b-a3b

⚡ Ping

2

3193ms

free

100%

llama-3.2-3b-instruct

🔍 Context Recall

2

907ms

free

100%

llama-3.2-3b-instruct

🚀 Throughput

2

—

0%

llama-3.2-3b-instruct

💻 Code Gen

2

2303ms

free

100%

llama-3.2-3b-instruct

{ } JSON Output

2

9174ms

free

100%

llama-3.2-3b-instruct

🧮 Reasoning

2

804ms

free

100%

llama-3.2-3b-instruct

⚡ Ping

2

3407ms

free

100%

diffusiongemma-26b-a4b-it

🔍 Context Recall

2

1199ms

free

100%

diffusiongemma-26b-a4b-it

🚀 Throughput

2

2469ms

free

100%

diffusiongemma-26b-a4b-it

💻 Code Gen

2

706ms

free

100%

diffusiongemma-26b-a4b-it

{ } JSON Output

2

882ms

free

100%

diffusiongemma-26b-a4b-it

🧮 Reasoning

2

601ms

free

100%

diffusiongemma-26b-a4b-it

⚡ Ping

2

833ms

free

100%

llama-3.3-70b-versatile

🔧 Tool Use

2

311ms

$0.161

100%

llama-3.3-70b-versatile

🔍 Context Recall

2

376ms

$0.166

100%

llama-3.3-70b-versatile

🚀 Throughput

2

2669ms

$0.636

100%

llama-3.3-70b-versatile

💻 Code Gen

2

714ms

$0.192

100%

llama-3.3-70b-versatile

{ } JSON Output

2

282ms

$0.074

100%

llama-3.3-70b-versatile

🧮 Reasoning

2

407ms

$0.061

100%

llama-3.3-70b-versatile

⚡ Ping

2

239ms

$0.026

100%

gemini-pro-latest

🔧 Tool Use

2

2954ms

—

100%

gemini-pro-latest

🔍 Context Recall

2

5028ms

—

100%

gemini-pro-latest

🚀 Throughput

2

—

0%

gemini-pro-latest

💻 Code Gen

2

6797ms

—

100%

gemini-pro-latest

{ } JSON Output

2

4865ms

—

100%

gemini-pro-latest

🧮 Reasoning

2

7731ms

—

100%

gemini-pro-latest

⚡ Ping

2

2814ms

—

100%

gemini-3.1-flash-lite-image

🔧 Tool Use

2

695ms

—

100%

gemini-3.1-flash-lite-image

🔍 Context Recall

2

880ms

—

100%

gemini-3.1-flash-lite-image

🚀 Throughput

2

2893ms

—

100%

gemini-3.1-flash-lite-image

💻 Code Gen

2

1009ms

—

100%

gemini-3.1-flash-lite-image

{ } JSON Output

2

783ms

—

100%

gemini-3.1-flash-lite-image

🧮 Reasoning

2

690ms

—

100%

gemini-3.1-flash-lite-image

⚡ Ping

2

780ms

—

100%

gemini-2.5-pro

🔧 Tool Use

2

2920ms

$0.269

100%

gemini-2.5-pro

🔍 Context Recall

2

5027ms

$0.701

100%

gemini-2.5-pro

🚀 Throughput

2

—

0%

gemini-2.5-pro

💻 Code Gen

2

13423ms

$2.51

100%

gemini-2.5-pro

{ } JSON Output

2

5189ms

$0.535

100%

gemini-2.5-pro

🧮 Reasoning

2

9979ms

$0.441

100%

gemini-2.5-pro

⚡ Ping

2

2990ms

$0.019

100%

Kimi-K2.6

🔧 Tool Use

2

2921ms

$0.472

100%

Kimi-K2.6

🔍 Context Recall

2

5224ms

$1.30

100%

Kimi-K2.6

🚀 Throughput

2

7202ms

$2.67

100%

Kimi-K2.6

💻 Code Gen

2

8982ms

$3.35

100%

Kimi-K2.6

{ } JSON Output

2

3864ms

$0.821

100%

Kimi-K2.6

🧮 Reasoning

2

10424ms

$4.06

100%

Kimi-K2.6

⚡ Ping

2

1485ms

$0.158

100%

Meta-Llama-3.1-70B-Instruct-Turbo

🔧 Tool Use

2

1344ms

$0.107

100%

Meta-Llama-3.1-70B-Instruct-Turbo

🔍 Context Recall

2

1968ms

$0.096

100%

Meta-Llama-3.1-70B-Instruct-Turbo

🚀 Throughput

2

—

0%

Meta-Llama-3.1-70B-Instruct-Turbo

💻 Code Gen

2

5457ms

$0.077

100%

Meta-Llama-3.1-70B-Instruct-Turbo

{ } JSON Output

2

1544ms

$0.036

100%

Meta-Llama-3.1-70B-Instruct-Turbo

🧮 Reasoning

2

1636ms

$0.026

100%

Meta-Llama-3.1-70B-Instruct-Turbo

⚡ Ping

2

1001ms

<$0.01

100%

DeepSeek-V3.1-Terminus

🔧 Tool Use

2

—

0%

DeepSeek-V3.1-Terminus

🔍 Context Recall

2

1080ms

$0.070

100%

DeepSeek-V3.1-Terminus

🚀 Throughput

2

7554ms

$0.727

100%

DeepSeek-V3.1-Terminus

💻 Code Gen

2

3626ms

$0.105

100%

DeepSeek-V3.1-Terminus

{ } JSON Output

2

1113ms

$0.046

100%

DeepSeek-V3.1-Terminus

🧮 Reasoning

2

1335ms

$0.062

100%

DeepSeek-V3.1-Terminus

⚡ Ping

2

975ms

<$0.01

100%

MiMo-V2.5

🔧 Tool Use

2

2182ms

$0.278

100%

MiMo-V2.5

🔍 Context Recall

2

1907ms

$0.180

100%

MiMo-V2.5

🚀 Throughput

2

15262ms

$1.53

100%

MiMo-V2.5

💻 Code Gen

2

2881ms

$0.379

100%

MiMo-V2.5

{ } JSON Output

2

1248ms

$0.114

100%

MiMo-V2.5

🧮 Reasoning

2

2443ms

$0.266

100%

MiMo-V2.5

⚡ Ping

2

944ms

$0.048

100%

Qwen3.5-397B-A17B

🔧 Tool Use

2

3833ms

$0.422

100%

Qwen3.5-397B-A17B

🔍 Context Recall

2

14887ms

$1.50

100%

Qwen3.5-397B-A17B

🚀 Throughput

2

—

0%

Qwen3.5-397B-A17B

💻 Code Gen

2

—

0%

Qwen3.5-397B-A17B

{ } JSON Output

2

20246ms

$2.10

100%

Qwen3.5-397B-A17B

🧮 Reasoning

2

51795ms

$4.76

100%

Qwen3.5-397B-A17B

⚡ Ping

2

7718ms

$0.722

100%

Qwen3-Max-Thinking

🔧 Tool Use

2

2943ms

$0.552

100%

Qwen3-Max-Thinking

🔍 Context Recall

2

2923ms

$0.449

100%

Qwen3-Max-Thinking

🚀 Throughput

2

16870ms

$4.58

100%

Qwen3-Max-Thinking

💻 Code Gen

2

5354ms

$1.14

100%

Qwen3-Max-Thinking

{ } JSON Output

2

2987ms

$0.246

100%

Qwen3-Max-Thinking

🧮 Reasoning

2

5012ms

$0.328

100%

Qwen3-Max-Thinking

⚡ Ping

2

2313ms

$0.024

100%

Qwen3-14B

🔧 Tool Use

2

3491ms

$0.066

100%

Qwen3-14B

🔍 Context Recall

2

4986ms

$0.096

100%

Qwen3-14B

🚀 Throughput

2

11921ms

$0.186

100%

Qwen3-14B

💻 Code Gen

2

—

0%

Qwen3-14B

{ } JSON Output

2

—

0%

Qwen3-14B

🧮 Reasoning

2

6553ms

$0.089

100%

Qwen3-14B

⚡ Ping

2

2779ms

$0.031

100%

MythoMax-L2-13b

🔧 Tool Use

2

—

0%

MythoMax-L2-13b

🔍 Context Recall

2

1495ms

$0.125

100%

MythoMax-L2-13b

🚀 Throughput

2

14182ms

$0.314

100%

MythoMax-L2-13b

💻 Code Gen

2

—

0%

MythoMax-L2-13b

{ } JSON Output

2

2074ms

$0.042

100%

MythoMax-L2-13b

🧮 Reasoning

2

—

0%

MythoMax-L2-13b

⚡ Ping

2

1009ms

$0.013

100%