Models

Model IDTotal WeightHostsInferencesAI Tokens
Qwen/Qwen3-235B-A22B-Instruct-2507-FP81,852,3023145641,088,226
Qwen/Qwen3-32B-FP8495,906110257125,707
Qwen/QwQ-32B62,831201013,376
Qwen/Qwen2.5-7B-Instruct58,65530425339,790
RedHatAI/Qwen2.5-7B-Instruct-quantized.w8a1600--