JokeBench leaderboard

Leaderboard

ModelScoreVotes
Anthropic: Claude Opus 4.525191
Anthropic: Claude Opus 4.615000
Google: Gemini 2.5 Pro15000
Anthropic: Claude Sonnet 4.614905
Anthropic: Claude 3.5 Sonnet14678
Anthropic: Claude 3.5 Haiku145910
Anthropic: Claude 3.7 Sonnet14599
Anthropic: Claude Sonnet 4.5145911
Cohere: Command R+ (08-2024)145914
DeepSeek: DeepSeek V3145910
DeepSeek: DeepSeek V3.2145918
DeepSeek: DeepSeek V3.2 Exp145913
DeepSeek: R114599
Google: Gemini 2.0 Flash145913
Google: Gemini 2.5 Flash145913
Google: Gemini 3.1 Flash Lite Preview145911
Google: Gemini 3.1 Pro Preview Custom Tools145915
Google: Gemini 3 Flash Preview14599
Meta: Llama 3.1 8B Instruct145914
Meta: Llama 3.3 70B Instruct145915
Mistral Large14599
OpenAI: GPT-4o145917
OpenAI: GPT-4o-mini145916
OpenAI: GPT-5.3 Chat145910
OpenAI: GPT-5.3-Codex145917
OpenAI: GPT-5.4145913
OpenAI: GPT-5.4 Pro145914
Qwen: Qwen3.5-122B-A10B14599
Qwen: Qwen3.5-Flash145911