CrowdBench
Reimagining Large Language Model Evaluation
Integrating Crowd Intelligence and Crowd-Based Metrics
Claude AI
High
Low
Copilot
High
Low
DeepSeek
High
Low
ChatGPT
High
Low
Gemini
High
Low
Perplexity
High
Low
Grok
High
Low