๐ŸŒŸ WritingBench: A Comprehensive Benchmark for Generative Writing

Evaluating LLMs' writing capabilities across 1,000 real-world queries, spanning 6 primary domains and 100 fine-grained subdomains.

[๐Ÿ“ƒ Paper] | [๐Ÿš€ GitHub]

๐Ÿ“… Last Updated: 2025-05-22 | ๐Ÿ“ˆ Scored by: Claude-3.7-Sonnet

10
llama-4-scout-17b-16e-instruct
Mistral AI
81.09
title={WritingBench: A Comprehensive Benchmark for Generative Writing}, 
author={Yuning Wu and Jiahao Mei and Ming Yan and Chenliang Li and Shaopeng Lai and Yuran Ren and Zijia Wang and Ji Zhang and Mengyue Wu and Qin Jin and Fei Huang},
year={2025},
url={https://arxiv.org/abs/2503.05244}, 
}```