license: mit base_model: - Qwen/Qwen2.5-7B-Instruct library_name: transformers
SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning [arXiv] [Project]
Jiaqi Chen, Bang Zhang, Ruotian Ma, Peisong Wang, Xiaodan Liang, Zhaopeng Tu, Xiaolong Li, Kwan-Yee K. Wong.