👋

xxx

I am Rui Li, a 3th year Master’s student at the MOE Key Lab of Computational Linguistics, Peking University, advised by Prof. Zhifang Sui. Prior to that, I received my B.S. in Computer Science and Technology, with a major in Artificial Intelligence, from Shandong University in 2023. I have gained practical experience as an intern at the StepFun AI. I have also worked as an intern at the PLUSLAB Group advised by Prof. Nanyun (Violet) Peng.

Please refer to my CV and Google Scholar for more details.

Research

I am broadly interested in Natural Language Processing. My research focuses on:

1. LLM alignment for harmless, honest, and helpful
2. AI-assisted research

🌈 I have a deep and enthusiastic interest in interdisciplinary studies. My educational background has provided me with a foundational understanding and initial professional competence in fields such as law and art.
🌈 I particularly enjoy the exchange and integration of knowledge across different disciplines and am keen to collaborate with individuals from diverse academic backgrounds in the future. For potential interdisciplinary collaborations, I am very willing to offer technical support and am open to discussing any topics related to my research or broader interests.

News

  • [2025.09] Got one papers accepted by NeurIPS 2025! 🎉
  • [2025.08] Got two papers accepted by EMNLP 2025! (1 Main, 1 Findings) 🎉
  • [2025.06] Got two first-authored papers accepted by ACL 2025! (1 Main, 1 Findings) 🎉
  • [2024.11] Contributed as a core contributor to a large-scale open-source data project SuperGPQA, which was organized by M-A-P (Multimodal Art Projection, a non-profit open-source AI research community).
  • [2024.10] My first first-authored paper was accepted by EMNLP 2024 findings, and two other papers I contributed to were also accepted by EMNLP main and EMNLP findings, respectively. 🎉
  • [2023.09] Started my Master study at the MOE Key Laboratory of Computational Linguistics, Peking University, advised by Prof. Zhifang Sui. 😊
  • [2023.05] Guided by Professor Qiong Zeng, I completed my graduation thesis which earned me the Outstanding Undergraduate Graduation Thesis Award and Outstanding Thesis Defense Award from Shandong University. This marked the beginning of my academic journey. 🎨

Publications

图片描述
  • How Far are LLMs from Being Our Digital Twins? A Benchmark for Persona-Based Behavior Chain Simulation

    Rui Li, Heming Xia, Xinfeng Yuan, Qingxiu Dong, Lei Sha, Wenjie Li, Zhifang Sui

    ACL 2025 (findings).

    (🤫 The total data volume of 1001 for this benchmark was inspired by One Thousand and One Nights, a beloved childhood fairy tale.)



图片描述
  • Towards Harmonized Uncertainty Estimation for Large Language Models

    Rui Li, Jing Long, Muge Qi, Heming Xia, Lei Sha, Peiyi Wang, Zhifang Sui

    ACL 2025. (Oral Presentation)



图片描述
  • Be a Multitude to Itself: A Prompt Evolution Framework for Red Teaming

    Rui Li, Peiyi Wang, Jingyuan Ma, Di Zhang, Lei Sha, Zhifang Sui

    EMNLP 2024 (findings).





图片描述
  • SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

    M-A-P (Multimodal Art Projection), Core Contributor

    NeurIPS 2025.



图片描述
  • Layer-Aware Representation Filtering: Purifying Finetuning Data to Preserve LLM Safety Alignment

    Hao Li, Lijun Li, Zhenghao Lu, Xianyi Wei, Rui Li, Jing Shao, Lei Sha

    EMNLP 2025.



图片描述
  • Beyond Single Frames: Can LMMs Comprehend Implicit Narratives in Comic Strip?

    Xiaochen Wang, Heming Xia, Jialin Song, Longyu Guan, Qingxiu Dong, Rui Li, Yixin Yang, Yifan Pu, Weiyao Luo, Yiru Wang, Xiangdi Meng, Wenjie Li, Zhifang Sui

    EMNLP 2025 (findings).

图片描述

  • A Survey on In-context Learning

    Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Lei Li, Zhifang Sui

    EMNLP 2024.





图片描述
  • ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors

    Zhexin Zhang, Yida Lu, Jingyuan Ma, Di Zhang, Rui Li, Pei Ke, Hao Sun, Lei Sha, Zhifang Sui, Hongning Wang, Minlie Huang
    EMNLP 2024 (findings).





图片描述
  • Large Language Models Struggle with Unreasonability in Math Problems

    Jingyuan Ma, Damai Dai, Zihang Yuan, Rui Li, Weilin Luo, Bin Wang, Qun Liu, Lei Sha, Zhifang Sui
    AAAI 2026.



图片描述
  • SCoRE: Benchmarking Long-Chain Reasoning in Commonsense Scenarios

    Weidong Zhan, Yue Wang, Nan Hu, Liming Xiao, Jingyuan Ma, Yuhang Qin, Zheng Li, Yixin Yang, Sirui Deng, Jinkun Ding, Wenhan Ma, Rui Li, Weilin Luo, Qun Liu, Zhifang Sui
    AAAI 2026 Bridge LMReasoning



Under Review & Preprint

A complete list of publications is available in my CV.

图片描述
  • LLM-REVal: Can We Trust LLM Reviewers Yet?
    Rui Li, Jia-Chen Gu, Po-Nien Kung, Heming Xia, Junfeng Liu, Xiangwen Kong, Zhifang Sui, Nanyun Peng



图片描述
  • Merlin’s Whisper: Enabling Efficient Reasoning in LLMs via Black-box Adversarial Prompting
    Heming Xia, Cunxiao Du, Rui Li, Chak Tou Leong, Yongqi Li, Wenjie Li



图片描述
  • OS-Catalyst: Advancing Computer-Using Agents Efficiency through Adaptive Action

    Xinfeng Yuan, Qiushi Sun, Yinghao Chen, Rui Li, Xuetian Chen, Siyu Yuan, Xintao Wang, Zichen Ding, Zonglin Li, Biqing Qi, Deqing Yang




图片描述
  • SenseJudge: Human-Centric Preference-Driven Judgment Framework

    Rui Li, Junfeng Liu, Xiangwen Kong, Zhifang Sui







图片描述
  • HauntAttack: When Attack Follows Reasoning as a Shadow

    Jingyuan Ma*, Rui Li*, Zheng Li, Junfeng Liu, Heming Xia, Lei Sha, Zhifang Sui





图片描述
  • Step-3 is Large yet Affordable: Model-system Co-design for Cost-effective Decoding
    StepFun AI, technical report, Contributor





图片描述
  • MACG: A Multi‑Agent Framework for Thematically Structuring and Generation of Related Work
    Zhuang Liu, Jian Liu, Chenbin Zhang, Rui Li, Chun Kang, Maolin Wang, Lei Sha






图片描述
  • Plug-and-Play Training Framework for Preference Optimization

    Jingyuan Ma, Rui Li, Zheng Li, Lei Sha, Zhifang Sui



  • Text-driven Palette Generation Method

    Rui Li, Qiong Zeng

Technical Skills

Languages: C/C++, Python, Java, Shell, MATLAB, HTML/CSS

Developer Tools: VS Code, PyCharm, Git, Linux, Vim

Minor: Law

Internship Experience

Oct. 2025 - Present, ByteDance, Algorithm Intern

Jul. 2024 - Oct. 2025, Stepfun, Algorithm Intern

Jun. 2023 - Sep. 2023, 36Kr, Intern Reporter

Feb. 2023 - Jun. 2023, Shandong University, Introduction to Artificial Intelligence, Teaching Assistant