Skip to content

Surveys


  • Peng-Yuan Wang, Tian-Shuo Liu, Chenyang Wang, Yi-Di Wang, Shu Yan, Cheng-Xing Jia, Xu-Hui Liu, Xin-Wei Chen, Jia-Cheng Xu, Ziniu Li, Yang Yu. A Survey on Large Language Models for Mathematical Reasoning, CoRR abs/2506.08446
  • Zhaohan Feng, Ruiqi Xue, Lei Yuan, Yang Yu, Ning Ding, Meiqin Liu, Bingzhao Gao, Jian Sun, Xinhu Zheng, Gang Wang. Multi-agent Embodied AI: Advances and Future Directions, CoRR abs/2505.05108
  • Rongjun Qin, Yang Yu. Learning in games: a systematic review. SCIENCE CHINA Information Sciences, 67: 171101, 2024.
  • Fan-Ming Luo, Tian Xu, Hang Lai, Xiong-Hui Chen, Weinan Zhang, Yang Yu. A survey on model-based reinforcement learning. SCIENCE CHINA Information Sciences, 67(2): 121101, 2024 CoRR abs/2206.09328.
  • Hong Qian, Yang Yu. Derivative-free reinforcement learning: A review. Frontiers of Computer Science, 15(6): 156336, 2022. CoRR abs/2102.05710.
  • Q. Yao, M. Wang, Y. Chen, W. Dai, Y.-F. Li, W.-W. Tu, Q. Yang, Y. Yu.* Taking human out of learning applications: A survey on automated machine learning*, CoRR abs/1810.13306, 2018.

LAMDA  RL LAB
School of Artificial Intelligence
National Key Laboratory for Novel Software Technology
Nanjing University, Nanjing 210023, China

Contact us

yuanl AT lamda DOT nju DOT edu DOT cn

Yi Fu Building, Xianlin Campus