Skip to content
Your Image Description

LAMDA RL LAB is a subgroup of LAMDA that focuses on advancing the field of reinforcement learning (RL) and its application to creating general decision-making intelligence. Key areas we are exploring include: model-based RL and world model learning, multi-agent and collaborative RL, planning and learning with large models, etc. Through both fundamental and application research, our aim is to create RL-based systems that exhibit general decision-making capabilities.

Highlights

Supported Product

REVIVE is Polixir's next-generation intelligent decision-making system that simplifies complex processes into easy workflows, enabling reinforcement learning control algorithms to be applied in real-world industrial scenarios.

Recent News

(in Chinese)
  1. ICLR 2025 | ADMPO:跨任意步预测的动力学模型能够有效提升有模型强化学习

    24年在 @俞扬 老师的指导下完成了一个关于model-based reinforcement learning (MBRL)的工作,方法简单有效,已被ICLR'2025接收。这也是我个人博士阶段的第一篇一作文章,在这里分享下文章的主要内容...
  2. NeurIPS'24 Oral | 让机器从教程书籍里学会决策(Policy Learning from Tutorial Books)

    错峰和大家分享一下我们最近发表在NeurIPS’24的oral 工作,《Policy learning from Tutorial Books via Understanding,Rehearsing and Introspecting》,本文也是我们的oral presentation的修改文稿 为什么要从书里学策略 近年来...
  3. NeurIPS 2024 | KALM:大语言模型的知识可用于强化学习训练

    分享一下我们NeurIPS 2024大语言模型驱动强化学习的工作KALM:《Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts》...
  4. BWArea Model: 决策视角下的可控语言生成

    前言在前段时间,在俞老师 @俞扬 的指导下,和鹏远师弟、子牛师兄 @李子牛 以及组内其他师弟一起做了可控语言模型方向上的探索[1],也对我们的做的工作简单介绍一下。随着语言模型的发展,大家对语言模型的要求也在不断提高,希望大语言模型去完成更加复杂和精确的任务...

LAMDA  RL LAB
School of Artificial Intelligence
National Key Laboratory for Novel Software Technology
Nanjing University, Nanjing 210023, China

Contact us

yuanl AT lamda DOT nju DOT edu DOT cn

Yi Fu Building, Xianlin Campus