LAMDA RL LAB

LAMDA RL LAB is a subgroup of LAMDA that focuses on advancing the field of reinforcement learning (RL) and its application to creating general decision-making intelligence. Key areas we are exploring include: model-based RL and world model learning, multi-agent and collaborative RL, planning and learning with large models, etc. Through both fundamental and application research, our aim is to create RL-based systems that exhibit general decision-making capabilities.

Highlights

Supported Product

REVIVE is Polixir's next-generation intelligent decision-making system that simplifies complex processes into easy workflows, enabling reinforcement learning control algorithms to be applied in real-world industrial scenarios.

NeurIPS 2025 | FTR: 在复杂环境中进行高效策略部署的强化学习方法

NeurIPS 2025 | CoPDT: 面对多任务约束以及多约束阈值的离线安全强化学习

ICML 2025 | Lapse: 面对状态可演变环境的强化学习策略复用方法

ICML 2025 | APEC：利用对抗模仿学习过程自动生成偏好数据，提升奖励模型泛化能力

ICML 2025 | CoLA: 基于latent action控制的语言模型

ICML 2025 | 大语言模型辅助的语义层面多样队友生成

NeoRL-2：面向现实场景的离线强化学习基准测试

ICLR 2025| 强化学习中基于视觉语言模型的时序抽象

ICLR 2025 | Q-Adapter: 使用个性化人类偏好定制语言模型的同时避免遗忘

ICLR 2025 | ReViWo：无惧视角切换的机器人控制