LAMDA RL LAB

LAMDA RL LAB is a subgroup of LAMDA that focuses on advancing the field of reinforcement learning (RL) and its application to creating general decision-making intelligence. Key areas we are exploring include: model-based RL and world model learning, multi-agent and collaborative RL, planning and learning with large models, etc. Through both fundamental and application research, our aim is to create RL-based systems that exhibit general decision-making capabilities.

Highlights

Supported Product

REVIVE is Polixir's next-generation intelligent decision-making system that simplifies complex processes into easy workflows, enabling reinforcement learning control algorithms to be applied in real-world industrial scenarios.

ICML 2026 ｜如何给LLM post-training 的 RL 阶段一个好的起点？

ICML 2026 | Speedup Patch：给机器人策略打一个即插即用的“加速补丁”

ICML 2026 ｜ Dspic: 通过正交观测划分实现的最大熵多智能体策略迭代

ICML 2026 | COMAD：面向离线多智能体持续协作的技能划分与复用

ICML 2026 | ReLAM：让机器人从视频中学会“自己设计奖励”

ICLR 2026 ｜ HVD：面向全身控制的层次化价值分解离线强化学习

ICLR 2026 | ADM-v2：在离线有模型强化学习中实现可靠的全视野 Roll-out

ICLR'26 | EMFuse:基于能量的模型融合

NeurIPS'25 | MAFIS: 面向可扩展多智能体模仿学习的统一框架

NeurIPS 2025 | FTR: 在复杂环境中进行高效策略部署的强化学习方法