Reinforcement Learning Diagram

AI gets a private tutor for learning human preferences more accurately

No matter how much data they learn, why do artificial intelligence (AI) models often miss the mark on human intent?

Drone Landing and Reinforcement Learning: State-of-Art, Challenges and Opportunities

Abstract: Unmanned aerial vehicles, and special multirotor drones, have shown great relevance in a plethora of missions that require high affordance, field of view, and precision. Their limited ...

IEEE

Inverse Reinforcement Learning for Discrete-Time Systems With Data Dropouts

Abstract: This article proposes inverse reinforcement learning (IRL) algorithms for tracking control of linear networked control systems under random state dropouts during wireless transmission. The ...

GitHub

Pearl - A Production-ready Reinforcement Learning AI Agent Library

Pearl is a new production-ready Reinforcement Learning AI agent library open-sourced by the Applied Reinforcement Learning team at Meta. Furthering our efforts on open AI innovation, Pearl enables ...

GitHub

verl: Volcano Engine Reinforcement Learning for LLMs

verl is a flexible, efficient and production-ready RL training library for large language models (LLMs). verl is the open-source version of HybridFlow: A Flexible and Efficient RLHF Framework paper.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results