Hands-on Reinforcement Learning with Ray and RLlib


In recent years, reinforcement learning (RL) has become a powerful item in our toolbox of machine learning methods. Its ability to produce end-to-end decision-making solutions via learning by doing within a well-defined problem environment makes RL particularly attractive as an alternative to classic supervised learning methods. However, several issues remain problematic when using RL to solve real-world industry problems: 1) RL algorithms are difficult to understand and therefore hard to customize and hypertune, 2) experiments need to run at scale in order to yield useful results within a reasonable time, and 3) often, a safe-to-use and fast simulator of the particular problem does not exist, however, historical sensor- and actor data are abundantly available.

In this tutorial, we will introduce RLlib (http://rllib.io/), an open-source RL library with a proven track record for solving real-life industry problems at scale. We will walk through different industrial RL use cases and the solutions RLlib offers for those. In particular, we will build a recommender system using offline RL, show how to train policies that master complex multi-agent games, and demonstrate how you can connect external simulators to RLlib at scale for faster learning.

This talk is targeted towards data scientists, research engineers, and software developers who are already familiar with machine learning concepts.


Avnish Narayan is an ML Engineer at Anyscale where he works on RLlib. He's passionate about exploring where RL can improve upon existing solutions in industrial applications. He previously received his MS in Computer Science at USC, where he did research on the applications of RL in robotic manipulation problems.

Open Data Science




Open Data Science
One Broadway
Cambridge, MA 02142

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from - Youtube
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google