Reinforcement
Learning

Agenda

  • Review of Machine Learning
  • Introduction
  • History
  • Demo - Playground
  • Future

Review of Machine Learning

picture picture

Review of Machine Learning

picture
picture
picture
  • Labled Data
  • Unlabled Data
  • No Data

Introduction

When to use Reinforcement Learning

  • Data for learning currently does not exist
  • Or you don’t want to wait to accumulate it
    (because delay might be costly)
  • Or the data may change rapidly causing the
    outcome to change more rapidly than a typical
    model refresh cycle can accommodate.

Introduction

Typical RL Problems

  • Robotic Control
  • AI Game Play

RL - How it Works

picture
  • Environment - all possible values and steps
  • State - current values
  • Reward - benefit from action
  • Agent - RL Algorithm
  • Policy - Solution or Steps to Maximize Reward

The RL Bible

Reinforcement Learning:
An Introduction

Richard S. Sutton and Andrew G. Barto

RL - History

  • 1963 - Tic Tac Toe
  • 1992 - Backgammon
  • 1997 - Deep Blue - Chess
  • 2013 - Deep Mind - Atari
  • 2016 - AlphaGo - Go
  • 2017 - OpenAI Bot - DOTA

Playground - OpenAI

CartPole with ACER

The Future
From MOBA to RTS

picture

Starcraft ][