Min Max Generalization for Deterministic Batch Mode ... - P.PDFKUL.COM

Viewer
Transcript

Min Max Generalization for Deterministic Batch Mode Reinforcement Learning: Relaxation Schemes

Raphael Fonteneau, Damien Ernst, Bernard Boigelot, Quentin Louveaux University of Liège

Mini-workshop on Reinforcement Learning Department of Electrical Engineering and Computer Science University of Liège September 29th, 2011

Formalization

The batch mode setting

Lipschitz continuity

The worst that can happen Liège

? UARS Satellite

Given: ●

The batch collection of trajectories

●

The Lipsthiz continuity assumptions + two constants

The T-stage problem

Any suggestions?

So let us start with the 2-stage case...

The 2-stage problem

First results

Relaxation scheme: trust region

Relaxation scheme: trust region

Relaxation scheme: Lagrangian dual

Relaxation scheme: Lagrangian dual

Relaxation schemes: synthesis

Illustration

●

●

Uniformly drawn state-action couples

Illustration

Grid

Average (uniform sampling)

Tons of future works

T-stage problem

Stochastic frameworks

? Exact solution ?

Infinite horizon

Min Max Generalization for Deterministic Batch Mode ...

Min Max Generalization for Deterministic Batch Mode ...

Min Max Generalization for Deterministic Batch Mode ...

Min Max Generalization for Deterministic Batch Mode ...

Min Max Generalization for Deterministic Batch Mode ... - Orbi (ULg)

Min Max Generalization for Deterministic Batch Mode ... - Orbi (ULg)

Min Max Generalization for Deterministic Batch Mode ... - Orbi (ULg)

Min Max Generalization for Deterministic Batch Mode ... - Orbi (ULg)

Relaxation Schemes for Min Max Generalization in ... - ORBi

Relaxation Schemes for Min Max Generalization in ... - ORBi

Relaxation Schemes for Min Max Generalization in ... - ORBi

Relaxation Schemes for Min Max Generalization in ... - ORBi

108.84 Min: 0 Max: 1384.67 Min: 0 Max: 1916.72 Min -

108.84 Min: 0 Max: 1384.67 Min: 0 Max: 1916.72 Min -

MeqTrees Batch Mode: A Short Tutorial - GitHub

MeqTrees Batch Mode: A Short Tutorial - GitHub

Upward Max Min Fairness - Research at Google

Upward Max Min Fairness - Research at Google

Min-Max Multiway Cut

Min-Max Multiway Cut

Batch Mode Adaptive Multiple Instance Learning for ... - IEEE Xplore

Batch Mode Adaptive Multiple Instance Learning for ... - IEEE Xplore

Batch Mode Reinforcement Learning based on the ... - Orbi (ULg)

Batch Mode Reinforcement Learning based on the ... - Orbi (ULg)

Recent Advances in Batch Mode Reinforcement Learning - Orbi (ULg)

Recent Advances in Batch Mode Reinforcement Learning - Orbi (ULg)

Batch Mode Reinforcement Learning based on the ...

Batch Mode Reinforcement Learning based on the ...

Recent Advances in Batch Mode Reinforcement Learning - Orbi (ULg)

Recent Advances in Batch Mode Reinforcement Learning - Orbi (ULg)

Contributions to Batch Mode Reinforcement Learning

Contributions to Batch Mode Reinforcement Learning

Batch mode reinforcement learning based on the ...

Batch mode reinforcement learning based on the ...

Batch mode reinforcement learning based on the ...

Batch mode reinforcement learning based on the ...

Contributions to Batch Mode Reinforcement Learning

Contributions to Batch Mode Reinforcement Learning

Batch Mode Reinforcement Learning based on the ...

Batch Mode Reinforcement Learning based on the ...

All-optical integrated ternary MIN and MAX gate

All-optical integrated ternary MIN and MAX gate

Multi-view Face Recognition with Min-Max Modular ... - Springer Link

Multi-view Face Recognition with Min-Max Modular ... - Springer Link

Many-to-Many Matching with Max-Min Preferences

Many-to-Many Matching with Max-Min Preferences

BAI-TOAN-MAX-MIN-TOI-UU-2017.pdf

BAI-TOAN-MAX-MIN-TOI-UU-2017.pdf

Min Max Generalization for Deterministic Batch Mode ...

Sep 29, 2011 - University of LiÃ¨ge. Mini-workshop on Reinforcement Learning. Department of Electrical Engineering and Computer Science. University of ...

732KB Sizes 0 Downloads 257 Views