Inverse Reinforcement Learning through Structured Classification #5

usajpn · 2019-12-24T23:43:48Z

Edouard Klein, Matthieu Geist, Bilal Piot, Olivier Pietquin
https://papers.nips.cc/paper/4551-inverse-reinforcement-learning-through-structured-classification.pdf
NIPS 2012

概要

逆強化学習ループの中の強化学習問題を解かないで良いように最初からエキスパートデータを入力として状態における行動のMulti Class Classification問題を解く。Multi Class Classification問題で求めたパラメータをそのまま報酬関数のパラメータとして使用して強化学習をせずに直接報酬関数を求める。

所感

Inner Loopの強化学習を解かないのは画期的だが、
MaxEnt IRLが出て以来、probabilisticなモデルでなく、SVM系列モデルでの論文なので、流行らなかったか？

usajpn added the IRL Inverse Reinforcement Learning label Dec 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inverse Reinforcement Learning through Structured Classification #5

Inverse Reinforcement Learning through Structured Classification #5

usajpn commented Dec 24, 2019 •

edited

Loading

Inverse Reinforcement Learning through Structured Classification #5

Inverse Reinforcement Learning through Structured Classification #5

Comments

usajpn commented Dec 24, 2019 • edited Loading

概要

所感

usajpn commented Dec 24, 2019 •

edited

Loading