電気学会論文誌C(電子・情報・システム部門誌)
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
<知能,ロボティクス>
複数のエキスパートから方策推定を行う敵対的逆強化学習
山下 廣大濱上 知樹
著者情報
ジャーナル 認証あり

2021 年 141 巻 12 号 p. 1405-1410

詳細
抄録

Inverse reinforcement learning is used for complex control tasks by using experts. However, since the learning results depend on the expert, it is impossible to imitate ungiven policies from expert when there are multiple optimal polices for the same goal, or when the environment changes from the training. The problems can be solved by giving multiple experts and representing their features in the latent space. the proposed method extends information maximizing generative adversarial imitation learning with adversarial inverse reinforcement learning to deal with such environment. Experiments show that the proposed method can not only imitate multiple experts, but also estimate ungiven polices.

著者関連情報
© 2021 電気学会
前の記事 次の記事
feedback
Top