Best Facebook - 搜索 News

19 小时

We study offline reinforcement learning (RL), which seeks to learn a good policy based on a fixed, pre-collected dataset. A fundamental challenge behind this task is the distributional shift due to th ...

一些您可能无法访问的结果已被隐去。