environments.py 文件源码

python
阅读 25 收藏 0 点赞 0 评论 0

项目:rc-nfq 作者: cosmoharrigan 项目源码 文件源码
def act(self, action):
        try:
            assert not self.terminal()
        except AssertionError as e:
            e.args += ('Further action not permitted: terminal state ' +
                       ' reached. Episode is over.',)
            raise

        probs = self.T[action][self.state, :]
        pmf = stats.rv_discrete(name='pmf',
                                values=(self.states, probs))
        successor_state = pmf.rvs()
        self.state = successor_state

        r = self.rewards[successor_state]
        return r
评论列表
文章目录


问题


面经


文章

微信
公众号

扫码关注公众号