valueIterationAgents.py 文件源码

python
阅读 21 收藏 0 点赞 0 评论 0

项目:Reinforcement-Learning 作者: victorgrego 项目源码 文件源码
def __init__(self, mdp, discount = 0.9, iterations = 100):
    """
      Your value iteration agent should take an mdp on
      construction, run the indicated number of iterations
      and then act according to the resulting policy.

      Some useful mdp methods you will use:
          mdp.getStates()
          mdp.getPossibleActions(state)
          mdp.getTransitionStatesAndProbs(state, action)
          mdp.getReward(state, action, nextState)
    """
    self.mdp = mdp
    self.discount = discount
    self.iterations = iterations
    self.values = util.Counter() # A Counter is a dict with default 0

    "*** YOUR CODE HERE ***"
评论列表
文章目录


问题


面经


文章

微信
公众号

扫码关注公众号