valueIterationAgents.py 文件源码

python

阅读 32 收藏 0 点赞 0 评论 0

项目：Reinforcement-Learning 作者: victorgrego 项目源码文件源码

def getPolicy(self, state):
    """
      The policy is the best action in the given state
      according to the values computed by value iteration.
      You may break ties any way you see fit.  Note that if
      there are no legal actions, which is the case at the
      terminal state, you should return None.
    """
    "*** YOUR CODE HERE ***"
    util.raiseNotDefined()

评论列表正在加载评论...

文章目录

提
问题

写
面经

写
文章

微信
公众号

扫码关注公众号