BlackRobot_SARSA_Trace.py 文件源码

python

阅读 23 收藏 0 点赞 0 评论 0

项目：RaspberryPi-Robot 作者: timestocome 项目源码文件源码

def choose_action(d, c, q_table):

    global epsilon
    state_actions = q_table[d][c][:]

    # random move or no data recorded for this state yet
    if (np.random.uniform() < epsilon) or (np.sum(state_actions) == 0):

        action_chose = np.random.randint(n_actions)

        # decrease random moves over time to a minimum of 10%
        if epsilon >  0.1: epsilon *= 0.9

    else:
        action_chose = state_actions.argmax()

    return action_chose

评论列表正在加载评论...

文章目录

提
问题

写
面经

写
文章

微信
公众号

扫码关注公众号