lrpg_cartpole.py 文件源码

python
阅读 26 收藏 0 点赞 0 评论 0

项目:cartpoleplusplus 作者: matpalm 项目源码 文件源码
def train(self, observations, actions, advantages):
    """ take one training step given observations, actions and subsequent advantages"""
    if VERBOSE_DEBUG:
      print "TRAIN"
      print "observations", np.stack(observations)
      print "actions", actions
      print "advantages", advantages
      _, loss = tf.get_default_session().run([self.train_op, self.loss],
                                             feed_dict={self.observations: observations,
                                                        self.actions: actions,
                                                        self.advantages: advantages})

    else:
      _, loss = tf.get_default_session().run([self.train_op, self.loss],
                                             feed_dict={self.observations: observations,
                                                        self.actions: actions,
                                                        self.advantages: advantages})
    return float(loss)
评论列表
文章目录


问题


面经


文章

微信
公众号

扫码关注公众号