Policy Gradient¶
-
class
PolicyGradient
(model, lr)[source]¶ Bases:
parl.core.paddle.algorithm.Algorithm
-
__init__
(model, lr)[source]¶ Policy gradient algorithm
Parameters: - model (parl.Model) – model defining forward network of policy.
- lr (float) – learning rate.
-