Policy Gradient¶
-
class
PolicyGradient(model, lr)[source]¶ Bases:
parl.core.paddle.algorithm.Algorithm-
__init__(model, lr)[source]¶ Policy gradient algorithm
Parameters: - model (parl.Model) – model defining forward network of policy.
- lr (float) – learning rate.
-