bars
😇
Deep Multi-Agent Reinforcement Learning
search
circle-xmark
⌘
Ctrl
k
copy
Copy
chevron-down
III Learning to Reciprocate
chevron-right
9. DiCE: The Infinitely Differentiable Monte Carlo Estimator
9.3 Higher Order Gradients
이번 Section에서는 higher order gradient를 구하는 기존 방법들에 대해 알아보겠습니다.
Previous
9.2.2 Surrogate Losses
chevron-left
Next
9.3.1 Higher Order Gradient Estimators
chevron-right
Last updated
5 years ago