Lecture 14 | Deep Reinforcement
2019-11-04 本文已影响0人
Ysgc
value iteration
https://math.stackexchange.com/questions/2639577/why-is-the-gradient-of-this-expectation-intractable
computational efficiency -> low resolution to high resolution
this hard attention -> a lot applications!!! -> improve efficiency
but still need RNN -> may be slow
efficiency depends on the case
high resolution input -> fast by this method
Q learning may be harder to tune