Lecture 14 | Deep Reinforcement

2019-11-04 本文已影响0人 Ysgc

value iteration

https://math.stackexchange.com/questions/2639577/why-is-the-gradient-of-this-expectation-intractable

turn a integration in high dim to a expectation problem???

computational efficiency -> low resolution to high resolution

this hard attention -> a lot applications!!! -> improve efficiency

but still need RNN -> may be slow

efficiency depends on the case

high resolution input -> fast by this method

Q learning may be harder to tune

上一篇下一篇

猜你喜欢

热点阅读