Gradient Descent

2018-08-16  本文已影响0人  光华_5206

Gradient descent

cost function: for example, MSE(Mean Square Error) can be expressed as ​. To be more generally, ​ . Its gradient can be formulated as

The calculation of gradient has to iterate all samples and sum them together. If the number of samples is very large, the calculation is very time-consuming.

So, to overcome this problem we need to divide the data into smaller sizes and give it to our computer one by one and update the weights of the neural networks at the end of every step to fit it to the data given.

sgd.gif
上一篇 下一篇

猜你喜欢

热点阅读