1-Bit Stochastic Gradient Descen

2016-05-04  本文已影响0人  世间五彩我执纯白

1. Abstract

2. Intro

3. Data-parallel Deterministically Distributed SGD

3.1. Data-parallel Distributed SGD

3.2. Double Buffering with Half Batches

3.3. Potential Faster-Than-Fixed-Cost Communication

3.4. Relation to Hogwild/ASGD

4. 1-bit SGD with Error Feedback

4.1. Aggregating the Gradients

5. System Description

6. Experimental Results

6.1. Cost Measurements

6.2. Effect of 1-bit Quantization

7. Conclusion

上一篇 下一篇

猜你喜欢

热点阅读