Xception
2017-10-16 本文已影响23人
信步闲庭v
Approach
data:image/s3,"s3://crabby-images/9a3db/9a3dba97236ce521d7c3a9895af2714915a829ec" alt=""
Two minor differences between and “extreme” version of an Inception module and a depthwise separable convolution would be:
- The order of the operations: depthwise separable convolutions as usually implemented perform first channel-wise spatial convolution and then perform 1x1 convolution, whereas Inception performs the 1x1 convolution first.
- The presence or absence of a non-linearity after the first operation. In Inception, both operations are followed by a ReLU non-linearity, however depthwise separable convolutions are usually implemented without non-linearities.
Experiment
data:image/s3,"s3://crabby-images/49407/49407ed10cfe32f35d12999e687d6a972d60bf14" alt=""
We presented a novel architecture based on this idea, named Xception, which has a similar parameter count as Inception V3. Compared to Inception V3, Xception shows small gains in classification performance on the ImageNet dataset and large gains on the JFT dataset.
References:
Xception: Deep Learning with Depthwise Separable Convolutions, Francois Chollet, 2016,CVPR