-----
Convolution is the main building block of convolutional neural networks (CNN). We observe that an optimized CNN often has highly correlated filters as the number of channels increases with depth, reducing the expressive power of feature representations. We propose Tied Block Convolution (TBC) that shares the same thinner filters over equal blocks of channels and produces multiple responses with a single filter. The concept of TBC can also be extended to group convolution and fully connected layers, and can be applied to various backbone networks and attention modules.
Our extensive experimentation on classification, detection, instance segmentation, and attention demonstrates TBC’s significant across-the-board
gain over standard convolution and group convolution. The proposed TiedSE attention module can even use 64
-----
To generate two activation maps, standard convolution requires two full-size filters and group convolution requires two half-size filters,
however, our tied block convolution only requires one half-size filter, that is, the parameters are reduced by 4
Let the input feature be denoted by
where
Group Convolution first divides input feature
where
Tied Block Convolution reduces the effective number of filters by reusing filters across different feature groups with the following formula:
where
The idea of tied block filtering can also be directly applied to group convolution, formulated as:
where
Convolution is a special case of fully connected (FC) layer, just as FC is a special case of convolution. We apply the same tied block filtering idea to FC.
Tied block fully connected layer (TFC) shares the FC connections between equal blocks of input channels.
Like TBC, TFC could reduce
-----
-----
-----
Tied Block Convolution: Leaner and Better CNNs with Shared Thinner Filters
Xudong Wang and Stella X. Yu
The Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI), 2021.
-----
@article{wang2020unsupervised,
title={Tied Block Convolution: Leaner and Better CNNs with Shared Thinner Filters},
author={Wang, Xudong and Yu, Stella X},
journal={arXiv preprint arXiv:2009.12021},
year={2020}
}