Computation-Performance Optimization of Convolutional Neural Networks with Redundant Filter Removal

chih-ting liu, tung-wei lin, yi-heng wu, yu-sheng lin, heng lee, yu tsao, shao-yi chien

PDF

Abstract

Convolutional neural networks (CNNs) are widely employed in modern computer vision algorithms, where the input image is convolved iteratively by many filters to extract the knowledge behind it. However, while the depth of convolutional layers gets deeper and deeper in recent years, the enormous computational complexity makes it difficult to be deployed on embedded systems with limited hardware resources. In this paper, inspired by rate-distortion optimization in image and video coding, we propose a computation-performance optimization (CPO) method to remove the redundant convolution filters in a CNN with performance constraints. To prove the effectiveness of the proposed method, CPO is applied to the networks for image super-resolution and image classification. Under almost the same PSNR drop and accuracy drop for performance evaluation in these two tasks, we can achieve the best parameter and computation reduction when compared with previous works.

Type

Journal article

Publication

IEEE Transactions on Circuits and Systems I: Regular Papers

Date

December, 2018

Filter Pruning Sensitivity

Computation-Performance Optimization of Convolutional Neural Networks with Redundant Filter Removal

Abstract

Related