Spatially-adaptive filter units for compact and efficient deep neural networks

Domen Tabernik; Matej Kristan; Aleš Leonardis

doi:10.1007/s11263-019-01282-1

Spatially-adaptive filter units for compact and efficient deep neural networks

Domen Tabernik^*, Matej Kristan, Aleš Leonardis

^*Corresponding author for this work

Computer Science

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

212 Downloads (Pure)

Abstract

Convolutional neural networks excel in a number of computer vision tasks. One of their most crucial architectural elements is the effective receptive field size, which has to be manually set to accommodate a specific task. Standard solutions involve large kernels, down/up-sampling and dilated convolutions. These require testing a variety of dilation and down/up-sampling factors and result in non-compact networks and large number of parameters. We address this issue by proposing a new convolution filter composed of displaced aggregation units (DAU). DAUs learn spatial displacements and adapt the receptive field sizes of individual convolution filters to a given problem, thus reducing the need for hand-crafted modifications. DAUs provide a seamless substitution of convolutional filters in existing state-of-the-art architectures, which we demonstrate on AlexNet, ResNet50, ResNet101, DeepLab and SRN-DeblurNet. The benefits of this design are demonstrated on a variety of computer vision tasks and datasets, such as image classification (ILSVRC 2012), semantic segmentation (PASCAL VOC 2011, Cityscape) and blind image de-blurring (GOPRO). Results show that DAUs efficiently allocate parameters resulting in up to 4× more compact networks in terms of the number of parameters at similar or better performance.

Original language	English
Pages (from-to)	2049-2067
Number of pages	19
Journal	International Journal of Computer Vision
Volume	128
Issue number	8-9
Early online date	2 Jan 2020
DOIs	https://doi.org/10.1007/s11263-019-01282-1
Publication status	Published - Sept 2020

Keywords

Adjustable receptive fields
Compact ConvNets
Displacement units
Efficient ConvNets

ASJC Scopus subject areas

Software
Computer Vision and Pattern Recognition
Artificial Intelligence

Access to Document

10.1007/s11263-019-01282-1Licence: None: All rights reserved

Tabernik_et_al_Spatially-adaptive_filter_units_International_Journal_of_Computer_Vision_2020
This is a post-peer-review, pre-copyedit version of an article published in International Journal of Computer Vision. The final authenticated version is available online at: https://doi.org/10.1007/s11263-019-01282-1
Accepted author manuscript, 6.09 MBLicence: Other (please specify with Rights Statement)

https://link.springer.com/article/10.1007%2Fs11263-019-01282-1Licence: None: All rights reserved

Cite this

@article{f64fe4cbba89400ba1eed4e627ca9d1c,

title = "Spatially-adaptive filter units for compact and efficient deep neural networks",

abstract = "Convolutional neural networks excel in a number of computer vision tasks. One of their most crucial architectural elements is the effective receptive field size, which has to be manually set to accommodate a specific task. Standard solutions involve large kernels, down/up-sampling and dilated convolutions. These require testing a variety of dilation and down/up-sampling factors and result in non-compact networks and large number of parameters. We address this issue by proposing a new convolution filter composed of displaced aggregation units (DAU). DAUs learn spatial displacements and adapt the receptive field sizes of individual convolution filters to a given problem, thus reducing the need for hand-crafted modifications. DAUs provide a seamless substitution of convolutional filters in existing state-of-the-art architectures, which we demonstrate on AlexNet, ResNet50, ResNet101, DeepLab and SRN-DeblurNet. The benefits of this design are demonstrated on a variety of computer vision tasks and datasets, such as image classification (ILSVRC 2012), semantic segmentation (PASCAL VOC 2011, Cityscape) and blind image de-blurring (GOPRO). Results show that DAUs efficiently allocate parameters resulting in up to 4× more compact networks in terms of the number of parameters at similar or better performance.",

keywords = "Adjustable receptive fields, Compact ConvNets, Displacement units, Efficient ConvNets",

author = "Domen Tabernik and Matej Kristan and Ale{\v s} Leonardis",

year = "2020",

month = sep,

doi = "10.1007/s11263-019-01282-1",

language = "English",

volume = "128",

pages = "2049--2067",

journal = "International Journal of Computer Vision",

issn = "0920-5691",

publisher = "Springer",

number = "8-9",

}

TY - JOUR

T1 - Spatially-adaptive filter units for compact and efficient deep neural networks

AU - Tabernik, Domen

AU - Kristan, Matej

AU - Leonardis, Aleš

PY - 2020/9

Y1 - 2020/9

N2 - Convolutional neural networks excel in a number of computer vision tasks. One of their most crucial architectural elements is the effective receptive field size, which has to be manually set to accommodate a specific task. Standard solutions involve large kernels, down/up-sampling and dilated convolutions. These require testing a variety of dilation and down/up-sampling factors and result in non-compact networks and large number of parameters. We address this issue by proposing a new convolution filter composed of displaced aggregation units (DAU). DAUs learn spatial displacements and adapt the receptive field sizes of individual convolution filters to a given problem, thus reducing the need for hand-crafted modifications. DAUs provide a seamless substitution of convolutional filters in existing state-of-the-art architectures, which we demonstrate on AlexNet, ResNet50, ResNet101, DeepLab and SRN-DeblurNet. The benefits of this design are demonstrated on a variety of computer vision tasks and datasets, such as image classification (ILSVRC 2012), semantic segmentation (PASCAL VOC 2011, Cityscape) and blind image de-blurring (GOPRO). Results show that DAUs efficiently allocate parameters resulting in up to 4× more compact networks in terms of the number of parameters at similar or better performance.

AB - Convolutional neural networks excel in a number of computer vision tasks. One of their most crucial architectural elements is the effective receptive field size, which has to be manually set to accommodate a specific task. Standard solutions involve large kernels, down/up-sampling and dilated convolutions. These require testing a variety of dilation and down/up-sampling factors and result in non-compact networks and large number of parameters. We address this issue by proposing a new convolution filter composed of displaced aggregation units (DAU). DAUs learn spatial displacements and adapt the receptive field sizes of individual convolution filters to a given problem, thus reducing the need for hand-crafted modifications. DAUs provide a seamless substitution of convolutional filters in existing state-of-the-art architectures, which we demonstrate on AlexNet, ResNet50, ResNet101, DeepLab and SRN-DeblurNet. The benefits of this design are demonstrated on a variety of computer vision tasks and datasets, such as image classification (ILSVRC 2012), semantic segmentation (PASCAL VOC 2011, Cityscape) and blind image de-blurring (GOPRO). Results show that DAUs efficiently allocate parameters resulting in up to 4× more compact networks in terms of the number of parameters at similar or better performance.

KW - Adjustable receptive fields

KW - Compact ConvNets

KW - Displacement units

KW - Efficient ConvNets

UR - http://www.scopus.com/inward/record.url?scp=85077567153&partnerID=8YFLogxK

U2 - 10.1007/s11263-019-01282-1

DO - 10.1007/s11263-019-01282-1

M3 - Article

AN - SCOPUS:85077567153

SN - 0920-5691

VL - 128

SP - 2049

EP - 2067

JO - International Journal of Computer Vision

JF - International Journal of Computer Vision

IS - 8-9

ER -

Spatially-adaptive filter units for compact and efficient deep neural networks

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Fingerprint

Cite this