SANet: Smoothed Attention Network for Single Stage Face Detector

Lei Shi; Xiang Xu; Ioannis A. Kakadiaris

doi:10.1109/ICB45273.2019.8987285

SANet: Smoothed Attention Network for Single Stage Face Detector

Lei Shi, Xiang Xu, Ioannis A. Kakadiaris

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

5 Scopus citations

Abstract

Recently, significant effort has been devoted to exploring the role of feature fusion and enriching contextual information on detecting multi-scale faces. However, simply integrating features of different levels could lead to introducing significant noise. Moreover, recently proposed approaches of enriching contextual information are not efficient or ignore the gridding artifacts produced by dilated convolution. To tackle these issues, we developed a smoothed attention network (dubbed SANet), which introduces an Attention-guided Feature Fusion Module (AFFM) and a Smoothed Context Enhancement Module (SCEM). In particular, the AFFM applies an attention module to high-level semantic features and fuses attention-focused features with low-level semantic features to reduce the noise of the fused feature map. The SCEM stacks dilated convolution and convolution layers alternately to re-learn the relationship among completely separate sets of units produced by dilated convolution to maintain consistency of local information. The SANet achieves promising results on the WIDER FACE validation and testing datasets and is state-of-the-art on the UFDD dataset.

Original language	English (US)
Title of host publication	2019 International Conference on Biometrics, ICB 2019
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781728136400
DOIs	https://doi.org/10.1109/ICB45273.2019.8987285
State	Published - Jun 2019
Event	2019 International Conference on Biometrics, ICB 2019 - Crete, Greece Duration: Jun 4 2019 → Jun 7 2019

Publication series

Name	2019 International Conference on Biometrics, ICB 2019

Conference

Conference	2019 International Conference on Biometrics, ICB 2019
Country/Territory	Greece
City	Crete
Period	6/4/19 → 6/7/19

ASJC Scopus subject areas

Computer Science Applications
Computer Vision and Pattern Recognition
Signal Processing
Statistics, Probability and Uncertainty
Demography

Access to Document

10.1109/ICB45273.2019.8987285

Cite this

SANet: Smoothed Attention Network for Single Stage Face Detector. / Shi, Lei; Xu, Xiang; Kakadiaris, Ioannis A.
2019 International Conference on Biometrics, ICB 2019. Institute of Electrical and Electronics Engineers Inc., 2019. 8987285 (2019 International Conference on Biometrics, ICB 2019).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Shi, L, Xu, X & Kakadiaris, IA 2019, SANet: Smoothed Attention Network for Single Stage Face Detector. in 2019 International Conference on Biometrics, ICB 2019., 8987285, 2019 International Conference on Biometrics, ICB 2019, Institute of Electrical and Electronics Engineers Inc., 2019 International Conference on Biometrics, ICB 2019, Crete, Greece, 6/4/19. https://doi.org/10.1109/ICB45273.2019.8987285

@inproceedings{ce2c790e90d64c02ab4240ea8bdd94d3,

title = "SANet: Smoothed Attention Network for Single Stage Face Detector",

abstract = "Recently, significant effort has been devoted to exploring the role of feature fusion and enriching contextual information on detecting multi-scale faces. However, simply integrating features of different levels could lead to introducing significant noise. Moreover, recently proposed approaches of enriching contextual information are not efficient or ignore the gridding artifacts produced by dilated convolution. To tackle these issues, we developed a smoothed attention network (dubbed SANet), which introduces an Attention-guided Feature Fusion Module (AFFM) and a Smoothed Context Enhancement Module (SCEM). In particular, the AFFM applies an attention module to high-level semantic features and fuses attention-focused features with low-level semantic features to reduce the noise of the fused feature map. The SCEM stacks dilated convolution and convolution layers alternately to re-learn the relationship among completely separate sets of units produced by dilated convolution to maintain consistency of local information. The SANet achieves promising results on the WIDER FACE validation and testing datasets and is state-of-the-art on the UFDD dataset.",

author = "Lei Shi and Xiang Xu and Kakadiaris, {Ioannis A.}",

note = "Funding Information: This material is based upon work supported by the U.S. Department of Homeland Security under Grant Award Number 2017-ST-BTI-0001-0201. This grant is awarded to the Borders, Trade, and Immigration (BTI) Institute: A DHS Center of Excellence led by the University of Houston, and includes support for the project EDGE awarded to the University of Houston. Publisher Copyright: {\textcopyright} 2019 IEEE.; 2019 International Conference on Biometrics, ICB 2019 ; Conference date: 04-06-2019 Through 07-06-2019",

year = "2019",

month = jun,

doi = "10.1109/ICB45273.2019.8987285",

language = "English (US)",

series = "2019 International Conference on Biometrics, ICB 2019",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2019 International Conference on Biometrics, ICB 2019",

address = "United States",

}

TY - GEN

T1 - SANet

T2 - 2019 International Conference on Biometrics, ICB 2019

AU - Shi, Lei

AU - Xu, Xiang

AU - Kakadiaris, Ioannis A.

N1 - Funding Information: This material is based upon work supported by the U.S. Department of Homeland Security under Grant Award Number 2017-ST-BTI-0001-0201. This grant is awarded to the Borders, Trade, and Immigration (BTI) Institute: A DHS Center of Excellence led by the University of Houston, and includes support for the project EDGE awarded to the University of Houston. Publisher Copyright: © 2019 IEEE.

PY - 2019/6

Y1 - 2019/6

N2 - Recently, significant effort has been devoted to exploring the role of feature fusion and enriching contextual information on detecting multi-scale faces. However, simply integrating features of different levels could lead to introducing significant noise. Moreover, recently proposed approaches of enriching contextual information are not efficient or ignore the gridding artifacts produced by dilated convolution. To tackle these issues, we developed a smoothed attention network (dubbed SANet), which introduces an Attention-guided Feature Fusion Module (AFFM) and a Smoothed Context Enhancement Module (SCEM). In particular, the AFFM applies an attention module to high-level semantic features and fuses attention-focused features with low-level semantic features to reduce the noise of the fused feature map. The SCEM stacks dilated convolution and convolution layers alternately to re-learn the relationship among completely separate sets of units produced by dilated convolution to maintain consistency of local information. The SANet achieves promising results on the WIDER FACE validation and testing datasets and is state-of-the-art on the UFDD dataset.

AB - Recently, significant effort has been devoted to exploring the role of feature fusion and enriching contextual information on detecting multi-scale faces. However, simply integrating features of different levels could lead to introducing significant noise. Moreover, recently proposed approaches of enriching contextual information are not efficient or ignore the gridding artifacts produced by dilated convolution. To tackle these issues, we developed a smoothed attention network (dubbed SANet), which introduces an Attention-guided Feature Fusion Module (AFFM) and a Smoothed Context Enhancement Module (SCEM). In particular, the AFFM applies an attention module to high-level semantic features and fuses attention-focused features with low-level semantic features to reduce the noise of the fused feature map. The SCEM stacks dilated convolution and convolution layers alternately to re-learn the relationship among completely separate sets of units produced by dilated convolution to maintain consistency of local information. The SANet achieves promising results on the WIDER FACE validation and testing datasets and is state-of-the-art on the UFDD dataset.

UR - http://www.scopus.com/inward/record.url?scp=85081061629&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85081061629&partnerID=8YFLogxK

U2 - 10.1109/ICB45273.2019.8987285

DO - 10.1109/ICB45273.2019.8987285

M3 - Conference contribution

AN - SCOPUS:85081061629

T3 - 2019 International Conference on Biometrics, ICB 2019

BT - 2019 International Conference on Biometrics, ICB 2019

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 4 June 2019 through 7 June 2019

ER -

SANet: Smoothed Attention Network for Single Stage Face Detector

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this