DVRNet: Decoupled visible region network for pedestrian detection

Lei Shi; Charles Livermore; Ioannis A. Kakadiaris

doi:10.1109/IJCB48548.2020.9304883

DVRNet: Decoupled visible region network for pedestrian detection

Lei Shi, Charles Livermore, Ioannis A. Kakadiaris

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

2 Scopus citations

Abstract

Pedestrian detection remains a challenging task due to the problems caused by occlusion variance. Visible-body bounding boxes are typically used as an extra supervision signal to improve the performance of pedestrian detection to predict the full-body. However, visible-body assisted approaches produce a large number of false positives, which result from a lack of adequate and discriminative full-body contextual information. In this paper, we propose a new network, dubbed DVRNet, based on the representative visible-body assisted pedestrian detector named Bi-box. Specifically, we extend Bi-box by adding three modules named the attention-based feature interleaver module (AFIM), the binary mask learning module (BMLM), and the head-aware feature enhancement module (HFEM), which play important roles in employing features learned by the visible-body and the head supervision signals to enrich high discriminative contextual information of the full-body and enhance the power of feature representation. Experimental results indicate that the DVRNet achieves promising results on the CityPersons and the CrowdHuman datasets.

Original language	English (US)
Title of host publication	IJCB 2020 - IEEE/IAPR International Joint Conference on Biometrics
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781728191867
DOIs	https://doi.org/10.1109/IJCB48548.2020.9304883
State	Published - Sep 28 2020
Event	2020 IEEE/IAPR International Joint Conference on Biometrics, IJCB 2020 - Virtual, Online, United States Duration: Sep 28 2020 → Oct 1 2020

Publication series

Name	IJCB 2020 - IEEE/IAPR International Joint Conference on Biometrics

Conference

Conference	2020 IEEE/IAPR International Joint Conference on Biometrics, IJCB 2020
Country/Territory	United States
City	Virtual, Online
Period	9/28/20 → 10/1/20

ASJC Scopus subject areas

Computer Vision and Pattern Recognition
Biomedical Engineering
Instrumentation

Access to Document

10.1109/IJCB48548.2020.9304883

Cite this

Shi, L., Livermore, C., & Kakadiaris, I. A. (2020). DVRNet: Decoupled visible region network for pedestrian detection. In IJCB 2020 - IEEE/IAPR International Joint Conference on Biometrics Article 9304883 (IJCB 2020 - IEEE/IAPR International Joint Conference on Biometrics). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IJCB48548.2020.9304883

DVRNet: Decoupled visible region network for pedestrian detection. / Shi, Lei; Livermore, Charles; Kakadiaris, Ioannis A.
IJCB 2020 - IEEE/IAPR International Joint Conference on Biometrics. Institute of Electrical and Electronics Engineers Inc., 2020. 9304883 (IJCB 2020 - IEEE/IAPR International Joint Conference on Biometrics).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Shi, L, Livermore, C & Kakadiaris, IA 2020, DVRNet: Decoupled visible region network for pedestrian detection. in IJCB 2020 - IEEE/IAPR International Joint Conference on Biometrics., 9304883, IJCB 2020 - IEEE/IAPR International Joint Conference on Biometrics, Institute of Electrical and Electronics Engineers Inc., 2020 IEEE/IAPR International Joint Conference on Biometrics, IJCB 2020, Virtual, Online, United States, 9/28/20. https://doi.org/10.1109/IJCB48548.2020.9304883

@inproceedings{670840c5a54243838d48477faf141409,

title = "DVRNet: Decoupled visible region network for pedestrian detection",

abstract = "Pedestrian detection remains a challenging task due to the problems caused by occlusion variance. Visible-body bounding boxes are typically used as an extra supervision signal to improve the performance of pedestrian detection to predict the full-body. However, visible-body assisted approaches produce a large number of false positives, which result from a lack of adequate and discriminative full-body contextual information. In this paper, we propose a new network, dubbed DVRNet, based on the representative visible-body assisted pedestrian detector named Bi-box. Specifically, we extend Bi-box by adding three modules named the attention-based feature interleaver module (AFIM), the binary mask learning module (BMLM), and the head-aware feature enhancement module (HFEM), which play important roles in employing features learned by the visible-body and the head supervision signals to enrich high discriminative contextual information of the full-body and enhance the power of feature representation. Experimental results indicate that the DVRNet achieves promising results on the CityPersons and the CrowdHuman datasets.",

author = "Lei Shi and Charles Livermore and Kakadiaris, {Ioannis A.}",

note = "Publisher Copyright: {\textcopyright} 2020 IEEE.; 2020 IEEE/IAPR International Joint Conference on Biometrics, IJCB 2020 ; Conference date: 28-09-2020 Through 01-10-2020",

year = "2020",

month = sep,

day = "28",

doi = "10.1109/IJCB48548.2020.9304883",

language = "English (US)",

series = "IJCB 2020 - IEEE/IAPR International Joint Conference on Biometrics",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "IJCB 2020 - IEEE/IAPR International Joint Conference on Biometrics",

address = "United States",

}

TY - GEN

T1 - DVRNet

T2 - 2020 IEEE/IAPR International Joint Conference on Biometrics, IJCB 2020

AU - Shi, Lei

AU - Livermore, Charles

AU - Kakadiaris, Ioannis A.

PY - 2020/9/28

Y1 - 2020/9/28

N2 - Pedestrian detection remains a challenging task due to the problems caused by occlusion variance. Visible-body bounding boxes are typically used as an extra supervision signal to improve the performance of pedestrian detection to predict the full-body. However, visible-body assisted approaches produce a large number of false positives, which result from a lack of adequate and discriminative full-body contextual information. In this paper, we propose a new network, dubbed DVRNet, based on the representative visible-body assisted pedestrian detector named Bi-box. Specifically, we extend Bi-box by adding three modules named the attention-based feature interleaver module (AFIM), the binary mask learning module (BMLM), and the head-aware feature enhancement module (HFEM), which play important roles in employing features learned by the visible-body and the head supervision signals to enrich high discriminative contextual information of the full-body and enhance the power of feature representation. Experimental results indicate that the DVRNet achieves promising results on the CityPersons and the CrowdHuman datasets.

AB - Pedestrian detection remains a challenging task due to the problems caused by occlusion variance. Visible-body bounding boxes are typically used as an extra supervision signal to improve the performance of pedestrian detection to predict the full-body. However, visible-body assisted approaches produce a large number of false positives, which result from a lack of adequate and discriminative full-body contextual information. In this paper, we propose a new network, dubbed DVRNet, based on the representative visible-body assisted pedestrian detector named Bi-box. Specifically, we extend Bi-box by adding three modules named the attention-based feature interleaver module (AFIM), the binary mask learning module (BMLM), and the head-aware feature enhancement module (HFEM), which play important roles in employing features learned by the visible-body and the head supervision signals to enrich high discriminative contextual information of the full-body and enhance the power of feature representation. Experimental results indicate that the DVRNet achieves promising results on the CityPersons and the CrowdHuman datasets.

UR - http://www.scopus.com/inward/record.url?scp=85099681848&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85099681848&partnerID=8YFLogxK

U2 - 10.1109/IJCB48548.2020.9304883

DO - 10.1109/IJCB48548.2020.9304883

M3 - Conference contribution

AN - SCOPUS:85099681848

T3 - IJCB 2020 - IEEE/IAPR International Joint Conference on Biometrics

BT - IJCB 2020 - IEEE/IAPR International Joint Conference on Biometrics

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 28 September 2020 through 1 October 2020

ER -

DVRNet: Decoupled visible region network for pedestrian detection

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this