What do I see? Modeling human visual perception for multi-person tracking

Xu Yan; Ioannis A. Kakadiaris; Shishir K. Shah

doi:10.1007/978-3-319-10605-2_21

What do I see? Modeling human visual perception for multi-person tracking

Xu Yan, Ioannis A. Kakadiaris, Shishir K. Shah

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

2 Scopus citations

Abstract

This paper presents a novel approach for multi-person tracking utilizing a model motivated by the human vision system. The model predicts human motion based on modeling of perceived information. An attention map is designed to mimic human reasoning that integrates both spatial and temporal information. The spatial component addresses human attention allocation to different areas in a scene and is represented using a retinal mapping based on the log-polar transformation while the temporal component denotes the human attention allocation to subjects with different motion velocity and is modeled as a static-dynamic attention map. With the static-dynamic attention map and retinal mapping, attention driven motion of the tracked target is estimated with a center-surround search mechanism. This perception based motion model is integrated into a data association tracking framework with appearance and motion features. The proposed algorithm tracks a large number of subjects in complex scenes and the evaluation on public datasets show promising improvements over state-of-the-art methods.

Original language	English (US)
Title of host publication	Computer Vision, ECCV 2014 - 13th European Conference, Proceedings
Publisher	Springer-Verlag
Pages	314-329
Number of pages	16
Edition	PART 2
ISBN (Print)	9783319106045
DOIs	https://doi.org/10.1007/978-3-319-10605-2_21
State	Published - 2014
Event	13th European Conference on Computer Vision, ECCV 2014 - Zurich, Switzerland Duration: Sep 6 2014 → Sep 12 2014

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Number	PART 2
Volume	8690 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	13th European Conference on Computer Vision, ECCV 2014
Country/Territory	Switzerland
City	Zurich
Period	9/6/14 → 9/12/14

ASJC Scopus subject areas

Theoretical Computer Science
Computer Science(all)

Access to Document

10.1007/978-3-319-10605-2_21

Cite this

Yan, X., Kakadiaris, I. A., & Shah, S. K. (2014). What do I see? Modeling human visual perception for multi-person tracking. In Computer Vision, ECCV 2014 - 13th European Conference, Proceedings (PART 2 ed., pp. 314-329). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 8690 LNCS, No. PART 2). Springer-Verlag. https://doi.org/10.1007/978-3-319-10605-2_21

What do I see? Modeling human visual perception for multi-person tracking. / Yan, Xu; Kakadiaris, Ioannis A.; Shah, Shishir K.
Computer Vision, ECCV 2014 - 13th European Conference, Proceedings. PART 2. ed. Springer-Verlag, 2014. p. 314-329 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 8690 LNCS, No. PART 2).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Yan, X, Kakadiaris, IA & Shah, SK 2014, What do I see? Modeling human visual perception for multi-person tracking. in Computer Vision, ECCV 2014 - 13th European Conference, Proceedings. PART 2 edn, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 2, vol. 8690 LNCS, Springer-Verlag, pp. 314-329, 13th European Conference on Computer Vision, ECCV 2014, Zurich, Switzerland, 9/6/14. https://doi.org/10.1007/978-3-319-10605-2_21

Yan X, Kakadiaris IA, Shah SK. What do I see? Modeling human visual perception for multi-person tracking. In Computer Vision, ECCV 2014 - 13th European Conference, Proceedings. PART 2 ed. Springer-Verlag. 2014. p. 314-329. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 2). doi: 10.1007/978-3-319-10605-2_21

Yan, Xu ; Kakadiaris, Ioannis A. ; Shah, Shishir K. / What do I see? Modeling human visual perception for multi-person tracking. Computer Vision, ECCV 2014 - 13th European Conference, Proceedings. PART 2. ed. Springer-Verlag, 2014. pp. 314-329 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 2).

@inproceedings{0313456a69cf47b38dd446725bd65423,

title = "What do I see? Modeling human visual perception for multi-person tracking",

abstract = "This paper presents a novel approach for multi-person tracking utilizing a model motivated by the human vision system. The model predicts human motion based on modeling of perceived information. An attention map is designed to mimic human reasoning that integrates both spatial and temporal information. The spatial component addresses human attention allocation to different areas in a scene and is represented using a retinal mapping based on the log-polar transformation while the temporal component denotes the human attention allocation to subjects with different motion velocity and is modeled as a static-dynamic attention map. With the static-dynamic attention map and retinal mapping, attention driven motion of the tracked target is estimated with a center-surround search mechanism. This perception based motion model is integrated into a data association tracking framework with appearance and motion features. The proposed algorithm tracks a large number of subjects in complex scenes and the evaluation on public datasets show promising improvements over state-of-the-art methods.",

author = "Xu Yan and Kakadiaris, {Ioannis A.} and Shah, {Shishir K.}",

note = "Funding Information: This work was supported in part by the US Department of Justice, grant number 2009-MU-MU-K004. Any opinions, findings, conclusions or recommendations expressed in this paper are those of the authors and do not necessarily reflect the views of our sponsors.; 13th European Conference on Computer Vision, ECCV 2014 ; Conference date: 06-09-2014 Through 12-09-2014",

year = "2014",

doi = "10.1007/978-3-319-10605-2_21",

language = "English (US)",

isbn = "9783319106045",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer-Verlag",

number = "PART 2",

pages = "314--329",

booktitle = "Computer Vision, ECCV 2014 - 13th European Conference, Proceedings",

edition = "PART 2",

}

TY - GEN

T1 - What do I see? Modeling human visual perception for multi-person tracking

AU - Yan, Xu

AU - Kakadiaris, Ioannis A.

AU - Shah, Shishir K.

N1 - Funding Information: This work was supported in part by the US Department of Justice, grant number 2009-MU-MU-K004. Any opinions, findings, conclusions or recommendations expressed in this paper are those of the authors and do not necessarily reflect the views of our sponsors.

PY - 2014

Y1 - 2014

N2 - This paper presents a novel approach for multi-person tracking utilizing a model motivated by the human vision system. The model predicts human motion based on modeling of perceived information. An attention map is designed to mimic human reasoning that integrates both spatial and temporal information. The spatial component addresses human attention allocation to different areas in a scene and is represented using a retinal mapping based on the log-polar transformation while the temporal component denotes the human attention allocation to subjects with different motion velocity and is modeled as a static-dynamic attention map. With the static-dynamic attention map and retinal mapping, attention driven motion of the tracked target is estimated with a center-surround search mechanism. This perception based motion model is integrated into a data association tracking framework with appearance and motion features. The proposed algorithm tracks a large number of subjects in complex scenes and the evaluation on public datasets show promising improvements over state-of-the-art methods.

AB - This paper presents a novel approach for multi-person tracking utilizing a model motivated by the human vision system. The model predicts human motion based on modeling of perceived information. An attention map is designed to mimic human reasoning that integrates both spatial and temporal information. The spatial component addresses human attention allocation to different areas in a scene and is represented using a retinal mapping based on the log-polar transformation while the temporal component denotes the human attention allocation to subjects with different motion velocity and is modeled as a static-dynamic attention map. With the static-dynamic attention map and retinal mapping, attention driven motion of the tracked target is estimated with a center-surround search mechanism. This perception based motion model is integrated into a data association tracking framework with appearance and motion features. The proposed algorithm tracks a large number of subjects in complex scenes and the evaluation on public datasets show promising improvements over state-of-the-art methods.

UR - http://www.scopus.com/inward/record.url?scp=84906512970&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84906512970&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-10605-2_21

DO - 10.1007/978-3-319-10605-2_21

M3 - Conference contribution

AN - SCOPUS:84906512970

SN - 9783319106045

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 314

EP - 329

BT - Computer Vision, ECCV 2014 - 13th European Conference, Proceedings

PB - Springer-Verlag

T2 - 13th European Conference on Computer Vision, ECCV 2014

Y2 - 6 September 2014 through 12 September 2014

ER -

What do I see? Modeling human visual perception for multi-person tracking

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this