Improve accurate pose alignment and action localization by dense pose estimation
File(s)fg2018_dc_paper_8.1.pdf (5.07 MB)
Accepted version
Author(s)
Zhou, Yuxiang
Deng, Jiankang
Zafeiriou, Stefanos
Type
Conference Paper
Abstract
In this work we explore the use of shape-based representations as an auxiliary source of supervision for pose estimation and action recognition. We show that shape-based representations can act as a source of `privileged information' that complements and extends the pure landmark-level annotations. We explore 2D shape-based supervision signals, such as Support Vector Shape. Our experiments show that shape-based supervision signals substantially improve pose alignment accuracy in the form of a cascade architecture. We outperform state-of-the-art methods on the MPII and LSP datasets, while using substantially shallower networks. For action localization in untrimmed videos, our method introduces additional classification signals based on the structured segment networks (SSN) and further improved the performance. To be specific, dense human pose and landmarks localization signals are involved in detection progress. We applied out network to all frames of videos alongside with output from SSN to further improve detection accuracy, especially for pose related and sparsely annotated videos. The method in general achieves state-of-the-art performance on Activity Detection Task for ActivityNet Challenge2017 test set and witnesses remarkable improvement on pose related and sparsely annotated categories e.g. sports.
Date Issued
2018-06-07
Date Acceptance
2018-05-15
Citation
Proceedings 2018 13th IEEE international conference on automatic face & gesture recognition (fg 2018), 2018, pp.480-484
ISBN
9781538623350
ISSN
2326-5396
Publisher
IEEE
Start Page
480
End Page
484
Journal / Book Title
Proceedings 2018 13th IEEE international conference on automatic face & gesture recognition (fg 2018)
Copyright Statement
© 2018 IEEE.
Sponsor
Engineering & Physical Science Research Council (E
Commission of the European Communities
Identifier
http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000454996700067&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=1ba7043ffcc86c417c072aa74d649202
Grant Number
EP/N007743/1
688520
Source
13th IEEE International Conference on Automatic Face & Gesture Recognition (FG)
Subjects
Science & Technology
Technology
Computer Science, Artificial Intelligence
Engineering, Electrical & Electronic
Computer Science
Engineering
Publication Status
Published
Start Date
2018-05-15
Finish Date
2018-05-19
Coverage Spatial
Xi an, China
Date Publish Online
2018-06-07