TY - JOUR
T1 - Histogram of Fuzzy Local Spatio-Temporal Descriptors for Video Action Recognition
AU - Zuo, Zheming
AU - Yang, Longzhi
AU - Liu, Yonghuai
AU - Chao, Fei
AU - Song, Ran
AU - Qu, Yanpeng
N1 - Funding Information:
Manuscript received August 2, 2019; revised October 16, 2019; accepted November 9, 2019. Date of publication December 3, 2019; date of current version February 28, 2020. This work was supported by the Royal Academy of Engineering under Grant IAPP1/100077. The work of Y. Liu was supported by the UKIERI-DST under Grant CHARM (DST UKIERI-2018-19-10). The work of Y. Qu was supported in part by the Innovation Support Plan for Dalian High-level Talents under Grant 2018RQ70 and in part by a Sêr Cymru II COFUND Fellowship, U.K. Paper no. TII-19-3599. (Corresponding author: Longzhi Yang.) Z. Zuo and L. Yang are with the Department of Computer and Information Sciences, Northumbria University, London E1 7HT, U.K. (e-mail: [email protected]; [email protected]).
Publisher Copyright:
© 2005-2012 IEEE.
PY - 2020/6/30
Y1 - 2020/6/30
N2 - Feature extraction plays a vital role in visual action recognition. Many existing gradient-based feature extractors, including histogram of oriented gradients, histogram of optical flow, motion boundary histograms, and histogram of motion gradients, build histograms for representing different actions over the spatio-temporal domain in a video. However, these methods require to set the number of bins for information aggregation in advance. Varying numbers of bins usually lead to inherent uncertainty within the process of pixel voting with regard to the bins in the histogram. This article proposes a novel method to handle such uncertainty by fuzzifying these feature extractors. The proposed approach has two advantages: it better represents the ambiguous boundaries between the bins and, thus, the fuzziness of the spatio-temporal visual information entailed in videos; and the contribution of each pixel is flexibly controlled by a fuzziness parameter for various scenarios. The proposed family of fuzzy descriptors and a combination of them are evaluated on two publicly available datasets, demonstrating that the proposed approach outperforms the original counterparts and other state-of-the-art methods.
AB - Feature extraction plays a vital role in visual action recognition. Many existing gradient-based feature extractors, including histogram of oriented gradients, histogram of optical flow, motion boundary histograms, and histogram of motion gradients, build histograms for representing different actions over the spatio-temporal domain in a video. However, these methods require to set the number of bins for information aggregation in advance. Varying numbers of bins usually lead to inherent uncertainty within the process of pixel voting with regard to the bins in the histogram. This article proposes a novel method to handle such uncertainty by fuzzifying these feature extractors. The proposed approach has two advantages: it better represents the ambiguous boundaries between the bins and, thus, the fuzziness of the spatio-temporal visual information entailed in videos; and the contribution of each pixel is flexibly controlled by a fuzziness parameter for various scenarios. The proposed family of fuzzy descriptors and a combination of them are evaluated on two publicly available datasets, demonstrating that the proposed approach outperforms the original counterparts and other state-of-the-art methods.
KW - Action recognition
KW - fuzziness
KW - histogram
KW - local feature descriptors
KW - video feature extraction
UR - http://www.scopus.com/inward/record.url?scp=85081995271&partnerID=8YFLogxK
U2 - 10.1109/TII.2019.2957268
DO - 10.1109/TII.2019.2957268
M3 - Article
SN - 1941-0050
VL - 16
SP - 4059
EP - 4067
JO - IEEE Transactions on Industrial Informatics
JF - IEEE Transactions on Industrial Informatics
IS - 6
M1 - 8919994
ER -