Event Oriented Dictionary Learning for Complex Event Detection

16 March 2015

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Image Processing

Vol. 24 (6), 1867-1878
https://doi.org/10.1109/tip.2015.2413294

Abstract

Complex event detection is a retrieval task with the goal of finding videos of a particular event in a large-scale unconstrained Internet video archive, given example videos and text descriptions. Nowadays, different multimodal fusion schemes of low-level and high-level features are extensively investigated and evaluated for the complex event detection task. However, how to effectively select the high-level semantic meaningful concepts from a large pool to assist complex event detection is rarely studied in the literature. In this paper, we propose a novel strategy to automatically select semantic meaningful concepts for the event detection task based on both the events-kit text descriptions and the concepts high-level feature descriptions. Moreover, we introduce a novel event oriented dictionary representation based on the selected semantic concepts. Toward this goal, we leverage training images (frames) of selected concepts from the semantic indexing dataset with a pool of 346 concepts, into a novel supervised multitask ℓ _p -norm dictionary learning framework. Extensive experimental results on TRECVID multimedia event detection dataset demonstrate the efficacy of our proposed method.

Keywords

Funding Information

Ministero dell’Istruzione, dell’Universita e della Ricerca Cluster Project Active Ageing at Home
European Commission Project xLiMe
Australian Research Council Discovery Projects
U.S. Army Research Office (W911NF-13-1-0277)
National Science Foundation (IIS-1251187)

This publication has 31 references indexed in Scilit:

Multitask Linear Discriminant Analysis for View Invariant Action Recognition
IEEE Transactions on Image Processing, 2014
Multimedia event detection with multimodal feature fusion and temporal concept localization
Machine Vision and Applications, 2013
Robust Visual Tracking via Structured Multi-Task Sparse Learning
International Journal of Computer Vision, 2012
Leveraging high-level and low-level features for multimedia event detection
Published by Association for Computing Machinery (ACM) ,2012
Integrating low-rank and group-sparse structures for robust multi-task learning
Published by Association for Computing Machinery (ACM) ,2011
Learning incoherent sparse and low-rank patterns from multiple tasks
Published by Association for Computing Machinery (ACM) ,2010
A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems
SIAM Journal on Imaging Sciences, 2009
Thresholding-based iterative selection procedures for model selection and shrinkage
Electronic Journal of Statistics, 2009
$rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation
IEEE Transactions on Signal Processing, 2006
Regularization and Variable Selection Via the Elastic Net
Journal of the Royal Statistical Society Series B: Statistical Methodology, 2005

Cited by 101 articles