Places Clustering of Full-Length Film Key-Frames Using Latent Aspect Modeling Over SIFT Matches
- 16 March 2009
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Circuits and Systems for Video Technology
- Vol. 19 (6), 832-841
- https://doi.org/10.1109/tcsvt.2009.2017304
Abstract
An improved unsupervised classification method to extract and link places features and cluster recurrent physical locations (key-places) within a movie is presented. Our approach finds links between key frames of a common key-place based on the use of a probabilistic latent space model over the possible local matches between the key frames image set. This allows the extraction of significant groups of local matching descriptors that may represent characteristic elements of a key-place. An exhaustive evaluation of our approach was conducted on in-house and public image datasets, as well as on full-length movies. Results revealed that our method is very efficient for near-duplicate object/background detection with weak overlap. Performance measurements on full-length movies indicate a recognition rate of about 75% on the key-places clustering with a false alarm rate (FAR) of approximately 2%.This publication has 32 references indexed in Scilit:
- Hierarchical Dirichlet ProcessesJournal of the American Statistical Association, 2006
- Photo tourismACM Transactions on Graphics, 2006
- Object Level Grouping for Video ShotsInternational Journal of Computer Vision, 2006
- Extracting Scale and Illuminant Invariant Regions through ColorPublished by British Machine Vision Association and Society for Pattern Recognition ,2006
- A Bayesian Hierarchical Model for Learning Natural Scene CategoriesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object CategoriesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Discovering objects and their location in imagesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Shot Clustering Techniques for Story BrowsingIEEE Transactions on Multimedia, 2004
- Video Google: a text retrieval approach to object matching in videosPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Systematic evaluation of logical story unit segmentationIEEE Transactions on Multimedia, 2002