Tiny Videos: A Large Data Set for Nonparametric Video Retrieval and Frame Classification

17 June 2010

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in Ieee Transactions On Pattern Analysis and Machine Intelligence

Vol. 33 (3), 618-630
https://doi.org/10.1109/tpami.2010.118

Abstract

In this paper, we present a large database of over 50,000 user-labeled videos collected from YouTube. We develop a compact representation called “tiny videos” that achieves high video compression rates while retaining the overall visual appearance of the video as it varies over time. We show that frame sampling using affinity propagation - an exemplar-based clustering algorithm - achieves the best trade-off between compression and video recall. We use this large collection of user-labeled videos in conjunction with simple data mining techniques to perform related video retrieval, as well as classification of images and video frames. The classification results achieved by tiny videos are compared with the tiny images framework for a variety of recognition tasks. The tiny images data set consists of 80 million images collected from the Internet. These are the largest labeled research data sets of videos and images available to date. We show that tiny videos are better suited for classifying scenery and sports activities, while tiny images perform better at recognizing objects. Furthermore, we demonstrate that combining the tiny images and tiny videos data sets improves classification precision in a wider range of categories.

Keywords

This publication has 20 references indexed in Scilit:

Tiny Videos: Non-parametric Content-Based Video Retrieval and Recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2008
Video copy detection
Published by Association for Computing Machinery (ACM) ,2007
Clustering by Passing Messages Between Data Points
Science, 2007
On Space-Time Interest Points
International Journal of Computer Vision, 2005
Semantic Video Summarization Using Mutual Reinforcement Principle and Shot Arrangement Patterns
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision, 2004
Robust Content-Based Video Copy Identification in a Large Reference Database
Lecture Notes in Computer Science, 2003
The open video project
Published by Association for Computing Machinery (ACM) ,2000
Content-based video indexing of TV broadcast news using hidden Markov models
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1999
Production model based digital video segmentation
Multimedia Tools and Applications, 1995

Cited by 33 articles