ChaLearn Looking at People RGB-D Isolated and Continuous Datasets for Gesture Recognition
- 1 June 2016
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 761-769
- https://doi.org/10.1109/cvprw.2016.100
Abstract
In this paper, we present two large video multi-modal datasets for RGB and RGB-D gesture recognition: the ChaLearn LAP RGB-D Isolated Gesture Dataset (IsoGD) and the Continuous Gesture Dataset (ConGD). Both datasets are derived from the ChaLearn Gesture Dataset (CGD) that has a total of more than 50000 gestures for the "one-shot-learning" competition. To increase the potential of the old dataset, we designed new well curated datasets composed of 249 gesture labels, and including 47933 gestures manually labeled the begin and end frames in sequences. Using these datasets we will open two competitions on the CodaLab platform so that researchers can test and compare their methods for "user independent" gesture recognition. The first challenge is designed for gesture spotting and recognition in continuous sequences of gestures while the second one is designed for gesture classification from segmented data. The baseline method based on the bag of visual words model is also presented.Keywords
This publication has 16 references indexed in Scilit:
- Guest Editors’ Introduction to the Special Issue on Multimodal Human Pose Recovery and Behavior AnalysisIEEE Transactions on Pattern Analysis and Machine Intelligence, 2016
- ChaLearn Looking at People 2015: Apparent Age and Cultural Event Recognition Datasets and ResultsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- ChaLearn looking at people 2015 new competitions: Age estimation and cultural event recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- A Fast and Accurate Unconstrained Face DetectorIEEE Transactions on Pattern Analysis and Machine Intelligence, 2015
- Fully convolutional networks for semantic segmentationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- ImageNet Large Scale Visual Recognition ChallengeInternational Journal of Computer Vision, 2015
- Spatial Pyramid Pooling in Deep Convolutional Networks for Visual RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence, 2015
- ChAirGestPublished by Association for Computing Machinery (ACM) ,2013
- ChaLearn multi-modal gesture recognition 2013Published by Association for Computing Machinery (ACM) ,2013
- Results and Analysis of the ChaLearn Gesture Challenge 2012Lecture Notes in Computer Science, 2013