Deep spatio-temporal feature fusion with compact bilinear pooling for multimodal emotion recognition
- 26 July 2018
- journal article
- research article
- Published by Elsevier BV in Computer Vision and Image Understanding
- Vol. 174, 33-42
- https://doi.org/10.1016/j.cviu.2018.06.005
Abstract
No abstract availableFunding Information
- Australian Research Council (DP140100793)
This publication has 25 references indexed in Scilit:
- Survey on audiovisual emotion recognition: databases, features, and data fusion strategiesAPSIPA Transactions on Signal and Information Processing, 2014
- LSTM-Modeling of continuous emotions in an audiovisual affect recognition frameworkImage and Vision Computing, 2013
- Human emotion recognition from videos using spatio-temporal and audio featuresThe Visual Computer, 2012
- Audiovisual emotion recognition using ANOVA feature selection method and multi-classifier neural networksNeural Computing & Applications, 2012
- Recognizing expressions from face and body gesture by temporal normalized motion and appearance featuresImage and Vision Computing, 2012
- Scott's ruleWIREs Computational Statistics, 2010
- Multimodal information fusion application to human emotion recognition from face and speechMultimedia Tools and Applications, 2009
- A Fast Learning Algorithm for Deep Belief NetsNeural Computation, 2006
- Robust Real-Time Face DetectionInternational Journal of Computer Vision, 2004
- An Algorithm for Determining the Endpoints of Isolated UtterancesBell System Technical Journal, 1975