Understanding Blooming Human Groups in Social Networks

3 September 2015

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Multimedia

Vol. 17 (11), 1980-1988
https://doi.org/10.1109/tmm.2015.2476657

Abstract

Human group, which indicates the people who share similar characteristics, is used to categorize humans into distinct populations or groups. In recent years, with the explosive growth of image, new concepts of human group are blooming in social networks . People in the same human group can be categorized by their facial and clothes appearance characteristics. In this work, we propose an approach to understanding the new concepts of human group with few positive samples. To this end, we construct visual models crossing two modalities related to human images and surrounding texts. Two convolutional neural networks based on face and upper body are constructed separately. Two different convolutional neural networks (CNNs) architectures are explored for visual pre-traing. To assist the human group recognition, we also merge global convolutional feature of the image. The surrounding texts are represented by semantical vectors and utilized as image labels. We transform words in the text into fixed length vectors by the skip-gram model. Then the texts corresponding to each image are converted into one feature vector by sparse coding and max pooling. Given a few positive samples of new concepts of human group, the visual model can be improved to understand the semantical meaning of the new label. The experimental results demonstrate the effectiveness of the proposed visual model and show the excellent learning capacity with few samples.

Keywords

This publication has 20 references indexed in Scilit:

Multi-View Intact Space Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2014
Large-Margin Multi-ViewInformation Bottleneck
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014
What Do You Do? Occupation Recognition in a Photo via Social Context
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
From Bikers to Surfers: Visual Recognition of Urban Tribes
Published by British Machine Vision Association and Society for Pattern Recognition ,2013
Web-Scale Multimedia Information Networks
Proceedings of the IEEE, 2012
Urban tribes: Analyzing group photos from a social perspective
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2012
Describing Clothing by Semantic Attributes
Lecture Notes in Computer Science, 2012
Describing people: A poselet-based approach to attribute classification
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Articulated pose estimation with flexible mixtures-of-parts
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011

Cited by 36 articles