Multitask Spectral Clustering by Exploring Intertask Correlation
- 18 September 2014
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Cybernetics
- Vol. 45 (5), 1083-1094
- https://doi.org/10.1109/tcyb.2014.2344015
Abstract
Clustering, as one of the most classical research problems in pattern recognition and data mining, has been widely explored and applied to various applications. Due to the rapid evolution of data on the Web, more emerging challenges have been posed on traditional clustering techniques: 1) correlations among related clustering tasks and/or within individual task are not well captured; 2) the problem of clustering out-of-sample data is seldom considered; and 3) the discriminative property of cluster label matrix is not well explored. In this paper, we propose a novel clustering model, namely multitask spectral clustering (MTSC), to cope with the above challenges. Specifically, two types of correlations are well considered: 1) intertask clustering correlation, which refers the relations among different clustering tasks and 2) intratask learning correlation, which enables the processes of learning cluster labels and learning mapping function to reinforce each other. We incorporate a novel l 2,p -norm regularizer to control the coherence of all the tasks based on an assumption that related tasks should share a common low-dimensional representation. Moreover, for each individual task, an explicit mapping function is simultaneously learnt for predicting cluster labels by mapping features to the cluster label matrix. Meanwhile, we show that the learning process can naturally incorporate discriminative information to further improve clustering performance. We explore and discuss the relationships between our proposed model and several representative clustering techniques, including spectral clustering, k -means and discriminative k -means. Extensive experiments on various real-world datasets illustrate the advantage of the proposed MTSC model compared to state-of-the-art clustering approaches.Funding Information
- ARC Discovery (DP130103252)
- Tianjin Key Laboratory of Cognitive Computing and Application
This publication has 31 references indexed in Scilit:
- Knowledge Adaptation with PartiallyShared Features for Event DetectionUsing Few ExemplarsIEEE Transactions on Pattern Analysis and Machine Intelligence, 2014
- Feature Selection for Multimedia Analysis by Sharing Information Among Multiple TasksIEEE Transactions on Multimedia, 2012
- Discriminative Nonnegative Spectral Clustering with Out-of-Sample ExtensionIEEE Transactions on Knowledge and Data Engineering, 2012
- Spectral Embedded Clustering: A Framework for In-Sample and Out-of-Sample Spectral ClusteringIEEE Transactions on Neural Networks, 2011
- HumanEva: Synchronized Video and Motion Capture Dataset and Baseline Algorithm for Evaluation of Articulated Human MotionInternational Journal of Computer Vision, 2009
- Clustering with Local and Global RegularizationIEEE Transactions on Knowledge and Data Engineering, 2009
- Kernel k-meansPublished by Association for Computing Machinery (ACM) ,2004
- Spectral grouping using the nystrom methodIeee Transactions On Pattern Analysis and Machine Intelligence, 2004
- Clustering for approximate similarity search in high-dimensional spacesIEEE Transactions on Knowledge and Data Engineering, 2002
- Pattern Recognition with Fuzzy Objective Function AlgorithmsPublished by Springer Science and Business Media LLC ,1981