Network Representation Learning: A Survey
Top Cited Papers
- 25 June 2018
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Big Data
- Vol. 6 (1), 3-28
- https://doi.org/10.1109/tbdata.2018.2850013
Abstract
With the widespread use of information technologies, information networks are becoming increasingly popular to capture complex relationships across various disciplines, such as social networks, citation networks, telecommunication networks, and biological networks. Analyzing these networks sheds light on different aspects of social life such as the structure of societies, information diffusion, and communication patterns. In reality, however, the large scale of information networks often makes network analytic tasks computationally expensive or intractable. Network representation learning has been recently proposed as a new learning paradigm to embed network vertices into a low-dimensional vector space, by preserving network topology structure, vertex content, and other side information. This facilitates the original network to be easily handled in the new vector space for further analysis. In this survey, we perform a comprehensive review of the current literature on network representation learning in the data mining and machine learning field. We propose new taxonomies to categorize and summarize the state-of-the-art network representation learning techniques according to the underlying learning mechanisms, the network information intended to preserve, as well as the algorithmic designs and methodologies. We summarize evaluation protocols used for validating network representation learning including published benchmark datasets, evaluation methods, and open source algorithms. We also perform empirical studies to compare the performance of representative algorithms on common datasets, and analyze their computational complexity. Finally, we suggest promising research directions to facilitate future study.Keywords
Funding Information
- National Science Foundation (IIS-1763452)
- Australian Research Council (LP160100630, DP180100966)
- China Scholarship Council (201506300082)
- Data61
- Commonwealth Scientific and Industrial Research Organisation
This publication has 88 references indexed in Scilit:
- Molecular signatures database (MSigDB) 3.0Bioinformatics, 2011
- Leveraging social media networks for classificationData Mining and Knowledge Discovery, 2011
- The BioGRID Interaction Database: 2008 updateNucleic Acids Research, 2007
- Graph evolutionACM Transactions on Knowledge Discovery From Data, 2007
- Reducing the Dimensionality of Data with Neural NetworksScience, 2006
- Modularity and community structure in networksProceedings of the National Academy of Sciences of the United States of America, 2006
- Nonlinear Dimensionality Reduction by Locally Linear EmbeddingScience, 2000
- A Global Geometric Framework for Nonlinear Dimensionality ReductionScience, 2000
- Normalized cuts and image segmentationIEEE Transactions on Pattern Analysis and Machine Intelligence, 2000
- Graph visualization and navigation in information visualization: A surveyIEEE Transactions on Visualization and Computer Graphics, 2000