Scaling knowledge graph embedding models for link prediction

Abstract

Developing scalable solutions for training Graph Neural Networks (GNNs) for link prediction tasks is challenging due to the inherent data dependencies which entail high computational costs and a huge memory footprint. We propose a new method for scaling training of knowledge graph embedding models for link prediction to address these challenges. Towards this end, we propose the following algorithmic strategies: self-sufficient partitions, constraint-based negative sampling, and edge mini-batch training. The experimental evaluation shows that our scaling solution for GNN-based knowledge graph embedding models achieves a 16x speed up on benchmark datasets while maintaining a comparable model performance to non-distributed methods on standard metrics.

Keywords

This publication has 15 references indexed in Scilit:

Microsoft Academic Graph: When experts are not enough
Quantitative Science Studies, 2020
Scalable Edge Partitioning
Published by Society for Industrial & Applied Mathematics (SIAM) ,2019
Long-tail Relation Extraction via Knowledge Graph Embeddings and Graph Convolution Networks
Published by Association for Computational Linguistics (ACL) ,2019
Cross-lingual Knowledge Graph Alignment via Graph Convolutional Networks
Published by Association for Computational Linguistics (ACL) ,2018
Knowledge Graph Embedding: A Survey of Approaches and Applications
IEEE Transactions on Knowledge and Data Engineering, 2017
Graph Edge Partitioning via Neighborhood Heuristic
Published by Association for Computing Machinery (ACM) ,2017
An End-to-End Model for Question Answering over Knowledge Base with Cross-Attention Combining Global Knowledge
Published by Association for Computational Linguistics (ACL) ,2017
GraphGen
Proceedings of the VLDB Endowment, 2015
One trillion edges
Proceedings of the VLDB Endowment, 2015
Think Locally, Act Globally: Highly Balanced Graph Partitioning
Lecture Notes in Computer Science, 2013

Cited by 1 article