Massively Parallel Graph Drawing and Representation Learning

Christian Böhm, Claudia Plant

Publications: Contribution to bookContribution to proceedingsPeer Reviewed

Abstract

To fully exploit the performance potential of modern multi-core processors, machine learning and data mining algorithms for big data must be parallelized in multiple ways. Today's CPUs consist of multiple cores, each following an independent thread of control, and each equipped with multiple arithmetic units which can perform the same operation on a vector of multiple data objects. Graph embedding, i.e. converting the vertices of a graph into numerical vectors is a data mining task of high importance and is useful for graph drawing (low-dimensional vectors) and graph representation learning (high-dimensional vectors). In this paper, we propose MulticoreGEMPE (Graph Embedding by Minimizing the Predictive Entropy), an information-theoretic method which can generate low and high-dimensional vectors. MulticoreGEMPE applies MIMD (Multiple Instructions Multiple Data, using OpenMP) and SIMD (Single Instructions Multiple Data, using AVX-512) parallelism. We propose general ideas applicable in other graph-based algorithms like \emph{vectorized hashing} and \emph{vectorized reduction}. Our experimental evaluation demonstrates the superiority of our approach.
Original languageEnglish
Title of host publication2020 IEEE International Conference on Big Data (Big Data)
EditorsXintao Wu, Chris Jermaine, Li Xiong, Xiaohua Tony Hu, Olivera Kotevska, Siyuan Lu, Weijia Xu, Srinivas Aluru, Chengxiang Zhai, Eyhab Al-Masri, Zhiyuan Chen, Jeff Saltz
PublisherIEEE
Pages609-616
Number of pages8
ISBN (Electronic)9781728162515
ISBN (Print)978-1-7281-6252-2
DOIs
Publication statusPublished - 6 Nov 2020

Austrian Fields of Science 2012

  • 102033 Data mining

Keywords

  • AVX-512
  • Graph Drawing
  • Graph Embedding
  • Graph Representation Learning
  • MIMD
  • Multi-core
  • SIMD

Cite this