Enhancing k-Means Algorithm with Tensor Processing Unit

Pranava Mummoju, Anna Wolff, Martin Perdacher, Claudia Plant, Christian Böhm

Publications: Contribution to bookContribution to proceedingsPeer Reviewed

Abstract

Clustering in Data Mining is the process of discovering groups of similar objects in data. The k-Means clustering algorithm, is designed to partition data into k distinct groups or clusters. With recent growth in data production, the need to scale up existing algorithms and computational ability has increased. Google introduced the Tensor Processing Unit (TPU), a powerful hardware, to meet the growing computational needs of modern technologies. In this paper, we aim to enhance the k-Means algorithm with the use of the Google TPU in terms of runtime while preserving the quality of the clustering results. We developed two versions that distribute training on the TPU in two different ways. The clustering results of the versions have advantages that complement each other in terms of runtime and accuracy.
Original languageEnglish
Title of host publication2022 IEEE International Conference on Big Data (IEEE BigData)
EditorsShusaku Tsumoto, Yukio Ohsawa, Lei Chen, Dirk Van den Poel, Xiaohua Hu, Yoichi Motomura, Takuya Takagi, Lingfei Wu, Ying Xie, Akihiro Abe, Vijay Raghavan
Place of PublicationPiscataway
PublisherIEEE
Pages194-200
Number of pages7
ISBN (Electronic)978-1-6654-8045-1
DOIs
Publication statusPublished - 1 Dec 2022

Austrian Fields of Science 2012

  • 102033 Data mining

Keywords

  • Clustering
  • Data Mining
  • k-mean
  • Tensor Processing Unit (TPU)

Cite this