Abstract
Clustering in Data Mining is the process of discovering groups of similar objects in data. The k-Means clustering algorithm, is designed to partition data into k distinct groups or clusters. With recent growth in data production, the need to scale up existing algorithms and computational ability has increased. Google introduced the Tensor Processing Unit (TPU), a powerful hardware, to meet the growing computational needs of modern technologies. In this paper, we aim to enhance the k-Means algorithm with the use of the Google TPU in terms of runtime while preserving the quality of the clustering results. We developed two versions that distribute training on the TPU in two different ways. The clustering results of the versions have advantages that complement each other in terms of runtime and accuracy.
Original language | English |
---|---|
Title of host publication | 2022 IEEE International Conference on Big Data (IEEE BigData) |
Editors | Shusaku Tsumoto, Yukio Ohsawa, Lei Chen, Dirk Van den Poel, Xiaohua Hu, Yoichi Motomura, Takuya Takagi, Lingfei Wu, Ying Xie, Akihiro Abe, Vijay Raghavan |
Place of Publication | Piscataway |
Publisher | IEEE |
Pages | 194-200 |
Number of pages | 7 |
ISBN (Electronic) | 978-1-6654-8045-1 |
DOIs | |
Publication status | Published - 1 Dec 2022 |
Austrian Fields of Science 2012
- 102033 Data mining
Keywords
- Clustering
- Data Mining
- k-mean
- Tensor Processing Unit (TPU)