Analyzing the Communication Clusters in Datacenters

Klaus-Tycho Foerster, Thibault Marette, Stefan Neumann, Claudia Plant, Ylli Sadikaj, Stefan Schmid, Yllka Velaj

Publications: Contribution to bookContribution to proceedingsPeer Reviewed

Abstract

Datacenter networks have become a critical infrastructure of our digital society and over the last years, great efforts have been made to better understand the communication patterns inside datacenters. In particular, existing empirical studies showed that datacenter traffic typically features much temporal and spatial structure, and that at any given time, some communication pairs interact much more frequently than others. This paper generalizes this study to communication groups and analyzes how clustered the datacenter traffic is, and how stable these clusters are over time. To this end, we propose a methodology which revolves around a biclustering approach, allowing us to identify groups of racks and servers which communicate frequently over the network. In particular, we consider communication patterns occurring in three different Facebook datacenters: a Web cluster consisting of web servers serving web traffic, a Database cluster which mainly consists of MySQL servers, and a Hadoop cluster. Interestingly, we find that in all three clusters, small groups of racks and servers can produce a large fraction of the network traffic, and we can determine these groups even when considering short snapshots of network traffic. We also show empirically that these clusters are fairly stable across time. Our insights on the size and stability of communication clusters hence uncover an interesting potential for resource optimizations in datacenter infrastructures.

Original languageEnglish
Title of host publicationProceedings of the ACM Web Conference 2023, WWW 2023, Austin, TX, USA, 30 April 2023 - 4 May 2023
EditorsYing Ding, Jie Tang, Juan F. Sequeda, Lora Aroyo, Carlos Castillo, Geert-Jan Houben
Place of PublicationNew York
PublisherACM
Pages3022-3032
Number of pages11
ISBN (Electronic)9781450394161
DOIs
Publication statusPublished - Apr 2023

Austrian Fields of Science 2012

  • 102033 Data mining

Keywords

  • Clustering
  • Data Center
  • Network Traffic

Fingerprint

Dive into the research topics of 'Analyzing the Communication Clusters in Datacenters'. Together they form a unique fingerprint.

Cite this