Skip to main navigation Skip to search Skip to main content

Parameter Selection for DBSCAN: Insights from Persistent Homology

Publications: Contribution to bookContribution to proceedingsPeer Reviewed

Abstract

Density-based clustering algorithms like DBSCAN are highly effective but sensitive to parameter selection, particularly the neighborhood radius (ϵ) and the minimum number of neighboring points to form a cluster (minPts). We analyze and investigate the influence of the parameter settings onto the clustering outcome under the lense of persistent homology, a technique from topological data analysis. Persistent homology analyzes topological features, such as connected components and loops, across multiple spatial scales, improving clustering accuracy and robustness. We use the density-connectivity distance, a recent finding in the field, to allow full automatization of our approach. In extensive experiments, we demonstrate how insights from persistent homology can help to identify optimal parameter values and introduce an approach to automate parameter selection for density-based clustering. The proposed technique allows DBSCAN and related algorithms to perform effectively on a large variety of datasets without any user input. It combines topological insights with clustering techniques to provide a foundation for robust, automated approaches to complex data analysis.

Original languageEnglish
Title of host publicationECAI 2025
Subtitle of host publication28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, Italy – Including 14th Conference on Prestigious Applications of Intelligent Systems (PAIS 2025)
EditorsInês Lynce, Nello Murano, Mauro Vallati, Serena Villata, Federico Chesani, Michela Milano, Andrea Omicini, Mehdi Dastani
PublisherIOS Press
Pages3250-3257
Number of pages8
Volume413
ISBN (Electronic) 978-1-64368-631-8
DOIs
Publication statusPublished - 1 Oct 2025

Publication series

SeriesFrontiers in Artificial Intelligence and Applications

Austrian Fields of Science 2012

  • 102033 Data mining

Fingerprint

Dive into the research topics of 'Parameter Selection for DBSCAN: Insights from Persistent Homology'. Together they form a unique fingerprint.

Cite this