Cloud-Based Approach on Genetic Data Imputation Parameters Optimization

Pavlo Horun, Christine Strauss

Veröffentlichungen: Beitrag zu KonferenzPaperPeer Reviewed

Abstract

The imputation process for genetic data is cost and time-intensive, primarily due to the high complexity of the methods involved, and the substantial volume of data processed. A thorough performance evaluation of the imputation algorithms such as Beagle, AlphaPlantImpute, LinkImputeR, MACH and others shows that while some algorithms are highly accurate, they are often computationally expensive. Being widely used, they have multiple input parameters which impact the quality and accuracy of the imputation. Traditional machine learning techniques for parameter optimization like grid search and randomized search become inefficient in high-dimensional parameter spaces, leading to prohibitive computational costs, especially in large-scale applications. Our study proposes the cloud-based approach for input parameters optimization by using Bayesian optimization with consecutive Domain Reduction Transformer (DRT). Described algorithm and developed library allow users to find the optimal input parameters for the data imputation in a more flexible way.
OriginalspracheEnglisch
Seiten279 - 286
PublikationsstatusVeröffentlicht - 7 Jan. 2025
VeranstaltungIDDM 2024 International Conference on Informatics & Data-Driven Medicine 2024 - Birmingham, Großbritannien / Vereinigtes Königreich
Dauer: 14 Nov. 202416 Nov. 2024
https://science.lpnu.ua/iddm-2024

Konferenz

KonferenzIDDM 2024 International Conference on Informatics & Data-Driven Medicine 2024
KurztitelIDDM 2024
Land/GebietGroßbritannien / Vereinigtes Königreich
OrtBirmingham
Zeitraum14/11/2416/11/24
Internetadresse

ÖFOS 2012

  • 102004 Bioinformatik
  • 102038 Cloud Computing
  • 101016 Optimierung

Fingerprint

Untersuchen Sie die Forschungsthemen von „Cloud-Based Approach on Genetic Data Imputation Parameters Optimization“. Zusammen bilden sie einen einzigartigen Fingerprint.

Zitationsweisen