Towards Data Normalization Task for the Efficient Mining of Medical Data

Ivan Izonin, Bohdan Ilchyshyn, Roman Tkachenko, Michal Gregus, Natalya Shakhovska, Christine Strauss

Veröffentlichungen: Beitrag in BuchBeitrag in KonferenzbandPeer Reviewed

Abstract

The paper investigates the problem of data normalization in solving medical diagnostics tasks by machine learning algorithms. The authors describe five different data normalization methods' operations, advantages, and disadvantages. The effectiveness of their work was evaluated using two data sets with different Imbalanced Ratio, which is typical for medical tasks. The modeling was performed by solving a binary classification task using three different machine learning methods based on decision trees. It is experimentally established that the method of normalization ScalerOnCircle, unlike others, increases the efficiency of analyzing medical data based on researched machine learning methods. There was a significant increase in the F1-score value when using this normalization method. It is because ScalerOnCircle, in addition to normalization by columns, provides the possibility of considering relationships between the attributes of each vector of a given dataset. This problem is very acute in the medical field, where data sets designed for intellectual analysis are characterized by many attributes and complex nonlinear relationships between them. This fact must be taken into account when mining such datasets. ScalerOnCircle opens up several benefits for the efficient mining of medical data.

OriginalspracheEnglisch
Titel2022 12th International Conference on Advanced Computer Information Technologies
ErscheinungsortPiscataway
Herausgeber (Verlag)IEEE
Seiten480-484
Seitenumfang5
ISBN (elektronisch)978-1-66541-050-2
ISBN (Print)978-1-66541-049-6, 978-1-66546-647-9
DOIs
PublikationsstatusVeröffentlicht - 2022
Veranstaltung12th International Conference on Advanced Computer Information Technologies, ACIT 2022 - Ruzomberok, Slowakei
Dauer: 26 Sept. 202228 Sept. 2022

Publikationsreihe

ReiheInternational Conference on Advanced Computer Information Technologies
ISSN2770-5218

Konferenz

Konferenz12th International Conference on Advanced Computer Information Technologies, ACIT 2022
Land/GebietSlowakei
OrtRuzomberok
Zeitraum26/09/2228/09/22

ÖFOS 2012

  • 102019 Machine Learning
  • 301103 Diagnostik in der Medizin

Fingerprint

Untersuchen Sie die Forschungsthemen von „Towards Data Normalization Task for the Efficient Mining of Medical Data“. Zusammen bilden sie einen einzigartigen Fingerprint.

Zitationsweisen