Skip to main navigation Skip to search Skip to main content

ML Pipeline Insights Service for Rule-Based Assessment of Training Practices in Reinforcement Learning

Publications: Contribution to bookContribution to proceedingsPeer Reviewed

Abstract

As artificial intelligence continues to advance, Reinforcement Learning (RL) has established itself as a core approach for developing intelligent agents that make decisions over time. As RL systems grow in complexity, the need for standardized training practices becomes critical. This paper introduces a rule-based assessment approach to enforce best practices in RL training. We define a comprehensive set of architectural rules focused on RL pipeline practices, models versioning, multi-agents deployment and managing models in inference. Our methodology integrates Large Language Models (LLMs) and custom-based code detectors to ensure compliance with these best practices across diverse RL systems. We developed a \textit{ML pipeline insights service} to automatically validate RL training practices directly from the source code. We validate our approach by applying it in a large-scale industrial case study and sixteen open-source case studies. Our evaluation showed that custom-based detectors achieved near-perfect precision and recall F_1 = 0.98, while LLM-based detectors provided scalable validation with moderate F_1 scores (0.67--0.71), demonstrating the hybrid approach’s strength in balancing accuracy and automation. The results demonstrate our tool's accuracy in identifying and enforcing best practices with high precision and recall rates, highlighting its practical applicability and automation feasibility.
Original languageEnglish
Title of host publicationSoftware Engineering and Advanced Applications
Subtitle of host publication51st Euromicro Conference, SEAA 2025, Salerno, Italy, September 10–12, 2025, Proceedings, Part I
PublisherSpringer Nature
Pages148-163
ISBN (Electronic)978-3-032-04190-6
ISBN (Print)978-3-032-04189-0
DOIs
Publication statusPublished - 7 Sept 2025
Event51st Euromicro Conference on Software Engineering and Advanced Applications - Salerno, Italy
Duration: 10 Sept 202512 Sept 2025
https://dsd-seaa.com/seaa2025/

Publication series

SeriesLecture Notes in Computer Science
Volume16081
ISSN0302-9743

Conference

Conference51st Euromicro Conference on Software Engineering and Advanced Applications
Abbreviated titleSEAA 2025
Country/TerritoryItaly
CitySalerno
Period10/09/2512/09/25
Internet address

Funding

This work was supported by: FFG (Austrian Research Promotion Agency) project MODIS (no. FO999895431); Austrian Science Fund (FWF) project CQ4CD, Grant-DOI: 10.55776/I6510.

FundersFunder number
Österreichische Forschungsförderungsgesellschaft mbH (FFG)FO999895431
Fonds zur Förderung der wissenschaftlichen Forschung (FWF)I 6510

Austrian Fields of Science 2012

  • 102022 Software development

Fingerprint

Dive into the research topics of 'ML Pipeline Insights Service for Rule-Based Assessment of Training Practices in Reinforcement Learning'. Together they form a unique fingerprint.

Cite this