Enabling semantics-aware process mining through the automatic annotation of event logs

Adrian Rebmann, Han van der Aa

Veröffentlichungen: Beitrag in FachzeitschriftArtikelPeer Reviewed


Process mining is concerned with the analysis of organizational processes based on event data recorded during their execution. Foundational process mining techniques analyze such data in an abstract manner, without taking the meaning of these events or their payload into consideration. By contrast, other techniques may exploit specific kinds of information contained in event data, such as resources in organizational mining and business objects in object-centric analysis, to gain more specific insights into an organization's operations. However, the information required for such analyses is typically not readily available. Rather, the meaning of events is often captured in an ad hoc manner, commonly through unstructured textual attributes, such as an event's label, or in unclearly named attributes. In this work, we address this gap by proposing an approach for the automatic annotation of semantic components in event logs. To achieve this, we combine the analysis of textual attribute values, based on a state-of-the-art language model, with novel attribute classification and component categorization techniques. In this manner, our approach first identifies up to eight semantic components per event, revealing information on the actions, business objects, and resources recorded in an event log. Afterwards, our approach further categorizes the identified actions and actors, allowing for a more in-depth analysis of key process perspectives. We demonstrate our approach's efficacy through an evaluation using a broad range of event logs and highlight its usefulness through four application scenarios enabled by our approach.

FachzeitschriftInformation Systems
PublikationsstatusVeröffentlicht - Dez. 2022

ÖFOS 2012

  • 102015 Informationssysteme
  • 102035 Data Science