Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
LLM-based Process Constraints Generationwith Context: Automating Conformance Checking and Semantic Anomaly Detection withInstruction Fine-Tuned and Vanilla Large Language Models
Umeå University, Faculty of Science and Technology, Department of Computing Science.
2025 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Analyzing the data generated by complex information systems to identify undesirable behaviors in event log traces---so-called conformance checking---is a key challenge. With the rise of deep learning and, more specifically, generative AI applications, one promising line of research is the auto-generation of (symbolic) temporal reasoning queries that can then be applied in a semi-automatic manner. Recent work has demonstrated that utilizing fine-tuned open-source large language models (LLMs) for this purpose is promising and, in some aspects, superior to other approaches to automated conformance checking.

This thesis further expands this research direction by integrating additional state-of-the-art LLMs, such as GPT-4o and Llama, and supporting the provision of process-specific natural language context to evaluate their effectiveness. While previous framework is designed to work directly with event log schemata, this study explores whether incorporating human-readable text descriptions as supplementary input improves performance for non-fine-tuned models. A naïve baseline is introduced to validate that all models outperform random predictions, ensuring the robustness of the evaluation.

The results show that fine-tuning significantly enhances performance, with the xSemAD model achieving consistently higher F1-scores across most constraint types compared to state-of-the-art LLMs. However, text descriptions did not yield the expected performance improvements, highlighting the complexity of aligning contextual information with process semantics. Additionally, for some inherently ambiguous constraints, such as Choice and Alternate Succession model performance was only marginally better than the naïve baseline. These findings emphasize the importance of task-specific adaptation and the need for advanced methods to address complex constraints.

By demonstrating the potential of fine-tuned LLMs for semantic anomaly detection, this thesis contributes to advancing automated conformance checking and lays the groundwork for future research. Proposed directions include improving textual context generation, exploring alternative ground truth sources, and developing specialized techniques for handling complex constraints.

Place, publisher, year, edition, pages
2025. , p. 35
Series
UMNAD ; 1527
National Category
Other Engineering and Technologies
Identifiers
URN: urn:nbn:se:umu:diva-236090OAI: oai:DiVA.org:umu-236090DiVA, id: diva2:1942070
External cooperation
SAP Signavio
Educational program
Master of Science Programme in Computing Science and Engineering
Supervisors
Examiners
Available from: 2025-03-04 Created: 2025-03-04 Last updated: 2025-03-04Bibliographically approved

Open Access in DiVA

fulltext(568 kB)109 downloads
File information
File name FULLTEXT01.pdfFile size 568 kBChecksum SHA-512
9e9466849f802ead85c7e786a59c7385b7a44f6c892fb5e42817990a162898bc85858e8af99c30274053619ae6ac9de3e2959a535e1ae3757d706479526d1997
Type fulltextMimetype application/pdf

By organisation
Department of Computing Science
Other Engineering and Technologies

Search outside of DiVA

GoogleGoogle Scholar
Total: 109 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 436 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf