Umeå universitets logga

umu.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
LLM-based Process Constraints Generationwith Context: Automating Conformance Checking and Semantic Anomaly Detection withInstruction Fine-Tuned and Vanilla Large Language Models
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för datavetenskap.
2025 (Engelska)Självständigt arbete på avancerad nivå (masterexamen), 20 poäng / 30 hpStudentuppsats (Examensarbete)
Abstract [en]

Analyzing the data generated by complex information systems to identify undesirable behaviors in event log traces---so-called conformance checking---is a key challenge. With the rise of deep learning and, more specifically, generative AI applications, one promising line of research is the auto-generation of (symbolic) temporal reasoning queries that can then be applied in a semi-automatic manner. Recent work has demonstrated that utilizing fine-tuned open-source large language models (LLMs) for this purpose is promising and, in some aspects, superior to other approaches to automated conformance checking.

This thesis further expands this research direction by integrating additional state-of-the-art LLMs, such as GPT-4o and Llama, and supporting the provision of process-specific natural language context to evaluate their effectiveness. While previous framework is designed to work directly with event log schemata, this study explores whether incorporating human-readable text descriptions as supplementary input improves performance for non-fine-tuned models. A naïve baseline is introduced to validate that all models outperform random predictions, ensuring the robustness of the evaluation.

The results show that fine-tuning significantly enhances performance, with the xSemAD model achieving consistently higher F1-scores across most constraint types compared to state-of-the-art LLMs. However, text descriptions did not yield the expected performance improvements, highlighting the complexity of aligning contextual information with process semantics. Additionally, for some inherently ambiguous constraints, such as Choice and Alternate Succession model performance was only marginally better than the naïve baseline. These findings emphasize the importance of task-specific adaptation and the need for advanced methods to address complex constraints.

By demonstrating the potential of fine-tuned LLMs for semantic anomaly detection, this thesis contributes to advancing automated conformance checking and lays the groundwork for future research. Proposed directions include improving textual context generation, exploring alternative ground truth sources, and developing specialized techniques for handling complex constraints.

Ort, förlag, år, upplaga, sidor
2025. , s. 35
Serie
UMNAD ; 1527
Nationell ämneskategori
Annan teknik
Identifikatorer
URN: urn:nbn:se:umu:diva-236090OAI: oai:DiVA.org:umu-236090DiVA, id: diva2:1942070
Externt samarbete
SAP Signavio
Utbildningsprogram
Civilingenjörsprogrammet i Teknisk datavetenskap
Handledare
Examinatorer
Tillgänglig från: 2025-03-04 Skapad: 2025-03-04 Senast uppdaterad: 2025-03-04Bibliografiskt granskad

Open Access i DiVA

fulltext(568 kB)103 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 568 kBChecksumma SHA-512
9e9466849f802ead85c7e786a59c7385b7a44f6c892fb5e42817990a162898bc85858e8af99c30274053619ae6ac9de3e2959a535e1ae3757d706479526d1997
Typ fulltextMimetyp application/pdf

Av organisationen
Institutionen för datavetenskap
Annan teknik

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 103 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

urn-nbn

Altmetricpoäng

urn-nbn
Totalt: 390 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf