The problem of inducing a model for forecasting the outcome of an ongoing process instance from historical log traces has attracted notable attention in the field of Process Mining. Approaches based on deep neural networks have become popular in this context, as a more effective alternative to previous feature- based outcome-prediction methods. However, these approaches rely on a pure supervised learning scheme, and unfit many real- life scenarios where the outcome of (fully unfolded) training traces must be provided by experts. Indeed, since in such a scenario only a small amount of labeled traces are usually given, there is a risk that an inaccurate or overfitting model is discovered. To overcome these issues, a novel outcome-discovery approach is proposed here, which leverages a fine-tuning strategy that learns general-enough trace representations from unlabelled log traces, which are then reused (and adapted) in the discovery of the outcome predictor. Results on real-life data confirmed that our proposal makes a more effective and robust solution for label- scarcity scenarios than current outcome-prediction methods

Learning Effective Neural Nets for Outcome Prediction from Partially Labelled Log Data

Francesco Folino;Gianluigi Folino;Massimo Guarascio;Luigi Pontieri
2019

Abstract

The problem of inducing a model for forecasting the outcome of an ongoing process instance from historical log traces has attracted notable attention in the field of Process Mining. Approaches based on deep neural networks have become popular in this context, as a more effective alternative to previous feature- based outcome-prediction methods. However, these approaches rely on a pure supervised learning scheme, and unfit many real- life scenarios where the outcome of (fully unfolded) training traces must be provided by experts. Indeed, since in such a scenario only a small amount of labeled traces are usually given, there is a risk that an inaccurate or overfitting model is discovered. To overcome these issues, a novel outcome-discovery approach is proposed here, which leverages a fine-tuning strategy that learns general-enough trace representations from unlabelled log traces, which are then reused (and adapted) in the discovery of the outcome predictor. Results on real-life data confirmed that our proposal makes a more effective and robust solution for label- scarcity scenarios than current outcome-prediction methods
2019
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
9781728137988
Process Mining
Neural Nets
Unlabelled Data
File in questo prodotto:
File Dimensione Formato  
Learning_Effective_Neural_Nets_for_Outcome_Prediction_from_Partially_Labelled_Log_Data.pdf

solo utenti autorizzati

Tipologia: Versione Editoriale (PDF)
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 140.02 kB
Formato Adobe PDF
140.02 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/369262
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 11
  • ???jsp.display-item.citation.isi??? 8
social impact