Projects per year
Abstract
Noise is a challenge for process mining algorithms, but there is no
standard definition of noise nor accepted way to quantify it. This
means it is not possible to mine with confidence from event logs
which may not record the underlying process correctly. We discuss
one way of thinking about noise in process mining. We consider mining
from a `noisy log' as learning a probability distribution over traces,
representing the true process, from a log which is a sample from
multiple distributions: the `true' process model and one or more
`noise' models. We apply this using a probabilistic analysis of the
Heuristics Miner algorithm, and demonstrate on a simple example.
We show that for a given model it is possible to predict how much
data is needed to mine the underlying model without the noise, and
identify differences in the the robustness of Heuristics Miner to
different types of noise.
standard definition of noise nor accepted way to quantify it. This
means it is not possible to mine with confidence from event logs
which may not record the underlying process correctly. We discuss
one way of thinking about noise in process mining. We consider mining
from a `noisy log' as learning a probability distribution over traces,
representing the true process, from a log which is a sample from
multiple distributions: the `true' process model and one or more
`noise' models. We apply this using a probabilistic analysis of the
Heuristics Miner algorithm, and demonstrate on a simple example.
We show that for a given model it is possible to predict how much
data is needed to mine the underlying model without the noise, and
identify differences in the the robustness of Heuristics Miner to
different types of noise.
Original language | English |
---|---|
Title of host publication | Computational Intelligence and Data Mining (CIDM), 2013 IEEE Symposium on |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 119-126 |
Number of pages | 8 |
ISBN (Electronic) | 978-1-4673-5895-8 |
DOIs | |
Publication status | Published - 2013 |
Fingerprint
Dive into the research topics of 'A Principled Approach to Mining From Noisy Logs Using Heuristics Miner'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Unified probabilistic modelleing of adaptive spatial temporal structures in the human brain
Tino, P. (Principal Investigator) & Kourtzi, Z. (Co-Investigator)
Biotechnology & Biological Sciences Research Council
1/10/10 → 30/03/14
Project: Research Councils