欢迎访问中国科学院大学学报,今天是

中国科学院大学学报 ›› 2009, Vol. 26 ›› Issue (1): 107-113.DOI: 10.7523/j.issn.2095-6134.2009.1.016

• 论文 • 上一篇    下一篇

一种利用聚类思想解决重复任务问题的处理方法

宋进亮 罗铁坚 陈 肃 刘 伟   

  1. 中国科学院研究生院, 北京100049
  • 收稿日期:1900-01-01 修回日期:1900-01-01 发布日期:2009-01-15

A clustering based method to solve duplicate tasks problem

SONG Jin-Liang, LUO Tie-Jian, CHEN Su, LIU Wei   

  1. Graduate University of the Chinese Academy of Sciences, Beijing 100049, China
  • Received:1900-01-01 Revised:1900-01-01 Published:2009-01-15

摘要: 流程挖掘是一种从实际业务执行日志中发现结构化流程信息的过程。流程挖掘技术广泛应用于业务流程的发现和辅助建模过程中,并能够通过差异分析的方法帮助改进已有业务流程。如何处理流程模型中的重复任务,是流程挖掘技术的一个关键问题。提出了一个在标准流程挖掘算法执行之前进行的重复任务处理阶段,这一重复任务处理方法可以很好地兼容目前已有的各种流程挖掘算法使之能够处理重复任务。并提出了一种能够将事件记录上下文信息的差别数值化的距离度量定义,使用这种度量能够利用聚类方法来识别输入数据中的重复任务。最后利用典型的带有重复任务的流程模型,对所提出的处理方法进行了模拟实验,并取得了良好的实验效果。

关键词: 流程挖掘, 重复任务, 聚类

Abstract: Process mining is to discover structured process description from real execution data. It helps the discovery and design of business process, and improves the existent ones through delta analysis. One of the challenging problems in process mining is how to deal with duplicate tasks. This paper provides a duplicate tasks treatment stage before the real execution of mining algorithm, which method is well compatible with existent process mining algorithms and helps them dealing with duplicate tasks. In addition, this paper designs a distance measure to transfer the difference of event context into numerical form, and take advantage of such distance to distinguish duplicate tasks through clustering technology. The method in this paper is proved by experiments on typical process model having duplicate tasks.

Key words: process mining, duplicate tasks, clustering