一种利用聚类思想解决重复任务问题的处理方法

doi:10.7523/j.issn.2095-6134.2009.1.016

中国科学院大学学报 ›› 2009, Vol. 26 ›› Issue (1): 107-113.DOI: 10.7523/j.issn.2095-6134.2009.1.016

一种利用聚类思想解决重复任务问题的处理方法

宋进亮罗铁坚陈肃刘伟

中国科学院研究生院, 北京100049

收稿日期:1900-01-01 修回日期:1900-01-01 发布日期:2009-01-15

A clustering based method to solve duplicate tasks problem

SONG Jin-Liang, LUO Tie-Jian, CHEN Su, LIU Wei

Graduate University of the Chinese Academy of Sciences, Beijing 100049, China

Received:1900-01-01 Revised:1900-01-01 Published:2009-01-15

摘要/Abstract

摘要： 流程挖掘是一种从实际业务执行日志中发现结构化流程信息的过程。流程挖掘技术广泛应用于业务流程的发现和辅助建模过程中，并能够通过差异分析的方法帮助改进已有业务流程。如何处理流程模型中的重复任务，是流程挖掘技术的一个关键问题。提出了一个在标准流程挖掘算法执行之前进行的重复任务处理阶段，这一重复任务处理方法可以很好地兼容目前已有的各种流程挖掘算法使之能够处理重复任务。并提出了一种能够将事件记录上下文信息的差别数值化的距离度量定义，使用这种度量能够利用聚类方法来识别输入数据中的重复任务。最后利用典型的带有重复任务的流程模型，对所提出的处理方法进行了模拟实验，并取得了良好的实验效果。

关键词: 流程挖掘, 重复任务, 聚类

Abstract: Process mining is to discover structured process description from real execution data. It helps the discovery and design of business process, and improves the existent ones through delta analysis. One of the challenging problems in process mining is how to deal with duplicate tasks. This paper provides a duplicate tasks treatment stage before the real execution of mining algorithm, which method is well compatible with existent process mining algorithms and helps them dealing with duplicate tasks. In addition, this paper designs a distance measure to transfer the difference of event context into numerical form, and take advantage of such distance to distinguish duplicate tasks through clustering technology. The method in this paper is proved by experiments on typical process model having duplicate tasks.

Key words: process mining, duplicate tasks, clustering

宋进亮罗铁坚陈肃刘伟. 一种利用聚类思想解决重复任务问题的处理方法[J]. 中国科学院大学学报, 2009, 26(1): 107-113.

SONG Jin-Liang, LUO Tie-Jian, CHEN Su, LIU Wei. A clustering based method to solve duplicate tasks problem[J]. , 2009, 26(1): 107-113.

[1]	杨随心, 耿修瑞, 杨炜暾, 赵永超, 卢晓军. 一种基于谱聚类算法的高光谱遥感图像分类方法[J]. 中国科学院大学学报, 2019, 36(2): 267-274.
[2]	隋小芸, 朱廷劭, 汪静莹. 基于局部特征优化的语音情感识别[J]. 中国科学院大学学报, 2017, 34(4): 431-438.
[3]	邢涛, 黄友红, 胡庆荣, 李军, 王冠勇. 基于动态K均值聚类算法的SAR图像分割[J]. 中国科学院大学学报, 2016, 33(5): 674-678.
[4]	公雪霜, 于丽君, 聂跃平, 朱建峰, 潘玉青. 辽宁西部地区先秦时期聚落遗址空间格局分析[J]. 中国科学院大学学报, 2016, 33(3): 373-379.
[5]	吴文娣, 程希骏, 刘峰. 基于K-means聚类和广义熵约束的CVaR投资组合模型[J]. 中国科学院大学学报, 2016, 33(1): 31-36.
[6]	谢小龙, 李毅. 陇西栽培蒙古黄芪生物学性状的多元统计分析[J]. 中国科学院大学学报, 2013, 30(4): 478-484.
[7]	毛万峰, 张红, 张波, 王超. 基于模糊水平集的SAR图像分割方法[J]. 中国科学院大学学报, 2013, 30(2): 238-243.
[8]	王秋明, 高慧颖, 刘科成. 基于模糊聚类及灰色关联的软件需求分析方法[J]. 中国科学院大学学报, 2010, 27(6): 859-863.
[9]	曹政, 朱明. 一种快速有效的相似视频检索方法[J]. 中国科学院大学学报, 2010, 27(3): 376-380.
[10]	夏鲁宁, 荆继武. SA-DBSCAN:一种自适应基于密度聚类算法[J]. 中国科学院大学学报, 2009, 26(4): 530-538.
[11]	王晶, 夏鲁宁, 荆继武. 一种基于密度最大值的聚类算法[J]. 中国科学院大学学报, 2009, 26(4): 539-548.
[12]	秦钰;　荆继武;　向继;　张爱华. 基于优化初始类中心点的K-means改进算法[J]. 中国科学院大学学报, 2007, 24(6): 771-777.
[13]	谢小龙胡延萍赵旭东王莉李毅. 陇西栽培蒙古黄芪酯酶同工酶数量分析[J]. 中国科学院大学学报, 2007, 24(4): 525-529.
[14]	刘务华; 罗铁坚; 王文杰. 文本聚类算法的质量评价[J]. 中国科学院大学学报, 2006, 23(5): 640-646.

一种利用聚类思想解决重复任务问题的处理方法

A clustering based method to solve duplicate tasks problem

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 14

编辑推荐

Metrics

本文评价

访问统计

联系我们