大理大学学报 ›› 2022, Vol. 7 ›› Issue (6): 9-17.

• 数学与计算机科学 • 上一篇    下一篇

高平均效用co-location模式挖掘的一种有效算法

  

  1. 大理大学数学与计算机学院,云南大理 671003
  • 收稿日期:2021-10-05 修回日期:2021-11-10 出版日期:2022-06-15 发布日期:2022-07-04
  • 通讯作者: 李晓伟,讲师,博士,E-mail:lixiaowei_xidian@163.com。
  • 作者简介:曾新,讲师,主要从事数据挖掘、计算机应用技术研究。
  • 基金资助:
    云南省地方本科高校(部分)基础研究联合专项资金项目(2018FH001-062;2018FH001-063);大理大学数据安全与应用创新团队(ZKLX2020308)

An Effective Algorithm for Mining High Average-Utility Co-Location Patterns

  1. College of Mathematics and Computer, Dali University, Dali, Yunnan 671003, China
  • Received:2021-10-05 Revised:2021-11-10 Online:2022-06-15 Published:2022-07-04

摘要:

co-location模式是空间特征集合的一个子集,模式中不同特征的实例频繁出现在邻近区域内。纵观高效用co-location模式挖掘的相关研究报道,现存的高效用co-location模式挖掘方法没有考虑模式的长度对模式效用的影响。为了探索这一问题,提出一种从空间数据集中挖掘高平均效用co-location模式的算法(HAUCP),以便更好地评价co-location模式的真实效用。首先,基于空间数据集提出高平均效用co-location模式的完整定义;其次,构建了高平均效用co-location模式挖掘的基本算法,并探索了两种有效的剪枝策略以提升算法的运行效率;最后,在真实数据集和合成数据集上对算法的有效性和可扩展性进行了大量实验,实验结果表明HAUCP算法挖掘出的高效用co-location模式更加合理。

关键词: 高平均效用, co-location模式, 模式长度, 空间数据集, 数据挖掘算法

Abstract:

Co-location pattern is a subset of spatial feature setand some instances of different features in co-location pattern frequently appear in adjacent areas. Looking at relevant research reports on high utility co-location pattern mining the existing high utility co-location patterns mining algorithms do not consider the impact of the length of pattern on the utility of pattern. To explore this issue we propose an effective algorithm of mining high average-utility co-location patterns HAUCP from spatial dataset to provide a better evaluation of the real utility of co-location pattern. First a comprehensive definition of high average-utility co-location patterns based on spatial dataset is proposed. Secondly the basic algorithm of high average-utility co-location pattern mining is constructed. For improving the efficiency of the basic algorithm two pruning strategies are developed to reduce the computing costs. Finally extensive experiments on real dataset and synthetic dataset are carried out to prove the effectiveness of the proposed algorithm. Experimental results show that high-utility co-location patterns of HAUCP are more reasonable.

Key words:

high average-utility, co-location patterns, pattern length, spatial dataset, data mining algorithm

中图分类号: