SIA OpenIR  > 信息服务与智能控制技术研究室
Alternative TitleAccelerate K-means for multi-center clustering of big datasets
张顺龙; 库涛; 周浩
Source Publication计算机应用研究
Indexed ByCSCD
Contribution Rank1
Funding Organization国家科技支持计划资助项目(2012BAH15F05) ; 吉林省科技型中小企业技术创新基金资助项目(12C26212201399) ; 国家自然科学基金资助项目(612033161,51205389)
KeywordDiack 加速k-means 聚类 三角定理
Other AbstractThe k-means algorithm is the most popular cluster algorithm. but for big dataset clustering with many clusters. it will take a lot of time to find all the clusters. This paper proposed a new acceleration method based on the thought of dynamical and immediate adjustment of the center K-means with triangle inequality. The triangle inequality is used to avoid redundant distance computations; But unlike Elkan’s algorithm. the centers are divided into outer-centers and inner-centers for each data point in the first place. and only the tracks of the lower bounds to inner-centers are kept; On the other hand. by adjusting the data points cluster by cluster and updating the cluster center immediately right after finishing each cluster’s adjustment. the number of iteration is effectively reduced. The experiment results show that our algorithm runs much faster than Elkan’s algorithm with much less memory consumption when the cluster center number is larger than 20 and the dataset records number is greater than 10 million. and the speedup becomes better when the k increases.
Citation statistics
Cited Times:7[CSCD]   [CSCD Record]
Document Type期刊论文
Corresponding Author张顺龙
Recommended Citation
GB/T 7714
张顺龙,库涛,周浩. 针对多聚类中心大数据集的加速K-means聚类算法[J]. 计算机应用研究,2016,33(2):413-416.
APA 张顺龙,库涛,&周浩.(2016).针对多聚类中心大数据集的加速K-means聚类算法.计算机应用研究,33(2),413-416.
MLA 张顺龙,et al."针对多聚类中心大数据集的加速K-means聚类算法".计算机应用研究 33.2(2016):413-416.
Files in This Item: Download All
File Name/Size DocType Version Access License
针对多聚类中心大数据集的加速K_mean(340KB)期刊论文作者接受稿开放获取ODC PDDLView Download
Related Services
Recommend this item
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[张顺龙]'s Articles
[库涛]'s Articles
[周浩]'s Articles
Baidu academic
Similar articles in Baidu academic
[张顺龙]'s Articles
[库涛]'s Articles
[周浩]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[张顺龙]'s Articles
[库涛]'s Articles
[周浩]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: 针对多聚类中心大数据集的加速K_means聚类算法.pdf
Format: Adobe PDF
All comments (0)
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.