Cluster Optimization for Boundary Points using Distributive Progressive Feature Selection Algorithm

Main Article Content

Ch. Raja Ramesh, et. al.

Abstract

A group of different data objects is classified as similar objects is known as clusters. It is the process of finding homogeneous data items like patterns, documents etc. and then group the homogenous data items togetherothers groupsmay have dissimilar data items. Most of the clustering methods are either crisp or fuzzy and moreover member allocation to the respective clusters is strictly based on similarity measures and membership functions.Both of the methods have limitations in terms of membership. One strictly decides a sample must belong to single cluster and other anyway fuzzy i.e probability. Finally, Quality and Purity like measure are applied to understand how well clusters are created. But there is a grey area in between i.e. ‘Boundary Points’ and ‘Moderately Far’ points from the cluster centre. We considered the cluster quality [18], processing time and relevant features identification as basis for our problem statement and implemented Zone based clustering by using map reducer concept. I have implemented the process to find far points from different clusters and generate a new cluster, repeat the above process until cluster quantity is stabilized. By using this processwe can improve the cluster quality and processing time also.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

Article Details

How to Cite
et. al., C. R. R. . (2021). Cluster Optimization for Boundary Points using Distributive Progressive Feature Selection Algorithm. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 12(2), 3320–3328. Retrieved from https://turcomat.org/index.php/turkbilmat/article/view/2391
Section
Articles