CSIR Central

K-Means Algorithm: An Unsupervised Clustering Approach using Various Similarity/Dissimilarity Measures

IR@CEERI: CSIR-Central Electronics Engineering Research Institute, Pilani

View Archive Info
 
 
Field Value
 
Title K-Means Algorithm: An Unsupervised Clustering Approach using Various Similarity/Dissimilarity Measures
 
Creator Patel, SS
Kumar, N
Aswathy, J
Vaddadi, SK
Akbar, SA
Panchariya, PC
 
Subject Digital Systems
 
Description Clustering is an unsupervised method of classifying data objects into similar groups based on some features or properties usually known as similarity or dissimilarity measures. K-Means is one of the most popular method of clustering falls under the category of hard clustering. In this clustering method, any data object can belong to a single cluster. On the other hand, in soft clustering methods (e.g. fuzzy c-means clustering), the data object can be clustered in more than one cluster with some degree which is specified by the membership value with limitation imposed as the summation of these membership values should he equal to one. Although K-Means clustering technique is fairly old approach but still enjoy immense popularity in terms of being used in data grouping applications and machine learning. In this paper K-Means approach with five different distance measures e.g. Euclidean, Squared Euclidean, Half Squared Euclidean. Cosine and City Block distance has been explored and a comparative study is made based on the performance of these similarity criterions on real time Edible oil dataset acquired using MIR spectroscopy. Furthermore, it is also tried to investigate which similarity measure performs well for a particular set of data carrying unique pattern. The K-Means algorithm with various similarity-dissimilarity measures have been formulated and implemented in MATLAB R2015b environment provided by Mathworks.
 
Date 2021
 
Type Conference or Workshop Item
PeerReviewed
 
Format application/pdf
 
Identifier http://ceeri.csircentral.net/571/1/232020.pdf
Patel, SS and Kumar, N and Aswathy, J and Vaddadi, SK and Akbar, SA and Panchariya, PC (2021) K-Means Algorithm: An Unsupervised Clustering Approach using Various Similarity/Dissimilarity Measures. In: 4th International Conference on Intelligent Sustainable Systems (ICISS-2021), February 26-27, 2021, SCAD College of Engineering and Technology, Tirunelveli, India.
 
Relation http://ceeri.csircentral.net/571/