DEV Community

Does Anyone Know of Any *Discrete* Clustering Algorithms?

Andrew (he/him) on June 07, 2019

I'm doing some research into clustering algorithms and every source I seem to find discusses 2D (or higher-dimensional) clustering of continuous da...
Collapse
 
berniwittmann profile image
Bernhard Wittmann • Edited

How about Single-Linkage Clustering or Complete-Linkage Clustering, both belong to hierarchical Clustering, you just have to choose a distance metric that works on the grid, like Manhattan Distance.

Actually shouldn't it be possible to adopt any Clustering algorithm: as an example k-means: you need to choose an appropriate distance metric as above and second adjust the calculation of the prototypes to choose a point of the grid.

Collapse
 
dylan profile image
Dylan • Edited

Can you be a bit more specific about what your data looks like? Are x and y categorical features and z continuous? I had, at some point, a SO thread about combining data specific distance functions in a nearest neighbor search. I can't find it anymore, but it would be sort of like def custom_distance(X): return scipy.dice(categorical_features) + scipy.euclidean(continuous_features)

It looks sort of like: members.cbio.mines-paristech.fr/~j...

Found it! Hopefully something in this thread is helpful.
datascience.stackexchange.com/ques...