
Recent Posts
Recent Comments
Akhil Kumar on Gaussian kernels Think Big, a Teradat… on Kmodes A Gentle Introductio… on Rolling and Unrolling RNN… santaraxita on Neural Networks 1: The ne… Jon B. on PageRank Archives
 December 2016
 November 2016
 October 2016
 June 2016
 April 2016
 January 2016
 November 2015
 October 2015
 July 2015
 June 2015
 May 2015
 January 2015
 September 2014
 June 2014
 May 2014
 March 2014
 February 2014
 January 2014
 December 2013
 October 2013
 September 2013
 August 2013
 July 2013
 June 2013
 May 2013
 April 2013
 March 2013
Categories
Meta
Category Archives: Clustering
Kmodes
I recently read an interesting Wired story about Chris McKinlay (a fellow alum of Middlebury College), who used a clustering algorithm to understand the pool of users on the dating site OkCupid (and successfully used this information to improve his … Continue reading
Posted in Clustering, Feature extraction
18 Comments
Modularity – Measuring cluster separation
We’ve now seen a number of different clustering algorithms, each of which will divide a data set into a number of subsets. This week, I want to ask the question: How do we know if answer that a clustering algorithm … Continue reading
Posted in Clustering, Unsupervised learning
Leave a comment
Spectral clustering
In the last few posts, we’ve been studying clustering, i.e. algorithms that try to cut a given data set into a number of smaller, more tightly packed subsets, each of which might represent a different phenomenon or a different type … Continue reading
Posted in Clustering, Unsupervised learning
9 Comments
Mapper and the choice of scale
In last week’s post, I described the DBSCAN clustering algorithm, which uses the notion of density to determine which data points in a data set form tightly packed groups called clusters. This algorithm relies on two parameters – a distance … Continue reading
Posted in Clustering, Unsupervised learning
4 Comments
Clusters and DBScan
A few weeks ago, I mentioned the idea of a clustering algorithm, but here’s a recap of the idea: Often, a single data set will be made up of different groups of data points, each of which corresponds to a … Continue reading
Posted in Clustering, Unsupervised learning
7 Comments