Monthly Archives: October 2013

Case study 3: Free form text

In the past two posts, I’ve been looking at ways to turn “unstructured” data into vector data that can be analyzed with techniques like the ones that I’ve described elsewhere on this blog. One of the most common types of … Continue reading

Posted in Feature extraction | 5 Comments

Case study 2: Tokens in census data

In last week’s post, I presented a brief tutorial on loading and analyzing the classic IRIS data set, which consists of four length measurements from each of 150 iris flowers. That data set was relatively easy to deal with because … Continue reading

Posted in Feature extraction | 9 Comments

Case study 1: Iris

Since the start of this blog, we’ve covered a lot of different algorithms that attempt to discover and summarize the geometric structure in a given data set. But as it turns out, this part of the data analysis is the … Continue reading

Posted in Feature extraction | 3 Comments