me on twitter

brain of mat kelcey


simple text search in ruby using ferret

September 12, 2010 at 09:28 PM | categories: search, ruby, ferret | View Comments

ferret is a lightweight text search engine for ruby, a bit like lucene but with less (ie no) java.i've been looking at it today as part of my named entity extraction prototype which needs to be able to fuzzily match one short string against a list of other short strings.let's go through an example, it's the only way my brain works sorry.moremaking a ferret index is simple; we'll just make a memory based index for this demo.12require 'ferret'index = Ferret::Index::Index.new()next we'll add a handful of places in africa and europe to our index.each document we add is simply a hash...
Read and Post Comments

latent semantic analysis via the singular value decomposition (for dummies)

April 19, 2010 at 08:50 PM | categories: search, semantic, classification, svd, e14, analysis, latent | View Comments

i've been trying to get a deeper understanding of latent semantic analysis for awhile now.last week i came to the conclusion the other way to truly understand would be to start from the ground upso here goes; mat's guide to latent semantic analysis via the singular value decomposition (for dummies)...
Read and Post Comments

old projects...