brain of mat kelcey...

an intro to semi supervised document classification

January 31, 2010 at 02:02 PM | categories: Uncategorized

here's a great lecture from tom mitchell about document classification using a semi supervised version of naive bayes.

semi supervised algorithms only require some of the training examples to be labeled and are able to make use of any unlabelled ones, very common when we have a huge corpus.

i've started an experiment brewing to test this out by porting some previous naive bayes work i did to use this semi supervised scheme and will published it when it's done.

cool stuff!!