[BangPypers] Text/Mail filtering/classification using python

Vijay Ramachandran vijay750 at gmail.com
Mon Dec 26 10:52:22 CET 2011


Use scikit <http://scikit-learn.org/stable/> - we've found that it works
better than the classifiers in nltk. For the spam/ham problem, I've heard
(circa 2006!!) that naive bayes works as well as any other classifier. For
the label classification problem, since there is dependency between words,
some other classifier will probably out perform a naive bayesian.

hth,
Vijay

-- 
Performance marketing on Twitter - http://www.wisdomtap.com/


More information about the BangPypers mailing list