A very good point! (Although augmented and log-average tf both do some kind of normalisation of the tf distribution before IDF weighting.) -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.python.org/pipermail/scikit-learn/attachments/20180131/16cfc8be/attachment.html>