[scikit-learn] Memory efficient TfidfVectorizer

Peng Yu pengyu.ut at gmail.com
Tue Jan 28 02:00:20 EST 2020


Hi,

To use TfidfVectorizer, the whole corpus must be used into memory.
This can be a problem for machines without a lot of memory. Is there a
way to use only a small amount of memory by saving most intermediate
results in the disk? Thanks.

-- 
Regards,
Peng


More information about the scikit-learn mailing list