[scikit-learn] Memory efficient TfidfVectorizer

Peng Yu pengyu.ut at gmail.com
Tue Jan 28 02:00:20 EST 2020


To use TfidfVectorizer, the whole corpus must be used into memory.
This can be a problem for machines without a lot of memory. Is there a
way to use only a small amount of memory by saving most intermediate
results in the disk? Thanks.


More information about the scikit-learn mailing list