[scikit-learn] Why the default max_samples of Random Forest is X.shape[0]?
Fernando Marcos Wittmann
fernando.wittmann at gmail.com
Sun May 10 18:42:41 EDT 2020
Okay, so it's sampling with replacement with same size of the original
dataset. That mean that some of the samples would be repeated for each tree
On Sun, May 10, 2020, 19:40 Fernando Marcos Wittmann <
fernando.wittmann at gmail.com> wrote:
> My question is why the full dataset is being used as default when building
> each tree. That's not random forest. The main point of RF is to build each
> tree with a subsample of the full dataset
>
> On Sun, May 10, 2020, 09:50 Joel Nothman <joel.nothman at gmail.com> wrote:
>
>> A bootstrap is very commonly a random draw with replacement of equal size
>> to the original sample.
>> _______________________________________________
>> scikit-learn mailing list
>> scikit-learn at python.org
>> https://mail.python.org/mailman/listinfo/scikit-learn
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scikit-learn/attachments/20200510/06c46d51/attachment.html>
More information about the scikit-learn
mailing list