[IPython-dev] RandomForestClassifier w/ IPython.parallel

Javier Martínez-López javi.martinez.lopez at gmail.com
Tue Feb 11 03:59:21 EST 2014


Hi,

maybe you could also use the "bigrf" package in R through "rpy2":
http://cran.r-project.org/web/packages/bigrf/index.html

Cheers,

Javier

On Fri, Feb 7, 2014 at 9:36 PM, Alessandro Gagliardi
<alessandro.gagliardi at glassdoor.com> wrote:
> Not sure if I'm addressing the best list for this question, so if there's a
> more appropriate list, please direct me to it.
>
> I want to run a large sklearn.ensemble.RandomForestClassifier (with maybe a
> dozens or maybe hundreds of trees and 100,000 samples). My desktop won't
> handle this so I want to try using StarCluster. RandomForestClassifier seems
> to parallelize easily, but I don't know how I would split it across many
> IPython.parallel engines (if that's even possible). (Or maybe I should be
> foregoing IPython.parallel and using MPI?)
>
> Any help would be greatly appreciated.
>
> Thanks,
>
> Alessandro Gagliardi| Glassdoor| alessandro at glassdoor.com
>
> We're hiring! Check out our open jobs.
>
> Twitter | Facebook  | Glassdoor Blog
>
> 2012 Webby Award Winner: Best Employment Site
>
> 2013 Webby Award Winner: Best Guides/Ratings/Review Site
>
>
> _______________________________________________
> IPython-dev mailing list
> IPython-dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/ipython-dev
>



More information about the IPython-dev mailing list