[IPython-dev] RandomForestClassifier w/ IPython.parallel

Javier Martínez-López javi.martinez.lopez at gmail.com
Tue Feb 11 03:59:21 EST 2014


maybe you could also use the "bigrf" package in R through "rpy2":



On Fri, Feb 7, 2014 at 9:36 PM, Alessandro Gagliardi
<alessandro.gagliardi at glassdoor.com> wrote:
> Not sure if I'm addressing the best list for this question, so if there's a
> more appropriate list, please direct me to it.
> I want to run a large sklearn.ensemble.RandomForestClassifier (with maybe a
> dozens or maybe hundreds of trees and 100,000 samples). My desktop won't
> handle this so I want to try using StarCluster. RandomForestClassifier seems
> to parallelize easily, but I don't know how I would split it across many
> IPython.parallel engines (if that's even possible). (Or maybe I should be
> foregoing IPython.parallel and using MPI?)
> Any help would be greatly appreciated.
> Thanks,
> Alessandro Gagliardi| Glassdoor| alessandro at glassdoor.com
> We're hiring! Check out our open jobs.
> Twitter | Facebook  | Glassdoor Blog
> 2012 Webby Award Winner: Best Employment Site
> 2013 Webby Award Winner: Best Guides/Ratings/Review Site
> _______________________________________________
> IPython-dev mailing list
> IPython-dev at scipy.org
> http://mail.scipy.org/mailman/listinfo/ipython-dev

More information about the IPython-dev mailing list