[SciPy-dev] Presentation of pymachine, a python package for machine learning
Petr Šimon
sim at klubko.net
Sun May 13 20:40:19 EDT 2007
On Monday 14 May 2007 08:24:52 David Cournapeau wrote:
> Dear scipy developers and users,
>
> As some of you may know already, my proposal for pymachine, a python
> toolbox for machine learning in python, has been accepted for the Summer
> of Code 2007. The detailed proposal is online [1], and wikified [2]. The
> proposal timeline consists of two main steps:
> - first improving existing tools related to machine learning in
> scipy, such as they become part of "official scipy" (eg all tools in
> toolbox going into main scipy namespace). This includes scipy.cluster,
> scipy.sandbox.pyem and scipy.sandbox.svm.
> - Then building from this set of toolboxes a more high level package,
> in the spirit of similar softwares, such as orange or weka [3],
> including some visualization tools for data exploration. This part of
> the code would be put in scikits (because it will require extra
> dependencies).
> All development will happen in the scipy and scikits subversion
> repositories.
>
> Now, before starting working on it, I would like to get some feedback
> about what other people think is necessary with respect to those goals:
> - What are the requirements for a toolbox to go from the sandbox into
> the scipy namespace ?
> - For people willing to use machine learning related software in
> python/scipy, what are the main requirements/concern ? (eg Data
> exploration GUI, efficiency, readability of the algorithms, etc...)
>
> cheers,
>
> David
>
> [1] http://www.ar.media.kyoto-u.ac.jp/members/david/fullproposal.html
>
> [2] http://projects.scipy.org/scipy/scipy/wiki/MachineLearning
>
> [3] orange http://magix.fri.uni-lj.si/orange/, weka:
> http://www.cs.waikato.ac.nz/ml/weka/
> _______________________________________________
> Scipy-dev mailing list
> Scipy-dev at scipy.org
> http://projects.scipy.org/mailman/listinfo/scipy-dev
Hello,
one concern might be data handling. I usually work with rather large datasets
that can't fit into memory and most of the machine learning packages
naturally do this (weka, orange).
Best
Petr
--
Petr Šimon
http://www.klubko.net
PhD student, TIGP-CLCLP Academia Sinica
http://clclp.ling.sinica.edu.tw
"... what the Buddhist call 'right livelyhood', I didn't have that,
I didn't have any way of making a living, and to make a living
is to be doing something that you love, something that was creative,
something that made sense..."
Mark Bittner, parrot caretaker, Telegraph Hill
More information about the SciPy-Dev
mailing list