[Chicago] hadoop and python?

Daniel Griffin dgriff1 at gmail.com
Mon Jun 28 17:29:05 CEST 2010


Hadoop lets you essentially do map reduce queries large datasets across
large numbers of systems. Its a way to process huge amounts of data. Since
Map/Reduce can be kind of difficult to write there are other libraries and
projects around to let you generate the Map/Reduce functions from stuff like
a SQL-like language and custom languages like pig.

There are a bunch of projects built on top of Hadoop also, but none are
really similar to EC2.



On Mon, Jun 28, 2010 at 10:21 AM, Brian Ray <brianhray at gmail.com> wrote:

> I was talking with someone about Cloud computing and was giving my
> testimony of having some fairly positive experiences deploying a
> Python powered solution under Amazon Elastic Compute Cloud (EC2).
> They started talking about how they were going to use hadoop. It was
> not until I googled it did I realize we are talking about two
> different things entirely.  It seems like "Cloud" computing needs some
> disambiguation.  Then I also recall that this part deploys their
> technology under .NET.  I do not even see how they could use hadoop.
> It seems more likely a Java or Jython thing, correct?  I really have
> no clue about hadoop but would like to learn about it. On the other
> hand, I do know a growing amount about EC2 pros and cons.  Hopefully,
> someone can shed some light on the distinction.
>
> -- Brian
> _______________________________________________
> Chicago mailing list
> Chicago at python.org
> http://mail.python.org/mailman/listinfo/chicago
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/chicago/attachments/20100628/8a49fea2/attachment.html>


More information about the Chicago mailing list