From jeffrey.fischer at gmail.com Tue Apr 19 10:29:01 2022 From: jeffrey.fischer at gmail.com (Jeff Fischer) Date: Tue, 19 Apr 2022 07:29:01 -0700 Subject: [Baypiggies] This month's BayPiggies: Faster Pandas and Command Line Tools Message-ID: *Thursday April 28, 2022 7:00 - 8:30 pm* (online) This month, we'll have a lightning talk about command line tools by Karen Dalton and a full talk about Modin by its creator, Devin Petersohn. Come join us! Lightning Talk: Tools to Improve You Command Line Experience Karen Dalton is a Principal Software Engineer at Stanford University School of Medicine where she leads a team of research software engineers working on on the Clinical Genome Resource consortia's ClinGen Data Platform. She has been an active member of the Python community, including helping with BayPiggies and PyBay, for several years. She will be talking about rich (https://github.com/Textualize/rich) and typer (https://github.com/tiangolo/typer). They are two Python tools, both in very active development, that can make your command line experience better. Main Talk: Scaling Up Your Pandas Workflows With Modin Pandas is one of the most commonly used data science libraries in Python, with a convenient set of APIs to help data scientists prepare, analyze, and explore their data. However, despite its widespread adoption, pandas suffers from severe memory and performance issues on moderately large datasets. We present Modin (https://github.com/modin-project/modin), a fast, scalable drop-in replacement for pandas. By changing just a single line of code, Modin seamlessly speeds up pandas workflow on a laptop or in a cluster. Modin has over 6.6k GitHub stars, 2.8 million downloads, and is deployed at many data-centric organizations to accelerate dataframe workflows. Speaker Bio: Devin Petersohn Devin is the lead developer of Modin and the co-founder and CTO of Ponder. Devin recently completed his Ph.D. from UC Berkeley RISE Lab, where he did research on distributed systems for data science. As a part of this work, he created Modin, a system for enabling scalable interactive data science. Code of Conduct https://baypiggies.net/pages/code_of_conduct.html Interactions online have less nuance than in-person interactions. Please be Open, Considerate and Respectful. Also, please refrain from discussing topics unrelated to the Python community or the technical content of the meeting. RSVP We will conduct the meeting via Zoom. Please register in advance. To do so, go to the Meetup page for this event: https://www.meetup.com/BAyPIGgies/events/284835486/. If you RSVP "Yes" to this event on MeetUp, the link to the Zoom meeting will be displayed. -------------- next part -------------- An HTML attachment was scrubbed... URL: From jeffrey.fischer at gmail.com Thu Apr 28 13:04:08 2022 From: jeffrey.fischer at gmail.com (Jeff Fischer) Date: Thu, 28 Apr 2022 10:04:08 -0700 Subject: [Baypiggies] Reminder: Talk on Scaling Pandas with Modin tonight Message-ID: At 7 pm tonight, we'll have a talk about Modin by its creator, Devin Petersohn. Come join us! We also have a last minute opening for a lightning talk (5 to 15 minutes). If you are interested, contact me. Main Talk: Scaling Up Your Pandas Workflows With Modin Pandas is one of the most commonly used data science libraries in Python, with a convenient set of APIs to help data scientists prepare, analyze, and explore their data. However, despite its widespread adoption, pandas suffers from severe memory and performance issues on moderately large datasets. We present Modin (https://github.com/modin-project/modin), a fast, scalable drop-in replacement for pandas. By changing just a single line of code, Modin seamlessly speeds up pandas workflow on a laptop or in a cluster. Modin has over 6.6k GitHub stars, 2.8 million downloads, and is deployed at many data-centric organizations to accelerate dataframe workflows. Speaker Bio: Devin Petersohn Devin is the lead developer of Modin and the co-founder and CTO of Ponder. Devin recently completed his Ph.D. from UC Berkeley RISE Lab, where he did research on distributed systems for data science. As a part of this work, he created Modin, a system for enabling scalable interactive data science. Code of Conduct https://baypiggies.net/pages/code_of_conduct.html Interactions online have less nuance than in-person interactions. Please be Open, Considerate and Respectful. Also, please refrain from discussing topics unrelated to the Python community or the technical content of the meeting. RSVP We will conduct the meeting via Zoom. Please register in advance. To do so, go to the Meetup page for this event: https://www.meetup.com/BAyPIGgies/events/284835486/. If you RSVP "Yes" to this event on MeetUp, the link to the Zoom meeting will be displayed. -------------- next part -------------- An HTML attachment was scrubbed... URL: