[Baypiggies] BayPiggies Jan. 23, 2025 - Packages, Peaks, Massive Graphs, Resilient Execution
Karen Dalton
kd at karend.net
Tue Jan 7 18:55:47 EST 2025
*BayPiggies Jan. 23, 2025 meeting!
Packages, Peaks, Massive Graphs, Resilient Execution*
* *https://www.meetup.com/baypiggies/events/305328742 (in-person at
SAP Labs in Palo Alto)*
* IMPORTANT: SAP requires all registrations be completed /in advance/
- and will email you prior to the event -- so no last minute
registrations will be honored, and no walk-ins will be allowed.
**Please RSVP on Meetup and also fill out the Google form here /*by
January 20*/ so you can receive the SAP form before the event
(REQUIRED): https://forms.gle/pCkGsCrzr3txS2VS6**
*Schedule:*
- 6:30 Register and Refreshments
- 7:00 Welcome + Announcements
- 7:05 Lightning Talk - Popular Python Packages Series - "uv"
- 7:10 Short Talk - Peaks, Valleys, and Python: Pipelining Biochemical
Data Processing with hplc-py
- 7:30 Short Talk - Cosmograph for Python — a new library for
visualizing massive network graphs and machine learning embeddings
- 7:55 Main Talk - DBOS Transact: Ultra-Lightweight Durable Execution
for Python
- 8:30 Wrap-up
***** Lightning talk ******
*Popular Python Packages Series - "uv"*
Karen starts off a new series of short talks in 2025 about trending
Python tools and packages, starting with "uv", an extremely fast Python
package and project manager.
***** Short talks ******
*"Peaks, Valleys, and Python: Pipelining Biochemical Data Processing
with hplc-py" presented by Griffin Chure
*
A common task in biological data processing is taking a complex signal
and breaking it down into its constituent parts. However, biological
data is noisy and evaluating the accuracy of an analysis pipeline often
requires manual intervention and assessment, hampering the throughput
that scientific or industrial problems often demand.
In this talk, I will highlight `hplc-py`, an open-source Python library
I developed to tackle this problem in the context of chromatography — an
analytical technique for quantifying the components of chemical
mixtures. This package leverages diverse functionality in the
scipy-stack — from peak detection to parameter optimization — to
deconvolve complex chemical spectra into signals from individual
molecules. While state-of-the-art software addressing this problem
requires extensive manual intervention, `hplc-py` significantly
decreases this intervention and is designed to be easily integrated into
data analysis pipelines. I will highlight how this software was
developed to be robust across different experiments while allowing the
end users to rapidly interact with the analysis and validate the results.
While the processing of real scientific data will be demonstrated in the
talk, no scientific background will be assumed. The open-source library
is available on GitHub (github.com/cremerlab/hplc-py) with detailed
documentation (cremerlab.github.io/hplc-py).
Griffin is a computational biologist with broad experience leveraging
mathematical modeling, Bayesian statistical inference, and scientific
software engineering to decode the complex mechanisms governing cellular
function and behavior. He is passionate about building performant and
robust software that employs quantitative methods to dissect biological
data, and building strong collaborations with scientists and engineers
across disciplines.
*"Cosmograph for Python — a new library for visualizing massive network
graphs and machine learning embeddings" presented by Nikita Rokotyan
*
Visualization of large amounts of data has always been a struggle and
required having sophisticated workflows. Cosmograph, a JavaScript
library for creating stunning, interactive data visualizations, is now
accessible to Pythonistas, data scientists, and AI engineers from the
comfort of their notebooks.
Cosmograph can handle several millions of nodes with GPU-accelerated,
force-directed layouts, enabling real-time exploration of
multidimensional data and complex networks. It’s the fastest JavaScript
library for Network Graph visualizations and its interactive tools—like
zooming, panning, filtering—help to transform data chaos into clarity.
This talk introduces the audience to Cosmograph’s capabilities and
demonstrates how it integrates seamlessly into Python workflows. No
specialized expertise is required, though those involved in data,
machine learning, or AI are likely to find it particularly engaging.
Cosmograph is available on PyPi (https://pypi.org/project/cosmograph/)
with a documentation at
https://cosmograph.app/docs/cosmograph/Cosmograph%20Python/get-started-widget.
Nikita (https://rokotyan.com) is the founder of Interacta, an
award-winning team of scientists, designers, and engineers dedicated to
building beautiful and functional tools for data exploration. Beyond his
passion for data, he has a deep appreciation for new media art and music
and enjoys discovering the stunning landscapes of California.
***** Main talk ******
*"DBOS Transact: Ultra-Lightweight Durable Execution for Python"
presented by Peter Kraft
*
We present DBOS Transact, an open source Python library providing
ultra-lightweight durable execution. Durable execution means your
software is resilient to failures. If ever interrupted or crashed, a
DBOS Transact application can resume from the last completed step,
automatically.
Under the hood DBOS Transact works by storing your program execution
state in a Postgres database. There's no need for an "orchestration
server": all you need is a Postgres database and adding some lightweight
Python decorators to your code. This approach is incredibly
cost-efficient and performant.
We will present some cool features of the framework such as scheduled
jobs and exactly-once events processing.
Peter is a co-founder of DBOS, Inc., building a new serverless platform
for backend developers that radically simplifies backend development. He
co-founded DBOS based on his PhD work at Stanford, where he was advised
by Peter Bailis and Matei Zaharia and worked closely with Michael
Stonebraker. He is interested in databases and distributed systems.
Important attendance note: SAP requires everyone to sign a
confidentiality and security disclosure to maintain confidentiality and
adhere to SAP's physical security protocols during your visit to SAP's
facility. This applies to all guests. All attendees must show a
Government-issued ID, and sign the SAP Security form to enter the event.
Register by Monday (midnight) January 20!
*/Thank you, SAP Labs, for sponsoring and hosting this month's meeting!/*
----
*Personal Donations to BayPiggies via PSF: *Please consider supporting
future BayPiggies events and Python in the Bay Area at the link below
via the Bay Area Python Association and the Python Software Foundation:
https://psfmember.org/civicrm/contribute/transact/?reset=1&id=43
----
See you soon!
-Karen and the BayPiggies organizers
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.python.org/pipermail/baypiggies/attachments/20250107/925ef502/attachment.html>
More information about the Baypiggies
mailing list