Programmatically finding "significant" data points

Jeremy Sanders jeremy+complangpython at jeremysanders.net
Tue Nov 14 09:08:50 EST 2006


erikcw wrote:

> I have a collection of ordered numerical data in a list.  The numbers
> when plotted on a line chart make a low-high-low-high-high-low (random)
> pattern.  I need an algorithm to extract the "significant" high and low
> points from this data.
> 
...
> 
> How do I sort through this data and pull out these points of
> significance?

Get a book on statistics. One idea is as follows. If you expect the points
to be centred around a single value, you can calculate the median or mean
of the points, calculate their standard deviation (aka spread), and remove
points which are more than N-times the standard deviation from the median.

Jeremy

-- 
Jeremy Sanders
http://www.jeremysanders.net/



More information about the Python-list mailing list