[SciPy-User] Mann-Whitney U Test

Tue Dec 16 08:32:55 EST 2014

Hello,

I am wondering why the Mann-Whiney U Test always returns the smallu value in the source code, it seems like it should return the U statistic for x or y consistently. https://github.com/scipy/scipy/blob/v0.14.0/scipy/stats/stats.py#L3943 . 

I might be missing something, but it seems impossible to determine which distribution in the lower, Wikipedia gives an example of where the lower median is not correct, reproduced below:

#!/usr/bin/env python
import scipy.stats as stats
import scipy.stats.mstats as mstats

Hare =     [ 1 ,  2,  3,  4,  5,  6,  7,  8,  9, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29]
Tortoise = [ 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 30, 31, 32, 33, 34, 35, 36, 37, 38]

print "Hare Median: %d, Tortoise Median: %d, U1: %d, U2: %d"%(mstats.mquantiles(Hare,[0.5])[0],mstats.mquantiles(Tortoise,[0.5])[0], stats.mannwhitneyu(Hare, Tortoise)[0], stats.mannwhitneyu(Tortoise,Hare)[0])

Hare Median: 20, Tortoise Median: 19, U1: 100, U2: 100