np.norm(a, axis=None) where a.ndim > 2 calls np.norm(np.ravel(a)). PR 14719 proposes deprecating this unexpected behavior, with the suggestion that people who need this should call ravel themselves. Thoughts? Matti