That's a cool trick but I am worried it would render our API too "frameworky" for my taste.

I think the FunctionTransformer is enough:

http://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.FunctionTransformer.html