I'm looking to sample a large dataset for a subset that best covers the space. One way of doing this would be maximum dissimilarity, say as implemented in R as part of
caret::maxDissim. Is anyone are of similar functionality available as part of a common Python package, perhaps in scikit-learn?