If you have such a small number of observations (with a much higher feature space) then why do you think you can accurately train not just a single MLP, but an ensemble of them without overfitting dramatically?
======================================================================
Thomas Evangelidis
Research Specialist