woops @ben me too i got that solution but i'm searching for a neater answer. let us say i'm using it over new data, 1 million data of red blue green can be saved by just dealing with the probabilities ^^_ Abdur-Rahmaan Janhangeer http://www.pythonmembers.club | https://github.com/Abdur-rahmaanJ Mauritius