[scikit-learn] is Sci_kiet-Learn the right choice for my project

Mike Oliver mo at globalsaassol.com
Sun Oct 9 18:43:24 EDT 2022


Granted, you are correct the examples I gave are simple enough for a set of rules to process.  The problem as I see it is that with the plethora of potential relationships, the rules may not be adequate.  Inconsistent data is not transactional for example I can store data on a pair of objects and they pass all the rules.  Then later another relationship is stored, and that passes all the rules…yet just because A is consistent with B and B is consistent with C does NOT mean A is consistent with C.  Anomaly detection can be simple or it can be complex.  But if 10,000 relationships have been stored and one changes I want an algorithm to emerge that we can see and turn into an action to fix it.

We are also hoping that things may emerge that we did not anticipate.  That may require deep learning sub-symbolic neural networks, but that is yet to be determined.

And yes, with literally thousands of records flowing through the system, the delay in processing each record against every other record in a read before write model is not going to perform well.  If we can use machine learning we can take that evaluation and even corrections out of that processing flow.


If scikit-learn is not a good fit for my goals, let me know.  If you know a better fit, please let me know as well.


From: Bill Ross <bross_phobrain at sonic.net>
Sent: Sunday, October 9, 2022 12:46 AM
To: Scikit-learn mailing list <scikit-learn at python.org>
Cc: scikit-learn <scikit-learn-bounces+bross_phobrain=sonic.net at python.org>; Mike Oliver <mo at globalsaassol.com>
Subject: Re: [scikit-learn] is Sci_kiet-Learn the right choice for my project

[Warning]CAUTION: This email originated from outside of the organization. Do not click on links or open attachments unless you recognize the sender and know that the contents are safe.

>   My hope is that with machine learning we can detect when an object is missing, or configured in error, or duplicates.

These look like simple correctness issues that I'd address with programming.
Why do you want to use a learned approach? Do you think it will be faster to develop, or have a faster runtime?




On 2022-10-08 01:57, Mike Oliver wrote:
Dear Sirs,

I am evaluating SciKit-Learn for a new project.  I am hoping to find a AI Machine Learning package that can take a large dataset of objects that have various object types and attributes.  These objects are typically related to other objects, such as a server to a Wifi device, or two network routers to each other, etc.  When these objects are setup data is gathered about where they are located, what settings there are, the device type, etc.

With large organizations there can be thousands of these objects and tens of thousands of relationships, descriptions, settings, etc.  My hope is that with machine learning we can detect when an object is missing, or configured in error, or duplicates.

The question is, will SciKit-Learn help with this problem? I understand that we will have to train it to identify what to look for and then act on what was found and predicted to be the solution algorithm. Or instructions.

Thanks for your help,

Great looking product and already have the tutorial up and running and have installed it in my Django platform.


scikit-learn mailing list
scikit-learn at python.org<mailto:scikit-learn at python.org>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.python.org/pipermail/scikit-learn/attachments/20221009/b5a0c3a6/attachment-0001.html>

More information about the scikit-learn mailing list