difflib appears to be very well laid out and utilizes well-known algorithms.
The only difficulty is that it only works when everything is in memory. Is there any ongoing effort to allow it to work on larger files? Would it be better to integrate larger dataset sizes into difflib or to create something new such as bigdifflib?
============================================ *-*-*- PRESBYTERIAN_HEALTHCARE_SERVICES_DISCLAIMER -*-*-*
This message originates from Presbyterian Healthcare Services or one of its affiliated organizations. It contains information, which may be confidential or privileged, and is intended only for the individual or entity named above. It is prohibited for anyone else to disclose, copy, distribute or use the contents of this message. All personal messages express views solely of the sender, which are not to be attributed to Presbyterian Healthcare Services or any of its affiliated organizations, and may not be distributed without this disclaimer. If you received this message in error, please notify us immediately at postmaster@phs.org.
If you would like more information about Presbyterian Healthcare Services please visit our web site
============================================
On Thu, Sep 26, 2013 at 12:11 PM, Watson, Paul pwatson@phs.org wrote:
difflib appears to be very well laid out and utilizes well-known algorithms.
The only difficulty is that it only works when everything is in memory. Is there any ongoing effort to allow it to work on larger files?
Not that I know of; you can try searching bugs.python.org, though, to verify.
Would it be better to integrate larger dataset sizes into difflib or to create something new such as bigdifflib?
This would be overkill for a new module. Patches are welcome if you would like to propose a solution for the pre-existing difflib module.
-Brett
============================================ *-*-*- PRESBYTERIAN_HEALTHCARE_SERVICES_DISCLAIMER -*-*-*
This message originates from Presbyterian Healthcare Services or one of its affiliated organizations. It contains information, which may be confidential or privileged, and is intended only for the individual or entity named above. It is prohibited for anyone else to disclose, copy, distribute or use the contents of this message. All personal messages express views solely of the sender, which are not to be attributed to Presbyterian Healthcare Services or any of its affiliated organizations, and may not be distributed without this disclaimer. If you received this message in error, please notify us immediately at postmaster@phs.org.
If you would like more information about Presbyterian Healthcare Services please visit our web site
============================================ _______________________________________________ stdlib-sig mailing list stdlib-sig@python.org https://mail.python.org/mailman/listinfo/stdlib-sig
Not that I know of; you can try searching bugs.python.org, though, to verify. Would it be better to integrate larger dataset sizes into difflib or to create something new such as bigdifflib?
This would be overkill for a new module. Patches are welcome if you would like to propose a solution for the pre-existing difflib module.
-Brett
=== Rather than modify difflib.py, I modified the diff.py command line interface that appears in the Python doc. I am sure that some of my clumsy code could be improved. ============================================ *-*-*- PRESBYTERIAN_HEALTHCARE_SERVICES_DISCLAIMER -*-*-*
This message originates from Presbyterian Healthcare Services or one of its affiliated organizations. It contains information, which may be confidential or privileged, and is intended only for the individual or entity named above. It is prohibited for anyone else to disclose, copy, distribute or use the contents of this message. All personal messages express views solely of the sender, which are not to be attributed to Presbyterian Healthcare Services or any of its affiliated organizations, and may not be distributed without this disclaimer. If you received this message in error, please notify us immediately at postmaster@phs.org.
If you would like more information about Presbyterian Healthcare Services please visit our web site
============================================
If you would like your patch to be considered for inclusion with Python you will need to open an issue at bugs.python.org and attach the patch to the issue.
On Wed, Oct 2, 2013 at 12:36 PM, Watson, Paul pwatson@phs.org wrote:
Not that I know of; you can try searching bugs.python.org, though, to verify.
Would it be better to integrate larger dataset sizes into difflib or to create something new such as bigdifflib?
This would be overkill for a new module. Patches are welcome if you would like to propose a solution for the pre-existing difflib module.
-Brett
=== Rather than modify difflib.py, I modified the diff.py command line interface that appears in the Python doc. I am sure that some of my clumsy code could be improved. ============================================ *-*-*- PRESBYTERIAN_HEALTHCARE_SERVICES_DISCLAIMER -*-*-*
This message originates from Presbyterian Healthcare Services or one of its affiliated organizations. It contains information, which may be confidential or privileged, and is intended only for the individual or entity named above. It is prohibited for anyone else to disclose, copy, distribute or use the contents of this message. All personal messages express views solely of the sender, which are not to be attributed to Presbyterian Healthcare Services or any of its affiliated organizations, and may not be distributed without this disclaimer. If you received this message in error, please notify us immediately at postmaster@phs.org.
If you would like more information about Presbyterian Healthcare Services please visit our web site
============================================