From pwatson at phs.org Thu Sep 26 18:11:43 2013 From: pwatson at phs.org (Watson, Paul) Date: Thu, 26 Sep 2013 16:11:43 +0000 Subject: [stdlib-sig] difflib on large files Message-ID: <61EFF35B4133924E97252606068940341C5AB449@EXCHSV1.phs.org> difflib appears to be very well laid out and utilizes well-known algorithms. The only difficulty is that it only works when everything is in memory. Is there any ongoing effort to allow it to work on larger files? Would it be better to integrate larger dataset sizes into difflib or to create something new such as bigdifflib? ============================================ *-*-*- PRESBYTERIAN_HEALTHCARE_SERVICES_DISCLAIMER -*-*-* This message originates from Presbyterian Healthcare Services or one of its affiliated organizations. It contains information, which may be confidential or privileged, and is intended only for the individual or entity named above. It is prohibited for anyone else to disclose, copy, distribute or use the contents of this message. All personal messages express views solely of the sender, which are not to be attributed to Presbyterian Healthcare Services or any of its affiliated organizations, and may not be distributed without this disclaimer. If you received this message in error, please notify us immediately at postmaster at phs.org. If you would like more information about Presbyterian Healthcare Services please visit our web site http://www.phs.org ============================================ From brett at python.org Fri Sep 27 14:58:13 2013 From: brett at python.org (Brett Cannon) Date: Fri, 27 Sep 2013 08:58:13 -0400 Subject: [stdlib-sig] difflib on large files In-Reply-To: <61EFF35B4133924E97252606068940341C5AB449@EXCHSV1.phs.org> References: <61EFF35B4133924E97252606068940341C5AB449@EXCHSV1.phs.org> Message-ID: On Thu, Sep 26, 2013 at 12:11 PM, Watson, Paul wrote: > difflib appears to be very well laid out and utilizes well-known > algorithms. > > The only difficulty is that it only works when everything is in memory. Is > there any ongoing effort to allow it to work on larger files? Not that I know of; you can try searching bugs.python.org, though, to verify. > Would it be better to integrate larger dataset sizes into difflib or to > create something new such as bigdifflib? > This would be overkill for a new module. Patches are welcome if you would like to propose a solution for the pre-existing difflib module. -Brett > > ============================================ > *-*-*- PRESBYTERIAN_HEALTHCARE_SERVICES_DISCLAIMER -*-*-* > > This message originates from Presbyterian Healthcare Services or one of > its affiliated organizations. > It contains information, which may be confidential or privileged, and is > intended only for the > individual or entity named above. It is prohibited for anyone else to > disclose, copy, distribute > or use the contents of this message. All personal messages express views > solely of the sender, which are > not to be attributed to Presbyterian Healthcare Services or any of its > affiliated organizations, and may not > be distributed without this disclaimer. If you received this message in > error, please notify us immediately > at postmaster at phs.org. > > If you would like more information about Presbyterian Healthcare Services > please visit our web site > > http://www.phs.org > > ============================================ > _______________________________________________ > stdlib-sig mailing list > stdlib-sig at python.org > https://mail.python.org/mailman/listinfo/stdlib-sig > -------------- next part -------------- An HTML attachment was scrubbed... URL: