<div dir="ltr"><br><br><div class="gmail_quote">On Mon Feb 23 2015 at 3:51:18 PM Paul Moore <<a href="mailto:p.f.moore@gmail.com">p.f.moore@gmail.com</a>> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">On 23 February 2015 at 18:40, Brett Cannon <<a href="mailto:brett@python.org" target="_blank">brett@python.org</a>> wrote:<br>
> Couldn't you just keep it in memory as bytes and then write directly over<br>
> the file? I realize that's a bit wasteful memory-wise but it is possible.<br>
> The docs could mention the memory cost is something to watch out for when<br>
> doing an in-place replacement. Heck the code could even make it an<br>
> io.BytesIO instance so the rest of the code doesn't have to care about this<br>
> special case.<br>
<br>
The real problem with overwriting is if there's a failure during the<br>
overwrite you lose the original file. My original API had overwrite as<br>
the default, but I think the risk makes that a bad idea.<br></blockquote><div><br></div><div>Couldn't you catch the exception, write the original file back out, and then re-raise the exception?</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<br>
One option would be to allow outputs (TARGET in pack() and NEW_ARCHIVE<br>
in set_interpreter()) to be open files (open for write in bytes mode)<br>
as well as filenames[1]. Then the caller has the choice of how to<br>
manage the output. The docs could include an example of overwriting<br>
via a BytesIO object, and point out the risk.<br></blockquote><div><br></div><div>That sounds like a good idea. No reason to do the file opening on someone's behalf when opening files is so easy and keeps layering abstractions at a good level. Would this extend also to the archive being read to be consistent?</div><div><br></div><div>I should mention I originally thought of extending this to pack() for 'main', but realized that passing in the function to set would require tools to import the code they are simply trying to pack and that was the wrong thing to do.</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<br>
BTW, while I was looking at the API, I realised I don't like the order<br>
of arguments in pack(). I'm tempted to make it pack(directory,<br>
target=None, interpreter=None, main=None) where a target of None means<br>
"use the name of the source directory with .pyz tacked on", exactly as<br>
for the command line API.<br>
<br>
What do you think? The change would be no more than a few minutes'<br>
work if it's acceptable.<br></blockquote><div><br></div><div>+1 from me.<br></div><div><br></div><div>-Brett</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Paul<br>
<br>
[1] What's the standard practice for such dual-mode arguments? ZipFile<br>
tests if the argument is a str instance and assumes a file if not. I'd<br>
be inclined to follow that practice here.<br>
</blockquote></div></div>