<div dir="ltr">On Sat, Aug 24, 2013 at 7:07 AM, Stefan Behnel <span dir="ltr"><<a href="mailto:stefan_ml@behnel.de" target="_blank">stefan_ml@behnel.de</a>></span> wrote:<br><div class="gmail_extra"><div class="gmail_quote">
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">PEP 3121 would no longer be necessary. Extension types can do all we need.<br>
No more special casing of modules, that was the idea.<br></blockquote><div></div></div><br></div><div class="gmail_extra">One nice thing about PEP 3121 is the addition of md_state to module objects to store internal module state. Wouldn't we be better served by improving the related API rather than abandoning it? If md_state were the home for all mutable internal state then load/reload could focus directly on just md_state and md_dict and not worry about other internal state, since all remaining state would be immutable (refcounts notwithstanding). If the API made this easier then we could leverage the strengths of PEP 3121 to make loading safer and more independent. Of course, we could certainly go the other way and actively discourage mutable internal state...</div>
<div class="gmail_extra"><br></div><div class="gmail_extra">This, coupled with the PEP 451-compatible API and with a proxying wrapper, would go a long way to various "reloading" issues that extension modules have.</div>
<div class="gmail_extra"><br><div class="gmail_quote">On Sun, Aug 25, 2013 at 5:54 AM, Stefan Behnel <span dir="ltr"><<a href="mailto:stefan_ml@behnel.de" target="_blank">stefan_ml@behnel.de</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">
<div class="im">(regarding reloading into the existing module's namespace)</div><br>I'm not sure this can be done in general. What if the module has threads<br>running that access the global state? In that case, reinitialising the<br>
module object itself would almost certainly lead to a crash.<br><br>And what if you do "from extmodule import some_function" in a Python<br>module? Then reloading couldn't replace that reference, just as for normal<br>
Python modules. Meaning that you'd still have to keep both modules properly<br>alive in order to prevent crashes due to lost global state of the imported<br>function.<br><br>The difference to Python modules here is that in Python code, you'll get<br>
some kind of exception if state is lost during a reload. In C code, you'll<br>most likely get a crash.<br><br>How would you even make sure global state is properly cleaned up? Would you<br>call tp_clear() on the module object before re-running the init code? Or<br>
how else would you enable the init code to do the right thing during both<br>the first run (where global state is uninitialised) and subsequent runs<br>(where global state may hold valid state and owned Python references)?<br>
<br>Even tp_clear() may not be enough, because it's only meant to clean up<br>Python references, not C-level state. Basically, for reloading to be<br>correct without changing the object reference, it would have to go all the<br>
way through tp_dealloc(), catch the object at the very end, right before it<br>gets freed, and then re-initialise it.<br></blockquote><div><br></div><div>Right. It would probably require a separate `PyImportInitializeState_<module>(PyObject *mod)` and/or some API that helps make it easier to manage mutable internal module state (on md_state).</div>
<div><br></div><div><br></div></div></div><div class="gmail_extra"><div class="gmail_quote">On Sun, Aug 25, 2013 at 6:36 AM, Stefan Behnel <span dir="ltr"><<a href="mailto:stefan_ml@behnel.de" target="_blank">stefan_ml@behnel.de</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">PJ Eby, <a href="tel:25.08.2013%2006" value="+12508201306">25.08.2013 06</a>:12:<br>
<div class="im">> My "Importing" package offers lazy imports by creating module objects<br>
> in sys.modules that are a subtype of ModuleType, and use a<br>
> __getattribute__ hook so that trying to use them fires off a reload()<br>
> of the module.<br>
<br>
</div>I wonder if this wouldn't be an approach to fix the reloading problem in<br>
general. What if extension module loading, at least with the new scheme,<br>
didn't return the module object itself and put it into sys.modules but<br>
created a wrapper that redirects its __getattr__ and __setattr__ to the<br>
actual module object? That would have a tiny performance impact on<br>
attribute access, but I'd expect that to be negligible given that the usual<br>
reason for the extension module to exist is that it does non-trivial stuff<br>
in whatever its API provides. Reloading could then really create a<br>
completely new module object and replace the reference inside of the wrapper.<br>
<br>
That way, code that currently uses "from extmodule import xyz" would<br>
continue to see the original version of the module as of the time of its<br>
import, and code that just did "import extmodule" and then used attribute<br>
access at need would always see the current content of the module as it was<br>
last loaded. I think that, together with keeping module global state in the<br>
module object itself, would nicely fix both cases.<br>
</blockquote><div><br></div><div>At first blush I like this.</div><div><br></div><div class="gmail_extra"><div class="gmail_quote"><div>-eric</div><div><br></div><div>p.s. Bear with me if I've missed something in the thread. I'm slogging through a backlog of email</div>
</div></div></div></div></div>