[replying to both Ping and Michael in the same email]<br><br><div><span class="gmail_quote">On 7/6/06, <b class="gmail_sendername">Michael Chermside</b> &lt;<a href="mailto:mcherm@mcherm.com">mcherm@mcherm.com</a>&gt; wrote:

</span><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Ka-Ping Yee writes:<br>&gt; i'm starting to think<br>&gt; that it would be good to clarify what kinds of threats we are

<br>&gt; trying to defend against, and specify what invariants we are<br>&gt; intending to preserve.<br><br>Yes!<br><br>&gt; So here are a couple of questions for clarification (some with my<br>&gt; guesses as to their answers):

<br><br>Okay, I'll throw in my thoughts also.<br><br>&gt; 1.&nbsp;&nbsp;When we say &quot;restricted/untrusted/&lt;whatever&gt; interpreter&quot; we<br>&gt;&nbsp;&nbsp;&nbsp;&nbsp; don't really mean that the *interpreter* is untrusted, right?<br>&gt;&nbsp;&nbsp;&nbsp;&nbsp; We mean that the Python code that runs in that interpreter is

<br>&gt;&nbsp;&nbsp;&nbsp;&nbsp; untrusted (i.e. to be prevented from doing harm), right?<br><br>Agreed. My interpretation of the proposal was that interpreters<br>were either &quot;sandboxed&quot; or &quot;trusted&quot;. &quot;Sandboxed&quot; means that there

are security restrictions imposed at some level (perhaps even NO restrictions). &quot;Trusted&quot; means that the interpreter implements no security restrictions (beyond what CPython already implements, which isn't much) and thus runs faster.

</blockquote><div><br>Yep. <br></div><br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">&gt; 2.&nbsp;&nbsp;I'm assuming that the implementation of the Python interpreter

<br>&gt;&nbsp;&nbsp;&nbsp;&nbsp; is always trusted<br><br>Sure... it's got to be.</blockquote><div><br>Yep. <br></div><br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">

&gt; What do<br>&gt; we take the Trusted Computing Base to include?&nbsp;&nbsp;The Python VM<br>&gt; implementation -- plus all the builtin objects and C modules?<br>&gt; Plus the whole standard library?<br><br>My interpretation of Brett's proposal is that the CPython developers

<br>would try to ensure that Python VM had no &quot;security holes&quot; when<br>running in sandboxed mode. Of course, we also &quot;try&quot; to ensure no<br>crashes are possible also, and while we're quite good, we're not

<br>perfect.<br><br>Beyond that, all pure-python modules with source available (whether<br>in the stdlib or not) can be &quot;trusted&quot; because they run in a<br>sandboxed VM. All C modules are *up to the user*. Brett proposes

<br>to provide a default list of useful-but-believed-to-be-safe modules<br>in the stdlib, but the user can configure the C-module whitelist<br>to whatever she desires.</blockquote><div><br>Michael has it on the money.<br>

</div><br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">&gt; 3.&nbsp;&nbsp;Is it part of the plan that we want to protect Python code from<br>&gt;&nbsp;&nbsp;&nbsp;&nbsp; other Python code?&nbsp;&nbsp;For example, should a Python program/function

<br>&gt;&nbsp;&nbsp;&nbsp;&nbsp; X be able to say &quot;i want to launch/call program/function Y with<br>&gt;&nbsp;&nbsp;&nbsp;&nbsp; *these* parameters and have it run under *these* limitations?&quot;<br>&gt;&nbsp;&nbsp;&nbsp;&nbsp; This has a big impact on the model.<br><br>Now *that* is a good question. I would say the answer is a partial

<br>&quot;no&quot;, because there are pieces of Brett's security model that are<br>tied to the interpreter instance. Python code cannot launch another<br>interpreter (but perhaps it *should* be able to?), so it cannot<br>

modify those restrictions for new Python code it launches.<br><br>However, I would rather like to allow Python code to execute other<br>code with greater restrictions, although I would accept all kinds<br>of limitations and performance penalties to do so. I would be

<br>satisfied if the caller could restrict certain things (like web<br>and file access) but not others (like memory limits or use of<br>stdout). I would satisfied if the caller paid huge overhead costs<br>of launching a separate interpreter -- heck, even a separate

process. And if it is willing to launch a separate process, then Brett's model works just fine: allow the calling code to start a new (restricted) Python VM.</blockquote><div> The plan is that there is no sandboxed eval() that runs unsafe code from a trusted interpreter within its namespace.&nbsp; I hope to provide Python code access to running a sandboxed interpreter where you can pass in a string to be executed, but the namespace for that sandboxed interpreter will be fresh and will not carry over in any way from the trusted interpreter.

<br></div><br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">&gt; We want to be able to guarantee that...<br>&gt;<br>&gt;&nbsp;&nbsp; A.&nbsp;&nbsp;The interpreter will not crash no matter what Python code

&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; it is given to execute. Agreed. We already want to guarantee that, with the caveat that the guarantee doesn't apply to a few special modules (like ctypes).</blockquote><div> Right, which is why I have been trying to plug the various known crashers that do not rely upon a specific extension module from being imported.

</div> <blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">&gt;&nbsp;&nbsp;B.&nbsp;&nbsp;Python programs running in different interpreters embedded &gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; in the same process cannot communicate with each other.

I don't want to guarantee this, does someone else? It's astonishingly hard... there are all kinds of clever &quot;knock on the walls&quot; tricks. For instance, communicate by varying your CPU utilization up and down in regular patterns.

I'd be satisfied if they could pass information (perhaps even someday provide a library making it *easy* to do so), but could not pass unforgable items like Python object references, open file descriptors, and so forth.

</blockquote><div><br>Or at least cannot communicate without explicit allowances to do so.<br><br>As for knocking on the walls, if you protect access to that kind of information well, it shouldn't be a problem.<br></div><br>

<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">&gt;&nbsp;&nbsp; C.&nbsp;&nbsp;Python programs running in different interpreters embedded &gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; in the same process cannot access each other's Python objects.

I strengthen that slightly to all &quot;unforgable&quot; items, not just object references.</blockquote><div> I would change that to add the caveat that what is exposed by a C extension module attribute will be shared.&nbsp; That is an implementation detail of multiple interpreters.

</div> <blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">&gt;&nbsp;&nbsp; D.&nbsp;&nbsp;A given piece of Python code cannot access or communicate &gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; with certain Python objects in the same interpreter.

&gt; &gt;&nbsp;&nbsp; E.&nbsp;&nbsp;A given piece of Python code can access only a limited set &gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; of Python objects in the same interpreter. Hmmm. I'm not sure.</blockquote><div> Not quite sure what you are getting at here, Ping.&nbsp; Are you saying to run code within an interpreter (sandboxed and not) and restricted even more beyond what the interpreter has been given by the security settings?

<br><br>These emails have convinced me to add a &quot;Threat Model&quot; section for the next draft of the design doc.<br></div><br>-Brett<br><br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">

-- Michael Chermside<br></blockquote></div><br>