[Python-Dev] What does a double coding cookie mean?
Serhiy Storchaka
storchaka at gmail.com
Wed Mar 16 03:59:55 EDT 2016
On 16.03.16 09:46, Glenn Linderman wrote:
> On 3/16/2016 12:09 AM, Serhiy Storchaka wrote:
>> On 16.03.16 08:34, Glenn Linderman wrote:
>>> From the PEP 263:
>>>
>>>> More precisely, the first or second line must match the regular
>>>> expression "coding[:=]\s*([-\w.]+)". The first group of this
>>>> expression is then interpreted as encoding name. If the encoding
>>>> is unknown to Python, an error is raised during compilation. There
>>>> must not be any Python statement on the line that contains the
>>>> encoding declaration.
>>>
>>> Clearly the regular expression would only match the first of multiple
>>> cookies on the same line, so the first one should always win... but
>>> there should only be one, from the first PEP quote "a magic comment".
>>
>> "The first group of this expression" means the first regular
>> expression group. Only the part between parenthesis "([-\w.]+)" is
>> interpreted as encoding name, not all expression.
>
> Sure. But there is no mention anywhere in the PEP of more than one
> being legal: just more than one position for it, EITHER line 1 or line
> 2. So while the regular expression mentioned is not anchored, to allow
> variation in syntax between emacs and vim, "must match the regular
> expression" doesn't imply "several times", and when searching for a
> regular expression that might not be anchored, one typically expects to
> find the first.
Actually "must match the regular expression" is not correct, because
re.match() implies anchoring at the start. I have proposed more correct
regular expression in other branch of this thread.
More information about the Python-Dev
mailing list