Mailman 3 Exposing Tools/parser/unparse.py in the stdlib? - Python-Dev

Exposing Tools/parser/unparse.py in the stdlib?

older
[RELEASE] Python 3.9.0a1 available...

Pablo Galindo Salgado

19 Nov 2019 19 Nov '19

6:10 a.m.

Hi, What do people feel about exposing Tools/parser/unparse.py in the standard library? Here is my initial rationale: * The tool already needs to be maintained and updated as is tested as part of the test suite. * I have used the tool almost all the time I needed to deal with AST transformations. * The public interface will have a very low surface API, keeping maintaining it (the public interface) a very small burden IMHO. We could add the public interface to the ast.py module or a new one if people feel strongly about it. Does anyone feel strongly against this or have any objection that I am not contemplating? Regards from rainy London, Pablo Galindo Salgado

Attachments:

attachment.htm (text/html — 2.0 KB)

Show replies by date

Mahmoud Hashemi

19 Nov 19 Nov

6:35 a.m.

From the first time I found it years ago, I've often wondered why it wasn't exposed. I presumed due to certain API shifts I wasn't paying close attention to, that maybe the ast module was buffering. But I've definitely wanted to import it in the past. If the API is as stable as you say, then I guess the only question I'd have is: will the (potential) upcoming shift to PEG [1] change that? :P [1]: https://www.youtube.com/watch?v=QppWTvh7_sI On Mon, Nov 18, 2019 at 4:48 PM Pablo Galindo Salgado wrote:

...

Hi,

What do people feel about exposing Tools/parser/unparse.py in the standard library? Here is my initial rationale:

* The tool already needs to be maintained and updated as is tested as part of the test suite. * I have used the tool almost all the time I needed to deal with AST transformations. * The public interface will have a very low surface API, keeping maintaining it (the public interface) a very small burden IMHO.

We could add the public interface to the ast.py module or a new one if people feel strongly about it.

Does anyone feel strongly against this or have any objection that I am not contemplating?

Regards from rainy London, Pablo Galindo Salgado _______________________________________________ Python-Dev mailing list -- python-dev@python.org To unsubscribe send an email to python-dev-leave@python.org https://mail.python.org/mailman3/lists/python-dev.python.org/ Message archived at https://mail.python.org/archives/list/python-dev@python.org/message/JAQDBMC2... Code of Conduct: http://python.org/psf/codeofconduct/

Guido van Rossum

1:21 p.m.

I don't think the PEG parser switch would change anything -- it produces the same AST as the old parser and ast.parse(). It looks a reasonable thing to move into the stdlib (maybe as ast.unparse()?), assuming it's complete. Tests and doc need to be written. On Tue, Nov 19, 2019 at 1:08 AM Mahmoud Hashemi wrote:

...

From the first time I found it years ago, I've often wondered why it wasn't exposed. I presumed due to certain API shifts I wasn't paying close attention to, that maybe the ast module was buffering. But I've definitely wanted to import it in the past.

If the API is as stable as you say, then I guess the only question I'd have is: will the (potential) upcoming shift to PEG [1] change that? :P

[1]: https://www.youtube.com/watch?v=QppWTvh7_sI

On Mon, Nov 18, 2019 at 4:48 PM Pablo Galindo Salgado wrote:

...
Hi,

What do people feel about exposing Tools/parser/unparse.py in the standard library? Here is my initial rationale:

* The tool already needs to be maintained and updated as is tested as part of the test suite. * I have used the tool almost all the time I needed to deal with AST transformations. * The public interface will have a very low surface API, keeping maintaining it (the public interface) a very small burden IMHO.

We could add the public interface to the ast.py module or a new one if people feel strongly about it.

Does anyone feel strongly against this or have any objection that I am not contemplating?

Regards from rainy London, Pablo Galindo Salgado _______________________________________________ Python-Dev mailing list -- python-dev@python.org To unsubscribe send an email to python-dev-leave@python.org https://mail.python.org/mailman3/lists/python-dev.python.org/ Message archived at https://mail.python.org/archives/list/python-dev@python.org/message/JAQDBMC2... Code of Conduct: http://python.org/psf/codeofconduct/

_______________________________________________ Python-Dev mailing list -- python-dev@python.org To unsubscribe send an email to python-dev-leave@python.org https://mail.python.org/mailman3/lists/python-dev.python.org/ Message archived at https://mail.python.org/archives/list/python-dev@python.org/message/WYVOBWFZ... Code of Conduct: http://python.org/psf/codeofconduct/

-- --Guido van Rossum (python.org/~guido) *Pronouns: he/him **(why is my pronoun here?)* http://feministing.com/2015/02/03/how-using-they-as-a-singular-pronoun-can-c...

Victor Stinner

20 Nov 20 Nov

2:26 a.m.

Hi, Le mar. 19 nov. 2019 à 01:42, Pablo Galindo Salgado a écrit :

...

What do people feel about exposing Tools/parser/unparse.py in the standard library?

I like the idea. I compared Lib/os.py to the unparse output and now I have many questions :-) (*) unparse loses all formatting and comments. The output is not really "pretty". If you want to put unparse in the stdlib, maybe we should design an API to be able to plug an external formatter to get "PEP 8"-like coding style or Black coding style for example. (*) unparse adds many useless parenthesis. The algorithm seems naive. For example, it adds "()" to "class _AddedDllDirectory():". It also adds parenthesis around yield, like "(yield (top, dirs, nondirs))", whereas the AST node was at "top level": it isn't a sub-expression. Maybe this algortihm should be made smarter. (*) the main() function is only used for testing. I would prefer to move this code to Lib/test/test_tools/test_unparse.py and simply remove it. For example, to avoid "import os" in unparse. (*) newlines in docstring are rendered as two characters: "\\" + "n" (escaped newline: \n), rather than a newline character. I would expect a newline, it's more common that \n... But it may "break" inline doctests rendering... Maybe it should be an option (render newlines as newline character or escape them?), or maybe even let the user choose how to render them using a callback (that's related to the "pluggable formatter" question). (*) Do you plan to provide any warranty about the output stability? Like Black formatter which tries to avoid changing output between minor versions. Or should we expect to get a different output per Python version? If we provide a warranty, new tests should be written. Currently, test_unparse only compares AST, not the text output. (*) Indentation is hardcoded to 4 spaces. What if I want 2 spaces or something different? Should it become an option? (*) Float infinity is replaces with 1e309. Again, maybe someone wants to render this differently? It sounds like an arbitrary choice (which "works" as expected). (*) Do we talk about 'string' vs "string" quotes? :-) Maybe it would help to have a written down PEP to answer to these questions and maybe specify the module API.

...

We could add the public interface to the ast.py module or a new one if people feel strongly about it.

I would prefer to keep a separated module, like "import ast.unparse" or "import unparse". Victor -- Night gathers, and now my watch begins. It shall not end until my death.

Pablo Galindo Salgado

2:42 a.m.

...

(*) unparse loses all formatting and comments. The output is not really "pretty". If you want to put unparse in the stdlib, maybe we should design an API to be able to plug an external formatter to get "PEP 8"-like coding style or Black coding style for example.

The ast does not have comment or formatting information so those will be lost as soon as you have an ast object. Formatting code with a formatter after generating it should be straightforward enough that I am not sure that we should expose an API to hook that into the unparser. Especially because it will likely imply still an adaptor over the autoformatted API.

...

unparse adds many useless parenthesis. The algorithm seems naive. For example, it adds "()" to "class _AddedDllDirectory():". It also adds parenthesis around yield, like "(yield (top, dirs, nondirs))", whereas the AST node was at "top level": it isn't a sub-expression. Maybe this algortihm should be made smarter.

I agree as this hurts visibility a bit, we can solve this issue separately.

...

the main() function is only used for testing. I would prefer to move this code to Lib/test/test_tools/test_unparse.py and simply remove it. For example, to avoid "import os" in unparse.

Totally agreed.

...

Do you plan to provide any warranty about the output stability? Like Black formatter which tries to avoid changing output between minor versions. Or should we expect to get a different output per Python version? If we provide a warranty, new tests should be written. Currently, test_unparse only compares AST, not the text output.

...

newlines in docstring are rendered as two characters: "\\" + "n" (escaped newline: \n), rather than a newline character. I would expect a newline, it's more common that \n... But it may "break" inline doctests rendering... Maybe it should be an option (render newlines as newline character or escape them?), or maybe even let the user choose how to render them using a callback (that's related to the "pluggable

I would say that it provides the same stability as the ast module does. In particular, I think the contract is that ast.parse(ast.unparse(the_ast)) should produce the same ast as the_ast. Any other guaranteed over the generated code can bite us very soon as is very limiting. Maybe we can think about stabilizing it if many users ask for it. formatter" question). Hummmm, i still prefer that the unparse() function does something very straighfoward and anything else can be done via post-processing (or maybe inheriting from the Unparser class if we would like to go that route). This is based on my opinion that providing guarantees too soon over the generated code other than the ast generated by parsing is the same can limit us very soon.

...

Indentation is hardcoded to 4 spaces. What if I want 2 spaces or something different? Should it become an option?

No in my opinion (at least initially).

...

Float infinity is replaced with 1e309. Again, maybe someone wants to render this differently? It sounds like an arbitrary choice (which "works" as expected).

That is not true. float('inf') is rendered as float('inf´):

...

...
...
Unparser(ast.parse("float('inf')")) float('inf')

...

Do we talk about 'string' vs "string" quotes?

Sounds like a painful discussion. But I would say that it does not matter for the time being (the resulting code is not something that should be maintained). And if you want to stabilize you can use an autoformatted.

...

Maybe it would help to have a written down PEP to answer to these questions and maybe specify the module API.

I think a PEP may be too much but I can write if people think it makes sense. Being the API as simple as 'ast.unparse' I think is unnecessary as my view is that we should not guarantee anything other than roundtrip over the generated source initially.

...

I would prefer to keep a separated module, like "import ast.unparse" or "import unparse".

Why? I think ast.unparse is a natural fit. It will likely be only one function exposed. On Tue, 19 Nov 2019 at 20:57, Victor Stinner wrote:

...

Hi,

Le mar. 19 nov. 2019 à 01:42, Pablo Galindo Salgado a écrit :

...
What do people feel about exposing Tools/parser/unparse.py in the standard library?

I like the idea. I compared Lib/os.py to the unparse output and now I have many questions :-)

(*) unparse loses all formatting and comments. The output is not really "pretty". If you want to put unparse in the stdlib, maybe we should design an API to be able to plug an external formatter to get "PEP 8"-like coding style or Black coding style for example.

(*) unparse adds many useless parenthesis. The algorithm seems naive. For example, it adds "()" to "class _AddedDllDirectory():". It also adds parenthesis around yield, like "(yield (top, dirs, nondirs))", whereas the AST node was at "top level": it isn't a sub-expression. Maybe this algortihm should be made smarter.

(*) the main() function is only used for testing. I would prefer to move this code to Lib/test/test_tools/test_unparse.py and simply remove it. For example, to avoid "import os" in unparse.

(*) newlines in docstring are rendered as two characters: "\\" + "n" (escaped newline: \n), rather than a newline character. I would expect a newline, it's more common that \n... But it may "break" inline doctests rendering... Maybe it should be an option (render newlines as newline character or escape them?), or maybe even let the user choose how to render them using a callback (that's related to the "pluggable formatter" question).

(*) Do you plan to provide any warranty about the output stability? Like Black formatter which tries to avoid changing output between minor versions. Or should we expect to get a different output per Python version? If we provide a warranty, new tests should be written. Currently, test_unparse only compares AST, not the text output.

(*) Indentation is hardcoded to 4 spaces. What if I want 2 spaces or something different? Should it become an option?

(*) Float infinity is replaces with 1e309. Again, maybe someone wants to render this differently? It sounds like an arbitrary choice (which "works" as expected).

(*) Do we talk about 'string' vs "string" quotes? :-)

Maybe it would help to have a written down PEP to answer to these questions and maybe specify the module API.

...
We could add the public interface to the ast.py module or a new one if people feel strongly about it.

I would prefer to keep a separated module, like "import ast.unparse" or "import unparse".

Victor -- Night gathers, and now my watch begins. It shall not end until my death.

Victor Stinner

3:36 a.m.

Le mar. 19 nov. 2019 à 22:12, Pablo Galindo Salgado a écrit :

...

...
Float infinity is replaced with 1e309. Again, maybe someone wants to render this differently? It sounds like an arbitrary choice (which "works" as expected).

That is not true. float('inf') is rendered as float('inf´):

...
...
...
Unparser(ast.parse("float('inf')")) float('inf')

I was thinking at:

...

...
...
Unparser(ast.parse("1e999")) 1e309

Maybe just move the constant as a class attribute, so it can be overriden in a subclass?

...

...
I would prefer to keep a separated module, like "import ast.unparse" or "import unparse".

Why? I think ast.unparse is a natural fit. It will likely be only one function exposed.

It's mostly to minimize the number of imports on "import ast". unparse requires extra imports like tokenize which has also tons of dependencies. Victor -- Night gathers, and now my watch begins. It shall not end until my death.

Brett Cannon

10:55 p.m.

Victor Stinner wrote:

...

Le mar. 19 nov. 2019 à 22:12, Pablo Galindo Salgado pablogsal@gmail.com a écrit :

...
Float infinity is replaced with 1e309. Again, maybe someone wants to render this differently? It sounds like an arbitrary choice (which "works" as expected). That is not true. float('inf') is rendered as float('inf´): Unparser(ast.parse("float('inf')")) float('inf') I was thinking at: Unparser(ast.parse("1e999")) 1e309 Maybe just move the constant as a class attribute, so it can be overriden in a subclass? I would prefer to keep a separated module, like "import ast.unparse" or "import unparse". Why? I think ast.unparse is a natural fit. It will likely be only one function exposed. It's mostly to minimize the number of imports on "import ast". unparse requires extra imports like tokenize which has also tons of dependencies.

Why are you specifically worried about that? If you're doing AST transformations you're probably either doing it offline/AOT or you're already doing extra processing which means you probably aren't stressing over how quickly an initial import will take if this is all worries surrounding doing live AST transformations on the way to writing out bytecode or loading a module on the fly.

...

Victor Night gathers, and now my watch begins. It shall not end until my death.

Nick Coghlan

21 Nov 21 Nov

3:17 a.m.

On Thu., 21 Nov. 2019, 3:30 am Brett Cannon, wrote:

...

Victor Stinner wrote:

...
...
It's mostly to minimize the number of imports on "import ast". unparse requires extra imports like tokenize which has also tons of dependencies.

Why are you specifically worried about that? If you're doing AST transformations you're probably either doing it offline/AOT or you're already doing extra processing which means you probably aren't stressing over how quickly an initial import will take if this is all worries surrounding doing live AST transformations on the way to writing out bytecode or loading a module on the fly.

The AST module is also the home of "ast.literal_eval", and that is going to be used in a much wider variety of situations. Cheers, Nick.

...

Nathaniel Smith

20 Nov 20 Nov

2:42 a.m.

On Mon, Nov 18, 2019 at 4:41 PM Pablo Galindo Salgado wrote:

...

Hi,

What do people feel about exposing Tools/parser/unparse.py in the standard library? Here is my initial rationale:

* The tool already needs to be maintained and updated as is tested as part of the test suite. * I have used the tool almost all the time I needed to deal with AST transformations. * The public interface will have a very low surface API, keeping maintaining it (the public interface) a very small burden IMHO.

We could add the public interface to the ast.py module or a new one if people feel strongly about it.

How does it compare to Berker's popular and well-maintained PyPI package for this? https://github.com/berkerpeksag/astor -n -- Nathaniel J. Smith -- https://vorpus.org

Guido van Rossum

11:26 a.m.

On Tue, Nov 19, 2019 at 10:24 PM Nathaniel Smith wrote:

...

On Mon, Nov 18, 2019 at 4:41 PM Pablo Galindo Salgado wrote:

...
Hi,

What do people feel about exposing Tools/parser/unparse.py in the

standard library? Here is my initial rationale:

...
* The tool already needs to be maintained and updated as is tested as

...
* I have used the tool almost all the time I needed to deal with AST

...
* The public interface will have a very low surface API, keeping

part of the test suite. transformations. maintaining it (the public interface) a very small burden IMHO.

...
We could add the public interface to the ast.py module or a new one if

people feel strongly about it.

How does it compare to Berker's popular and well-maintained PyPI package for this? https://github.com/berkerpeksag/astor

Does that even have unparse() functionality? From the README it seems to focus on a nicer ast.dump(), which is quite different (in behavior and how it's used) from unparse(). -- --Guido van Rossum (python.org/~guido) *Pronouns: he/him **(why is my pronoun here?)* http://feministing.com/2015/02/03/how-using-they-as-a-singular-pronoun-can-c...

Batuhan Taskaya

12:32 p.m.

Yes, there is a to_source function which allows AST to source conversation. On Wed, Nov 20, 2019, 9:02 AM Guido van Rossum wrote:

...

On Tue, Nov 19, 2019 at 10:24 PM Nathaniel Smith wrote:

...
On Mon, Nov 18, 2019 at 4:41 PM Pablo Galindo Salgado wrote:

...
Hi,

What do people feel about exposing Tools/parser/unparse.py in the

standard library? Here is my initial rationale:

...
* The tool already needs to be maintained and updated as is tested as

...
* I have used the tool almost all the time I needed to deal with AST

...
* The public interface will have a very low surface API, keeping

part of the test suite. transformations. maintaining it (the public interface) a very small burden IMHO.

...
We could add the public interface to the ast.py module or a new one if

people feel strongly about it.

How does it compare to Berker's popular and well-maintained PyPI package for this? https://github.com/berkerpeksag/astor

Does that even have unparse() functionality? From the README it seems to focus on a nicer ast.dump(), which is quite different (in behavior and how it's used) from unparse().

-- --Guido van Rossum (python.org/~guido) *Pronouns: he/him **(why is my pronoun here?)* http://feministing.com/2015/02/03/how-using-they-as-a-singular-pronoun-can-c... _______________________________________________ Python-Dev mailing list -- python-dev@python.org To unsubscribe send an email to python-dev-leave@python.org https://mail.python.org/mailman3/lists/python-dev.python.org/ Message archived at https://mail.python.org/archives/list/python-dev@python.org/message/3Z34GG2B... Code of Conduct: http://python.org/psf/codeofconduct/

Batuhan Taskaya

3:01 a.m.

This is a really great idea, after pretty ast.dump it will be great addition for people who works with AST code transformations. I think with the new end_lineno and end_col_offset information output can be little bit close the original. I want to volunteer the work if there is an open opportunity after consensus about adding this. On Tue, Nov 19, 2019, 3:45 AM Pablo Galindo Salgado wrote:

...

Hi,

What do people feel about exposing Tools/parser/unparse.py in the standard library? Here is my initial rationale:

* The tool already needs to be maintained and updated as is tested as part of the test suite. * I have used the tool almost all the time I needed to deal with AST transformations. * The public interface will have a very low surface API, keeping maintaining it (the public interface) a very small burden IMHO.

We could add the public interface to the ast.py module or a new one if people feel strongly about it.

Does anyone feel strongly against this or have any objection that I am not contemplating?

Regards from rainy London, Pablo Galindo Salgado _______________________________________________ Python-Dev mailing list -- python-dev@python.org To unsubscribe send an email to python-dev-leave@python.org https://mail.python.org/mailman3/lists/python-dev.python.org/ Message archived at https://mail.python.org/archives/list/python-dev@python.org/message/JAQDBMC2... Code of Conduct: http://python.org/psf/codeofconduct/

Ivan Levkivskyi

5:48 a.m.

Just wanted to add my +1 to this idea. Moving it to ast module will add little maintenance costs, but will make it easier to use. -- Ivan On Tue, 19 Nov 2019 at 00:46, Pablo Galindo Salgado wrote:

...

Hi,

What do people feel about exposing Tools/parser/unparse.py in the standard library? Here is my initial rationale:

* The tool already needs to be maintained and updated as is tested as part of the test suite. * I have used the tool almost all the time I needed to deal with AST transformations. * The public interface will have a very low surface API, keeping maintaining it (the public interface) a very small burden IMHO.

We could add the public interface to the ast.py module or a new one if people feel strongly about it.

Does anyone feel strongly against this or have any objection that I am not contemplating?

Regards from rainy London, Pablo Galindo Salgado _______________________________________________ Python-Dev mailing list -- python-dev@python.org To unsubscribe send an email to python-dev-leave@python.org https://mail.python.org/mailman3/lists/python-dev.python.org/ Message archived at https://mail.python.org/archives/list/python-dev@python.org/message/JAQDBMC2... Code of Conduct: http://python.org/psf/codeofconduct/

Pablo Galindo Salgado

21 Nov 21 Nov

4:21 a.m.

Opened https://bugs.python.org/issue38870 to track this. On Tue, 19 Nov 2019 at 00:40, Pablo Galindo Salgado wrote:

...

Hi,

What do people feel about exposing Tools/parser/unparse.py in the standard library? Here is my initial rationale:

* The tool already needs to be maintained and updated as is tested as part of the test suite. * I have used the tool almost all the time I needed to deal with AST transformations. * The public interface will have a very low surface API, keeping maintaining it (the public interface) a very small burden IMHO.

We could add the public interface to the ast.py module or a new one if people feel strongly about it.

Does anyone feel strongly against this or have any objection that I am not contemplating?

Regards from rainy London, Pablo Galindo Salgado

1617

Age (days ago)

1618

Last active (days ago)

List overview

Download

13 comments

9 participants

participants (9)

Batuhan Taskaya
Brett Cannon
Guido van Rossum
Ivan Levkivskyi
Mahmoud Hashemi
Nathaniel Smith
Nick Coghlan
Pablo Galindo Salgado
Victor Stinner

Exposing Tools/parser/unparse.py in the stdlib?

tags

participants (9)