[Python-ideas] pathlib suggestions

Philipp A. flying-sheep at web.de
Thu Jan 26 05:01:02 EST 2017


How about adding a new argument to with_suffix?

Path.with_suffix(suffix: str,
                 stripped: Union[int, str, Iterable[str]]=1)

stripped would either receive an int (in which case it will greedily strip
up to that many suffixes), or a (optionally compound) suffix which would be
stripped if present verbatim, or an iterable of suffix strings, in which
case it would strip all suffixes in the iterable as many times as
available. Examples:

Path('flop.pkg.tar.gz').with_suffix('') → Path('flop.pkg.tar')  # current
behavior
Path('flop.pkg.tar.gz').with_suffix('', 2) → Path('flop.pkg')  # you have
to know what you’re doing. 3 would have stripped '.pkg' too
Path('flop.pkg.tar.gz').with_suffix('', '.tar.gz') → Path('flop.pkg')
Path('flop.pkg.tar.gz').with_suffix('', '.gz.tar') →
Path('flop.pkg.tar.gz')  # not stripped, the suffix doesn’t appear verbatim
Path('flop.pkg.tar.gz.tar').with_suffix('', ['.gz', '.tar']) →
Path('flop.pkg')  # all instances stripped. probably useless.


Franklin? Lee <leewangzhong+python at gmail.com> schrieb am Mi., 25. Jan. 2017
um 21:44 Uhr:

> > A ".tar.gz" is not the same as a ".svg.gz".  The fact that they are both
> > gzip-compressed is an implementation detail as far as most software I
> deal
> > with is concerned.  My unarchiver will extract a ".tar.gz" into a
> directory
> > as if it was just a ".tar", while my image viewer will view a ".svg.gz"
> as a
> > vector image as if it was just a ".svg".  From a user-interaction
> > standpoint, the ".gz" part is ignored.
>
> Just to be sure we're on the same page:
> - A .tar file is an uncompressed bundle of files.
> - A .gz file is a compressed version of a single file.
> - Technically, there's no such thing as a .tar.gz file. "x.tar.gz"
> means that if you unwrap it with gunzip, you'll get a file called
> "x.tar", which you can then unpack with tar.
>
> "x.tar.gz" is not a tar file using the gzip compression. It's a gz
> file which unpacks to a tar file. Conceptually, your unarchiver does
> it in two separate steps.
>
> Similarly, "x.svg.gz" is a gz file which unpacks to an svg file. Your
> viewer just knows to unzip it before use.
>
> I don't wanna appear as a naysayer, so here's an alternative
> suggestion: A parameter for a collection of "extension suffixes". The
> function will try to eat extensions from the end until it finds one
> NOT on the list (or it runs out). The docs can recommend `('gz', 'xz',
> 'bz', 'bz2', ...)`. Maybe a later Python version can use that
> recommendation as the default.
>
> IMO, ".part1" is not a part of the extension. You'd usually have
> "x.part1.rar" and "x.part2.rar" in the same folder, and it makes more
> sense that there are two files with base names "x.part1" and "x.part2"
> than to have two different files with the same base name and an
> extension which just keeps them ordered.
> _______________________________________________
> Python-ideas mailing list
> Python-ideas at python.org
> https://mail.python.org/mailman/listinfo/python-ideas
> Code of Conduct: http://python.org/psf/codeofconduct/
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/python-ideas/attachments/20170126/60c8da67/attachment-0001.html>


More information about the Python-ideas mailing list