[New-bugs-announce] [issue40510] [regression] ZipFile fails to round trip on some files

David Naylor report at bugs.python.org
Tue May 5 07:31:57 EDT 2020


New submission from David Naylor <naylor.b.david at gmail.com>:

With commit 18ee29d0b8 [1] a change was introduced that prevents a round-trip of some zip files (i.e. files generated by Microsoft Excel) due to the clobbering of `ZipInfo.flag_bits`[2] and `external_attr`[3].  

For example:
```python[name=zip-round-trip.py]
#!/usr/bin/env python3
import io
import sys
import zipfile

compression = zipfile.ZIP_STORED

source = sys.stdin
dest = sys.stdout

with io.BytesIO(source.buffer.read()) as source, io.BytesIO() as buffer:
    with zipfile.ZipFile(source, "r") as source_zip, zipfile.ZipFile(buffer, "w") as dest_zip:
        dest_zip.comment = source_zip.comment
        for info in source_zip.infolist():
            content = source_zip.read(info)
            dest_zip.writestr(info, content, compression)

    buffer.seek(0)
    dest.buffer.write(buffer.read())
```

```shell
> python3.5 zip-round-trip.py < Book1.zip > Python.zip
> diff Book1.zip Python.zip
> python3.6 zip-round-trip.py < Book1.zip > Python.zip
> diff Book1.zip Python.zip
Binary files Book1.zip and Python.zip differ
```

[1] https://github.com/python/cpython/commit/18ee29d0b870caddc0806916ca2c823254f1a1f9
[2] https://github.com/python/cpython/blob/f01d1be97d740ea0369379ca305646a26694236e/Lib/zipfile.py#L1579
[3] https://github.com/python/cpython/blob/f01d1be97d740ea0369379ca305646a26694236e/Lib/zipfile.py#L1586

----------
components: Library (Lib)
files: Book1.zip
messages: 368130
nosy: DragonSA
priority: normal
severity: normal
status: open
title: [regression] ZipFile fails to round trip on some files
type: behavior
versions: Python 3.6
Added file: https://bugs.python.org/file49121/Book1.zip

_______________________________________
Python tracker <report at bugs.python.org>
<https://bugs.python.org/issue40510>
_______________________________________


More information about the New-bugs-announce mailing list