[New-bugs-announce] [issue11211] gzip.open() fails for gzipped file

Toni Mueller report at bugs.python.org
Mon Feb 14 16:08:19 CET 2011


New submission from Toni Mueller <tonimueller at users.sourceforge.net>:

I have files that I would like to read with Python, but can't:

$ python jmlreader.py woerter-allg.jml
Traceback (most recent call last):
  File "jmlreader.py", line 14, in <module>
    readFile(sys.argv[1])
  File "jmlreader.py", line 10, in readFile
    for line in f:
  File "/usr/lib/python2.6/gzip.py", line 438, in next
    line = self.readline()
  File "/usr/lib/python2.6/gzip.py", line 393, in readline
    c = self.read(readsize)
  File "/usr/lib/python2.6/gzip.py", line 219, in read
    self._read(readsize)
  File "/usr/lib/python2.6/gzip.py", line 255, in _read
    self._read_gzip_header()
  File "/usr/lib/python2.6/gzip.py", line 156, in _read_gzip_header
    raise IOError, 'Not a gzipped file'
IOError: Not a gzipped file

The file itself is a gzipped file, though: 

$ gzip -cd woerter-allg.jml|head
<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<Lesson>
<Category name="All">
<Deck>
<Card AmountLearnedBack="0" AmountLearnedFront="0" Backside="yu3 yan2  (zhong3)&#10;&#10;Sprache" DateCreated="02-Mar-2009 16:05:15" DateModified="02-Mar-2009 16:05:15" DateTouched="02-Mar-2009 16:05:15" Frontside="语言(种)" TestsHit="0" TestsTotal="0">
<Side/>
<Side/>
</Card>
<Card AmountLearnedBack="0" AmountLearnedFront="0" Backside="shu4&#10;&#10;Baum, pflanzen, hervorbringen" DateCreated="02-Mar-2009 16:06:26" DateModified="02-Mar-2009 16:06:49" DateTouched="02-Mar-2009 16:06:26" Frontside=" 树" TestsHit="0" TestsTotal="0">
<Side/>

Unfortunately, the file is a bit weird:

$ file woerter-allg.jml
woerter-allg.jml: Zip archive data, at least v2.0 to extract

Rewriting the program with zlib, instead of gzip, didn't help a bit:

$ python jmlreader.py woerter-allg.jml.gz 
Traceback (most recent call last):
  File "jmlreader.py", line 15, in <module>
    readFile(sys.argv[1])
  File "jmlreader.py", line 11, in readFile
    unc = zlib.decompress(s)
zlib.error: Error -3 while decompressing data: incorrect header check


IOW, the file was actually compressed with gzip, not zip.

It would be nice if the gzip module could read the file. I've attached a file that reproduces the problem.

----------
components: Extension Modules
files: test.jml
messages: 128550
nosy: tonimueller
priority: normal
severity: normal
status: open
title: gzip.open() fails for gzipped file
type: behavior
versions: Python 2.6
Added file: http://bugs.python.org/file20760/test.jml

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue11211>
_______________________________________


More information about the New-bugs-announce mailing list