New submission from Toni Mueller <tonimuel...@users.sourceforge.net>:

I have files that I would like to read with Python, but can't:

$ python jmlreader.py woerter-allg.jml
Traceback (most recent call last):
  File "jmlreader.py", line 14, in <module>
    readFile(sys.argv[1])
  File "jmlreader.py", line 10, in readFile
    for line in f:
  File "/usr/lib/python2.6/gzip.py", line 438, in next
    line = self.readline()
  File "/usr/lib/python2.6/gzip.py", line 393, in readline
    c = self.read(readsize)
  File "/usr/lib/python2.6/gzip.py", line 219, in read
    self._read(readsize)
  File "/usr/lib/python2.6/gzip.py", line 255, in _read
    self._read_gzip_header()
  File "/usr/lib/python2.6/gzip.py", line 156, in _read_gzip_header
    raise IOError, 'Not a gzipped file'
IOError: Not a gzipped file

The file itself is a gzipped file, though: 

$ gzip -cd woerter-allg.jml|head
<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<Lesson>
<Category name="All">
<Deck>
<Card AmountLearnedBack="0" AmountLearnedFront="0" Backside="yu3 yan2  
(zhong3)&#10;&#10;Sprache" DateCreated="02-Mar-2009 16:05:15" 
DateModified="02-Mar-2009 16:05:15" DateTouched="02-Mar-2009 16:05:15" 
Frontside="语言(种)" TestsHit="0" TestsTotal="0">
<Side/>
<Side/>
</Card>
<Card AmountLearnedBack="0" AmountLearnedFront="0" 
Backside="shu4&#10;&#10;Baum, pflanzen, hervorbringen" DateCreated="02-Mar-2009 
16:06:26" DateModified="02-Mar-2009 16:06:49" DateTouched="02-Mar-2009 
16:06:26" Frontside=" 树" TestsHit="0" TestsTotal="0">
<Side/>

Unfortunately, the file is a bit weird:

$ file woerter-allg.jml
woerter-allg.jml: Zip archive data, at least v2.0 to extract

Rewriting the program with zlib, instead of gzip, didn't help a bit:

$ python jmlreader.py woerter-allg.jml.gz 
Traceback (most recent call last):
  File "jmlreader.py", line 15, in <module>
    readFile(sys.argv[1])
  File "jmlreader.py", line 11, in readFile
    unc = zlib.decompress(s)
zlib.error: Error -3 while decompressing data: incorrect header check


IOW, the file was actually compressed with gzip, not zip.

It would be nice if the gzip module could read the file. I've attached a file 
that reproduces the problem.

----------
components: Extension Modules
files: test.jml
messages: 128550
nosy: tonimueller
priority: normal
severity: normal
status: open
title: gzip.open() fails for gzipped file
type: behavior
versions: Python 2.6
Added file: http://bugs.python.org/file20760/test.jml

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue11211>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to