On Wed, Sep 05, 2018 at 06:58:04PM +0200, Hilmar Preuße wrote: > On 04.09.2018 22:49, Adrian Bunk wrote: > > Hi Adrian,
Hi Hilmar, > thanks for the report! > > > cd doc; rubber --warn all --pdf manual.tex > > Traceback (most recent call last): > > File "/usr/bin/rubber", line 17, in <module> > > sys.exit (cmdline (args)) > <snip> > > return codecs.ascii_decode(input, self.errors)[0] > > UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 10: > > ordinal not in range(128) > > make[2]: *** [Makefile:1917: doc/manual.pdf] Error 1 > > > > This is a common problem when running Python3 in C locale. > > > I.e. is this a problem, which needs to be fixed in python or do /we/ > need to fix it? If the first: is there a bug report in Debian? what defines the encoding of input files for rubber? Currently the encoding of the locale is used, on the buildds this is C which means ASCII. For why3 the relevant difference is between the nonworking LANG=C rubber --warn all --pdf manual.tex and the working LANG=C.UTF-8 rubber --warn all --pdf manual.tex How is the input encoding of the files opened by rubber defined/set? Similar to LaTeX, Python has become less tolerant in handling/discarding illegal characters and treats such cases as error. If input files are supposed to be in some fixed encoding (usually UTF-8), then this has to be passed as parameter when opening the file. > Hilmar cu Adrian -- "Is there not promise of rain?" Ling Tan asked suddenly out of the darkness. There had been need of rain for many days. "Only a promise," Lao Er said. Pearl S. Buck - Dragon Seed