Package: xmlstarlet Version: 1.6.1-1 Severity: normal -----cutting here may damage your screen surface----- tglase@tglase:~ $ xmlstarlet sel -t -c //name -n <test.gpx; echo $?
0 tglase@tglase:~ $ xmlstarlet sel -t -c //name -n <test2.gpx; echo $? <name>Test © file</name> 0 tglase@tglase:~ $ cat test.gpx; echo <?xml version="1.0" encoding="utf-8"?> <gpx xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" version="1.0" creator="Groundspeak GeoTour" xsi:schemaLocation="http://www.topografix.com/GPX/1/0 http://www.topografix.com/GPX/1/0/gpx.xsd http://www.groundspeak.com/cache/1/0 http://www.groundspeak.com/cache/1/0/cache.xsd" xmlns="http://www.topografix.com/GPX/1/0"> <name>Test © file</name> </gpx> tglase@tglase:~ $ cat test2.gpx; echo <?xml version="1.0" encoding="utf-8"?> <gpx> <name>Test © file</name> </gpx> -----cutting here may damage your screen surface----- I expect the command to show the same output for both files. This also affects parsing XHTML files, much to my dismay. (There was another bug that a © sign was extended into an endless stream of © but I cannot reproduce it any more, maybe that has gone away with the new libxml2…?) -- System Information: Debian Release: stretch/sid APT prefers unreleased APT policy: (500, 'unreleased'), (500, 'buildd-unstable'), (500, 'unstable') Architecture: x32 (x86_64) Foreign Architectures: i386, amd64 Kernel: Linux 4.5.0-2-amd64 (SMP w/4 CPU cores) Locale: LANG=C, LC_CTYPE=en_GB.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/lksh Init: sysvinit (via /sbin/init) Versions of packages xmlstarlet depends on: ii libc6 2.22-10 ii libxml2 2.9.3+dfsg1-1.2 ii libxslt1.1 1.1.28-4 xmlstarlet recommends no packages. xmlstarlet suggests no packages. -- no debconf information

