Bug#750946: libhtml-html5-parser-perl: UTF-8 character breaks parse_file

2017-08-07 Thread Gregory Williams
On Aug 7, 2017, at 8:26 AM, gregor herrmann wrote: > > > This looks indeed much better than my crude workarounds, thanks for > that! > > Do you think you can take this up with upstream? Yes, I think Kjetil and I can work on getting this merged upstream. Thanks, Greg

Bug#750946: libhtml-html5-parser-perl: UTF-8 character breaks parse_file

2017-08-06 Thread Gregory Williams
On Sat, 5 Aug 2017 12:16:04 -0400 gregor herrmann wrote: > What helps is: > - replace in lib/HTML/HTML5/Parser.pm > $response->{decoded_content} with $response->{content} > which feels a bit dangerous > - or in lib/HTML/HTML5/Parser/UA.pm's get: > move the > if ($uri =~