#1185: HTML meta tag charset detection never matches strings without quotation
marks
--------------------+-------------------------------------------------------
Reporter: david | Owner: david
Type: defect | Status: new
Priority: normal | Milestone: 1.0.2
Component: filter | Version: 1.0.1
Severity: normal | Keywords:
Has_patch: 0 |
--------------------+-------------------------------------------------------
There is a problem in
[source:tags/1.0.2RC1/src/filter/agaviformpopulationfilter.class....@4299#l215]:
* {{{text/html; charset="UTF-8"}}} matches this pattern
* {{{text/html; charset=UTF-8}}} never does, because the branch with the
lookahead assertion {{{(?=[;\s])}}} doesn't match the end of subject;
should probably be changed to something like {{{($|(?=[;\s]))}}}
It seems however that current versions of libxml always produce a document
with the {{{encoding}}} property, likely by looking at an HTML document's
{{{<meta http-equiv="Content-Type" ... />}}} header (which the above code
reads) even in XML parsing mode.
This ticket is related to #1183 (I discovered this issue while working on
a fix for that one)
--
Ticket URL: <http://trac.agavi.org/ticket/1185>
Agavi <http://www.agavi.org/>
An MVC Framework for PHP5
_______________________________________________
Agavi Tickets Mailing List
[email protected]
http://lists.agavi.org/mailman/listinfo/tickets