#1185: HTML meta tag charset detection never matches strings without quotation
marks
--------------------+-------------------------------------------------------
 Reporter:  david   |       Owner:  david
     Type:  defect  |      Status:  new  
 Priority:  normal  |   Milestone:  1.0.2
Component:  filter  |     Version:  1.0.1
 Severity:  normal  |    Keywords:       
Has_patch:  0       |  
--------------------+-------------------------------------------------------
 There is a problem in
 
[source:tags/1.0.2RC1/src/filter/agaviformpopulationfilter.class....@4299#l215]:

  * {{{text/html; charset="UTF-8"}}} matches this pattern
  * {{{text/html; charset=UTF-8}}} never does, because the branch with the
 lookahead assertion {{{(?=[;\s])}}} doesn't match the end of subject;
 should probably be changed to something like {{{($|(?=[;\s]))}}}

 It seems however that current versions of libxml always produce a document
 with the {{{encoding}}} property, likely by looking at an HTML document's
 {{{<meta http-equiv="Content-Type" ... />}}} header (which the above code
 reads) even in XML parsing mode.

 This ticket is related to #1183 (I discovered this issue while working on
 a fix for that one)

-- 
Ticket URL: <http://trac.agavi.org/ticket/1185>
Agavi <http://www.agavi.org/>
An MVC Framework for PHP5



_______________________________________________
Agavi Tickets Mailing List
[email protected]
http://lists.agavi.org/mailman/listinfo/tickets

Reply via email to