Hi!

I have a result field that is a simple xml document. This field is messed up on the way into the datamodel. It seems like it is the xml matcher in "StringChopper" that is a bit aggressive.

It handles the head tag and cdata tags as normal tags, and this makes the matching wrong.

I'm working with sesat 2.17, and this change seems to fix it.
Can you verify and commit a fix?

Index: result-spi/src/main/java/no/sesat/search/result/ StringChopper.java
===================================================================
--- result-spi/src/main/java/no/sesat/search/result/StringChopper.java (revision 6756) +++ result-spi/src/main/java/no/sesat/search/result/StringChopper.java (working copy)
@@ -43,7 +43,7 @@

-    private static final Pattern openTag = Pattern.compile("<[^<]+>");
+ private static final Pattern openTag = Pattern.compile("<[^\\?!] [^<]+>");

Thanks!

--
ENDRE MIDTGÅRD MECKELBORG
Seniorutvikler
Schibsted Søk AS
sesam.no

+47 930 14 504



_______________________________________________
Kernel-development mailing list
[email protected]
http://sesat.no/mailman/listinfo/kernel-development

Reply via email to