Re: HTML Stripping slower in Solr 1.4?
Yonik Seeley wrote: Is BaseCharFilter required for the html strip filter? -Yonik http://www.lucidimagination.com It could be if HTMLStripCharFilter is reverted to first version. The first version of HTMLStripCharFilter, for example, if we have paaa, it producesaaa (3 space chars prior to aaa). But after committed SOLR-1394, it produces aaa (1 space) and now it uses correct() method of BaseCharFilter to correct offsets. Koji -- http://www.rondhuit.com/en/
Re: HTML Stripping slower in Solr 1.4?
Thanks Koji for logging the ticket. I noticed its priority is set to minor. Is there any work around? I feel like I am being half as productive as every iteration is taking twice as much time. Thanks Robin On Tue, Dec 1, 2009 at 11:47 AM, Koji Sekiguchi k...@r.email.ne.jp wrote: Robin, Thank you for reporting this. Performance degradation of HTML Stripper could be in 1.4. I opened a ticket in Lucene: https://issues.apache.org/jira/browse/LUCENE-2098 Koji -- http://www.rondhuit.com/en/
Re: HTML Stripping slower in Solr 1.4?
Is BaseCharFilter required for the html strip filter? -Yonik http://www.lucidimagination.com On Tue, Dec 1, 2009 at 1:17 AM, Koji Sekiguchi k...@r.email.ne.jp wrote: Robin, Thank you for reporting this. Performance degradation of HTML Stripper could be in 1.4. I opened a ticket in Lucene: https://issues.apache.org/jira/browse/LUCENE-2098 Koji -- http://www.rondhuit.com/en/