Re: HTML Stripping slower in Solr 1.4?

2009-12-05 Thread Koji Sekiguchi

Yonik Seeley wrote:

Is BaseCharFilter required for the html strip filter?

-Yonik
http://www.lucidimagination.com

  

It could be if HTMLStripCharFilter is reverted to first version.
The first version of HTMLStripCharFilter, for example,
if we have paaa, it producesaaa (3 space chars prior
to aaa). But after committed SOLR-1394, it produces  aaa
(1 space) and now it uses correct() method of BaseCharFilter
to correct offsets.

Koji

--
http://www.rondhuit.com/en/



Re: HTML Stripping slower in Solr 1.4?

2009-12-04 Thread Robin Wojciki
Thanks Koji for logging the ticket. I noticed its priority is set to
minor. Is there any work around? I feel like I am being half as
productive as every iteration is taking twice as much time.

Thanks
Robin

On Tue, Dec 1, 2009 at 11:47 AM, Koji Sekiguchi k...@r.email.ne.jp wrote:
 Robin,

 Thank you for reporting this. Performance degradation of HTML Stripper
 could be in 1.4. I opened a ticket in Lucene:

 https://issues.apache.org/jira/browse/LUCENE-2098

 Koji

 --
 http://www.rondhuit.com/en/




Re: HTML Stripping slower in Solr 1.4?

2009-12-04 Thread Yonik Seeley
Is BaseCharFilter required for the html strip filter?

-Yonik
http://www.lucidimagination.com


On Tue, Dec 1, 2009 at 1:17 AM, Koji Sekiguchi k...@r.email.ne.jp wrote:
 Robin,

 Thank you for reporting this. Performance degradation of HTML Stripper
 could be in 1.4. I opened a ticket in Lucene:

 https://issues.apache.org/jira/browse/LUCENE-2098

 Koji

 --
 http://www.rondhuit.com/en/