Bill Moseley wrote:
At 04:55 PM 03/26/02 +0800, Stas Bekman wrote:

OK, how about this:
When indexing strip all the HTML. and replace \n with whatever swish-e can store.


You mean replace \n with some flag (e.g. '#$%$#') only in <pre> sections?
\n in HTML should matter.

You suggested %0A in the previous email, but I was actually thinking to have all sentences separated with <br> rather than thrown altogether. no matter if it's HTML or <pre>


When presenting results display:

- N "short sentences" per hit
- enclosed in <pre></pre>
- and \n restored

this should make it perfect speed and usability-wise if this is doable. And this will improve the speed of highlighting since it'll be a plain text.


It won't gain any speed since it currently only works with plain text.  The
trade off will be maybe better readability, but a loss of compactness.

It might be fun to try -- wouldn't be that hard...thanks to Perl.

cool ;)

__________________________________________________________________
Stas Bekman            JAm_pH ------> Just Another mod_perl Hacker
http://stason.org/     mod_perl Guide ---> http://perl.apache.org
mailto:[EMAIL PROTECTED] http://use.perl.org http://apacheweek.com
http://modperlbook.org http://apache.org   http://ticketmaster.com


--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to