On Fri, 23 Feb 2001, Mike wrote:
> >If you have specific concerns, it's helpful to know what they are.
> >Certainly the length of the excerpts shown can be varied easily:
> >
> ><http://www.htdig.org/attrs.html#excerpt_length>
>
> Geoff, I've tried these and they didn't work for me. Unless, I am supposed to fully
> rebuild the indexes again using those options? I've only tested these after the
>results
> were completed with the htsearch tool.
If an attribute applies to htdig, you will usually need to rebuild the
database to get it to "take effect." If an attribute applies to htsearch,
then you will not. An example of "didn't work" is usually helpful--do you
see that the excerpts shown are larger than specified by the
excerpt_length attribute?
> The problem is that some of the public sites that I index are just
> nuts. They have HUGE amounts of text in their <head> sections and
> <meta> sections.
This should not be a "problem" with how htsearch displays results.
> The second problem is the number of similar finds, there are simply
> too many of them. Often, there might be page after page after page of
> the exact same site in the results. I've not been able to fix that
> yet.
The exact same *site* or the exact same *URL*. If it's the former, then
you'll probably want to tinker with the scoring factors. (There's also a
patch to adjust scores based on URL patterns.) If it's the latter, then
it's a bug.
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html