On Mon, 04 Nov 2002 at 02:54:42 +0300, Kir Kolyshkin wrote:

> Francesco Caccavella wrote:
> >At 19.46 02/11/2002 -0500, you wrote:
> >
> >>Is there a way I can ask searchd to ignore the HTML title and the META
> >>keywords and description when performing a search?
> >>
> >>I've configured my system to use tl=off, ds=off, and kw=off but that 
> >>doesn't
> >>seem to be working.
> >>
> >>Thanks!
> >
> >
> >As I suggest between the lines some messages ago 
> >(http://forum.aspseek.org/index.php?t=msg&goto=1472), in the next 
> >release (when? ;-) would be useful to disallow (or simply diminish) the 
> >weight of the meta keywords and description for the results (directly in 
> >aspseek.conf or searchd.conf). Google ignores this meta too.
> 
> Probably a better idea would be to implement two "custom"
> meta-tags that could be used instead of "Keywords" and
> "Description" by users which wants them, and have an additional
> option in either searchd.conf or s.htm's "variables" section
> to specify the defaults about which fields are to be used.
> Matt, what's your opinion?

Yes, I think there would be some benefit in this.

As I see it, this comes back to the "href text" modifications.  I had a
working version of this some months back but it got put on hold while I
re-wrote libaspseek and the PHP module and I haven't come back to it as
yet.  I think that was at version 1.2.6 so I have some work to do to bring
the patch up to date.

I had also started to implement domain name words using a slightly
modified ternary tree of dictionary words to decompose the domain string
into words.

Currently there is no space to support custom tags within the word indexes
(all combinations of the 2 bits assigned for this are used) however with
the href text modifications this will be extended to 4 bits of which the
msb will indicate "Body" leaving 3 usable bits.  Out of this will be
assigned "Title", "Description", "Keywords", "HREF Text" and "Domain Words"
leaving 3 unused slots.

I agree that the order of relevance and the overall weight of these should
be configurable.

I'd like to get the href text additions into the devel tree but I'm not
sure of a timeframe for this yet.

I'm cross posting this to aseek-devel since I think the thread should be
continued there.


Matt.

Reply via email to