Hi guys,

I've been banging my head on a wall with this for the last 6 hours.

Time to ask for help!

I have an intranet web site (static) built by a script from a database.

I'm using htdig to allow user searches from each page on a hierarchical
structure.

As structure gets deeper (nearer the end page) the search options are
restricted by using <input type=hidden name=keywords value=""> in the
search form (Clients request!) where value is a list of categories from
the structure e.g. heading1, heading2, heading3, etc.  At the top of the
tree search results must include heading 1 AND whatever the user enters,
at level 2 results must include heading1 AND heading2 AND whatever the
user enters etc.

The search is further restricted by using exclude and restrict values to
return search results from within the search origin page's directory.

(I hope this is making sense)

Now, what's happening is this:

Searching for known pages at a *deep* level (i.e. large number of
*keywords* added to search string)
sometimes works and sometimes doesn't.

I've checked the number of documents against the number of documents
given by -s in htmerge and they match, so I know the dig is working.

The only thing I can see that might be at the root of the problem is
that, since the *keyword* list is built by the scripting program from
field data, punctuation and *bad words* are getting in. ( ' , - the, in,
etc.)  Could this be the answer?  If so why? since when searching
without restrictions adding punctuation to my keyed in words does not
fail.


Here's an example:

<input type=hidden name=keywords value="Sandwell Cradley Heath Everyday
Life Houses and Homes"> at the search form works fine.

<input type=hidden name=keywords value="Sandwell Cradley Heath Everyday
Life Leisure, Sport and Pastimes"> at the search form doesn't

and this:

<form method="post" action="/cgi-bin/htsearch">
<input type=hidden name=method value=All>
<input type=hidden name=format value=builtin-long>
<input type=hidden name=sort value=score>
<input type=hidden name=config value=htdig>
<input type=hidden name=restrict value="/exhib1/">
<input type=hidden name=exclude value="">
<input type=hidden name=keywords
value="Made In The Black Country - Cars">
<input type="text" size="22" name="words" value="">
<input type="submit" value="Search">
</form>

doesn't work at all!

I'm hoping someone will understand what I'm on about and have a sensible
answer.

Am I right in thinking that keywords added in this way follow different
rules to words entered directly into a search box, if so how do I
configure htdig to ignore punctuation and bad words and allow the search
to proceed normally?

The important thing is that all the words added to keywords appear in
the searched for document - is htdig or htmerge slinging them out or is
it htsearch that refuses to allow them?



Thanks in advance.

Kind regards

Tony







_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to