Searcher's bits of Fri, 26 Jul 2002 translated to:

>Another item I can't seem to find information on is multi word search results.
>I know we can use 'any' and 'all' but I'm wondering what is the command line
>method of getting the best result on multiple words.

If you use the boolean match method, you can use 'and', 'or', and
'not' to refine your query.

>For example, if I search for 'puppy dogs', how can I form my keyword statement
>in order to get only results that include both words as well as the normal

A query of   puppy and dogs   should only return pages in which
both words occur (assuming you are using the boolean match method).

>Also, is there some way of ensuring that if a result comes back from a certain
>site, that it does not come back many times? I see that very often, we get back
I am not sure if I understand the question. Do you have pages that
are showing up more than once in the results? Or are you trying to
arbitrarily limit the number of hits from a particular server? If
the problem is one of duplicates, see
http://www.htdig.org/FAQ.html#q4.24  If not, then you might take a
look at the restrict and exclude attributes (assuming there is
something in the URL's on which you can filter). If you need to
remove hits in a more or less arbitrary manner, I think you would
need to write some sort of wrapper.

Jim



-------------------------------------------------------
This sf.net email is sponsored by: Dice - The leading online job board
for high-tech professionals. Search and apply for tech jobs today!
http://seeker.dice.com/seeker.epl?rel_code=31
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to