On Friday 14 February 2003 11:14, Adam Brown wrote:
> Hi,
>
> I am indexing this page using Htdig 3.1.6:
> http://wire.org.au/information/violence/domestic/womens_stories/one_rural_w
>omans_story.html The page contains the words "woman's" and "womans" but not
> "woman'.
>
> The search page is located at: http://wire.org.au/public_search.html
>
> When I search for "rural woman's" or "rural womans" I get no hits. However
> when I search for "woman" the page is returned.
>
> My understanding is that using the default Htdig settings that "woman's"
> gets indexed as "womans". So surely a search for 'womans' should be
> successful.
>
> Can anyone shed any light on this problem?
>
> thanks
>
> Adam
>
>

Researching further:

Results from htdig -vvvv indicate that the word "woman" is indexed, not 
"womans"

A search for "women's" (note the e) returns a hit. I looked in the ispell 
dictionary file english.0 and the listings for the two words are:
woman/MY
women/MS

Is it the case that Htdig reduces the search word "woman's" to "womans" which 
doesn't register a hit because "woman" is recorded in the database and 
"womans" is not a valid extension of "woman"?

I use the setting:
valid_punctuation: .-_/!#$%^&'()

Need help with this.

thanks

Ad



-------------------------------------------------------
This SF.NET email is sponsored by: FREE  SSL Guide from Thawte
are you planning your Web Server Security? Click here to get a FREE
Thawte SSL guide and find the answers to all your  SSL security issues.
http://ads.sourceforge.net/cgi-bin/redirect.pl?thaw0026en
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to