According to Russell Howe:
> I was looking for a solution to a problem I suppose is semi-specialist.
>
> I have been asked about producing a search front-end for an archive of
> html files and media clips - nothing too complex there. The problem lies
> in the searching - it needs to accept keywords in several languages,
> translate the keywords to matching English words and then perform the
> search using those words.
>
> Is this possible with htdig? It would be fairly simple I suppose to
> wrap something around the actual submitting of the search request,
> give it the terms and a language and have it look up the words in a
> database, but was wondering if it had already been done.
>
> I'm not sure if the languages being looked at stray outside of the
> ISO-8859-1 character set, or if this would even be a problem.
>
> Could any replies be CC'ed to me, since I'm not on the list and may
> possibly miss replies when I look through the list archives.
If it's been done, I haven't heard about it. I imagine you could use the
synonyms fuzzy match algorithm to implement something like this. You'd
need to define sysnonyms dictionaries that define all the equivalences
you want between English words and those of other languages. The user's
language selection would chose which config file is used, and the config
file for a particular language would select the synonyms dictionary you
defined for that language. Either that or you could combine all your
dictionaries into one and not bother with language selection.
As for non-ISO-8859-1 character sets, I don't know if this would be
a problem or not. I assume all the indexed documents are English, so
the accented characters would only come up in user input. I'm not
sure how those would end up being encoded by the browser and passed
to the CGI program. htsearch expects 8-bit characters in the character
set defined for its locale setting.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html