I'm working on the same project too...

I studied the Source And I have the follovig idea:

Let the user choose if he wants or not to use the ISO latin support - so
let us put a 2 new variables in the ~/htdig/htcommon/defaults.cc (h)
file.

The one shoud be a boolean var: USE_ISO_LATIN the other the file name of
the caharacter extensions:

That file shoud be similar to htdig.conf and should contain lines like
this:

a:    a'|e'|

Well - the latin characters separated by a | sign.

And then let us generate a nev config type variable in
~/htdig/htdig/htdig.cc which shoud read that char_equiv file and let us
build a table with name Spec_Char with this config variables (Read the
Configuration.h in htlib dir).

The names will be standard english characters and the values will be the
latin and :qother char_equiv with the respective english char.

And after the building this table, using something like this:

external  Configuration    Char_Equiv

Char_Equiv.Defaults();
Char_Equiv.Read(CHAR_EQUIV_FILE)

we should use the the followings:

StringMatch.Findfirst(Char_Equiv[a])

If the StringMatch returns with a found character we can figure out
using the "which" variable defined in StringMatch.h which character or
characters shoud we change to std english chars. in the respective word.

In this way all the special characters should be changed to the std
english chars - and more. We can make our own rules - lets say if the
user starts a searching containing somthing like this  "va'lue" we shoud
find the results for "value" too (We change a' to a)

For making this - as I realised from the source - we need to use the
Configuration.h (respective cc) to add a new config type object, to
standard htdig config objects.

We need to to use the followings:

Configuration.h (cc)
StringMatch.h (cc)
defaults.h (cc)
htdig.h (cc)

I coudn't figure out if this kind of replacing the characters or
character sets will affect the words parsed with external parser too...

We should keep in touch about this problem :)

Best reagrds:

Marton Lorand


------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.

Reply via email to