I have a pages I am trying to index and search which have accents
written as html entities. One example word would be
Schrödinger
Namely: Schrodinger with an umlaut on the o.

My htdig.conf file contains the settings:
locale: en_US
search_algorithm:       exact:1 accents:1 synonyms:0.5 endings:0.1
and I run "htfuzzy accents" at the end of the rundig script. 

I was hoping that this would enable me to search for "Schrodinger"
(ie without the o-umlaut) to hit all occurrences of the above word.
However, this does not give any results. If I search for "dinger"
then I do get the results I wanted (and more). Basically the indexer has
broken the word at the html entity. 

Does anyone have any ideas as to how I can get this working?
Thanks
Andy


-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click
_______________________________________________
ht://Dig general mailing list: <htdig-general@lists.sourceforge.net>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to