On 03/17/2011 02:51 PM, Peter Desjardins wrote:
Hi. I am fielding some questions about the search behavior in the
webhelp output. Is there an explanation of the behavior available
somewhere?

Specifically, I need to understand:

* How substrings are handled. Why does "locale" match "localeString"
but "crea" doesn't match "create"?

* Is there a way to search for strings that contain special characters
like periods. Can I search for "foo.bar" by escaping the period? Can I
remove the period from the list of special characters?

Thanks for your help. I have turned off stemming in case that matters.

Peter Desjardins

Hi Peter,
Good point. That had occurred to me before, but then I forgot to put it in the feature list, so I've just added the following to the feature list[1]:

"Cause indexer not to tokenize punctuation if the punctuation isn't followed by a space. So don't tokenize "build.xml" into "build" and "xml". Instead just enter it as "build.xml". Evaluation how to handle things that look like paths "foo/bar/baz.xml" Should we tokenize into foo, bar, and baz.xml or not or put both in the index? "

So we should tokenize "build. xml" but not "build.xml". Unfortunately, I can't suggest any workaround at the moment.

Thanks,
David

[1] http://docbook.xmlpress.net/tiki-index.php?page=WebHelp2011


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to