On 29 July 2010 14:29, Mike Sokolov <[email protected]> wrote:
> Yes it's a character issue; unfortunately these really aren't combining
> marks: they are separate characters (that look like apostrophes) intended to
> indicate stress in pronunciation (eg: re'bate), but we want to ignore them
> for the purposes of search.
>
> The best idea I have now is to mark up as:
>
> <w display="re'bate">rebate</w>
>
> so we search the text and display the attribute, but I was hoping to find a
> solution that didn't rely on changing the documents, if possible.  A long
> shot, but you never know :)
>
> -Mike

Any good writing your own collation Mike?
I'm wondering how it might be placed to make it 'low' visibility,
but your problem seems to be you want it 'out of the way' for earch,
as your markup shows.

Yuk. Use one version for search, another for presentation?
Would remove the extra markup? Just as dirty though.

Can Lucene do it might be a fair question?

Sorry, out of further ideas.


-- 
Dave Pawson
XSLT XSL-FO FAQ.
Docbook FAQ.
http://www.dpawson.co.uk
_______________________________________________
General mailing list
[email protected]
http://developer.marklogic.com/mailman/listinfo/general

Reply via email to