so as you tell me, special characters are not supported directly from xindice. might xindice update this requirements in the future for leting the use of those?
i'll start to work on your solutions, thanks
regards
There's a few of ways to do this, but none truly easy. You can (1)
take the entity files from XHTML and muck with them in vi to turn them into a sed script*; (2) you could normalize the files with James Clark's SP tools (www.jclark.com), or (3) perhaps use the Xerces 2 parser feature "setExpandEntityReferenceNodes" when you parse your documents (in validation mode, with the correct DTD available). This also means your documents must be valid XHTML, which is a good thing but might be a pain to fix if they're not.
