Hello all,

As I am new to the list I will briefly explain my situation. I work for small university project that is creating a fully bilingual Canadian English and French dictionary. The project started in the 1980`s and is currently still done in SGML. I have completed the conversion to XML and I have now planning on using the Xindice and Cocoon combination to search the XML documents and make them available on the Internet.
As I am sure most of you know the 1.0 release totally garbled all special chars including the letters with french accents. The CVS (last friday May 24) version didn`t solve the problem but the patch from
http://lambiek.amplexor.be/downloads/xindice-utf8-patches
outputs the special chars in UTF-8. Thank you very much for your work.
Some additional observations.
1) The last command in the patch failed. Probably not important.
2) The Xindice-HTTP-0.8 package does not like UTF-8 characters.
I also have a question. Xindice resolves all entities when storing document. Would it not be possible to store the unresolved entity? XPath queries would resolve the entity during a search but I could retrieve the document with the entities unresolved and then let the browser or whatever worry about the display.
Again thank you for your time.


Konrad Scherer



Reply via email to