Marshall Schor wrote:
Ahmed Abdeen(Home) wrote:
Hello UIMA Developers,I am getting the following error when I run the
UIMA
Document Analyzer.
However, If I use the interactive mode it works fine. I can't specify
what
is the source file of this issue. I would appreciate any help.
Thanks,
Ahmed
Please see
http://incubator.apache.org/uima/downloads/releaseDocs/2.2.2-incubating/docs/html/tutorials_and_users_guides/tutorials_and_users_guides.html#ugr.tug.xmi_emf.xml_character_issues
It appears that some String data which is being serialized has invalid
character codes in it (from an XML viewpoint) - namely a x'00'. There
are several things you can do.
Please note that 0x0 is not just invalid XML, it's also invalid Unicode.
This sort of thing often happens when you're reading in a file with the
wrong endcoding, say a utf-16 file read in as utf-8. Or maybe what you're
reading in isn't a text file at all.
--Thilo