vijay vijay wrote:
Hi,

          i have added word file as input to the my analysisengine.here i am
getting an error like this
"org.xml.sax.SAXParseException:charecterreference "&#17" is an invalid
XML charecter".

          so how to solve this error. i have given both TXT and DOC in that
input file.TXT is giving result,when it comes to DOC it is thorwing an error
like this

           can any one guide me here

vijay

You first have to parse the word doc file and extract the text content before you can analyze it. UIMA doesn't provide a word file parser out of the box. You have to write one yourself.

-- Michael

Reply via email to