Sorry, I have no pointers to a word doc parser.
-- Michael
vijay vijay wrote:
thnak u
is there any references from ur side.i think i might me
troubleing you. i have been stuck with this almost for 2 weeks .thats why i
have posted this to u.can u tell me what kind of parser we need.
actaulluy i have tried with
resp.setContentType("application/binary");
resp.setHeader("Content-Disposition", "attachment; filename=\"" +
fname.getName() + "\";");
and
response.setContentType("application/vnd.ms-word");
still i got the same result.
vijay
On 10/17/07, Michael Baessler <[EMAIL PROTECTED]> wrote:
No, you need a special parser to parse the content of the word file to
extract the plain text.
When you only replace the files you will get confused results.
-- Michael
vijay vijay wrote:
Hi michael
i have seen so many postings from u. my problem is can i
keep
word files in place of text files in input directory.
On 10/17/07, Michael Baessler <[EMAIL PROTECTED]> wrote:
I don't understand your problem, so I can't help you. I'm also not sure
if you talk about UIMA problems.
-- Michael
vijay vijay wrote:
HI Michael,
can u help me on my topic if u give me some urls also
no
problem.i have not been posting because i don't recive any
replies.todayout
of curiosity i have posted.
i have sucessfully getting the results for uima as web
application. i am able to look for strings dynamically(text).here in
place
of text i have given word doc then problem started coming. it is
reading
only test from it and if u have table and figures which are not
recognized.ihave used poi concept here and converted the word doc into
text file then i
done the search same thing is repeted.
so can u help me here.......michael
vijay
On 10/17/07, Michael Baessler <[EMAIL PROTECTED]> wrote:
vijay vijay wrote:
Hi
i have done one sample example UIMA as Web Application.here i
have
taken reference from ExampleApplication.it is working fine with text
files.
i am able to see the result dynamically.so i have taken one step
further
by
taking word in place of text .here the problem is it is recognizing
the
word
doc and giving result but not able to get the tables and screens.
here insted of taking word directley i have used poi and conveted in
to
text
passed the out put to my annotation.here also i am getting the same
problem.
so if u want to look for word docs do we need to use other
techniques.canany one help me here
vijay
Again, do not post the same question to both UIMA lists!
-- Michael