Sorry, I have no pointers to a word doc parser.

-- Michael

vijay vijay wrote:
thnak u
                is there any references from ur side.i think i might me
troubleing you. i have been stuck with this almost for 2 weeks .thats why i
have posted this to u.can u tell me what kind of parser we need.

actaulluy i have tried with
resp.setContentType("application/binary");
    resp.setHeader("Content-Disposition", "attachment; filename=\"" +
fname.getName() + "\";");
   and

response.setContentType("application/vnd.ms-word");

still i got the same result.

vijay
On 10/17/07, Michael Baessler <[EMAIL PROTECTED]> wrote:
No, you need a special parser to parse the content of the word file to
extract the plain text.
When you only replace the files you will get confused results.

-- Michael

vijay vijay wrote:
Hi michael
               i have seen so many postings from u. my problem is can i
keep
word files in place of text files in input directory.


On 10/17/07, Michael Baessler <[EMAIL PROTECTED]> wrote:

I don't understand your problem, so I can't help you. I'm also not sure
if you talk about UIMA problems.

-- Michael
vijay vijay wrote:

HI Michael,
                can u help me on my topic if u give me some urls also
no
problem.i have not been posting because i don't recive any
replies.todayout
of curiosity i have posted.

             i have sucessfully getting the results for uima as web
application. i am able to look for strings dynamically(text).here in

place

of text i have given word doc then problem started coming. it is
reading
only test from it and if u have table and figures which are not
recognized.ihave used poi concept here and converted the word doc into
text file then i
done the search same thing is repeted.

so can u help me here.......michael
vijay


On 10/17/07, Michael Baessler <[EMAIL PROTECTED]> wrote:


vijay vijay wrote:


Hi
      i have done one sample example UIMA as Web Application.here i

have

taken reference from ExampleApplication.it is working fine with text


files.


i am able to see the result dynamically.so i have taken one step

further

by


taking word in place of text .here the problem is it is recognizing

the

word


doc and giving result  but not able to get the tables and screens.

here insted of taking word directley i have used poi and conveted in

to

text


passed the out put to my annotation.here also i am getting the same


problem.


so if u want to look for word docs do we need to use other
techniques.canany one help me here

vijay




Again, do not post the same question to both UIMA lists!

-- Michael





Reply via email to