You're right! That's why we used the helper application.


----- Original Message -----
From: "Vince P" <[EMAIL PROTECTED]>
To: <ADVANCED-DOTNET@DISCUSS.DEVELOP.COM>
Sent: Monday, December 11, 2006 10:22 PM
Subject: Re: [ADVANCED-DOTNET] Does anyone know how to read a Word document
in .Net 2003?


Microsoft does not recommend nor support using the Office COM objects for
server operations.

From http://support.microsoft.com/kb/257757/

MORE INFORMATION
All current versions of Microsoft Office were designed, tested, and
configured to run as end-user products on a client workstation. They assume
an interactive desktop and user profile, and do not provide the level of
reentrancy or security that is necessary to meet the needs of server-side
components that are designed to run unattended.

Microsoft does not currently recommend, and does not support, Automation of
Microsoft Office applications from any unattended, non-interactive client
application or component (including ASP, DCOM, and NT Services), because
Office may exhibit unstable behavior and/or deadlock when run in this
environment.

If you are building a solution that runs in a server-side context, you
should attempt wherever possible to use components that have been made safe
for unattended execution, or find alternatives that allow at least a part of
the code to run client-side. If you choose to use an Office application from
a server-side solution, you will find that it lacks many of the necessary
capabilities to run successfully, and you will be taking risks with the
stability of your overall solution.

-----Original Message-----
From: Discussion of advanced .NET topics.
[mailto:[EMAIL PROTECTED] On Behalf Of
Peter van der Weerd
Sent: Monday, December 11, 2006 3:03 PM
To: ADVANCED-DOTNET@DISCUSS.DEVELOP.COM
Subject: Re: [ADVANCED-DOTNET] Does anyone know how to read a
Word document in .Net 2003?

We used Automation with MS Word succesfully in a server condition.
However, we had to make a few design decisions:

1) Due to the fact that Word is written as a UI-application
you must be prepared that messageboxes might occur, stopping
your application. We wrote code to auto-answer some of those
boxes, and cancelled the conversion if we didn't know what to answer.

2) If you want to load many documents in a loop you will end
with a crashed application...
We used Windows to do the cleanup by writing a
helper-application that does the Word automation. The
server-app then starts this helper-app for each document that
must be converted.
The server is able to set a timeout for the conversion and
cancel helper-app if it lasts too long. The helper-app itself
can listen to an Win32-event to kill itself.

Succes,
Peter v/d Weerd


----- Original Message -----
From: "Steve Welborn" <[EMAIL PROTECTED]>
To: <ADVANCED-DOTNET@DISCUSS.DEVELOP.COM>
Sent: Monday, December 11, 2006 2:55 PM
Subject: Re: [ADVANCED-DOTNET] Does anyone know how to read a
Word document in .Net 2003?


> You could try the Automation/Server idea, MS makes it easy
to use, but
> like most here I've had nothing but nightmares with it.
> Automation with Word is a
> memory hog, majority of the time the instances still remain
in memory
> despite whatever measure you take to close it and not to
mention the
> crash's that have or could occur.
>
> But from what you described your use to be I would probably
go with a
> Service as well. I would just be sure to double check to
get it out of
> memory when done.
>
>
> Good luck.
> Steve
>
> -----Original Message-----
> From: Discussion of advanced .NET topics.
> [mailto:[EMAIL PROTECTED] On Behalf
> Of Marc Brooks
> Sent: Monday, December 11, 2006 12:22 AM
> To: ADVANCED-DOTNET@DISCUSS.DEVELOP.COM
> Subject: Re: [ADVANCED-DOTNET] Does anyone know how to
> read a Word document
> in .Net 2003?
>
> On 12/10/06, Jon Rothlander <[EMAIL PROTECTED]>
> wrote:
>> I think that is what I want to do.  I just want
> something that will
> convert
>> it to text.  I was just thinking that if in a .Net
> app you can easily open
>> the Word doc and the save it back out as a Text
> file...
>
> Having been there, done that, and regretted it, let me
> share.  I
> worked on a project[1] that used to extract resumes in
> Word/Word
> Perfect/etc. documents via automation so we could pass
> them through an
> expert system to extract the information. The WinWord
> process
> constantly crashed and locked the service.
>
> Eventually, after trying several commercial conversion
> tools
> (including several supposed to be used in batch
> conversion or
> server-based setups), nothing was working.
>
> Then I hit on the radical idea that "if it's good
> enough for
> index-server[2], it's good enough for me" and used the
> installed
> IFilter drivers to suck out the text of any file we
> had an IFilter
> driver (and dude, are there tons of them available for
> free). I wrote
> a little COM component in C++ that simply defers to
> the shell to load
> the correct driver and then ignored all the
> "formatting" information
> and kept the text, which is returned as a BSTR.
> Optionally, you can
> ask it to "clean the text" to normalize the Unicode
> encodings and
> morphing digits-like characters to actual digits
>
> If you are interested, I can post the source for
> this... it is still
> in service to  this day and it really works well.
>
> [1] http://www.sendouts.com
> [2]
>
http://msdn.microsoft.com/library/default.asp?url=/library/en-
us/indexsrv/hh
> /indexsrv/ixufilt_94fm.asp
>
> IFilters:
> http://www.adobe.com/support/downloads/8122.htm
>
http://www.corel.com/support/ftpsite/pub/wordperfect/wpwin/8/c
wps8.htm#
> http://www.adobe.com/support/downloads/8126.htm
> http://www.cad-company.nl/ifilter/
> http://www.microsoft.com/sharepoint/techinfo/reskit/RTF_Filter.asp
> http://www.microsoft.com/sharepoint/techinfo/reskit/XML_Filter.asp
> http://www.naa.gov.au/Search/srchadm/help/default.htm#Top
> http://www.mp3machine.com/software/MP3_Ifilter/=
>
> --
> "I am Dyslexic of Borg. Resistors are fertile. Prepare
> to have your
> ass laminated." -- Dan Nitschke
>
> Marc C. Brooks
> http://musingmarc.blogspot.com
>
> ===================================
> This list is hosted by DevelopMentorR
> http://www.develop.com
>
> View archives and manage your subscription(s) at
> http://discuss.develop.com
>
>
>
>
>
______________________________________________________________
______________________
> Do you Yahoo!?
> Everyone is raving about the all-new Yahoo! Mail beta.
> http://new.mail.yahoo.com
>
> ===================================
> This list is hosted by DevelopMentor®  http://www.develop.com
>
> View archives and manage your subscription(s) at
> http://discuss.develop.com
>

===================================
This list is hosted by DevelopMentor®  http://www.develop.com

View archives and manage your subscription(s) at
http://discuss.develop.com

===================================
This list is hosted by DevelopMentor®  http://www.develop.com

View archives and manage your subscription(s) at http://discuss.develop.com

===================================
This list is hosted by DevelopMentor®  http://www.develop.com

View archives and manage your subscription(s) at http://discuss.develop.com

Reply via email to