Yeah, I was just collaborating what you said :)
> -----Original Message----- > From: Discussion of advanced .NET topics. > [mailto:[EMAIL PROTECTED] On Behalf Of > Peter van der Weerd > Sent: Monday, December 11, 2006 3:57 PM > To: ADVANCED-DOTNET@DISCUSS.DEVELOP.COM > Subject: Re: [ADVANCED-DOTNET] Does anyone know how to read a > Word document in .Net 2003? > > You're right! That's why we used the helper application. > > > ----- Original Message ----- > From: "Vince P" <[EMAIL PROTECTED]> > To: <ADVANCED-DOTNET@DISCUSS.DEVELOP.COM> > Sent: Monday, December 11, 2006 10:22 PM > Subject: Re: [ADVANCED-DOTNET] Does anyone know how to read a > Word document in .Net 2003? > > > Microsoft does not recommend nor support using the Office COM > objects for server operations. > > From http://support.microsoft.com/kb/257757/ > > MORE INFORMATION > All current versions of Microsoft Office were designed, > tested, and configured to run as end-user products on a > client workstation. They assume an interactive desktop and > user profile, and do not provide the level of reentrancy or > security that is necessary to meet the needs of server-side > components that are designed to run unattended. > > Microsoft does not currently recommend, and does not support, > Automation of Microsoft Office applications from any > unattended, non-interactive client application or component > (including ASP, DCOM, and NT Services), because Office may > exhibit unstable behavior and/or deadlock when run in this > environment. > > If you are building a solution that runs in a server-side > context, you should attempt wherever possible to use > components that have been made safe for unattended execution, > or find alternatives that allow at least a part of the code > to run client-side. If you choose to use an Office > application from a server-side solution, you will find that > it lacks many of the necessary capabilities to run > successfully, and you will be taking risks with the stability > of your overall solution. > > > -----Original Message----- > > From: Discussion of advanced .NET topics. > > [mailto:[EMAIL PROTECTED] On Behalf Of Peter van > > der Weerd > > Sent: Monday, December 11, 2006 3:03 PM > > To: ADVANCED-DOTNET@DISCUSS.DEVELOP.COM > > Subject: Re: [ADVANCED-DOTNET] Does anyone know how to read a Word > > document in .Net 2003? > > > > We used Automation with MS Word succesfully in a server condition. > > However, we had to make a few design decisions: > > > > 1) Due to the fact that Word is written as a UI-application > you must > > be prepared that messageboxes might occur, stopping your > application. > > We wrote code to auto-answer some of those boxes, and cancelled the > > conversion if we didn't know what to answer. > > > > 2) If you want to load many documents in a loop you will end with a > > crashed application... > > We used Windows to do the cleanup by writing a > helper-application that > > does the Word automation. The server-app then starts this > helper-app > > for each document that must be converted. > > The server is able to set a timeout for the conversion and cancel > > helper-app if it lasts too long. The helper-app itself can > listen to > > an Win32-event to kill itself. > > > > Succes, > > Peter v/d Weerd > > > > > > ----- Original Message ----- > > From: "Steve Welborn" <[EMAIL PROTECTED]> > > To: <ADVANCED-DOTNET@DISCUSS.DEVELOP.COM> > > Sent: Monday, December 11, 2006 2:55 PM > > Subject: Re: [ADVANCED-DOTNET] Does anyone know how to read a Word > > document in .Net 2003? > > > > > > > You could try the Automation/Server idea, MS makes it easy > > to use, but > > > like most here I've had nothing but nightmares with it. > > > Automation with Word is a > > > memory hog, majority of the time the instances still remain > > in memory > > > despite whatever measure you take to close it and not to > > mention the > > > crash's that have or could occur. > > > > > > But from what you described your use to be I would probably > > go with a > > > Service as well. I would just be sure to double check to > > get it out of > > > memory when done. > > > > > > > > > Good luck. > > > Steve > > > > > > -----Original Message----- > > > From: Discussion of advanced .NET topics. > > > [mailto:[EMAIL PROTECTED] On Behalf Of Marc > > > Brooks > > > Sent: Monday, December 11, 2006 12:22 AM > > > To: ADVANCED-DOTNET@DISCUSS.DEVELOP.COM > > > Subject: Re: [ADVANCED-DOTNET] Does anyone know how to > read a Word > > > document in .Net 2003? > > > > > > On 12/10/06, Jon Rothlander <[EMAIL PROTECTED]> > > > wrote: > > >> I think that is what I want to do. I just want > > > something that will > > > convert > > >> it to text. I was just thinking that if in a .Net > > > app you can easily open > > >> the Word doc and the save it back out as a Text > > > file... > > > > > > Having been there, done that, and regretted it, let me share. I > > > worked on a project[1] that used to extract resumes in Word/Word > > > Perfect/etc. documents via automation so we could pass > them through > > > an expert system to extract the information. The WinWord process > > > constantly crashed and locked the service. > > > > > > Eventually, after trying several commercial conversion tools > > > (including several supposed to be used in batch conversion or > > > server-based setups), nothing was working. > > > > > > Then I hit on the radical idea that "if it's good enough for > > > index-server[2], it's good enough for me" and used the installed > > > IFilter drivers to suck out the text of any file we had > an IFilter > > > driver (and dude, are there tons of them available for free). I > > > wrote a little COM component in C++ that simply defers to > the shell > > > to load the correct driver and then ignored all the "formatting" > > > information and kept the text, which is returned as a BSTR. > > > Optionally, you can > > > ask it to "clean the text" to normalize the Unicode encodings and > > > morphing digits-like characters to actual digits > > > > > > If you are interested, I can post the source for this... > it is still > > > in service to this day and it really works well. > > > > > > [1] http://www.sendouts.com > > > [2] > > > > > http://msdn.microsoft.com/library/default.asp?url=/library/en- > > us/indexsrv/hh > > > /indexsrv/ixufilt_94fm.asp > > > > > > IFilters: > > > http://www.adobe.com/support/downloads/8122.htm > > > > > http://www.corel.com/support/ftpsite/pub/wordperfect/wpwin/8/c > > wps8.htm# > > > http://www.adobe.com/support/downloads/8126.htm > > > http://www.cad-company.nl/ifilter/ > > > http://www.microsoft.com/sharepoint/techinfo/reskit/RTF_Filter.asp > > > http://www.microsoft.com/sharepoint/techinfo/reskit/XML_Filter.asp > > > http://www.naa.gov.au/Search/srchadm/help/default.htm#Top > > > http://www.mp3machine.com/software/MP3_Ifilter/= > > > > > > -- > > > "I am Dyslexic of Borg. Resistors are fertile. Prepare to > have your > > > ass laminated." -- Dan Nitschke > > > > > > Marc C. Brooks > > > http://musingmarc.blogspot.com > > > > > > =================================== > > > This list is hosted by DevelopMentorR http://www.develop.com > > > > > > View archives and manage your subscription(s) at > > > http://discuss.develop.com > > > > > > > > > > > > > > > > > ______________________________________________________________ > > ______________________ > > > Do you Yahoo!? > > > Everyone is raving about the all-new Yahoo! Mail beta. > > > http://new.mail.yahoo.com > > > > > > =================================== > > > This list is hosted by DevelopMentor® http://www.develop.com > > > > > > View archives and manage your subscription(s) at > > > http://discuss.develop.com > > > > > > > =================================== > > This list is hosted by DevelopMentor® http://www.develop.com > > > > View archives and manage your subscription(s) at > > http://discuss.develop.com > > =================================== > This list is hosted by DevelopMentor® http://www.develop.com > > View archives and manage your subscription(s) at > http://discuss.develop.com > > =================================== > This list is hosted by DevelopMentor® http://www.develop.com > > View archives and manage your subscription(s) at > http://discuss.develop.com =================================== This list is hosted by DevelopMentor® http://www.develop.com View archives and manage your subscription(s) at http://discuss.develop.com