> Hallo Spezialists, > > as you already know I am Debian GNU/Linux Consultant in Strasbourg and > I have a customer in France who has over 68 millionen Winword 6.0 > Dokuments archived whcih can not more be opened by MSO 2007 and previos > version. > > This is not realy funny, since this enterprise is woking for the french > administration and they need all metadata inside the documents including > "last printed", "modified by", 2last modified", ... > > The problem is, that I have not found a commandline converter under > GNU/Linux, which can convert those documents to XML or ODT. > > Currently, the customer convert document by document manualy as requestd > under OpenOffice.org 2.2 which require additional emplyees (at least 3) > which then convert the rest of the 68 million in there "free time". > > I think, I do not must you tell you the costs for it!!!!!!!!! > > Does anyone KNOW a "WORKING SOLUTION" who to convert those bunch of > documents on the commandline? Since the FileServer has resources enough > I can code some stuff and let it running overt he first to the last > document but since we are poking arround with it since last year already > ist is now very urgent... > > So, Micr0$oft has, for the french administration workin enterprise, sold > a "SOLUTION" (1994) which can not have success over the time > > Does anyone has ideas how to solv it? > > Byside this costomer will su Micr0$fot France for > damages in the hight of several million Euro. >
That is a fair number of doc's.... I've just done a test and the built in 'wizard converter' for OOo will convert .doc to .odt just fine. It also maintains the meta data as far as I can see (eg, create time, modified time, etc...). This is not a command line however and it was fairly slow (ie, 25 minutes for 63 documents). Maybe with a bit of hacking and coding a command line tool could be built (the source for the wizard is in OOo Basic). /paul -- For those in NZ that want cheaper petrol - have a look here http://www.thecheappetrolproject.co.nz/index.php
