A few years back I used the standalone 'antiword' binary to convert
.doc files to plaintext. It seemed to work pretty well.

-Shaneal

On Mon, Feb 28, 2011 at 5:20 PM, Daniel Herring <[email protected]> wrote:
> On Mon, 28 Feb 2011, Mark H. David wrote:
>
>> Does anyone know of any CL libraries for dealing with Microsoft Word files?
>> Tools for creating them, reading from them, parsing them, converting
>> them to plain text or other formats, things like that?
>
> I suspect that RDNZL might provide the best results.  You can use it to
> hook into the beast itself.
>
> Your other approach is to hook into the code for another office suite such
> as Open/LibreOffice, AbiWord, or KWord.
>
> In addition to Apache POI, there is also wvWare, but it doesn't support
> the new XML formats...
>
> Right when the libraries were becoming good at doc, MS went and changed
> formats.  Funny coincidence, that.
>
> Later,
> Daniel
>
> _______________________________________________
> pro mailing list
> [email protected]
> http://common-lisp.net/cgi-bin/mailman/listinfo/pro
>

_______________________________________________
pro mailing list
[email protected]
http://common-lisp.net/cgi-bin/mailman/listinfo/pro

Reply via email to