On Mon, Feb 20, 2012 at 10:40:58AM +0200, Nadav Har'El wrote: > On Sun, Feb 19, 2012, Dotan Cohen wrote about "Re: Preparing to convince to > shift to non-propriety documents formats": > > Undocumented? Which file format is that? All the .doc and .docx > > formats are documented, even the older binary formats. > > Where is the ".doc" format documented? > > I once wrote a tool to extract the text in MS Office files (for a search > engine). It was a really annoying reverse-engineering-like > trial-and-error process, and I could hardly find any documentation. > The PowerPoint format (.ppt) was particularly odd. > > What documentation do you refer to?
According to Wikipedia, it's partially documented. I did not follow the links inside: http://en.wikipedia.org/wiki/DOC_(computing)#Specification -- Didi _______________________________________________ Linux-il mailing list [email protected] http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il
