Yegor, The first 8 bytes contain the standard MS Office magic number stuff - d0 cf 11 e0 a1 b1 1a e1.
Seems like they compress data in a proprietary way. I've read one post where someone recommended the .NET Packaging API to crack these ... Not a good option ... - Dmitry -----Original Message----- From: Yegor Kozlov [mailto:[EMAIL PROTECTED] Sent: Thursday, August 28, 2008 2:18 PM To: POI Users List Subject: Re: How to extract embedded files from Office 07 The first 4 bytes may contain the length of the uncompressed data. That's how OLE data is stored in the binary formats. Yegor > Tried reading the .bin file with 7-Zip-Jbindings, no go. It wasn't > recognized as any of > > ARJ BZIP_2 CAB CHM CPIO CDEB ISO LZH NSIS RAR SPLIT TAR Z ZIP > > Egads. > > > -----Original Message----- > From: Dmitry Goldenberg > Sent: Thursday, August 28, 2008 1:15 PM > To: POI Users List > Subject: RE: How to extract embedded files from Office 07 > > 1. I got two .bin files, oleObject1 and oleObject2. > 2. the UNIX file utility spits out "Microsoft Office Document" > 3. the magic number on .bin is d0 cf 11 e0 a1 b1 1a e1 which explains #2. > 4. this seems to be the BIN/ISO format. Would I be able to read it with > something like 7-Zip-Jbinding perhaps? Is there an easier way to decompress > the file/extract contents? > > Thanks. > - Dmitry > > -----Original Message----- > From: Nick Burch [mailto:[EMAIL PROTECTED] > Sent: Thursday, August 28, 2008 1:03 PM > To: POI Users List > Subject: Re: How to extract embedded files from Office 07 > > On Thu, 28 Aug 2008, Dmitry Goldenberg wrote: >> I've noticed that some embeddings in '07 are represented as .bin files >> which they store in the /embeddings subdirectory within the doc >> structure. For example, I embedded a zip file and it showed up as >> oleObject1.bin. >> >> Does anyone have any idea as to how to read this type of file or convert >> it back to the original zip? > > Two things I'd suggest trying: > * if you embed two zip files, do you get oleObject1 and oleObject2, or > just a bigger oleObject1 ? > * if you unzip the parent ooxml file, and run the unix "file" utility > against oleObject1, what does it say the file is? > > Nick > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
