Yegor,

The first 8 bytes contain the standard MS Office magic number stuff - d0 cf 11 
e0 a1 b1 1a e1.

Seems like they compress data in a proprietary way. I've read one post where 
someone recommended the .NET Packaging API to crack these ...  Not a good 
option ...

- Dmitry

-----Original Message-----
From: Yegor Kozlov [mailto:[EMAIL PROTECTED]
Sent: Thursday, August 28, 2008 2:18 PM
To: POI Users List
Subject: Re: How to extract embedded files from Office 07

The first 4 bytes may contain the length of the uncompressed data. That's how 
OLE data is stored in the binary formats.

Yegor

> Tried reading the .bin file with 7-Zip-Jbindings, no go.  It wasn't 
> recognized as any of
>
> ARJ BZIP_2 CAB CHM CPIO CDEB ISO LZH NSIS RAR SPLIT TAR Z ZIP
>
> Egads.
>
>
> -----Original Message-----
> From: Dmitry Goldenberg
> Sent: Thursday, August 28, 2008 1:15 PM
> To: POI Users List
> Subject: RE: How to extract embedded files from Office 07
>
> 1. I got two .bin files, oleObject1 and oleObject2.
> 2. the UNIX file utility spits out "Microsoft Office Document"
> 3. the magic number on .bin is d0 cf 11 e0 a1 b1 1a e1 which explains #2.
> 4. this seems to be the BIN/ISO format. Would I be able to read it with 
> something like 7-Zip-Jbinding perhaps? Is there an easier way to decompress 
> the file/extract contents?
>
> Thanks.
> - Dmitry
>
> -----Original Message-----
> From: Nick Burch [mailto:[EMAIL PROTECTED]
> Sent: Thursday, August 28, 2008 1:03 PM
> To: POI Users List
> Subject: Re: How to extract embedded files from Office 07
>
> On Thu, 28 Aug 2008, Dmitry Goldenberg wrote:
>> I've noticed that some embeddings in '07 are represented as .bin files
>> which they store in the /embeddings subdirectory within the doc
>> structure. For example, I embedded a zip file and it showed up as
>> oleObject1.bin.
>>
>> Does anyone have any idea as to how to read this type of file or convert
>> it back to the original zip?
>
> Two things I'd suggest trying:
> * if you embed two zip files, do you get oleObject1 and oleObject2, or
>    just a bigger oleObject1 ?
> * if you unzip the parent ooxml file, and run the unix "file" utility
>    against oleObject1, what does it say the file is?
>
> Nick
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
>
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to