> But what's about getAllEmbeddedObjects()? Does it parse object streams
> or It relies on OLE stream names? Is there big impact to wtite
> functions to dump these objects to disk?
Embedded OLE data is stored in the host-defined format. It means that
depending on the data the structure of directory entries can be
different.
For example:
XLS:
CompObj
DocumentSummaryInformation
SummaryInformation
Workbook
DOC:
CompObj
DocumentSummaryInformation
SummaryInformation
WordDocument
ZIP:
CompObj
ObjInfo
Ole
Ole10Native
TEXT:
???
VISIO
???
PDF
???
.....
In case if the embedded entry is a xls, ppt or doc you can save it with
appropriate extension and it will be "real" office document. You
should be able to open it in MS Office or by POI.
I'm note sure about ZIP, text or other formats. I searched the spec
but didn't find a clue. My advice is to look at CompObj. Every embedded
entry seems to have it. Try to parse this data, may be you will figure out the
pattern.
Yegor
> On 5/7/08, Nick Burch <[EMAIL PROTECTED]> wrote:
>> On Wed, 7 May 2008, Yury Batrakov wrote:
>>
>> > It's quite obvious and I already can save office files, but the problem is
>> to save zip and other files: I don't know which of the streams in
>> _1271662200 should i open and save. Word, Excel and others have some
>> predefined stream name, but all others don't.
>> >
>>
>> For zip, should be easy. Open each one in turn, check the first few bytes
>> and see if they are the zip header. If so, save that
>>
>> For text, I guess just open it and see if it looks text like (mostly just
>> bytes in the right ranges)
>>
>>
>> Nick
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [EMAIL PROTECTED]
>> For additional commands, e-mail: [EMAIL PROTECTED]
>>
>>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]