On Thu, 22 Dec 2011, Jan Stette wrote:
I'm trying to extract some information from Visio files and I don't seem to
be able to access all the data that's in them.

The main use case for HDGF is text extraction. Some other bits got done too, but mostly just the core and those parts needed to get the text out

Looking into this using the VSDDumper tool, I notice that the files contain StringsStream, but that the API doesn't really provide any way to access the contents of such streams.

Currently the stream type we have the most support for is the chunks stream

I think you should find most of the basics for the stream in place, but there's nothing yet to decode it. If you do get something working, we'd love a patch! (Unfortunately it's been a few years since I did most of the HDGF work, so I may not be able to provide loads of help on the file format....)

Nick

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to