Thanks for letting us know you found a solution to the problem. Also, I realised that I did not answer one of your questions. The reason that the shared strings table grew in size once you had opened the file and then re-saved it using Excel is because Excel would have populated the shared strings table. Typically, Excel will seek to minimise the file size and one of the ways it does this is to remove any duplicated strings by utilising the shared strings table.
It should be quite possible to use POI to get at in-line strings. As I said, I have no direct experience with the streaming API but have used parsers in the past. Typically, there is a class that handles the markup via a callback mechanism. Methods will be called when the opening and closing tags of an element are called. It would be possible to create logic within that class to watch for the inlie notification and then act accordingly; i.e. not look in the shared strings table but read the value directly from the relevant element. -- View this message in context: http://apache-poi.1045710.n5.nabble.com/sax-get-nothing-but-numbers-from-a-large-excel2007-and-sharedStrings-xml-is-empty-tp5719797p5719814.html Sent from the POI - User mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
