POI Colleagues,
On TIKA-2157 and TIKA-2130, Seva Alekseyev attached files that trigger a
ZipException on an object embedded within a ppt. We've seen these in our
regression corpus as well. For now, we're swallowing these in Tika. If anyone
has a chance to look into those triggering files to figure out if the embedded
files are truly corrupt or if this is something we can fix in POI, I'd
appreciate it. I investigated a bit with TIKA-2130's file, and it _looks_ to
me like the zip stream is truly corrupt, but this area of the code base is not
one of my strengths.
Thank you.
Cheers,
Tim