tika-user  

Re: Exception threw when filtering the attached Excel using tika-app-0.4.jar

Li Leon
Fri, 04 Dec 2009 01:05:09 -0800

Thanks for the reply.

So this is a known issue?

Is it possible to workaround it programmatically in some way?

Thanks,

2009/12/4 Alex Ott <alex...@gmail.com>

> Hello
>
> Jukka Zitting  at "Fri, 4 Dec 2009 09:30:45 +0100" wrote:
>  JZ> Hi,
>
>  JZ> On Fri, Dec 4, 2009 at 3:37 AM, Li Leon <leon800...@gmail.com> wrote:
>  >> I got an exception when filtering the attached Excel file using "type
>  >> bugs.xls | java -jar tika-app -0.4.jar -".
>  >>
>  >> Any ideas? The embedded object seemed to cause the problem.
>
>  JZ> Yep, I can see the problem too. The exception is coming from the
>  JZ> Apache POI library that Tika uses for parsing Microsoft file formats.
>  JZ> Can you file a bug report about this in the POI issue tracker at [1]?
>  JZ> The problem might be related to the already reported bug #47685 [2].
>
>  JZ> [1] https://issues.apache.org/bugzilla/buglist.cgi?product=POI
>  JZ> [2] https://issues.apache.org/bugzilla/show_bug.cgi?id=47685
>
> The problem is, that /LNK01612D14/Ole10Native entry is broken - it contains
> wrong size of object name, stored in it....
>
>  JZ> PS. For the record, the exception stack trace is included below.
>
> --
> With best wishes, Alex Ott, MBA
> http://alexott.blogspot.com/           http://xtalk.msk.su/~ott/
> http://alexott-ru.blogspot.com/
>