I have a similar problem, (Java memory exception is what I get), how do I
use the 1.1 RC? Same repo, as below?
<dependency>
<groupId>org.apache.tika</groupId>
<artifactId>tika-core</artifactId>
<version>1.0-SNAPSHOT</version>
</dependency>
<dependency>
<groupId>org.apache.tika</groupId>
<artifactId>tika-parsers</artifactId>
<version>1.0-SNAPSHOT</version>
</dependency>
On Thu, Mar 8, 2012 at 5:13 AM, Nick Burch <[email protected]> wrote:
> On Thu, 8 Mar 2012, Harry Simons wrote:
>
>> I tried the BFF Validator, and it is indeed failing!
>>
>
> If you're able to share the error log, that could be helpful
>
> However, the file got created by MS Word only, and I doubt if it's
>> 'corrupt'... since both MS Word and LibreOffice can load it fine without
>> any errors or even warnings of any kind -- everything seems to be normal
>> with these apps. I can even use LibreOffice 3.5 to convert it to pdf or to
>> a .zip of xml's.
>>
>
> If you load it in word, and do a save-as, does the new .doc file show the
> same problem?
>
> Do you/others still feel it could be addressed by a POI upgrade?
>>
>
> You could try with the Tika 1.1 release candidate, that has the latest POI
> release in it. You could also try dropping in a recent POI nightly build to
> see if that helps - Tika will upgrade shortly to POI 3.8 beta 6 once that's
> out
>
>
> Also, I thought Tika uses POI and would be using POI as a .jar. But
>> looking in Tika sources, I could find only *POI*.java files but no
>> *POI*.jar or *poi*.jar file(s).
>>
>
> Depends how you use Tika. The Tika-App inlines all the dependencies, the
> Tika OSGi Bundle has them individually as jars in the bundle, or Maven will
> download them for you
>
> Nick
>
--
Mark Kerzner, CEO, SHMsoft <http://shmsoft.com/>