OK I fixed it. Not sure if I did it right but it at least passes the tests ☺ 
and compiles.

That’s a start:

https://github.com/apache/tika/pull/169#issuecomment-299017261

So, check out the update I made there.

Thanks and sorry for the false alarms!

Cheers,
Chris




On 5/3/17, 11:54 AM, "Chris Mattmann" <[email protected]> wrote:

    Hey Team,
    
    I’m trying to get TIKA-2016 sentiment analysis integrated and having a heck 
of 
    a time fighting tika-bundle and OSGI of which I am not an expert.
    
    See: https://github.com/apache/tika/pull/169/files
    
    Basically what I’m saying:
    
    1. The USC IRDS sentiment analysis parser has  a bunch of Maven 
    exclusions in the pom.xml updates to tika-parsers that Thamme made.
    This compiles file but failed at tika-bundle.
    2. Usually my tika-bundle updates:
    a. Include the jar artifactId ref
    b. Add a ;resolution:=optional for the package includes
    3. Doing #2 usually fixes it. In this case there are a ton of weird 
exclusions.
    I tried to reflect in OSGI tika-bundle/pom.xml as best as I can, I tried 
    excluding Solr, handling the tika-serialization inclusion needed, etc., 
    and I can get it to the point (if I add sentiment-analysis-parser artifactId
    back in) where it gets to the tests, but it fails the tests with:
    
    Running org.apache.tika.bundle.BundleIT
    [main] INFO org.ops4j.pax.exam.spi.DefaultExamSystem - Pax Exam System 
(Version: 4.10.0) created.
    [main] INFO org.ops4j.pax.exam.junit.impl.ProbeRunner - creating PaxExam 
runner for class org.apache.tika.bundle.BundleIT
    [main] INFO org.ops4j.pax.exam.junit.impl.ProbeRunner - running test class 
org.apache.tika.bundle.BundleIT
    INFO  running testBundleSimpleText in reactor
    INFO  running testManifestNoJUnit in reactor
    INFO  running testTesseractParser in reactor
    INFO  running testTikaBundle in reactor
    INFO  running testBundleDetection in reactor
    INFO  running testBundleDetectors in reactor
    INFO  running testBundleLoaded in reactor
    INFO  running testForkParser in reactor
    INFO  running testBundleParsers in reactor
    [main] INFO org.ops4j.pax.exam.spi.reactors.ReactorManager - suite finished
    Tests run: 9, Failures: 2, Errors: 5, Skipped: 0, Time elapsed: 7.155 sec 
<<< FAILURE!
    
    Results :
    
    Failed tests:   testBundleDetectors(org.apache.tika.bundle.BundleIT): 
Should have several Detector names, found 2
      testBundleParsers(org.apache.tika.bundle.BundleIT): Should have lots 
Parser names, found 2
    
    Tests in error: 
      testBundleSimpleText(org.apache.tika.bundle.BundleIT): 
org.apache.tika.mime.MediaType not found by org.apache.tika.bundle [13]
      testTesseractParser(org.apache.tika.bundle.BundleIT): Could not 
initialize class org.apache.tika.parser.ocr.TesseractOCRParser
      testTikaBundle(org.apache.tika.bundle.BundleIT): Could not initialize 
class org.apache.tika.parser.pkg.PackageParser
      testBundleDetection(org.apache.tika.bundle.BundleIT): 
org.apache.tika.mime.MediaType not found by org.apache.tika.bundle [13]
      testForkParser(org.apache.tika.bundle.BundleIT): 
org.apache.tika.mime.MediaType not found by org.apache.tika.bundle [13]
    
    Tests run: 9, Failures: 2, Errors: 5, Skipped: 0
    
    I have no clue why. I’ve messed with including/excluding tika-core, etc, 
but when I got
    stuck in something similar before I just had to step away from it and the 
fix was not
    something I was thinking of ☺
    
    Any help from Bob, Nick or other OSGI gurus is appreciated.
    
    Cheers,
    Chris
    
    
    
    
    


Reply via email to