Adrian Bird created TIKA-4750:
---------------------------------
Summary: tika-4.0.0-alpha1 - tess4j-parser not available
Key: TIKA-4750
URL: https://issues.apache.org/jira/browse/TIKA-4750
Project: Tika
Issue Type: Bug
Affects Versions: 4.0.0
Reporter: Adrian Bird
I've tried to use the 'tess4j-parser' but get the following error:
{noformat}
DEBUG [main] 09:09:06,858 org.apache.tika.config.loader.TikaObjectMapperFactory
Loaded component registry: parse-context
Exception in thread "main" org.apache.tika.exception.TikaConfigException:
Unknown component type: 'tess4j-parser'
at
org.apache.tika.config.loader.ComponentInstantiator.instantiate(ComponentInstantiator.java:179)
at
org.apache.tika.config.loader.LoaderContext.instantiate(LoaderContext.java:110)
at
org.apache.tika.config.loader.ParserLoader.loadComponent(ParserLoader.java:61)
at
org.apache.tika.config.loader.ParserLoader.loadComponent(ParserLoader.java:46)
at
org.apache.tika.config.loader.AbstractSpiComponentLoader.load(AbstractSpiComponentLoader.java:107)
at
org.apache.tika.config.loader.TikaLoader.loadComponent(TikaLoader.java:683)
at org.apache.tika.config.loader.TikaLoader.get(TikaLoader.java:647)
at
org.apache.tika.config.loader.TikaLoader.loadParsers(TikaLoader.java:247)
at
org.apache.tika.config.loader.TikaLoader.loadAutoDetectParser(TikaLoader.java:379)
at org.apache.tika.cli.TikaCLI.configure(TikaCLI.java:901)
at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:532)
at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:267)
Caused by: java.lang.ClassNotFoundException: Component 'tess4j-parser' is not
registered. Components must be registered via @TikaComponent annotation or .idx
file. Arbitrary class names are not allowed for security reasons.
at
org.apache.tika.serialization.ComponentNameResolver.resolveClass(ComponentNameResolver.java:116)
at
org.apache.tika.config.loader.ComponentInstantiator.instantiate(ComponentInstantiator.java:176)
... 11 more
{noformat}
FYI I've probably done all the testing I'm going to with this version.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)