Hi, folks.

It seems that we have too much dependencies in `tika-parsers` and many of
them could actually be not used. As Tim found in TIKA-2007 [1]
`jackson-core` wasn't necessary for `tika-parsers` at all.

When I looked into current parser deps I found a lot of strange deps like
`quartz` with `c3p0` (jdbc connection pool impl) and `ehcache-core` via
`cdm`, lucene parts (via `ctakes-core`), spring framework 3.x (also via
`ctakes-core`) et cetera. Latter could even break app if you have another
spring version in transitive deps.

Also, there seems to be no tests for ctakes parser on the first glance and
I have no easy way to check what I can exclude from deps without breaking
things.

What do you think about shrinking some of such deps? With at least minimal
test coverage to ensure common usecases won't be broken, of course.

[1]:
https://issues.apache.org/jira/browse/TIKA-2007?focusedCommentId=15435206&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15435206
-- 

Best regards,
Konstantin Gribov

Reply via email to