Hi,
Currently the Apache POI dependency is in several modules and it's sort
of a beast (> 2 MB in size). It appears many of the modules are only
using the IOUtils library. The big exception is the office module which
is responsible for parsing documents. These methods appear to also exist
in commons io which is only ~ 180 KB. Any concerns with replacing this
POI stuff with commons-io? Does POI offer anything above the commons-io
functionality in IOUtils? If not I think it would be great to isolate
the poi dependency to the office module only.
- Bob
- Tika 2.0 - Replace POI IOUtils with commons-io IOUtils Bob Paulin
-