On 5/23/22 8:16 PM, Michael Slusarz wrote:
Unfortunately, Tika has not yet been converted to events/categories with the ability to more granularly enable debugging just for this component.
Aha! Thx.
It's probably easier to just look at tika's debugging logs. The default log level (at least in Tika 2.3) will output an INFO line for every attachment indexed:> INFO [qtp235162442-22] 16:15:19,905 org.apache.tika.server.core.resource.TikaResource /tika (text/calendar)
I've been poking at slf4j; thought that's what tika 2x uses. So far, haven't been able to get a peep out of it. :-/ Need to go a'googlin' OTOH, with `mail_debug = on`, I see these, ... 2022-05-23 20:49:37 indexer-worker([email protected])<dOdUCaArjGIKlwEA+IOfAw:mFdtKKErjGIMlwEA+IOfAw>: Debug: http-client: request [Req1: PUT http://127.0.0.1/tika/]: Submitted (requests left=1) 2022-05-23 20:49:37 indexer-worker([email protected])<dOdUCaArjGIKlwEA+IOfAw:mFdtKKErjGIMlwEA+IOfAw>: Debug: http-client: request [Req1: PUT http://127.0.0.1/tika/]: Waiting for request to finish 2022-05-23 20:49:37 indexer-worker([email protected])<dOdUCaArjGIKlwEA+IOfAw:mFdtKKErjGIMlwEA+IOfAw>: Debug: http-client: queue http://127.0.0.1:9998: Connection to peer 127.0.0.1:9998 claimed request [Req1: PUT http://127.0.0.1:9998/tika/] 2022-05-23 20:49:37 indexer-worker([email protected])<dOdUCaArjGIKlwEA+IOfAw:mFdtKKErjGIMlwEA+IOfAw>: Debug: http-client: conn 127.0.0.1:9998 [1]: Claimed request [Req1: PUT http://127.0.0.1:9998/tika/] 2022-05-23 20:49:37 indexer-worker([email protected])<dOdUCaArjGIKlwEA+IOfAw:mFdtKKErjGIMlwEA+IOfAw>: Debug: http-client: request [Req1: PUT http://127.0.0.1/tika/]: Sent header 2022-05-23 20:49:37 indexer-worker([email protected])<dOdUCaArjGIKlwEA+IOfAw:mFdtKKErjGIMlwEA+IOfAw>: Debug: http-client: request [Req1: PUT http://127.0.0.1/tika/]: Send more (sent 5562, buffered=5570) 2022-05-23 20:49:37 indexer-worker([email protected])<dOdUCaArjGIKlwEA+IOfAw:mFdtKKErjGIMlwEA+IOfAw>: Debug: http-client: request [Req1: PUT http://127.0.0.1/tika/]: Waiting for request to finish 2022-05-23 20:49:37 indexer-worker([email protected])<dOdUCaArjGIKlwEA+IOfAw:mFdtKKErjGIMlwEA+IOfAw>: ... , which looks promising. But, so far, body Search exec, from within TBird, is not returning anything that I know is in that PDF. Which is the 'problem' I'm trying to log in order to debug ...
