[ https://issues.apache.org/jira/browse/JAMES-3719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Benoit Tellier closed JAMES-3719. --------------------------------- Resolution: Fixed > Reactify Tika calls > ------------------- > > Key: JAMES-3719 > URL: https://issues.apache.org/jira/browse/JAMES-3719 > Project: James Server > Issue Type: Improvement > Components: elasticsearch > Affects Versions: 3.7.0 > Reporter: Benoit Tellier > Assignee: Benoit Tellier > Priority: Major > Fix For: 3.8.0 > > Time Spent: 40m > Remaining Estimate: 0h > > We rely on blocking HTTP calls to extract textual content with Tika. > This means: > - Threads hangs around why we do the requests... > - We are blocking in a parrallel reactor thread (cassandra-app) which is > dramatic performance wise. > We can improve this matter of fact by using reactor-netty to query tika. > Caching layers need to be adapted to - guava is blocking. Caffeine library > can be a good candidate as a reactive caching library. > Also, we need to uncouple MIME parsing and content extraction: both are > currently tightly coupled; I suggest extracting a POJO representation of the > mail first, then extract content if need be, not do both at the same time. -- This message was sent by Atlassian Jira (v8.20.7#820007) --------------------------------------------------------------------- To unsubscribe, e-mail: server-dev-unsubscr...@james.apache.org For additional commands, e-mail: server-dev-h...@james.apache.org