[
https://issues.apache.org/jira/browse/SOLR-13973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17190728#comment-17190728
]
Alexandre Rafalovitch commented on SOLR-13973:
----------------------------------------------
Sanity check on the links and numbers (I use both Drupal 8/9 and Solr, though
not currently together):
1) [https://www.drupal.org/project/apachesolr] is for Drupal 7 and before.
Those people will also continue using Solr 4 or whatever was the configuration
last updated to (2 years ago at the latest)
2) [https://www.drupal.org/project/search_api_solr] starting from v4 release
does support Drupal 8.8+/9 (only) and Solr versions more recent than Solr 6.4.
It was released in May 2020 and updated since. The current adoption is probably
still fairly low, but will accelerate next year, once previous release version
is no longer supported (notice said December).
Drupal was never known for chasing latest Solr version as they have their own
configuration that is designed for field definitions with wildcards and maybe
only recently (if at all) with managed-schema API manipulation. They can also
can keep using Solr 8 for another 5-6 years with Tika built in.
If Tika is removed from Solr (in version 9 the earliest), this will only affect
the choices of those setting up new Drupal installation and wanting new
features of Solr 9. At that point (say in 4 years), we can figure something out
for Solr 11. Most likely a variation on preconfigured Solr and Tika colocated
in a Docker container.
On the other hand, I honestly don't know much about solarium library directly.
Perhaps it is a serious issue there, though we have to look again at number of
active installations*percentage of those using /extract handler*percentage of
people able to run _latest_ Solr process but not a second (also Java) process.
So, to me, this sounds less like a -1, then as an awareness for a bit of an
extra education around that edge case. And, yes, awareness of the greater
community; something we really need to pay more attention to in general.
Of course, any improvement of workflow we can do between Solr and Tika, both
standalone, would be very good regardless of this particular use case.
> Deprecate Tika
> --------------
>
> Key: SOLR-13973
> URL: https://issues.apache.org/jira/browse/SOLR-13973
> Project: Solr
> Issue Type: Improvement
> Reporter: Ishan Chattopadhyaya
> Assignee: Ishan Chattopadhyaya
> Priority: Blocker
> Fix For: 8.7
>
> Time Spent: 10m
> Remaining Estimate: 0h
>
> Solr's primary responsibility should be to focus on search and scalability.
> Having to deal with the problems (CVEs) of Velocity, Tika etc. can slow us
> down. I propose that we deprecate it going forward.
> Tika can be run outside Solr. Going forward, if someone wants to use these,
> it should be possible to bring them into third party packages and installed
> via package manager.
> Plan is to just to throw warnings in logs and add deprecation notes in
> reference guide for now. Removal can be done in 9.0.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]