[ 
https://issues.apache.org/jira/browse/TIKA-1053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13545906#comment-13545906
 ] 

Uwe Schindler commented on TIKA-1053:
-------------------------------------

Apache Solr also disabled the CLASS file parser in the TIKA Extraction plugin 
for the same reasons. Lucene uses ASM 4.1 for some checks (or so-called 
"forbidden-apis" checker, 
http://blog.thetaphi.de/2012/07/default-locales-default-charsets-and.html) in 
the build process and we cannot separate the classpath while building Solr. So 
we removed the ASM dependency. If somebody want to index CLASS file, he can 
later add the dependency to the isolated Solr classloader (solr/lib folder for 
plugins).

The same applies for other more special parsers like NetCDF, MP3 to not bloat 
Solr's WAR file generally useless parsers (Solr should index documents not 
binary files with little metadata). One can always add those parses later.
                
> Upgrade Tika Parsers to use ASM 4.x
> -----------------------------------
>
>                 Key: TIKA-1053
>                 URL: https://issues.apache.org/jira/browse/TIKA-1053
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.2
>            Reporter: Vincent Massol
>
> Right now Tika 1.2 uses ASM 3.1. 
> However this is causing some issues for us on the XWiki project since we also 
> bundle other framework that use a more recent version of ASM (we use pegdown 
> which uses parboiled which draws ASM 4.0).
> The problem is that ASM 3.x and 4.0 are not compatible...
> See http://jira.xwiki.org/browse/XE-1269 for more details about the issue 
> we're facing.
> Thanks for considering upgrading to ASM 4.x :)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to