[ 
https://issues.apache.org/jira/browse/SOLR-3623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13419680#comment-13419680
 ] 

Robert Muir commented on SOLR-3623:
-----------------------------------

Sorry... lemme try to explain in more detail:

{quote}
can you be specific as to what you mean by "the morfologik jars" ? because 
evidently i don't have a clue what that means and i missunderstood what you 
were including in that list originally (i thought you ment all of the 
anlysis-extra libs that weren't ICU).
{quote}

{noformat}
WEB-INF/lib/lucene-analyzers-morfologik-4.0-SNAPSHOT.jar <-- this is the lucene 
integration code (analyzer, tokenfilter)
WEB-INF/lib/morfologik-fsa-1.5.3.jar <-- these 3 jars are dependencies of the 
above
WEB-INF/lib/morfologik-polish-1.5.3.jar
WEB-INF/lib/morfologik-stemming-1.5.3.jar
{noformat}

But this does not good for solr users: because the factory 
(MorfologikFilterFactory.java) is in apache-solr-analysis-extras.jar. 
Furthermore, I think having this situation (where these files are in the war, 
but the factory as a plugin) causes classloader hell.

So with the factory in this contrib module, the jar files should really be 
going in contrib/analysis-extras/lucene-libs as part of the packaging process 
just like the other dependencies this contrib module has, otherwise we should 
move the factory to core (see below)

{quote}
are you saying it is good or bad that lucene-analyzers-smartcn is not currently 
in solr core? (ie: in your opinion, should SmartChinese*Factory move into 
solr/core ?)
{quote}

In my opinion, it would be nice because we could have a text_zh configured in 
the example that indexes chinese as words. Currently to do this, you have to 
deal with this huge hassle that is this crazy analysis-extras contrib which is 
a big barrier for indexing Chinese text.

But thats just my opinion, i hate the contrib in general because I think its a 
pain to use. The reason it exists was because I initially wanted to integrate 
smartchinese with solr but there were concerns about it increasing the size of 
the .war file since the smart chinese jar is 3MB. So I created this contrib and 
added factories for any analyzers that didnt have factories just as a way of at 
least providing some help to make them usable. Just FYI: the solr.war is near 
20MB now.

Still, as it is, at least its some way to provide factories for these analyzers 
versus having none before.

{quote}
again: what exactly do you mean by "wrong place" ?
am i correct in understanding that you feel they should be in the war file, but 
that it is a mistake they are not included in solr.base.classpath at compile 
time?
{quote}

Under the current setup, the factory is in contrib/analysis-extras. the 
contrib/analysis-extras build logic puts these dependencies into 
contrib/analysis-extras' classpath so the tests will pass.

If we want to move the factories to core, then we have to adjust the solr core 
classpath to then include the jar files instead.

{quote}
a quick glance at that reveals other things in that classpath that shouldnt be 
in there like analyzers-uima.jar (which, should instead be configured in the 
uima contrib's classpath only)
{quote}

Here is contrib/uima/build.xml:
{noformat}
  <path id="classpath">
    <pathelement path="${analyzers-uima.jar}"/>
    <path refid="solr.base.classpath"/>
  </path>
{noformat}

So its useless to have analyzers-uima in the solr core classpath, because in 
the current packaging solr core code should not be depending on this jar. And 
contrib/uima already adds this itself. 

                
> analysis-extras lucene libraries are redundenly packaged (in war and in 
> lucene-libs)
> ------------------------------------------------------------------------------------
>
>                 Key: SOLR-3623
>                 URL: https://issues.apache.org/jira/browse/SOLR-3623
>             Project: Solr
>          Issue Type: Bug
>          Components: Build
>            Reporter: Lance Norskog
>            Assignee: Hoss Man
>            Priority: Minor
>             Fix For: 4.0, 5.0
>
>
> Various dependencies for contrib/analysis-extras are packaged 
> contrib/analysis-extras/lucene-libs (along with instructions in 
> contrib/analysis-extras/README.txt that users need to include them 
> explicitly) even though these jars are already hardcoded into the solr war 
> file.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to