[jira] [Commented] (LUCENE-4956) the korean analyzer that has a korean morphological analyzer and dictionaries

Christian Moen (JIRA) Tue, 14 May 2013 02:15:20 -0700

    [ 
https://issues.apache.org/jira/browse/LUCENE-4956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13656919#comment-13656919
 ]


Christian Moen commented on LUCENE-4956:
----------------------------------------

Hello SooMyung,

Thanks for the above regarding field type.  The general approach we have taken 
in Lucene is to do the same analysis at both index and query side.  For 
example, the Japanese analyzer also has functionality to do compound splitting 
and we've discussed doing this one the index side only per default for field 
type {{text_ja}}, but we decided against it.

I've included your field type in the latest code I've checked in just now, but 
it's likely that we will change this in the future.

I'm wondering if you could help me with a few sample sentences that illustrates 
the various options {{KoreanFilter}} has.  I'd like to add some test-cases for 
these to better understand the differences between them and to verify correct 
behaviour.  Test-cases for this is also a useful way to document functionality 
in general.  Thanks for any help with this!
                
> the korean analyzer that has a korean morphological analyzer and dictionaries
> -----------------------------------------------------------------------------
>
>                 Key: LUCENE-4956
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4956
>             Project: Lucene - Core
>          Issue Type: New Feature
>          Components: modules/analysis
>    Affects Versions: 4.2
>            Reporter: SooMyung Lee
>            Assignee: Christian Moen
>              Labels: newbie
>         Attachments: kr.analyzer.4x.tar
>
>
> Korean language has specific characteristic. When developing search service 
> with lucene & solr in korean, there are some problems in searching and 
> indexing. The korean analyer solved the problems with a korean morphological 
> anlyzer. It consists of a korean morphological analyzer, dictionaries, a 
> korean tokenizer and a korean filter. The korean anlyzer is made for lucene 
> and solr. If you develop a search service with lucene in korean, It is the 
> best idea to choose the korean analyzer.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-4956) the korean analyzer that has a korean morphological analyzer and dictionaries

Reply via email to