[ 
https://issues.apache.org/jira/browse/LUCENE-4956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13807918#comment-13807918
 ] 

Benson Margulies edited comment on LUCENE-4956 at 10/29/13 12:34 PM:
---------------------------------------------------------------------

Something's funny here. On this page 
(http://www.kristalinfo.com/TestCollections/), the zip file has directories like

HANTEC-2.0/relevance_file/과학기술분야/
HANTEC-2.0/relevance_file/전체/

The first translates as 'Science and Technology' and the second as 'All'.

The code in the patch expects the word 'full' in latin-alphabet, no funny 
full-width, in the that intermediate directory. So I don't see how a code-page 
option to unzip got there. I'm suspecting that an 'mv' is in order.


was (Author: bmargulies):
Something's funny here. On this page 
(http://www.kristalinfo.com/TestCollections/), the zip file has directories like

HANTEC-2.0/relevance_file/과학기술분야/
HANTEC-2.0/relevance_file/전체/

The code in the patch expects the word 'full' in latin-alphabet, no funny 
full-width, in the that intermediate directory. So I don't see how a code-page 
option to unzip got there. I'm suspecting that an 'mv' is in order.

> the korean analyzer that has a korean morphological analyzer and dictionaries
> -----------------------------------------------------------------------------
>
>                 Key: LUCENE-4956
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4956
>             Project: Lucene - Core
>          Issue Type: New Feature
>          Components: modules/analysis
>    Affects Versions: 4.2
>            Reporter: SooMyung Lee
>            Assignee: Christian Moen
>              Labels: newbie
>         Attachments: eval.patch, kr.analyzer.4x.tar, lucene-4956.patch, 
> lucene4956.patch, LUCENE-4956.patch
>
>
> Korean language has specific characteristic. When developing search service 
> with lucene & solr in korean, there are some problems in searching and 
> indexing. The korean analyer solved the problems with a korean morphological 
> anlyzer. It consists of a korean morphological analyzer, dictionaries, a 
> korean tokenizer and a korean filter. The korean anlyzer is made for lucene 
> and solr. If you develop a search service with lucene in korean, It is the 
> best idea to choose the korean analyzer.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to