[jira] [Created] (LUCENE-8772) [nori] A word that is registered in advance, but the words are not separated and recognized as 'UNKNOWN'
YOO JEONGIN created LUCENE-8772: --- Summary: [nori] A word that is registered in advance, but the words are not separated and recognized as 'UNKNOWN' Key: LUCENE-8772 URL: https://issues.apache.org/jira/browse/LUCENE-8772 Project: Lucene - Core Issue Type: Bug Components: modules/analysis Affects Versions: 8.0, 7.7.1, 7.7, 7.6, 7.5 Reporter: YOO JEONGIN Attachments: image-2019-04-19-11-32-56-310.png In case of 'nori', if there is no word starting from the left, 'UNKNOWN' is analyzed even if there is a word already registered in the middle. So here is the question. Does nori analyze only on the left side and do not analyze from the right side? Could this be solved? ex) input => 갊수학 Condition dictionary registered : 수학 dictionary Unregistered : 갊 result => 갊수학 !image-2019-04-19-11-32-56-310.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (LUCENE-8772) [nori] A word that is registered in advance, but the words are not separated and recognized as 'UNKNOWN'
[ https://issues.apache.org/jira/browse/LUCENE-8772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YOO JEONGIN updated LUCENE-8772: Description: hello, In case of 'nori', if there is no word starting from the left, 'UNKNOWN' is analyzed even if there is a word already registered in the middle. So here is the question. Does nori analyze only on the left side and do not analyze from the right side? Could this be solved? ex) input => 갊수학 Condition dictionary registered : 수학 dictionary Unregistered : 갊 result => 갊수학 !image-2019-04-19-11-32-56-310.png! was: In case of 'nori', if there is no word starting from the left, 'UNKNOWN' is analyzed even if there is a word already registered in the middle. So here is the question. Does nori analyze only on the left side and do not analyze from the right side? Could this be solved? ex) input => 갊수학 Condition dictionary registered : 수학 dictionary Unregistered : 갊 result => 갊수학 !image-2019-04-19-11-32-56-310.png! > [nori] A word that is registered in advance, but the words are not separated > and recognized as 'UNKNOWN' > - > > Key: LUCENE-8772 > URL: https://issues.apache.org/jira/browse/LUCENE-8772 > Project: Lucene - Core > Issue Type: Bug > Components: modules/analysis >Affects Versions: 7.5, 7.6, 7.7, 7.7.1, 8.0 >Reporter: YOO JEONGIN >Priority: Major > Attachments: image-2019-04-19-11-32-56-310.png > > > hello, > In case of 'nori', if there is no word starting from the left, 'UNKNOWN' is > analyzed even if there is a word already registered in the middle. > So here is the question. > Does nori analyze only on the left side and do not analyze from the right > side? > Could this be solved? > > ex) > input => 갊수학 > Condition > dictionary registered : 수학 > dictionary Unregistered : 갊 > result => 갊수학 > !image-2019-04-19-11-32-56-310.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Created] (LUCENE-8603) There seems to be a typo in the user dictionary constant definition part
YOO JEONGIN created LUCENE-8603: --- Summary: There seems to be a typo in the user dictionary constant definition part Key: LUCENE-8603 URL: https://issues.apache.org/jira/browse/LUCENE-8603 Project: Lucene - Core Issue Type: Bug Components: modules/analysis Affects Versions: 7.5 Reporter: YOO JEONGIN Attachments: image-2018-12-12-13-51-48-067.png Hello, It seems that you found a typo while changing the dictionary version of nori to the latest version. It should be changed as shown below. Of course, I think. Please review. Thank you. !image-2018-12-12-13-51-48-067.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (LUCENE-8603) There seems to be a typo in the user dictionary constant definition part
[ https://issues.apache.org/jira/browse/LUCENE-8603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YOO JEONGIN updated LUCENE-8603: Description: Hello, It seems that you found a typo while changing the dictionary version of nori to the latest version. I think it should be changed as shown below. Please review. Thank you. !image-2018-12-12-13-51-48-067.png! was: Hello, It seems that you found a typo while changing the dictionary version of nori to the latest version. It should be changed as shown below. Of course, I think. Please review. Thank you. !image-2018-12-12-13-51-48-067.png! > There seems to be a typo in the user dictionary constant definition part > > > Key: LUCENE-8603 > URL: https://issues.apache.org/jira/browse/LUCENE-8603 > Project: Lucene - Core > Issue Type: Bug > Components: modules/analysis >Affects Versions: 7.5 >Reporter: YOO JEONGIN >Priority: Major > Attachments: image-2018-12-12-13-51-48-067.png > > > Hello, > It seems that you found a typo while changing the dictionary version of nori > to the latest version. > I think it should be changed as shown below. > Please review. > Thank you. > > !image-2018-12-12-13-51-48-067.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (LUCENE-8570) Issue that the synonym filter is not executed in the KOREAN analyzer 'Nori'
[ https://issues.apache.org/jira/browse/LUCENE-8570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YOO JEONGIN updated LUCENE-8570: Affects Version/s: (was: 7.5) 7.6 > Issue that the synonym filter is not executed in the KOREAN analyzer 'Nori' > --- > > Key: LUCENE-8570 > URL: https://issues.apache.org/jira/browse/LUCENE-8570 > Project: Lucene - Core > Issue Type: Bug > Components: modules/analysis >Affects Versions: 7.5 >Reporter: YOO JEONGIN >Priority: Major > Fix For: 7.6, master (8.0) > > Attachments: LUCENE-8570.patch, image-2018-11-21-15-45-57-644.png, > image-2018-11-21-15-46-28-114.png, image-2018-11-21-15-46-45-081.png, > image-2018-11-22-11-36-09-190.png, image-2018-11-22-11-36-27-916.png > > > 안녕하세요 > 저는 한국형 형태 분석입니다. 'Nori'를 사용하여 적용 가능한 질문에 답하십시오. > 아직까지는 필터가 작동하지 않아 응답이 없습니다. 이 버그는 본래의 기능과 다를 수 있습니다. > "Nori"를 적용 할 수있는 적절한 방법을 선택하십시오. > 고맙습니다. > !image-2018-11-21-15-45-57-644.png! > !image-2018-11-21-15-46-28-114.png! > !image-2018-11-21-15-46-45-081.png! > !image-2018-11-22-11-36-27-916.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (LUCENE-8603) There seems to be a typo in the user dictionary constant definition part
[ https://issues.apache.org/jira/browse/LUCENE-8603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YOO JEONGIN updated LUCENE-8603: Affects Version/s: (was: 7.5) 7.6 > There seems to be a typo in the user dictionary constant definition part > > > Key: LUCENE-8603 > URL: https://issues.apache.org/jira/browse/LUCENE-8603 > Project: Lucene - Core > Issue Type: Bug > Components: modules/analysis >Affects Versions: 7.6 >Reporter: YOO JEONGIN >Priority: Major > Attachments: image-2018-12-12-13-51-48-067.png > > > Hello, > It seems that you found a typo while changing the dictionary version of nori > to the latest version. > I think it should be changed as shown below. > Please review. > Thank you. > > !image-2018-12-12-13-51-48-067.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (LUCENE-8603) There seems to be a typo in the user dictionary constant definition part
[ https://issues.apache.org/jira/browse/LUCENE-8603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YOO JEONGIN updated LUCENE-8603: Affects Version/s: (was: 7.6) 7.5 > There seems to be a typo in the user dictionary constant definition part > > > Key: LUCENE-8603 > URL: https://issues.apache.org/jira/browse/LUCENE-8603 > Project: Lucene - Core > Issue Type: Bug > Components: modules/analysis >Affects Versions: 7.5 >Reporter: YOO JEONGIN >Priority: Major > Attachments: image-2018-12-12-13-51-48-067.png > > > Hello, > It seems that you found a typo while changing the dictionary version of nori > to the latest version. > I think it should be changed as shown below. > Please review. > Thank you. > > !image-2018-12-12-13-51-48-067.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (LUCENE-8570) Issue that the synonym filter is not executed in the KOREAN analyzer 'Nori'
[ https://issues.apache.org/jira/browse/LUCENE-8570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YOO JEONGIN updated LUCENE-8570: Affects Version/s: (was: 7.6) 7.5 > Issue that the synonym filter is not executed in the KOREAN analyzer 'Nori' > --- > > Key: LUCENE-8570 > URL: https://issues.apache.org/jira/browse/LUCENE-8570 > Project: Lucene - Core > Issue Type: Bug > Components: modules/analysis >Affects Versions: 7.5 >Reporter: YOO JEONGIN >Priority: Major > Fix For: 7.6, master (8.0) > > Attachments: LUCENE-8570.patch, image-2018-11-21-15-45-57-644.png, > image-2018-11-21-15-46-28-114.png, image-2018-11-21-15-46-45-081.png, > image-2018-11-22-11-36-09-190.png, image-2018-11-22-11-36-27-916.png > > > 안녕하세요 > 저는 한국형 형태 분석입니다. 'Nori'를 사용하여 적용 가능한 질문에 답하십시오. > 아직까지는 필터가 작동하지 않아 응답이 없습니다. 이 버그는 본래의 기능과 다를 수 있습니다. > "Nori"를 적용 할 수있는 적절한 방법을 선택하십시오. > 고맙습니다. > !image-2018-11-21-15-45-57-644.png! > !image-2018-11-21-15-46-28-114.png! > !image-2018-11-21-15-46-45-081.png! > !image-2018-11-22-11-36-27-916.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (LUCENE-8603) There seems to be a typo in the user dictionary constant definition part
[ https://issues.apache.org/jira/browse/LUCENE-8603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16722636#comment-16722636 ] YOO JEONGIN commented on LUCENE-8603: - thanks you, [~jim.ferenczi] > There seems to be a typo in the user dictionary constant definition part > > > Key: LUCENE-8603 > URL: https://issues.apache.org/jira/browse/LUCENE-8603 > Project: Lucene - Core > Issue Type: Bug > Components: modules/analysis >Affects Versions: 7.5 >Reporter: YOO JEONGIN >Priority: Major > Fix For: master (8.0), 7.7 > > Attachments: image-2018-12-12-13-51-48-067.png > > > Hello, > It seems that you found a typo while changing the dictionary version of nori > to the latest version. > I think it should be changed as shown below. > Please review. > Thank you. > > !image-2018-12-12-13-51-48-067.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Created] (LUCENE-8570) Issue that the synonym filter is not executed in the KOREAN analyzer 'Nori'
YOO JEONGIN created LUCENE-8570: --- Summary: Issue that the synonym filter is not executed in the KOREAN analyzer 'Nori' Key: LUCENE-8570 URL: https://issues.apache.org/jira/browse/LUCENE-8570 Project: Lucene - Core Issue Type: Bug Components: modules/analysis Affects Versions: 7.5 Reporter: YOO JEONGIN Attachments: image-2018-11-21-15-45-57-644.png, image-2018-11-21-15-46-28-114.png, image-2018-11-21-15-46-45-081.png Hello I have a question about applying synonyms to korean morphological analyzer 'Nori'. If you perform a synonym filter after tokenizing with 'nori', there is no response from the analyzer as shown below. I want to know if this is a bug, not in the original function, or if I made the wrong setting. If you know the correct way to apply a synonym filter in "Nori," I'd like to ask you for your advice. Thank you. !image-2018-11-21-15-45-57-644.png! !image-2018-11-21-15-46-28-114.png! !image-2018-11-21-15-46-45-081.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (LUCENE-8570) Issue that the synonym filter is not executed in the KOREAN analyzer 'Nori'
[ https://issues.apache.org/jira/browse/LUCENE-8570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YOO JEONGIN updated LUCENE-8570: Description: 안녕하세요 저는 한국형 형태 분석입니다. 'Nori'를 사용하여 적용 가능한 질문에 답하십시오. 아직까지는 필터가 작동하지 않아 응답이 없습니다. 이 버그는 본래의 기능과 다를 수 있습니다. "Nori"를 적용 할 수있는 적절한 방법을 선택하십시오. 고맙습니다. !image-2018-11-21-15-45-57-644.png! !image-2018-11-21-15-46-28-114.png! !image-2018-11-21-15-46-45-081.png! !image-2018-11-22-11-36-27-916.png! was: 안녕하세요 저는 한국형 형태 분석입니다. 'Nori'를 사용하여 적용 가능한 질문에 답하십시오. 아직까지는 필터가 작동하지 않아 응답이 없습니다. 이 버그는 본래의 기능과 다를 수 있습니다. "Nori"를 적용 할 수있는 적절한 방법을 선택하십시오. 고맙습니다. !image-2018-11-21-15-45-57-644.png! !image-2018-11-21-15-46-28-114.png! !image-2018-11-21-15-46-45-081.png! !image-2018-11-22-11-36-09-190.png! > Issue that the synonym filter is not executed in the KOREAN analyzer 'Nori' > --- > > Key: LUCENE-8570 > URL: https://issues.apache.org/jira/browse/LUCENE-8570 > Project: Lucene - Core > Issue Type: Bug > Components: modules/analysis >Affects Versions: 7.5 >Reporter: YOO JEONGIN >Priority: Major > Attachments: image-2018-11-21-15-45-57-644.png, > image-2018-11-21-15-46-28-114.png, image-2018-11-21-15-46-45-081.png, > image-2018-11-22-11-36-09-190.png, image-2018-11-22-11-36-27-916.png > > > 안녕하세요 > 저는 한국형 형태 분석입니다. 'Nori'를 사용하여 적용 가능한 질문에 답하십시오. > 아직까지는 필터가 작동하지 않아 응답이 없습니다. 이 버그는 본래의 기능과 다를 수 있습니다. > "Nori"를 적용 할 수있는 적절한 방법을 선택하십시오. > 고맙습니다. > !image-2018-11-21-15-45-57-644.png! > !image-2018-11-21-15-46-28-114.png! > !image-2018-11-21-15-46-45-081.png! > !image-2018-11-22-11-36-27-916.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (LUCENE-8570) Issue that the synonym filter is not executed in the KOREAN analyzer 'Nori'
[ https://issues.apache.org/jira/browse/LUCENE-8570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YOO JEONGIN updated LUCENE-8570: Attachment: image-2018-11-22-11-36-27-916.png > Issue that the synonym filter is not executed in the KOREAN analyzer 'Nori' > --- > > Key: LUCENE-8570 > URL: https://issues.apache.org/jira/browse/LUCENE-8570 > Project: Lucene - Core > Issue Type: Bug > Components: modules/analysis >Affects Versions: 7.5 >Reporter: YOO JEONGIN >Priority: Major > Attachments: image-2018-11-21-15-45-57-644.png, > image-2018-11-21-15-46-28-114.png, image-2018-11-21-15-46-45-081.png, > image-2018-11-22-11-36-09-190.png, image-2018-11-22-11-36-27-916.png > > > 안녕하세요 > 저는 한국형 형태 분석입니다. 'Nori'를 사용하여 적용 가능한 질문에 답하십시오. > 아직까지는 필터가 작동하지 않아 응답이 없습니다. 이 버그는 본래의 기능과 다를 수 있습니다. > "Nori"를 적용 할 수있는 적절한 방법을 선택하십시오. > 고맙습니다. > !image-2018-11-21-15-45-57-644.png! > !image-2018-11-21-15-46-28-114.png! > !image-2018-11-21-15-46-45-081.png! > !image-2018-11-22-11-36-27-916.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (LUCENE-8570) Issue that the synonym filter is not executed in the KOREAN analyzer 'Nori'
[ https://issues.apache.org/jira/browse/LUCENE-8570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YOO JEONGIN updated LUCENE-8570: Description: 안녕하세요 저는 한국형 형태 분석입니다. 'Nori'를 사용하여 적용 가능한 질문에 답하십시오. 아직까지는 필터가 작동하지 않아 응답이 없습니다. 이 버그는 본래의 기능과 다를 수 있습니다. "Nori"를 적용 할 수있는 적절한 방법을 선택하십시오. 고맙습니다. !image-2018-11-21-15-45-57-644.png! !image-2018-11-21-15-46-28-114.png! !image-2018-11-21-15-46-45-081.png! !image-2018-11-22-11-36-09-190.png! was: Hello I have a question about applying synonyms to korean morphological analyzer 'Nori'. If you perform a synonym filter after tokenizing with 'nori', there is no response from the analyzer as shown below. I want to know if this is a bug, not in the original function, or if I made the wrong setting. If you know the correct way to apply a synonym filter in "Nori," I'd like to ask you for your advice. Thank you. !image-2018-11-21-15-45-57-644.png! !image-2018-11-21-15-46-28-114.png! !image-2018-11-21-15-46-45-081.png! > Issue that the synonym filter is not executed in the KOREAN analyzer 'Nori' > --- > > Key: LUCENE-8570 > URL: https://issues.apache.org/jira/browse/LUCENE-8570 > Project: Lucene - Core > Issue Type: Bug > Components: modules/analysis >Affects Versions: 7.5 >Reporter: YOO JEONGIN >Priority: Major > Attachments: image-2018-11-21-15-45-57-644.png, > image-2018-11-21-15-46-28-114.png, image-2018-11-21-15-46-45-081.png, > image-2018-11-22-11-36-09-190.png, image-2018-11-22-11-36-27-916.png > > > 안녕하세요 > 저는 한국형 형태 분석입니다. 'Nori'를 사용하여 적용 가능한 질문에 답하십시오. > 아직까지는 필터가 작동하지 않아 응답이 없습니다. 이 버그는 본래의 기능과 다를 수 있습니다. > "Nori"를 적용 할 수있는 적절한 방법을 선택하십시오. > 고맙습니다. > !image-2018-11-21-15-45-57-644.png! > !image-2018-11-21-15-46-28-114.png! > !image-2018-11-21-15-46-45-081.png! > !image-2018-11-22-11-36-09-190.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (LUCENE-8570) Issue that the synonym filter is not executed in the KOREAN analyzer 'Nori'
[ https://issues.apache.org/jira/browse/LUCENE-8570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YOO JEONGIN updated LUCENE-8570: Attachment: image-2018-11-22-11-36-09-190.png > Issue that the synonym filter is not executed in the KOREAN analyzer 'Nori' > --- > > Key: LUCENE-8570 > URL: https://issues.apache.org/jira/browse/LUCENE-8570 > Project: Lucene - Core > Issue Type: Bug > Components: modules/analysis >Affects Versions: 7.5 >Reporter: YOO JEONGIN >Priority: Major > Attachments: image-2018-11-21-15-45-57-644.png, > image-2018-11-21-15-46-28-114.png, image-2018-11-21-15-46-45-081.png, > image-2018-11-22-11-36-09-190.png, image-2018-11-22-11-36-27-916.png > > > 안녕하세요 > 저는 한국형 형태 분석입니다. 'Nori'를 사용하여 적용 가능한 질문에 답하십시오. > 아직까지는 필터가 작동하지 않아 응답이 없습니다. 이 버그는 본래의 기능과 다를 수 있습니다. > "Nori"를 적용 할 수있는 적절한 방법을 선택하십시오. > 고맙습니다. > !image-2018-11-21-15-45-57-644.png! > !image-2018-11-21-15-46-28-114.png! > !image-2018-11-21-15-46-45-081.png! > !image-2018-11-22-11-36-09-190.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (LUCENE-8570) Issue that the synonym filter is not executed in the KOREAN analyzer 'Nori'
[ https://issues.apache.org/jira/browse/LUCENE-8570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16696302#comment-16696302 ] YOO JEONGIN commented on LUCENE-8570: - Thank you [~jim.ferenczi] I applied it as a patch and it was done well. Thank you very much for your quick patch. > Issue that the synonym filter is not executed in the KOREAN analyzer 'Nori' > --- > > Key: LUCENE-8570 > URL: https://issues.apache.org/jira/browse/LUCENE-8570 > Project: Lucene - Core > Issue Type: Bug > Components: modules/analysis >Affects Versions: 7.5 >Reporter: YOO JEONGIN >Priority: Major > Attachments: LUCENE-8570.patch, image-2018-11-21-15-45-57-644.png, > image-2018-11-21-15-46-28-114.png, image-2018-11-21-15-46-45-081.png, > image-2018-11-22-11-36-09-190.png, image-2018-11-22-11-36-27-916.png > > > 안녕하세요 > 저는 한국형 형태 분석입니다. 'Nori'를 사용하여 적용 가능한 질문에 답하십시오. > 아직까지는 필터가 작동하지 않아 응답이 없습니다. 이 버그는 본래의 기능과 다를 수 있습니다. > "Nori"를 적용 할 수있는 적절한 방법을 선택하십시오. > 고맙습니다. > !image-2018-11-21-15-45-57-644.png! > !image-2018-11-21-15-46-28-114.png! > !image-2018-11-21-15-46-45-081.png! > !image-2018-11-22-11-36-27-916.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Created] (LUCENE-8578) Can I do a lot of analysis on one field at the time of indexing?
YOO JEONGIN created LUCENE-8578: --- Summary: Can I do a lot of analysis on one field at the time of indexing? Key: LUCENE-8578 URL: https://issues.apache.org/jira/browse/LUCENE-8578 Project: Lucene - Core Issue Type: Improvement Reporter: YOO JEONGIN Hello I have a question about index schemas. 1) Can I do various analysis on one field? For example, you can analyze the 'title' field with multiple tokenizers, and merge the analysis into a single field. 2) You can collect multiple fields in one field using 'copyField' function. However, several fields have different data attributes (eg, category fields, text fields, etc.) _) At this time, I would like to analyze each field differently. Do you have these features in version 7.5? Is there any kind of shortcut to do these similar functions? Thank you for your advice. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (SOLR-12655) Add Korean analyzer JAR file (NORI) and schema.xml example to Solr
[ https://issues.apache.org/jira/browse/SOLR-12655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YOO JEONGIN updated SOLR-12655: --- Attachment: image-2018-09-05-17-42-09-983.png > Add Korean analyzer JAR file (NORI) and schema.xml example to Solr > -- > > Key: SOLR-12655 > URL: https://issues.apache.org/jira/browse/SOLR-12655 > Project: Solr > Issue Type: New Feature > Security Level: Public(Default Security Level. Issues are Public) > Components: Build, Schema and Analysis >Affects Versions: 7.4 >Reporter: Uwe Schindler >Assignee: Uwe Schindler >Priority: Major > Fix For: master (8.0), 7.5 > > Attachments: SOLR-12655.patch, image-2018-09-05-17-42-09-983.png, > screenshot-1.png > > > In Lucene 7.4 we added the NORI analyzer for Korean. In contrast to Kuromoji, > the JAR file is missing in the distribution (the analyzers-kuromoji is part > of main solr distribution). We should also add an updated/new "text_ko" field > in the default schema. > See also SOLR-12255 about the documentation. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-12655) Add Korean analyzer JAR file (NORI) and schema.xml example to Solr
[ https://issues.apache.org/jira/browse/SOLR-12655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16604130#comment-16604130 ] YOO JEONGIN commented on SOLR-12655: hello, I do not know if I can post a question here, but I will post it carefully to get help. I am using the mecab-ko-dic-2.0.1-20150920 version of the dictionaries. I applied this dictionary to nori. Unfortunately, an error has occurred. As you read this question, if you have a simple understanding of the issues and fixes, I would appreciate it if you let me know. Thank you. error !image-2018-09-05-17-42-09-983.png! > Add Korean analyzer JAR file (NORI) and schema.xml example to Solr > -- > > Key: SOLR-12655 > URL: https://issues.apache.org/jira/browse/SOLR-12655 > Project: Solr > Issue Type: New Feature > Security Level: Public(Default Security Level. Issues are Public) > Components: Build, Schema and Analysis >Affects Versions: 7.4 >Reporter: Uwe Schindler >Assignee: Uwe Schindler >Priority: Major > Fix For: master (8.0), 7.5 > > Attachments: SOLR-12655.patch, screenshot-1.png > > > In Lucene 7.4 we added the NORI analyzer for Korean. In contrast to Kuromoji, > the JAR file is missing in the distribution (the analyzers-kuromoji is part > of main solr distribution). We should also add an updated/new "text_ko" field > in the default schema. > See also SOLR-12255 about the documentation. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-12655) Add Korean analyzer JAR file (NORI) and schema.xml example to Solr
[ https://issues.apache.org/jira/browse/SOLR-12655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16604314#comment-16604314 ] YOO JEONGIN commented on SOLR-12655: [~jim.ferenczi] Answer Thank you very much. We have added many additional keywords for our domain at mecab-ko-dic-2.0-1-20150920. We also adjusted the leftid, rightid, and cost values to match our domain. So, if we change to the 2.0.3 dictionary, we have to recalibrate our adjusted values back to 2.0.3. So we asked to use nori using 2.0.1 that we are using. And during the test, mecab-ko-dic-2.1.1-20180720 also caused the same error. Let's go through the guide again. Thank you so much for your quick reply. > Add Korean analyzer JAR file (NORI) and schema.xml example to Solr > -- > > Key: SOLR-12655 > URL: https://issues.apache.org/jira/browse/SOLR-12655 > Project: Solr > Issue Type: New Feature > Security Level: Public(Default Security Level. Issues are Public) > Components: Build, Schema and Analysis >Affects Versions: 7.4 >Reporter: Uwe Schindler >Assignee: Uwe Schindler >Priority: Major > Fix For: master (8.0), 7.5 > > Attachments: SOLR-12655.patch, image-2018-09-05-17-42-09-983.png, > screenshot-1.png > > > In Lucene 7.4 we added the NORI analyzer for Korean. In contrast to Kuromoji, > the JAR file is missing in the distribution (the analyzers-kuromoji is part > of main solr distribution). We should also add an updated/new "text_ko" field > in the default schema. > See also SOLR-12255 about the documentation. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (SOLR-12655) Add Korean analyzer JAR file (NORI) and schema.xml example to Solr
[ https://issues.apache.org/jira/browse/SOLR-12655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YOO JEONGIN updated SOLR-12655: --- Attachment: image-2018-09-06-09-42-10-381.png > Add Korean analyzer JAR file (NORI) and schema.xml example to Solr > -- > > Key: SOLR-12655 > URL: https://issues.apache.org/jira/browse/SOLR-12655 > Project: Solr > Issue Type: New Feature > Security Level: Public(Default Security Level. Issues are Public) > Components: Build, Schema and Analysis >Affects Versions: 7.4 >Reporter: Uwe Schindler >Assignee: Uwe Schindler >Priority: Major > Fix For: master (8.0), 7.5 > > Attachments: SOLR-12655.patch, image-2018-09-05-17-42-09-983.png, > image-2018-09-06-09-42-10-381.png, screenshot-1.png > > > In Lucene 7.4 we added the NORI analyzer for Korean. In contrast to Kuromoji, > the JAR file is missing in the distribution (the analyzers-kuromoji is part > of main solr distribution). We should also add an updated/new "text_ko" field > in the default schema. > See also SOLR-12255 about the documentation. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-12655) Add Korean analyzer JAR file (NORI) and schema.xml example to Solr
[ https://issues.apache.org/jira/browse/SOLR-12655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16605109#comment-16605109 ] YOO JEONGIN commented on SOLR-12655: [~jim.ferenczi], hello we proceeded according to the guide, but another error occurred and we ask for help. the following error has occurred. Which part should I fix? If it is time, I would like you to help. !image-2018-09-06-09-42-10-381.png! > Add Korean analyzer JAR file (NORI) and schema.xml example to Solr > -- > > Key: SOLR-12655 > URL: https://issues.apache.org/jira/browse/SOLR-12655 > Project: Solr > Issue Type: New Feature > Security Level: Public(Default Security Level. Issues are Public) > Components: Build, Schema and Analysis >Affects Versions: 7.4 >Reporter: Uwe Schindler >Assignee: Uwe Schindler >Priority: Major > Fix For: master (8.0), 7.5 > > Attachments: SOLR-12655.patch, image-2018-09-05-17-42-09-983.png, > image-2018-09-06-09-42-10-381.png, screenshot-1.png > > > In Lucene 7.4 we added the NORI analyzer for Korean. In contrast to Kuromoji, > the JAR file is missing in the distribution (the analyzers-kuromoji is part > of main solr distribution). We should also add an updated/new "text_ko" field > in the default schema. > See also SOLR-12255 about the documentation. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Issue Comment Deleted] (SOLR-12655) Add Korean analyzer JAR file (NORI) and schema.xml example to Solr
[ https://issues.apache.org/jira/browse/SOLR-12655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] YOO JEONGIN updated SOLR-12655: --- Comment: was deleted (was: [~jim.ferenczi], hello we proceeded according to the guide, but another error occurred and we ask for help. the following error has occurred. Which part should I fix? If it is time, I would like you to help. !image-2018-09-06-09-42-10-381.png!) > Add Korean analyzer JAR file (NORI) and schema.xml example to Solr > -- > > Key: SOLR-12655 > URL: https://issues.apache.org/jira/browse/SOLR-12655 > Project: Solr > Issue Type: New Feature > Security Level: Public(Default Security Level. Issues are Public) > Components: Build, Schema and Analysis >Affects Versions: 7.4 >Reporter: Uwe Schindler >Assignee: Uwe Schindler >Priority: Major > Fix For: master (8.0), 7.5 > > Attachments: SOLR-12655.patch, image-2018-09-05-17-42-09-983.png, > image-2018-09-06-09-42-10-381.png, screenshot-1.png > > > In Lucene 7.4 we added the NORI analyzer for Korean. In contrast to Kuromoji, > the JAR file is missing in the distribution (the analyzers-kuromoji is part > of main solr distribution). We should also add an updated/new "text_ko" field > in the default schema. > See also SOLR-12255 about the documentation. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (LUCENE-8578) Can I do a lot of analysis on one field at the time of indexing?
[ https://issues.apache.org/jira/browse/LUCENE-8578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16704065#comment-16704065 ] YOO JEONGIN commented on LUCENE-8578: - [~jpountz] , Thank you. I will ask you through the channel you provided. Please acknowledge. Thank you. > Can I do a lot of analysis on one field at the time of indexing? > > > Key: LUCENE-8578 > URL: https://issues.apache.org/jira/browse/LUCENE-8578 > Project: Lucene - Core > Issue Type: Improvement >Reporter: YOO JEONGIN >Priority: Major > > Hello > I have a question about index schemas. > 1) Can I do various analysis on one field? > For example, you can analyze the 'title' field with multiple tokenizers, and > merge the analysis into a single field. > 2) You can collect multiple fields in one field using 'copyField' function. > However, several fields have different data attributes (eg, category fields, > text fields, etc.) _) > At this time, I would like to analyze each field differently. > Do you have these features in version 7.5? Is there any kind of shortcut to > do these similar functions? > Thank you for your advice. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (LUCENE-8772) [nori] A word that is registered in advance, but the words are not separated and recognized as 'UNKNOWN'
[ https://issues.apache.org/jira/browse/LUCENE-8772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16822831#comment-16822831 ] YOO JEONGIN commented on LUCENE-8772: - hello, [~jim.ferenczi] thank you for the reply. Even if the cost increases, I think that the words in advance should be changed to be recognizable. I know which part I need to fix, but I do not know how to fix it. Could you tell me the revision code? > [nori] A word that is registered in advance, but the words are not separated > and recognized as 'UNKNOWN' > - > > Key: LUCENE-8772 > URL: https://issues.apache.org/jira/browse/LUCENE-8772 > Project: Lucene - Core > Issue Type: Bug > Components: modules/analysis >Affects Versions: 7.5, 7.6, 7.7, 7.7.1, 8.0 >Reporter: YOO JEONGIN >Priority: Major > Attachments: image-2019-04-19-11-32-56-310.png > > > hello, > In case of 'nori', if there is no word starting from the left, 'UNKNOWN' is > analyzed even if there is a word already registered in the middle. > So here is the question. > Does nori analyze only on the left side and do not analyze from the right > side? > Could this be solved? > > ex) > input => 갊수학 > Condition > dictionary registered : 수학 > dictionary Unregistered : 갊 > result => 갊수학 > !image-2019-04-19-11-32-56-310.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Created] (LUCENE-8693) nori > special(symbol) characters issue
YOO JEONGIN created LUCENE-8693: --- Summary: nori > special(symbol) characters issue Key: LUCENE-8693 URL: https://issues.apache.org/jira/browse/LUCENE-8693 Project: Lucene - Core Issue Type: Bug Components: modules/analysis Affects Versions: 7.6 Reporter: YOO JEONGIN Hi I'm using the "nori" analyzer. Whether it's an error or an intentional question. All special characters are filtered. Special characters stored in the dictionary are also filtered. How do I print special characters? thanks. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (LUCENE-8693) nori > special(symbol) characters issue
[ https://issues.apache.org/jira/browse/LUCENE-8693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16767877#comment-16767877 ] YOO JEONGIN commented on LUCENE-8693: - hello, [~jim.ferenczi] For example 1.1 0.01% CND0-4CZ12-D324F-FEW1F i...@gmail.com 1,000,000 4·3 ... This is for accurate retrieval of data. thanks. > nori > special(symbol) characters issue > --- > > Key: LUCENE-8693 > URL: https://issues.apache.org/jira/browse/LUCENE-8693 > Project: Lucene - Core > Issue Type: Bug > Components: modules/analysis >Affects Versions: 7.6 >Reporter: YOO JEONGIN >Priority: Major > > Hi > I'm using the "nori" analyzer. > Whether it's an error or an intentional question. > All special characters are filtered. > Special characters stored in the dictionary are also filtered. > How do I print special characters? > thanks. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org