[
https://issues.apache.org/jira/browse/SOLR-9250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15349678#comment-15349678
]
Tim Nolan commented on SOLR-9250:
---------------------------------
The filed configuration:
{code:XML|title=sg/conf/schema.xml}
<field name="contact-name" type="phrase" indexed="true" stored="false"/>
{code}
If I make the field stored, the returned result is shown below:
{code}
{
"numFound": 1,
"start": 0,
"docs": [
{
"id": "CO_912054",
"type": "CONTACT",
"contact-id": 912054,
"contact-name": "Tùûüÿ€àâæçéèêëïîôœm UTF8"
}
]
{code}
One thing that might be a contributing factor is that € is a 3-byte UTF-8
character. Maybe that affects the parser?
> Search breaks with EU symbol € and wildcard *
> ---------------------------------------------
>
> Key: SOLR-9250
> URL: https://issues.apache.org/jira/browse/SOLR-9250
> Project: Solr
> Issue Type: Bug
> Security Level: Public(Default Security Level. Issues are Public)
> Components: Server
> Affects Versions: 5.3.1
> Reporter: Tim Nolan
>
> While testing UTF-8 character searches, which worked, we have noticed a
> combination that fails. Testing with the data {{Tùûüÿ€àâæçéèêëïîôœm}}, we
> found the search worked, but by adding a wild-card (e.g.
> {{Tùûüÿ€àâæçéèêëïîôœm*}}), the search fails. Adding the wildcard before the
> {{€}} symbol worked (i.e. {{Tùûüÿ*}}).
> Showing the logs for these queries:
> {noformat:title=Full text without wildcard, hit=1}
> 2016-06-25 13:16:34.361 [qtp237852351-21] INFO
> org.apache.solr.core.SolrCore.Request – [core-name] webapp=/solr
> path=/select
> params={q=Tùûüÿ€àâæçéèêëïîôœm&indent=true&fq=type:CONTACT&rows=12&wt=json&_=1466860594348}
> hits=1 status=0 QTime=0
> {noformat}
> {noformat:title=Full text with wildcard, hit=0}
> 2016-06-25 13:16:41.172 [qtp237852351-16] INFO
> org.apache.solr.core.SolrCore.Request – [core-name] webapp=/solr
> path=/select
> params={q=Tùûüÿ€àâæçéèêëïîôœm*&indent=true&fq=type:CONTACT&rows=12&wt=json&_=1466860601160}
> hits=0 status=0 QTime=0
> {noformat}
> {noformat:title=Partial text before € with wildcard, hit=1}
> 2016-06-25 13:16:52.135 [qtp237852351-18] INFO
> org.apache.solr.core.SolrCore.Request – [core-name] webapp=/solr
> path=/select
> params={q=Tùûüÿ*&indent=true&fq=type:CONTACT&rows=12&wt=json&_=1466860612125}
> hits=1 status=0 QTime=2
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]