[jira] [Commented] (LUCENE-7836) Multiple Token Regex Search Not working

Michael McCandless (JIRA) Thu, 18 May 2017 03:44:48 -0700

    [ 
https://issues.apache.org/jira/browse/LUCENE-7836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16015561#comment-16015561
 ]


Michael McCandless commented on LUCENE-7836:
--------------------------------------------

I suspect the problem is "Hello World" was indexed as 2 tokens, since you're 
indexing with whitespace tokenization, which means the regexp will never match 
a single token.

If you indexed "Hello World" as a single token then the regexp should match.

> Multiple Token Regex Search Not working
> ---------------------------------------
>
>                 Key: LUCENE-7836
>                 URL: https://issues.apache.org/jira/browse/LUCENE-7836
>             Project: Lucene - Core
>          Issue Type: Bug
>         Environment: Linux(Centos)
>            Reporter: Parmeet Singh Sachdeva
>            Priority: Blocker
>
> I am able to search for a regex query like H[a-z]llo but I am not able to 
> search for a regex query like H[a-z]llo Wor[a-z]d even though I have "Hello 
> World" in my source tree. I am not able to search multi-word regex queries.
> I am using OpenGrok which is, in turn, using 
> org.apache.lucene.search.RegexpQuery class of Lucene 6.5.0 which extends 
> AutomatonQuery.
> I have indexed the data based on whitespace tokenization.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (LUCENE-7836) Multiple Token Regex Search Not working

Reply via email to