As mentioned, document is a source code. As you know all below statments
are equal:
A = foo() {
A=foo(){
A= foo(){
...With standard whitespace analyzer in action statements wanted to match can be on one to five terms in this case. If spacing is definite, I could go either a phrase search or regexep. Any suggestions for this case? On Sat, Feb 13, 2016 at 1:34 PM Jack Krupansky <[email protected]> wrote: > Obviously you wouldn't need to do a regex for simply terms like foo and bar > - just use simple terms and quoted phrase to match "foo bar". If you really > do need to do complex pattern regexes and match across adjacent terms, your > best bet is to keep a copy of the source text in a separate string (not > tokenized text) field and then you can do a complex regex that spans terms > (and only do that if normal span queries don't do what you need.) > > What does your typical cross-term regex actually look like? > > > -- Jack Krupansky > > On Sat, Feb 13, 2016 at 1:25 PM, Uwe Schindler <[email protected]> wrote: > > > Hi, > > > > That's very easy to explain: Regexp queries only work on terms, you > > already said it in your introduction. There is no phrase query in Lucene > > that accepts regular expressions. > > > > Uwe > > > > ----- > > Uwe Schindler > > H.-H.-Meier-Allee 63, D-28213 Bremen > > http://www.thetaphi.de > > eMail: [email protected] > > > > > -----Original Message----- > > > From: Kudrettin Güleryüz [mailto:[email protected]] > > > Sent: Saturday, February 13, 2016 7:14 PM > > > To: [email protected] > > > Subject: Spaces in regular expressions > > > > > > Hello, > > > > > > I am using standard whitespace analyzer to index a source code document > > > using Lucene 5. > > > > > > I understand that a document with content foo bar would have only two > > > terms: foo and bar. When I search for "foo bar" it normally matches > the > > > document. Similarly a regexp query /foo/ or /bar/ also matches the > > > document. > > > > > > Can you help me understand why doesn't a regexp query like /foo bar/ > > > doesn't match the document? > > > > > > Thank you, > > > Kudret > > > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: [email protected] > > For additional commands, e-mail: [email protected] > > > > >
