Basic sentence parsing with the regex highlighter fragmenter

2010-01-05 Thread Caleb Land
this to work, and I always end up with this problem. For example, I've tried: \w[^.!?]{0,200}[.!?] Which seems like it should include the ending punctuation, but it doesn't, so I think I'm missing something. Does anybody know a regex that works? -- Caleb Land

Re: Basic sentence parsing with the regex highlighter fragmenter

2010-01-05 Thread Caleb Land
PM, Caleb Land caleb.l...@gmail.com wrote: Hello, I'm using Solr 1.4, and I'm trying to get the regex fragmenter to parse basic sentences, and I'm running into a problem. I'm using the default regex specified in the example solr configuration: [-\w ,/\n\']{20,200} But I am using a larger

Re: Basic sentence parsing with the regex highlighter fragmenter

2010-01-06 Thread Caleb Land
shot (which is what WordDelimiterFactory gives you, options permitting). So I think you probably want one of the other analyzers Have you tried any other analyzers? StandardAnalyzer might be more friendly HTH Erick On Tue, Jan 5, 2010 at 5:18 PM, Caleb Land caleb.l...@gmail.com

Re: Basic sentence parsing with the regex highlighter fragmenter

2010-01-07 Thread Caleb Land
+ a space off on the client side, and leaving the sentence terminator-less. -- Caleb Land