Re: LUCENE-5791 and LUCENE-6672 (BasicOperations#determinize() performance)

2015-12-01 Thread Irfan Hamid
Hi Michael,

Is the functionality you're mentioning the same as the one pointed out by
David Causse in LUCENE-6672
? If so he is claiming
that maxDeterminizedStates is not respected by UTF32ToUTF8 and can thus
cause a problem. I'm looking at lucene trunk to try and figure it out.
However, input from you would be much appreciated.

TIA,
Irfan.

On Tue, Dec 1, 2015 at 3:19 AM, Dawid Weiss  wrote:

>
>> I think it would be interesting to explore an NFA implementation for
>> Lucene!
>>
>>
> It would be interesting and valuable to have an optimized non-DFA graph
> engine in general. I'm thinking of something like re2.
> https://github.com/google/re2
>
> Dawid
>
>


LUCENE-5791 and LUCENE-6672 (BasicOperations#determinize() performance)

2015-11-30 Thread Irfan Hamid
Lucene devs,

We are hitting performance problems when our customers issue pathological
wildcard queries. Searching the Lucene JIRA I came across these two work
items and unfortunately it seems like there's no easy way out. However, in
LUCENE-6672  David
Causse has a couple of proposed solutions. I was wondering if either of
those or something similar were integrated into the code-base down the line?

If not, would the community be interested in a pull request if/when we fix
this in our fork and bake it in production for a while?

TIA,
Irfan.