Yes, please open a JIRA issue and submit your patches.

I wonder if there is anyway to deprecate functionality in a JFlex grammar? That is, is there anyway we can communicate to people that both will be supported through 2.9 and then the correct way will be supported in 3.x?

-Grant

On Nov 27, 2007, at 2:18 AM, Shai Erera wrote:

I understand it would change the behavior of existing search solutions, however the current behavior is just wrong. An ACRONYM cannot be ABC.DEF. If you look up acronym in Wikipedia, you find only examples of I.B.M. / U.S.A. like, or NATO, IBM, USA, but nothing of the form StandardAnalyzer currently
recognizes.

There are several ways to solve this change:
1. Create a new analyzer that fixes the problem - that way, applications that don't want to use it will not have to, if they feel ok with the current
behavior. However, for those who would like to get a correct behavior,
they'll be able to. This is not my favorite solution, but I think it would
be preferable than simply not fixing it.
2. Fix it in the new version (2.3) and specifically mention that in the release notes. Aren't there releases where applications need to re- build the
index because of fundamental changes?

Am I the only one who thinks that?

BTW, I changed the definition in the jflex file and recompiled using jflex
and it indeed solved the problem. It now recognizes www.abc.com. and
www.abc.com as hosts. I can attach the 'patch' files if you'd like to
compare.

On Nov 27, 2007 9:07 AM, Chris Hostetter <[EMAIL PROTECTED]> wrote:


: If you pass "www.abc.com", the output is (www.abc.com, 0,11,type=<HOST>)
: (which is correct in my opinion).
: However, if you pass "www.abc.com." (notice the extra '.' at the end),
the
: output is (wwwabccom,0,12,type=<ACRONYM>).

see also...

http://www.nabble.com/Inconsistent-StandardTokenizer-behaviour-tf596059.html#a1593383

http://www.nabble.com/Standard-Analyzer---Host-and-Acronym-tf3620533.html#a10109926

one hitch which potentially changing this now is that it would break
some searches in applications that have existing indexes built using
previous versions.



-Hoss


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]




--
Regards,

Shai Erera

--------------------------
Grant Ingersoll
http://lucene.grantingersoll.com

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to