[ 
https://issues.apache.org/jira/browse/LUCENE-1528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12667538#action_12667538
 ] 

Michael Busch commented on LUCENE-1528:
---------------------------------------

Looks good, Luis!

I was just wondering if you can do something like the following to avoid 
defining the whitespace chars in two places:
{noformat}
| <#_WHITESPACE: ( " " | "\t" | "\n" | "\r") >
| <#_TERM_START_CHAR: ( ~( <_WHITESPACE> | [ "+", "-", "!", "(", ")", ":", "^",
                           "[", "]", "\"", "{", "}", "~", "*", "?", "\\" ])
                       | <_ESCAPED_CHAR> ) >
{noformat}

This does not compile... is there another way to achieve this in javacc?
If not, it's not a big deal and I can commit this patch as is.

> Add support for Ideographic Space to the queryparser - also know as fullwith 
> space and wide-space
> -------------------------------------------------------------------------------------------------
>
>                 Key: LUCENE-1528
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1528
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: QueryParser
>    Affects Versions: 2.4.1
>            Reporter: Luis Alves
>            Assignee: Michael Busch
>            Priority: Minor
>             Fix For: 2.4.1
>
>         Attachments: lucene_wide_space_v1_src.patch
>
>   Original Estimate: 4h
>  Remaining Estimate: 4h
>
> The Ideographic Space is a space character that is as wide as a normal CJK 
> character cell.
> It is also known as wide-space or fullwith space.This type of space is used 
> in CJK languages.
> This patch adds support for the wide space, making the queryparser component 
> more friendly
> to queries that contain CJK text.
> Reference:
> 'http://en.wikipedia.org/wiki/Space_(punctuation)' - see Table of spaces, 
> char U+3000.
> I also added a new testcase that fails before the patch.
> After the patch is applied all junits pass.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to