I am no a JavaCC-expert either. Maybe it´s a precedence problem.
Could you try

| <EMAIL: <ALPHANUM> (("."|"-"|"_") <ALPHANUM>)+ "@" <ALPHANUM> (("."|"-") <ALPHANUM>)+ >


Christoph



Erik Hatcher schrieb:
I'm not JavaCC-savvy enough (yet), but it seems there is a flaw in the StandardTokenizer and its determination of e-mail addresses.

If I analyze "[EMAIL PROTECTED]", it splits into two tokens: "[EMAIL PROTECTED]" and "com". Shouldn't this rule:

// email addresses
| <EMAIL: <ALPHANUM> ("."|"-"|"_" <ALPHANUM>)+ "@" <ALPHANUM> ("."|"-" <ALPHANUM>)+ >


Be clever enough to keep the .com with it? Perhaps some other parsing is taking precedence?

Thanks,
    Erik


--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]



-- ***************************************************************** * Dr. Christoph Goller Tel.: +49 89 203 45734 * * Detego Software GmbH Mobile: +49 179 1128469 * * Keuslinstr. 13 Fax.: +49 721 151516176 * * 80798 München, Germany Email: [EMAIL PROTECTED] * *****************************************************************


--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to