Re: DO NOT REPLY [Bug 27901] - TextCharIterator.remove() does not work properly

2004-03-25 Thread Christian Z.
Am Do, den 25.03.2004 schrieb Glen Mazza um 22:38:
> Ich bin confused--ist chz ([EMAIL PROTECTED])--Christian
> Geisert oder anderer Christian?  The bugzilla entry
> lists chz as being "Christian Z", so I'm not sure whom
> I'm speaking with!  

Christian Ziesemer
chz

I adjusted that.

> If the language has no space (Japanese, or Thai I
> guess) then this particular portion, of whitespace
> removal is not relevant I would guess--I think
> dictionaries are needed for hyphenation, not space
> removal, no?

I don't know the whole context the FOText class is used, so if it's only
used for space removal -- then of course you've nothing to do in Thai
and therefore limited use for changing. :-)

Christian Z.



Re: DO NOT REPLY [Bug 27901] - TextCharIterator.remove() does not work properly

2004-03-25 Thread John Austin
On Thu, 2004-03-25 at 19:08, Glen Mazza wrote:
> Ich bin confused--ist chz ([EMAIL PROTECTED])--Christian
> Geisert oder anderer Christian?  The bugzilla entry
> lists chz as being "Christian Z", so I'm not sure whom
> I'm speaking with!  

So we shouldn't all be running around with multiple e-mail
identities ?

My excuse is, I used that e-mail address years ago when I opened
my first Bugzilla account.
 

-- 
John Austin <[EMAIL PROTECTED]>


Re: DO NOT REPLY [Bug 27901] - TextCharIterator.remove() does not work properly

2004-03-25 Thread Glen Mazza
Ich bin confused--ist chz ([EMAIL PROTECTED])--Christian
Geisert oder anderer Christian?  The bugzilla entry
lists chz as being "Christian Z", so I'm not sure whom
I'm speaking with!  

Glen

--- chz <[EMAIL PROTECTED]> wrote:
> 
> The BreakIterator origins from the ICU project and
> they state that they
> even handle Thai correctly, which has no word
> separator like the space
> character and needs therefore a dictionary based
> solution.
> 

That business logic is I believe already handled in
another class (FOBlock? I believe--the one calling the
current iterators), so these iterators are for actual
space removal in the Western sense.  (Of course, what
you mention may need tuning/fixing in FOBlock as
well.)

If the language has no space (Japanese, or Thai I
guess) then this particular portion, of whitespace
removal is not relevant I would guess--I think
dictionaries are needed for hyphenation, not space
removal, no?

Glen



Re: DO NOT REPLY [Bug 27901] - TextCharIterator.remove() does not work properly

2004-03-25 Thread chz

When I look into FOText class again, the word boundary analysis seems
not very elegant to me. Perhaps Java's java.text.BreakIterator could do
the same thing?
The BreakIterator origins from the ICU project and they state that they
even handle Thai correctly, which has no word separator like the space
character and needs therefore a dictionary based solution.

Regards,
Christian