Re: [l10n-dev] Breakiterator override probem

2008-11-14 Thread Karl Hong
Hi Alan, I have been assigned the issue 51772, Quotes in Hebrew workbreaking don't work during spellcheck, I will work with Writer team to fix it in next release. Thanks, Karl. On 2008年11月07日 02:48, Alan Yaniger wrote: Hi Karl, Thanks for you help and your patience. Your solution helps

Re: [l10n-dev] Breakiterator override probem

2008-11-07 Thread Alan Yaniger
Hi Karl, Thanks for you help and your patience. Your solution helps regarding work-breaking when I travel between words with Ctrl-Arrow Left and Ctrl-Arrow Right. The word LHZ is treated as one word in this respect. However, regarding spellchecking it is still split into two words at the

Re: [l10n-dev] Breakiterator override probem

2008-11-06 Thread Alan Yaniger
Hi Karl, Thanks for your help, but I'm still having difficulty. I've added the following at the end of data/dict_word.txt: $HebrewLetter = [\u05d0-\u05ea]; $DoubleQuote = \u0022; $HebrewLetter+ $DoubleQuote $HebrewLetter+; but it doesn't have any effect. If I have a word like LHZ,

Re: [l10n-dev] Breakiterator override probem

2008-11-06 Thread Karl Hong
Hi Alan, There is a dict_word_he.txt in the directory, it will be used for Hebrew text, you need to add the rule in this file. Thanks, Karl. On 2008年11月06日 11:15, Alan Yaniger wrote: Hi Karl, Thanks for your help, but I'm still having difficulty. I've added the following at the end of

Re: [l10n-dev] Breakiterator override probem

2008-11-06 Thread Alan Yaniger
Hi Karl, I've added mt rule to dict_word_he.txt, and rebuilt, but still no change. Is the syntax of my rule OK? Alan Karl Hong wrote: Hi Alan, There is a dict_word_he.txt in the directory, it will be used for Hebrew text, you need to add the rule in this file. Thanks, Karl. On

Re: [l10n-dev] Breakiterator override probem

2008-11-06 Thread Karl Hong
Hi Alan, You also need to remove Hebrew script from $ALetter, otherwise another general rule may take charge on Hebrew word. $Hebrew = [:Script = Hebrew:]; $DoubleQuote = \u0022; $Hebrew+ $DoubleQuote $Hebrew+; $ALetter = [\u0002 [:Alphabetic:] [:name= COMMERCIAL AT:] [:name= HEBREW

[l10n-dev] Breakiterator override probem

2008-11-05 Thread Alan Yaniger
Hi list-members, For Hebrew text, I would like to override the BreakIteratorImpl::endOfScript() function. I tried: - writing a Breakiterator_he class (with hxx and cxx files) , - I added it to the SLOFILES section of makefile.mk, - I added it to the instances array in registerservices.cxx -

Re: [l10n-dev] Breakiterator override probem

2008-11-05 Thread Eike Rathke
Hi Alan, On Wednesday, 2008-11-05 11:03:52 +0200, Alan Yaniger wrote: For Hebrew text, I would like to override the BreakIteratorImpl::endOfScript() function. I tried: - writing a Breakiterator_he class (with hxx and cxx files) , - I added it to the SLOFILES section of makefile.mk, - I

Re: [l10n-dev] Breakiterator override probem

2008-11-05 Thread Mathias Bauer
Hi Alan, Alan Yaniger wrote: Hi list-members, For Hebrew text, I would like to override the BreakIteratorImpl::endOfScript() function. I tried: - writing a Breakiterator_he class (with hxx and cxx files) , - I added it to the SLOFILES section of makefile.mk, - I added it to the

Re: [l10n-dev] Breakiterator override probem

2008-11-05 Thread Karl Hong
Hi Alan, ScriptType breakiterator is not controlled by language, but Unicode script type definition. It does not like character/word/sentence/line breakiterators, which can be customized by language, only one script type breakiterator for all languages. What would you like to do with

Re: [l10n-dev] Breakiterator override probem

2008-11-05 Thread Alan Yaniger
Hi Karl, I'm trying to address issue 51772. Single or double-quotes are used in Hebrew within a word to specify the sound j or acronyms, respectively. At present, they are considered as word breaks during spellchecking, because their script type is not COMPLEX, but LATIN. endOfScript()

Re: [l10n-dev] Breakiterator override probem

2008-11-05 Thread Karl Hong
Hi Eike, There is only one published breakiterator for application to call, it is implemented in breakiteratorImp.cxx, in which only mentioned 4 breaktierators call getLocaleSpecificBreakIterator to get language specific breakiterator. Logically ScriptType breakiterator could not be

Re: [l10n-dev] Breakiterator override probem

2008-11-05 Thread Karl Hong
Hi Alan, I would suggest you write a rule in data/dict_word.txt, something like hebrew_letter+quotation_markhebrew_letter+; it means a Hebrew word is one or more Hebrew letters, following by a quotation mark, and following by one or more Hebrew letters. for rule syntax, check ICU user guide