Re: [tex-hyphen] extending `hyph-zh-latn-pinyin.tex'

2018-11-24 Thread Arthur Reutenauer
On Sat, Nov 24, 2018 at 10:59:49AM +, Philip Taylor wrote: > i. e., break between any “ng” in the middle of a word. > >Not convinced, Arthur.  Some would write (e.g.,) 北京人 (Běijīngrén) as Ah, you’re right, I knew I should have looked more closely. >

Re: [tex-hyphen] extending `hyph-zh-latn-pinyin.tex'

2018-11-24 Thread Philip Taylor
Arthur Reutenauer wrote: i. e., break between any “ng” in the middle of a word. Not convinced, Arthur.  Some would write (e.g.,) 北京人 (Běijīngrén) as one word, not two (cf. recent discussions on Pinyin in Language Log).  I would view the

Re: [tex-hyphen] extending `hyph-zh-latn-pinyin.tex'

2018-11-24 Thread Arthur Reutenauer
On Sat, Nov 24, 2018 at 11:02:48AM +0100, Werner LEMBERG wrote: >To be > serious: If patgen produces those patterns, I think they are *really* > necessary. Do you still have the source file somewhere? Best,

Re: [tex-hyphen] extending `hyph-zh-latn-pinyin.tex'

2018-11-24 Thread Werner LEMBERG
>> Please elaborate. > > I think the patterns of the form 1nV and 1rV, for V any vowel, are > useless because all the breaks they specify are already covered by > the V1n and V1r patterns, and that all the 1gV could be replaced by > just > > n1g > n2g. > > i. e., break between

Re: [tex-hyphen] extending `hyph-zh-latn-pinyin.tex'

2018-11-24 Thread Arthur Reutenauer
On Fri, Nov 23, 2018 at 11:32:11PM +0100, Mojca Miklavec wrote: > I still find it useful to have a level of abstraction, *in particular* > when the rules are really simple. I agree, and I think they should be expressed in terms of context-free grammar, for example for Turkish > vowels = %w{a â

Re: [tex-hyphen] extending `hyph-zh-latn-pinyin.tex'

2018-11-24 Thread Arthur Reutenauer
On Fri, Nov 23, 2018 at 08:57:21PM +0100, Werner LEMBERG wrote: > Please elaborate. I think the patterns of the form 1nV and 1rV, for V any vowel, are useless because all the breaks they specify are already covered by the V1n and V1r patterns, and that all the 1gV could be replaced by just

Re: [tex-hyphen] extending `hyph-zh-latn-pinyin.tex'

2018-11-23 Thread Werner LEMBERG
>> > But I would create a simple script in any programming language >> > (lua, ruby, python, ...) and generate two pattern files out of >> > it. >> >> Not necessary – the stuff is so simple, and the number of syllables >> is closed which means there won't be any changes except bug fixes. >> A

Re: [tex-hyphen] extending `hyph-zh-latn-pinyin.tex'

2018-11-23 Thread Mojca Miklavec
On Thu, 22 Nov 2018 at 09:44, Werner LEMBERG wrote: > > > But I would create a simple script in any programming language (lua, > > ruby, python, ...) and generate two pattern files out of it. > > Not necessary – the stuff is so simple, and the number of syllables is > closed which means there

Re: [tex-hyphen] extending `hyph-zh-latn-pinyin.tex'

2018-11-23 Thread Werner LEMBERG
> Speaking of bugfixes, I am actually wondering about some patterns in > the file. Please elaborate. Werner

Re: [tex-hyphen] extending `hyph-zh-latn-pinyin.tex'

2018-11-22 Thread Arthur Reutenauer
On Thu, Nov 22, 2018 at 09:43:57AM +0100, Werner LEMBERG wrote: > Not necessary – the stuff is so simple, and the number of syllables is > closed which means there won't be any changes except bug fixes. A > simple search and replace did the job in a few minutes; see attached > file. Thanks,

Re: [tex-hyphen] extending `hyph-zh-latn-pinyin.tex'

2018-11-22 Thread Werner LEMBERG
>> (1) Another file. >> >> This solution I rather dislike. > > This is what I would go for. OK. > But I would create a simple script in any programming language (lua, > ruby, python, ...) and generate two pattern files out of it. Not necessary – the stuff is so simple, and the number

Re: [tex-hyphen] extending `hyph-zh-latn-pinyin.tex'

2018-11-21 Thread Arthur Reutenauer
>> (1) Another file. >> >> This solution I rather dislike. > > This is what I would go for. > > But I would create a simple script in any programming language (lua, > ruby, python, ...) and generate two pattern files out of it. As Mojca said, without the shadow of a doubt. Just use a

Re: [tex-hyphen] extending `hyph-zh-latn-pinyin.tex'

2018-11-21 Thread Mojca Miklavec
Dear Werner, On Wed, 21 Nov 2018 at 20:32, Werner LEMBERG wrote: > > Folks, > > I think it would be nice to extend `hyph-zh-latn-pinyin.tex' by > covering Chinese syllables with tone marks – it's trivial to extend, > say, pattern > > a1b > > with > > ā1b > á1b > ǎ1b > à1b > > and ditto

[tex-hyphen] extending `hyph-zh-latn-pinyin.tex'

2018-11-21 Thread Werner LEMBERG
Folks, I think it would be nice to extend `hyph-zh-latn-pinyin.tex' by covering Chinese syllables with tone marks – it's trivial to extend, say, pattern a1b with ā1b á1b ǎ1b à1b and ditto for all other patterns. However, such an extended file would only be usable by XeTeX and