[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-04-16 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

--- Comment #17 from Mike Kaganski  ---
(In reply to Eyal Rozenberg from comment #15)
> For now, LO should be able to let us force the use of any of fonts in the
> three language-groups which contain our glyphs of interest (e.g. the digits
> of a number). I agree that the language group is an artificial construct,
> but it is what LO associates a font right now, so either we allow setting a
> language group, or allow setting a language and have that auto-mapped to one
> of the groups (and thus also group fonts).

Assigning a *language* (not "language group"!) with any run of the text must be
enough to associate that text with the "language group", and thus to force
picking of glyphs from the respective font associated with that group.

What I would agree would be "assigning a language does not pick the right font"
bug.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-04-15 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

--- Comment #16 from Mike Kaganski  ---
(In reply to Eyal Rozenberg from comment #15)
> > This one should be WF
> 
> Do you mean you're suggesting this issue be marked WFM?

WFM stands for "WORKSFORME", a resolution that means "there was a reproducible
problem that OP described, in some version; later, it was obviously fixed, so
it is not reproducible anymore in newer versions, but we don't know which
commit was that - so instead of marking FIXED (reserved for cases where we know
exact commit), we use WORKSFORME".

WF stands for WONTFIX, which is for "there is an acknowledged problem around
this issue, but this proposal should not be implemented" (there may be
different reasons why).

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-04-15 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

--- Comment #15 from Eyal Rozenberg  ---
(In reply to Mike Kaganski from comment #14)
> setting the language, ... should be enough.

I would be fine with the ability to set the language independently of the
direction, but - what will map this setting to a change of font?

> or explicitly setting direction and font, 

Neither of these are relevant in themselves. The entanglement with the
direction setting is part of my problem here - I need to change the language
regardless of the direction.

> The problem that the "language group" tries to solve comes from not
> all fonts having all glyphs etc.

Yes, I assumed as much in comment #10.

> This one should be WF

Do you mean you're suggesting this issue be marked WFM?

> and overall, we definitely need better *concept* for
> handling of this complex issue - but the current state is based on (1) state
> of the art - fonts have imperfect coverage;

I wouldn't characterize it as "state of the art", because it's not something
that is expected to change, or progress, in a different direction. It's
perfectly ok for fonts to have partial coverage. (More on that below)


>  IMO, this could
> only improve *much later*, when (1) is improved greatly

You are proposing a paradigm change for distribution and design of fonts in
general, that is way beyond the scope of LO. Its merits can be debated - I
personally don't agree with your paradigm change - but it cannot be a factor in
short-to-medium-term engineering decisions. If there was 100% consensus that
this is where the world of fonts were going, then maybe you could argue against
addressing issues such as this one. But - with due respect - it's just your
opinion, or the opinion of some people (I've not heard this from others). So I
don't believe it should have bearing on this issue.

For now, LO should be able to let us force the use of any of fonts in the three
language-groups which contain our glyphs of interest (e.g. the digits of a
number). I agree that the language group is an artificial construct, but it is
what LO associates a font right now, so either we allow setting a language
group, or allow setting a language and have that auto-mapped to one of the
groups (and thus also group fonts).


---

Sidenote:

My idea for a long term alternative to the "language group" is one of two, both
involving multiple fonts:

1. A font preference list: To decide which font is used for a glyph, one
searches a list of decreasing preference, and the first font with that glyph
available is chosen.
2. A glyph set map: The Unicode plane / set of all characters is subdivided
into sets (e.g. represented as a list of ranges), each mapped to a font. Simple
division templates would be "all from font F1", "strongly-language-L glyhps
from F1, the rest from F2" and a few others.

In both of these, advanced users would be offered a dialog (pane) for
arbitrarily modifying the list/map.

This does not cover the issue of skewing the preferences for characters which
may be part of different Unicode text runs, and thus perceived as part of
sequences in different languages - like numbers, or punctuation marks. But the
specifics would take more thinking and are even farther beyond the scope of
this bug.

---

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-04-15 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

--- Comment #14 from Mike Kaganski  ---
There is nothing like "language group" property of character or paragraph. The
"language group" is just some artificial construct, very problematic itself,
which should be dropped completely at some point if possible, and not increased
in presence by all means. No, there's no need to add it here; setting the
language, or explicitly setting direction and font, should be enough. The
problem that the "language group" tries to solve comes from not all fonts
having all glyphs; and even though there already are *some* fonts with ~wide
coverage (of questionable quality for different scripts), they are still not
predominant, hence the styles include three fonts for respective groups,
allowing each style choose respective font with (supposedly) existing glyphs
whenever the script/language is matching that group (so you have a chimera
three-piece metafont, with specific glyphs coming from one of the three fonts).

This one should be WF; and overall, we definitely need better *concept* for
handling of this complex issue - but the current state is based on (1) state of
the art - fonts have imperfect coverage; (2) legacy (we must support existing
documents having those synthetic metafonts, both in our native formats, and in
external formats using the same concept). IMO, this could only improve *much
later*, when (1) is improved greatly (not something we can change).

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-04-08 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

--- Comment #13 from Eyal Rozenberg  ---
(In reply to Heiko Tietze from comment #12)

So now I'm confused about your opinion. You said that "using the language group
for layouting" bothers you, but then you wrote that "The language group is a
mechanism to bundle languages so that you don't have to assign RTL to both
Hebrew and Arab" <- but setting text to be RTL is part of "layouting" as you
described it. So does the very mechanism of language groups bother you?

Anyway, language grouping doesn't actually save you from marking anything as
RTL. I think. I always though it was mostly a simplifying mechanism for font
combination.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-04-08 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

--- Comment #12 from Heiko Tietze  ---
(In reply to Eyal Rozenberg from comment #11)
> Can you clarify what you meant by the word "layouting"?
Text runs either from left to right or the other way (and anything else that
belongs to the visible layout of the text). The language group is a mechanism
to bundle languages so that you don't have to assign RTL to both Hebrew and
Arab, for example.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-04-07 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

--- Comment #11 from Eyal Rozenberg  ---
I'll just make a clarification w.r.t. my tone.

When I said "is not tolerable", I didn't mean to suggest I am accusing
developers of having acted in an intolerable way. I meant to say that reason
would not tolerate an affirmation of such a behavior as the appropriate one.


On another note: Heiko, you said:

> What bothers me in general on the ticket is using the language group for 
> layouting.

Can you clarify what you meant by the word "layouting"?


The way things stand right now seem to be the result of a choice of
convenience, which often, or usually, works: If your document has, say, content
in both Arabic and English, then the LTR runs are assumed to be Spanish, with
the user wanting the font family they've chosen for 'Western' languages, as it
likely has full coverage of the Latin-1 character set with the glyphs the user
wants. And similarly, the RTL runs are assumed to be in Arabic, and the font to
use for these would be the one covering the Unicode range for Arabic, which the
user has likely chosen as its Complex scripts font.

We are seeing a "corner case" of these assumptions not holding: Characters
which are common to text in different language groups, which already undermines
the assumptions somewhat, and are direction neutral, which makes them
susceptible to be switched back and forth.

But more generally - the assumptions don't hold:

* A user may want/need, a more complex covering of the set of Unicode
characters by different fonts - even if all text is in the same language (e.g.
for characters like arrows, or numbers, or dingbats, or emojis). And the
different fonts the user has may have complex intersections requiring a more
involved logic for preferences.
* There may be multiple languages used within the same language group, with the
user needing different fonts for them. Obvious example: Hebrew and Arabic. I'm
guessing that maybe even CJK authors may want a different font for Japanese and
for Chinese, for example, even if many glyphs are shared between them.


So, the question is (or one of the questions is): Should this specific issue be
resolved by some kind of localized action, retaining the
assumption-of-convenience from above, or should it upending that assumption?

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-04-07 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

Heiko Tietze  changed:

   What|Removed |Added

 CC|libreoffice-ux-advise@lists |heiko.tietze@documentfounda
   |.freedesktop.org|tion.org
   Keywords|needsUXEval |needsDevAdvice

--- Comment #10 from Heiko Tietze  ---
(In reply to Eyal Rozenberg from comment #6)
> It's more than about a use-case, it's a matter of principle: 
> 
> * It does not make sense that setting a direction also sets the language.
> * It does not make sense, and is not tolerable, that changing the direction
> of a run of text changes its font.

What bothers me in general on the ticket is using the language group for
layouting. 
But I cannot judge on this topic since it affects basic development aspects.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-04-06 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

JO3EMC  changed:

   What|Removed |Added

   See Also|https://bugs.documentfounda |
   |tion.org/show_bug.cgi?id=14 |
   |4003|

--- Comment #9 from JO3EMC  ---
I'm sorry.
In Bug 144003, the automatic recognition of language groups does not seem to be
wrong.
It seemed that the handling of punctuation marks was just strange.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-04-06 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

JO3EMC  changed:

   What|Removed |Added

   See Also||https://bugs.documentfounda
   ||tion.org/show_bug.cgi?id=14
   ||4003

--- Comment #8 from JO3EMC  ---
After that, I noticed ...
The Bug 144003 issue in Japanese may be related to this issue.
In vertical writing (RTL), Japanese characters are automatically recognized in
the wrong language group when Western characters are followed by punctuation
marks.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-04-05 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

--- Comment #7 from JO3EMC  ---
As you may know, in the current Japanese language, LTR is the basis for
horizontal writing.
So, no matter what the automatic language group selection works, you usually
don't have to worry about mixed character directions.

Of course, sentences with a mixture of Western characters (mainly English ASCII
characters) and Japanese characters are common.
In such cases, it is natural that the Western characters are treated as Western
instead of Japanese.
The current automatic recognition of LibreOffice language groups seems to work
well in the Japanese environment.
So I don't think we'll often encounter cases where we have to manually change
language groups individually.

ASCII numerical characters are also automatically recognized as Western.
In Japanese, that is OK.

As discussed in Bug 146910 etc., there is some need to apply the same font to
both Western and CJK language groups, but that is not the same as wanting to
treat them all as the same language group.
It is a need to recognize them as different language groups and to be able to
easily apply the same font.
There are also many needs to apply different fonts to each.

So far, I've talked about the situation in Japanese that seems to be related to
this issue, but I haven't fully understood this issue.
I'm unfamiliar with CTL and RTL and can't figure out what's wrong and how you
want it to work.
Please pardon.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-03-30 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

--- Comment #6 from Eyal Rozenberg  ---
(In reply to Heiko Tietze from comment #5)
> What exactly is the use case / scenario then? Besides convenience.

I thought the title of the bug made it clear...

It's more than about a use-case, it's a matter of principle: 

* It does not make sense that setting a direction also sets the language.
* It does not make sense, and is not tolerable, that changing the direction of
a run of text changes its font.

In the attached document, the 12:35 should not appear in the CTL font. And at
the very least, it should be easy to prevent that from happening, and easy to
indicate it's in English rather than Hebrew (which would make it use the
Western language group font).

At the moment, it just can't be done: You can't say it's in English, and you
can't set its font to the Western languages group font. (You could change the
CTL font to the Western language font but that's a hack, not a solution.)


(I'll also say that it's not obvious what the font selection logic for
"None"-language text should be, but that also would be another bug.)

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-03-30 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

--- Comment #5 from Heiko Tietze  ---
(In reply to Eyal Rozenberg from comment #3)
> ...this bug is only about language+font selection

What exactly is the use case / scenario then? Besides convenience.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-03-30 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

--- Comment #4 from Eyal Rozenberg  ---
Moreover, even if manually changing the language group to "none" helped, that
wouldn't resolve the bug, because:

* People would not easily figure out that's what they need to do.
* No right-click menu UI for this.
* There are parity issues with MS Office for .doc and .docx document
importation.
* Autocorrect cannot be assumed to be applied by default, and is anyway
something optional, not to be relied on.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-03-30 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

--- Comment #3 from Eyal Rozenberg  ---
(In reply to Heiko Tietze from comment #2)
> The only issue for me is when I select the number/date - as it is RTL I
> cannot mark from left.

... but that would be a whole different bug page, about selection. That is
annoying, actually; would you open a separate bug about it? Anyway, to be
clear, this bug is only about language+font selection, and especially the font.

> But changing the language to None (we have a section
> in the status bar to quickly reach language options) does the trick.

I don't think so. When I did this, it set the language in all groups to "none",
but the font didn't change. Which means it probably didn't change the
language-group selection either.

> The language group is maybe only a virtual thing meaning just at the UI,
> haven't check the ODF. Although the idea to get rid of it was rejected in
> bug 146910 it was at least worth to discuss.

Ok. But - I'm not taking a position on that matter here.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-03-30 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

Heiko Tietze  changed:

   What|Removed |Added

   See Also||https://bugs.documentfounda
   ||tion.org/show_bug.cgi?id=14
   ||6928
   Severity|normal  |enhancement
 CC||frank...@goodhorse.idv.tw,
   ||jo3...@jarl.com,
   ||libreoffice-ux-advise@lists
   ||.freedesktop.org,
   ||naru...@gmail.com,
   ||shinji.en...@gmail.com

--- Comment #2 from Heiko Tietze  ---
The only issue for me is when I select the number/date - as it is RTL I cannot
mark from left. But changing the language to None (we have a section in the
status bar to quickly reach language options) does the trick.

The language group is maybe only a virtual thing meaning just at the UI,
haven't check the ODF. Although the idea to get rid of it was rejected in bug
146910 it was at least worth to discuss.

Possible solution to the number problem might be to add this to the AutoCorrect
options as "[ ] Use 'None' for language in case of numbers".

Wonder how CJK people deal with the problem.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 148257] Missing/unexposed ability to explicitly set the "language group" of a piece of text

2022-03-29 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=148257

--- Comment #1 from Eyal Rozenberg  ---
Created attachment 179195
  --> https://bugs.documentfoundation.org/attachment.cgi?id=179195=edit
Document illustrating the issue

This document merely illustrates the direction -> language group (-> language)
-> font automatic choice by LO.

-- 
You are receiving this mail because:
You are the assignee for the bug.