https://bugs.documentfoundation.org/show_bug.cgi?id=151148

            Bug ID: 151148
           Summary: Finding KATAKANA which has voice consonant mark
                    returns incorrect results.
           Product: LibreOffice
           Version: 7.4.1.2 release
          Hardware: x86-64 (AMD64)
                OS: All
            Status: UNCONFIRMED
          Severity: normal
          Priority: medium
         Component: LibreOffice
          Assignee: [email protected]
          Reporter: [email protected]

Description:
The "voice consonant mark" means little dashes or circle which is put on some
KATAKANA characters: e.g. カ (KA) with the little dashes becomes ガ (GA), ハ (HA)
with a little circle becomes パ (PA).
Half-width KATAKANA treats such little dashes or circle ― U+FF9E and U+FF9F ―
as a single character. for example, "ガ" is a combination of two characters
(U+FF76, U+FF9E). In Full-width KATAKANA, KATAKANA character with a voice
consonant mark counts to 1 character, e.g. "ガ" (U+30AC).
Japanese human usually recognizes KATAKANA with such marks as one character,
even if a combination of 2 half-width characters.

If a finding string includes such KATAKANA with voice consonant mark, the
searching result is incorrect. This problem occurs, at least in Calc, Writer,
Draw and Impress.
The issue has reproduced since the commit
d6336e0b21eeece0e678a8768938c04fa120043f, and didn't before that commit.

Steps to Reproduce:
1. open the attachment with Writer.
2. open Find and Replace dialog and Uncheck "Match Character Width"
3. enter a KATAKANA string which contains voice consonant mark:
  Examination 1: enter "ガギグゲゴ" (U+30AC + U+30AE + U+3030B0 + U+30B2 + U+30B4)
or "ガギグゲゴ" (U+FF76 + U+FF9E + U+FF77 + U+FF9E + U+FF78 + U+FF9E + U+FF79 +
U+FF9E + U+FF7A+ U+FF9E)
  Examination 2: enter "ギグゲ" (U+30AE, U+3030B0, U+30B2) or "ギグゲ" (U+FF77 +
U+FF9E + U+FF78 + U+FF9E + U+FF79 + U+FF9E)
4. click Find Next.


Actual Results:
Examination 1: "ガギグゲゴ" (U+FF76 + U+FF9E + U+FF77 + U+FF9E + U+FF78 + 
U+FF9E +
U+FF79 + U+FF9E + U+FF7A+ U+FF9E) or "ガギグゲゴ01234" (U+30AC + U+30AE + U+3030B0 +
U+30B2 + U+30B4 + U+0030 + U+0031 + U+0032 + U+0033 + U+0034)
Examination 2: "ギグゲ" (U+FF77 + U+FF9E + U+FF78 + U+FF9E+ U+FF79 + U+FF9E),
"グゲ0123" (U+30B0 + U+30B2 + U+0030 + U+0031 + U+0032 + U+0033) or "グゲゴ012"
(U+30B0 + U+30B2 + U+30B4 + U+0030 + U+0031 + U+0032)


Expected Results:
Examnation 1: "ガギグゲゴ" (U+30AC + U+30AE + U+3030B0 + U+30B2 + U+30B4) or 
"ガギグゲゴ"
(U+FF76 + U+FF9E + U+FF77 + U+FF9E + U+FF78 + U+FF9E + U+FF79 + U+FF9E +
U+FF7A+ U+FF9E)
Examination 2: "ギグゲ" (U+30AE + U+30B0 + U+30B2) or "ギグゲ" (U+FF77 + U+FF9E +
U+FF78 + U+FF9E+ U+FF79 + U+FF9E)



Reproducible: Always


User Profile Reset: No



Additional Info:
Version: 7.4.1.2 / LibreOffice Community
Build ID: 40(Build:2)
CPU threads: 8; OS: Linux 5.19; UI render: default; VCL: gtk3
Locale: ja-JP (ja_JP.UTF-8); UI: en-US
7.4.1-2
Calc: threaded

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to