https://bugs.documentfoundation.org/show_bug.cgi?id=103528

--- Comment #8 from V Stuart Foote <[email protected]> ---
Created attachment 128382
  --> https://bugs.documentfoundation.org/attachment.cgi?id=128382&action=edit
OOo era design document for the Chinese language conversion tool

(In reply to Kumāra from comment #6)
> (In reply to V Stuart Foote from comment #4)
> > Also, what OS and Desktop are you using--and which build of LibreOffice?
> 
> I believe the issue is unrelated to these. I thought "unspecified" is the
> right thing to specify, but come to think of it "Inherited form OOo" is more
> accurate. Sorry about that.
>

As the project's code is modified to function on each OS--knowing the OS and
Desktop environment (where applicable) helps us to confirm and reproduce
issues. And importantly to identify where/when in the source things change.

> 
> I'm not a coder, but I wonder if having a look at Bug 46182 might help to
> find a solution to this.
>

Yes looks like "Inherited from OOo" is correct. The word lists provided for
Chinese in stc_char.dic and stc_word.dic have received little adjustment since
OOo era. One of the original design documents from 2004 for function of the
tool is attached.

Line 1176 of the stc_char.dic holds the pán entries for 盘 (U+76d8) and 槃
(U+69c3)

What is unclear in the source is the relation between the single character
conversion, and the bound form/word listing. Also, the syntax of the character
and word list is a little unclear.

niè 涅 (U+6d85), or more correctly nièpán 涅槃 has no entry in either the
character or the word list--so IIUC the single replacement occurs. But it looks
like it may be as simple as adding the literary and liturgical terms to the
stc_word.dic with matching simplified and traditional values--as a "place
holder"--to ensure that single character substitution does not occur. Of course
there would be an upper limit to what the word table could hold.

> Can we mark this as NEW?

Certainly.

=-ref-=
http://opengrok.libreoffice.org/xref/core/i18npool/source/textconversion/genconv_dict.cxx
http://opengrok.libreoffice.org/xref/core/i18npool/inc/textconversion.hxx
http://opengrok.libreoffice.org/xref/core/i18npool/source/textconversion/data/stc_char.dic
http://opengrok.libreoffice.org/xref/core/i18npool/source/textconversion/data/stc_word.dic

-- 
You are receiving this mail because:
You are the assignee for the bug.
_______________________________________________
Libreoffice-bugs mailing list
[email protected]
https://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs

Reply via email to