Package: unicode-data
Version: 10.0.0-3
Severity: wishlist

http://www.unicode.org/Public/ has a lot of data.

Data under emoji is included in the unicode-data package but data under
cldr is not.  It will be nice cldr data is also available.

Background:

I actually needed emoji data and some parts of cldr data under the
/usr/share/unocode directory to update ibus program.

I understand emoji is mentioned in but cldr is not mentioned in
http://www.unicode.org/versions/Unicode10.0.0/

But http://www.unicode.org/ has a prominent link to
http://cldr.unicode.org/index so I do expect this data is also included
or coordinated with this package.

Fedora seems to create another rpm
 https://github.com/fujiwarat/cldr-emoji-annotation
which looks like coming from the unicode cldr data but not all of them.
He installs part of idata from the zip file under 
 /usr/share/unocode/cldr/common/annotations
 /usr/share/unocode/cldr/common/annotationsDerived

The data used is 
 http://www.unicode.org/Public/cldr/31.0.1/cldr-common-31.0.1.zip
 http://www.unicode.org/Public/cldr/31.0.1/core.zip
 (These seem to be the same file)

It will be nice you also include or package these zip files in
 http://www.unicode.org/Public/cldr/31.0.1/
or its new version.

Anyway, this data archive is huge.  Unless careful coordination is done,
we may end up lots of duplicated data.

I am not quite sure how these should be packaged for Debian.  I guess
unicode-data maintainer has a better idea,  So i am filing this bug
report.

Osamu
-- System Information:
Debian Release: buster/sid
  APT prefers testing
  APT policy: (500, 'testing'), (500, 'stable'), (90, 'unstable')
Architecture: amd64 (x86_64)

Kernel: Linux 4.12.0-2-amd64 (SMP w/4 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8), 
LANGUAGE=en_US:en (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)

-- no debconf information

Reply via email to