Hi,
Thanks for the hint.
Yesterday, late in the evening I have also found IBM's ftp server with the
former ICU releases. But, I had not got the time to search for the original
source files.
Now, I found them - also consulting markmail to get hints for the ICU version.
There are part of the ICU version 2.2, released 2002-08-15 found at [3].
This ICU release is completely under ICU license.
Best regards, Oliver.
[3] ftp://ftp.software.ibm.com/software/globalization/icu/2.2/
On 02.12.2011 01:42, Rob Weir wrote:
On Thu, Dec 1, 2011 at 12:03 PM, Oliver-Rainer Wittmann
<[email protected]> wrote:
Hi,
I need some help here.
It is about the following data files in folder
i18npool/source/breakiterator/data/
-- char_in.txt
-- count_word*.txt
-- dict_word*.txt
-- edit_word*.txt
-- line.txt
-- sent.txt
(A) I did not find the original sources of these data files on [2].
Does somebody know the original source for these data files?
Maybe try searching the old list archives:
http://openoffice.markmail.org/
When I typed in some file names, like dict_word.txt I see activity
going back to 2002 in the ancient CVS. At that point it looks like it
was in the ICU component, or at least its placement in the tree
suggests that. ICU came from IBM, as you know.
Perhaps it would line up more with an earlier ICU version, like in the
2.x series:
ftp://ftp.software.ibm.com/software/globalization/icu/
(B) The data files count_word*.txt, dict_word*.txt and edit_word*.txt do not
differ much. I assume that they are adapted from the original source for
certain usages and languages.
Can someone confirm this?
(C) I have found files at [3] which correspond to these data files. The
found files are named char.txt, line.txt, sent.txt and word.txt. Thus, it
looks like that the original source of these data files is ICU. This would
mean that the license for these files seems to be the ICU license.
Can someone confirm this?
Note: Eike Rathke stated in an posting made in June 2011 that these data
files are taken from ICU and had been adpated for OOo.
Thus again, can somebody help here?
Best regards, Oliver.
[3]
http://www.opensource.apple.com/source/ICU/ICU-400.39/icuSources/data/brkitr/
and
http://www.opensource.apple.com/source/ICU/ICU-400.42/icuSources/data/brkitr/
On 01.12.2011 14:48, Oliver-Rainer Wittmann wrote:
Hi,
looking at our IP clearance wiki page showed that there is an entry for
which I
was volunteering, but which get out of my focus. Now, it gets back to my
attention.
It is the issue regarding the license headers for the data files in module
i18npool - see [1].
Status update:
- Most data files are covered by Oracle's SGA
- The data files in folder i18npool/source/breakiterator/data/ which have
an IBM
copyright does not have a proper license header.
I will look at ICU [2] for an appropriate replacement.
[1] https://cwiki.apache.org/confluence/display/OOOUSERS/IP_Clearance
[2] http://site.icu-project.org/