[lingu-dev] Help needed - bulk extraction of words

Leif Lodahl Thu, 07 Feb 2008 14:19:33 -0800

Hi all,

The Danish project has been so fortunate to receive a bunch of articlesfrom a news magazine. These are odt files and we would like to extractthe words from these documents. We have programs for this purpose, butwe usually get donations one document at the time. This time we haveseveral thousand documents and I believe it would take about a year toload these documents one by one.


Do any of you have a program that can extract words from several documents ?

The words will be loaded into our workflow for linguistic processing andat the end be a part of the Danish spelling directory.


Thanks in advance.

--
Med venlig hilsen - best regards,

Leif Lodahl
Native-Language coordinator DA.OpenOffice.org
Mail: [EMAIL PROTECTED]
Blog: http://lodahl.blogspot.com/

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

[lingu-dev] Help needed - bulk extraction of words

Reply via email to