Jonathan, you will have to remove current words with the same language from the database first.
Tobias On 20.12.2012, at 20:36, Jonathan Felder <[email protected]> wrote: > So I'm following the instructions here: > http://opencast.jira.com/wiki/display/MHTRUNK/Configure+Text+Analysis > > To load a dictionary for text analysis. > > I successfully created the list with the following stats: > Statistics for language: en > ------------------------------------- > Parsing complete in: 02:32:10 > All documents parsed:12961996 > Number of unique words:12648667 > Number of all words:2303801228 > > I deleted the current dictionary that comes with the default install from the > database and from the dictionaries directory. Then I copied the new > dictionary over and noted this in the logs: > > 3639:2012-12-19 11:56:39 INFO (DictionaryScanner:70) - Loading language pack > from /opt/matterhorn/1.4.msub_RC01/etc/dictionaries/en.csv > > However, nothing happens. No new entries are added to the database and it > has been almost 24hrs with no further entries added to the log. > > As a sanity check I did the same thing with the Spanish dictionary that also > comes with the default install. It worked fine: > > 3878:2012-12-19 12:07:04 INFO (DictionaryScanner:70) - Loading language pack > from /opt/matterhorn/1.4.msub_RC01/etc/dictionaries/es.csv > 3879:2012-12-19 12:07:04 DEBUG (DictionaryServiceJpaImpl:200) - Added 'ESTA' > to the es dictionary with count 441402 and weight 0.005096136023035724 > 3880:2012-12-19 12:07:04 DEBUG (DictionaryServiceJpaImpl:200) - Added 'COMO' > to the es dictionary with count 1618214 and weight 0.01868283029614893 > 3881:2012-12-19 12:07:04 DEBUG (DictionaryServiceJpaImpl:200) - Added 'ERES' > to the es dictionary with count 4077 and weight 4.707034985323276E-5 > . > . > . > > Am I missing something? > _______________________________________________ > Matterhorn-users mailing list > [email protected] > http://lists.opencastproject.org/mailman/listinfo/matterhorn-users _______________________________________________ Matterhorn-users mailing list [email protected] http://lists.opencastproject.org/mailman/listinfo/matterhorn-users
