On Mon, Sep 12, 2011 at 9:42 AM, Stefane Fermigier <[email protected]> wrote: > > On Sep 12, 2011, at 8:35 AM, Rupert Westenthaler wrote: > >> Took me some time, but finally I know the reason for this. >> >> In short the stanbol.data.opennlp.lang.en [1] and >> stanbol.data.opennlp.ner.en [2] bundles do include invalid versions of >> the OpenNLP models (all files are only 2kByte in size). >> Looking at the contents of such files gives you the following information >> >> <html> >> <head> >> <title>SourceForge - This project has been temporarily blocked for >> exceeding its bandwidth threshold</title> >> ... >> >> I have not known that SourceForge does such things. Maybe we should >> host the OpenNLP models on the IKS dev server until they are available >> vial apache.org. > > Yes. It's always a bad idea to depend on third-party servers.
Add them to svn? The files aren't that big. A similar issue but maybe a bit harder to solve is the downloaded dbpedia data, I don't think the released version should depend on third party servers for compiling. Cheers, Reto
