Hi
The referenced OpenNLP models are
find . -name "*.bin" | grep "downloads"
downloaded from:
http://dev.iks-project.eu/downloads/opennlp/models-1.5/{name}
with the exception:
* sv-pos-maxent.bin is named se-pos-maxent.bin on the server
* sv-sent.bin is named se-sent.bin on the server
for those two file you will need to rename the according file on the
target directory
The build also downloads the DBpedia default data index
find . -name "*.solrindex.zip" | grep "downloads"
the file is located on the server at
http://dev.iks-project.eu/downloads/stanbol-indices/dbpedia_43k.solrindex.zip
best
Rupert
In the following I included the paths relative to the {trunk}
./opennlp/lang/da/downloads/resources/org/apache/stanbol/data/opennlp/da-pos-maxent.bin
./opennlp/lang/da/downloads/resources/org/apache/stanbol/data/opennlp/da-sent.bin
./opennlp/lang/de/downloads/resources/org/apache/stanbol/data/opennlp/de-pos-maxent.bin
./opennlp/lang/de/downloads/resources/org/apache/stanbol/data/opennlp/de-sent.bin
./opennlp/lang/en/downloads/resources/org/apache/stanbol/data/opennlp/en-chunker.bin
./opennlp/lang/en/downloads/resources/org/apache/stanbol/data/opennlp/en-pos-maxent.bin
./opennlp/lang/en/downloads/resources/org/apache/stanbol/data/opennlp/en-sent.bin
./opennlp/lang/en/downloads/resources/org/apache/stanbol/data/opennlp/en-token.bin
./opennlp/lang/es/downloads/resources/org/apache/stanbol/data/opennlp/es-pos-maxent.bin
./opennlp/lang/es/downloads/resources/org/apache/stanbol/data/opennlp/es-sent.bin
./opennlp/lang/nl/downloads/resources/org/apache/stanbol/data/opennlp/nl-pos-maxent.bin
./opennlp/lang/nl/downloads/resources/org/apache/stanbol/data/opennlp/nl-sent.bin
./opennlp/lang/nl/downloads/resources/org/apache/stanbol/data/opennlp/nl-token.bin
./opennlp/lang/pt/downloads/resources/org/apache/stanbol/data/opennlp/pt-pos-maxent.bin
./opennlp/lang/pt/downloads/resources/org/apache/stanbol/data/opennlp/pt-sent.bin
./opennlp/lang/pt/downloads/resources/org/apache/stanbol/data/opennlp/pt-token.bin
./opennlp/lang/sv/downloads/resources/org/apache/stanbol/data/opennlp/sv-pos-maxent.bin
./opennlp/lang/sv/downloads/resources/org/apache/stanbol/data/opennlp/sv-sent.bin
./opennlp/ner/en/downloads/resources/org/apache/stanbol/data/opennlp/en-ner-location.bin
./opennlp/ner/en/downloads/resources/org/apache/stanbol/data/opennlp/en-ner-organization.bin
./opennlp/ner/en/downloads/resources/org/apache/stanbol/data/opennlp/en-ner-person.bin
./opennlp/ner/es/downloads/resources/org/apache/stanbol/data/opennlp/es-ner-location.bin
./opennlp/ner/es/downloads/resources/org/apache/stanbol/data/opennlp/es-ner-organization.bin
./opennlp/ner/es/downloads/resources/org/apache/stanbol/data/opennlp/es-ner-person.bin
./opennlp/ner/nl/downloads/resources/org/apache/stanbol/data/opennlp/nl-ner-location.bin
./opennlp/ner/nl/downloads/resources/org/apache/stanbol/data/opennlp/nl-ner-organization.bin
./opennlp/ner/nl/downloads/resources/org/apache/stanbol/data/opennlp/nl-ner-person.bin
./sites/dbpedia/downloads/resources/org/apache/stanbol/data/site/dbpedia/default/index/dbpedia_43k.solrindex.zip
On Fri, Mar 1, 2013 at 4:22 AM, wuhaijiang <[email protected]> wrote:
> I am so sorry to burden you so much, but after i download this file
> manually, there is another corrupted file, is there a efficient way to find
> which file is corrupted, and where to download it? Thank you a lot!
>
>> I tried three times again, but this problem still exist, is there any
>> other place where i can download it in advance?
>>>>
>>>> and I build successfully by running "mvn clean install -DskipTests=true"
>>>> but it failed when i run "mvn clean install ", here is the error
>>>> information
>>>
>>> as part of the built OpenNLP language models are downloaded and those
>>> models get used for unit tests. The reported exception suggests that
>>> the downloaded files got somehow corrupted. So while the build with
>>> -DskipTests=true does succeed the corrupted files will be also
>>> included in the launcher and will cause similar errors at runtime.
>>>
>>> To get rid of the corrupted files (by forcing the (re)download of
>>> those files you need to run "mvn clean install -Ddownloads" as
>>> described in [1].
>>>
>>> best
>>> Rupert
>>>
>>>
>>> [1] http://markmail.org/message/mnppqm5gt5ppuvho
>>>
>>> On Thu, Feb 28, 2013 at 3:43 AM, wuhaijiang <[email protected]>
>>> wrote:
>>>>
>>>> I download the source code by running "svn co
>>>> http://svn.apache.org/repos/asf/stanbol/trunk/".
>>>> and I build successfully by running "mvn clean install -DskipTests=true"
>>>> but it failed when i run "mvn clean install ", here is the error
>>>> information
>>>>
>>>>
>>>> at java.lang.reflect.Method.invoke(Method.java:597)
>>>> at
>>>>
>>>> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
>>>> at
>>>>
>>>> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>>>> at
>>>>
>>>> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
>>>> at
>>>>
>>>> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>>>> at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
>>>> at
>>>>
>>>> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
>>>> at
>>>>
>>>> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
>>>> at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
>>>> at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
>>>> at
>>>> org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
>>>> at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
>>>> at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
>>>> at
>>>>
>>>> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>>>> at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
>>>> at
>>>>
>>>> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:234)
>>>> at
>>>>
>>>> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:133)
>>>> at
>>>>
>>>> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:114)
>>>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>>>> at
>>>>
>>>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>>>> at
>>>>
>>>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>>>> at java.lang.reflect.Method.invoke(Method.java:597)
>>>> at
>>>>
>>>> org.apache.maven.surefire.util.ReflectionUtils.invokeMethodWithArray(ReflectionUtils.java:188)
>>>> at
>>>>
>>>> org.apache.maven.surefire.booter.ProviderFactory$ProviderProxy.invoke(ProviderFactory.java:166)
>>>> at
>>>>
>>>> org.apache.maven.surefire.booter.ProviderFactory.invokeProvider(ProviderFactory.java:86)
>>>> at
>>>>
>>>> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:101)
>>>> at
>>>> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:74)
>>>> Caused by: org.xml.sax.SAXParseException: Premature end of file.
>>>> at
>>>>
>>>> com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(AbstractSAXParser.java:1231)
>>>> at
>>>>
>>>> opennlp.tools.dictionary.serializer.DictionarySerializer.create(DictionarySerializer.java:221)
>>>> ... 48 more
>>>> [Fatal Error] :29623:16: Premature end of file.
>>>> 815 [main] INFO org.apache.stanbol.commons.opennlp.TextAnalyzer - Unable
>>>> to
>>>> load POS Model for language 'en'
>>>> opennlp.tools.util.InvalidFormatException: The profile data stream has
>>>> an
>>>> invalid format!
>>>> at
>>>>
>>>> opennlp.tools.dictionary.serializer.DictionarySerializer.create(DictionarySerializer.java:224)
>>>> at
>>>> opennlp.tools.postag.POSDictionary.create(POSDictionary.java:282)
>>>> at
>>>>
>>>> opennlp.tools.postag.POSModel$POSDictionarySerializer.create(POSModel.java:48)
>>>> at
>>>>
>>>> opennlp.tools.postag.POSModel$POSDictionarySerializer.create(POSModel.java:44)
>>>> at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:135)
>>>> at opennlp.tools.postag.POSModel.<init>(POSModel.java:93)
>>>> at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native
>>>> Method)
>>>> at
>>>>
>>>> sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
>>>> at
>>>>
>>>> sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
>>>> at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
>>>> at
>>>> org.apache.stanbol.commons.opennlp.OpenNLP.initModel(OpenNLP.java:491)
>>>> at
>>>> org.apache.stanbol.commons.opennlp.OpenNLP.initModel(OpenNLP.java:436)
>>>> at
>>>>
>>>> org.apache.stanbol.commons.opennlp.OpenNLP.getPartOfSpeachModel(OpenNLP.java:232)
>>>> at
>>>>
>>>> org.apache.stanbol.commons.opennlp.TextAnalyzer.getPosTagger(TextAnalyzer.java:234)
>>>> at
>>>>
>>>> org.apache.stanbol.commons.opennlp.TextAnalyzer$AnalysedText.<init>(TextAnalyzer.java:453)
>>>> at
>>>>
>>>> org.apache.stanbol.commons.opennlp.TextAnalyzer$AnalysedText.<init>(TextAnalyzer.java:435)
>>>> at
>>>>
>>>> org.apache.stanbol.commons.opennlp.TextAnalyzer$AnalysedText.<init>(TextAnalyzer.java:404)
>>>> at
>>>>
>>>> org.apache.stanbol.commons.opennlp.TextAnalyzer.analyseSentence(TextAnalyzer.java:344)
>>>> at
>>>>
>>>> org.apache.commons.opennlp.TextAnalyzerTest.testSingleSentenceNoChunkerConfig(TextAnalyzerTest.java:103)
>>>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>>>> at
>>>>
>>>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>>>> at
>>>>
>>>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>>>> at java.lang.reflect.Method.invoke(Method.java:597)
>>>> at
>>>>
>>>> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
>>>> at
>>>>
>>>> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>>>> at
>>>>
>>>> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
>>>> at
>>>>
>>>> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>>>> at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
>>>> at
>>>>
>>>> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
>>>> at
>>>>
>>>> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
>>>> at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
>>>> at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
>>>> at
>>>> org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
>>>> at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
>>>> at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
>>>> at
>>>>
>>>> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>>>> at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
>>>> at
>>>>
>>>> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:234)
>>>> at
>>>>
>>>> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:133)
>>>> at
>>>>
>>>> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:114)
>>>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>>>> at
>>>>
>>>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>>>> at
>>>>
>>>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>>>> at java.lang.reflect.Method.invoke(Method.java:597)
>>>> at
>>>>
>>>> org.apache.maven.surefire.util.ReflectionUtils.invokeMethodWithArray(ReflectionUtils.java:188)
>>>> at
>>>>
>>>> org.apache.maven.surefire.booter.ProviderFactory$ProviderProxy.invoke(ProviderFactory.java:166)
>>>> at
>>>>
>>>> org.apache.maven.surefire.booter.ProviderFactory.invokeProvider(ProviderFactory.java:86)
>>>> at
>>>>
>>>> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:101)
>>>> at
>>>> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:74)
>>>> Caused by: org.xml.sax.SAXParseException: Premature end of file.
>>>> at
>>>>
>>>> com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(AbstractSAXParser.java:1231)
>>>> at
>>>>
>>>> opennlp.tools.dictionary.serializer.DictionarySerializer.create(DictionarySerializer.java:221)
>>>> ... 48 more
>>>> [Fatal Error] :29623:16: Premature end of file.
>>>> 843 [main] INFO org.apache.stanbol.commons.opennlp.TextAnalyzer - Unable
>>>> to
>>>> load POS Model for language 'en'
>>>> opennlp.tools.util.InvalidFormatException: The profile data stream has
>>>> an
>>>> invalid format!
>>>> at
>>>>
>>>> opennlp.tools.dictionary.serializer.DictionarySerializer.create(DictionarySerializer.java:224)
>>>> at
>>>> opennlp.tools.postag.POSDictionary.create(POSDictionary.java:282)
>>>> at
>>>>
>>>> opennlp.tools.postag.POSModel$POSDictionarySerializer.create(POSModel.java:48)
>>>> at
>>>>
>>>> opennlp.tools.postag.POSModel$POSDictionarySerializer.create(POSModel.java:44)
>>>> at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:135)
>>>> at opennlp.tools.postag.POSModel.<init>(POSModel.java:93)
>>>> at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native
>>>> Method)
>>>> at
>>>>
>>>> sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
>>>> at
>>>>
>>>> sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
>>>> at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
>>>> at
>>>> org.apache.stanbol.commons.opennlp.OpenNLP.initModel(OpenNLP.java:491)
>>>> at
>>>> org.apache.stanbol.commons.opennlp.OpenNLP.initModel(OpenNLP.java:436)
>>>> at
>>>>
>>>> org.apache.stanbol.commons.opennlp.OpenNLP.getPartOfSpeachModel(OpenNLP.java:232)
>>>> at
>>>>
>>>> org.apache.stanbol.commons.opennlp.TextAnalyzer.getPosTagger(TextAnalyzer.java:234)
>>>> at
>>>>
>>>> org.apache.stanbol.commons.opennlp.TextAnalyzer$AnalysedText.<init>(TextAnalyzer.java:453)
>>>> at
>>>>
>>>> org.apache.stanbol.commons.opennlp.TextAnalyzer$AnalysedText.<init>(TextAnalyzer.java:435)
>>>> at
>>>>
>>>> org.apache.stanbol.commons.opennlp.TextAnalyzer$AnalysedText.<init>(TextAnalyzer.java:404)
>>>> at
>>>>
>>>> org.apache.stanbol.commons.opennlp.TextAnalyzer.analyseSentence(TextAnalyzer.java:344)
>>>> at
>>>>
>>>> org.apache.commons.opennlp.TextAnalyzerTest.testSingleSentenceChunkerConfig(TextAnalyzerTest.java:88)
>>>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>>>> at
>>>>
>>>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>>>> at
>>>>
>>>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>>>> at java.lang.reflect.Method.invoke(Method.java:597)
>>>> at
>>>>
>>>> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
>>>> at
>>>>
>>>> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>>>> at
>>>>
>>>> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
>>>> at
>>>>
>>>> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>>>> at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
>>>> at
>>>>
>>>> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
>>>> at
>>>>
>>>> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
>>>> at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
>>>> at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
>>>> at
>>>> org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
>>>> at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
>>>> at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
>>>> at
>>>>
>>>> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>>>> at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
>>>> at
>>>>
>>>> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:234)
>>>> at
>>>>
>>>> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:133)
>>>> at
>>>>
>>>> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:114)
>>>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>>>> at
>>>>
>>>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>>>> at
>>>>
>>>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>>>> at java.lang.reflect.Method.invoke(Method.java:597)
>>>> at
>>>>
>>>> org.apache.maven.surefire.util.ReflectionUtils.invokeMethodWithArray(ReflectionUtils.java:188)
>>>> at
>>>>
>>>> org.apache.maven.surefire.booter.ProviderFactory$ProviderProxy.invoke(ProviderFactory.java:166)
>>>> at
>>>>
>>>> org.apache.maven.surefire.booter.ProviderFactory.invokeProvider(ProviderFactory.java:86)
>>>> at
>>>>
>>>> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:101)
>>>> at
>>>> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:74)
>>>> Caused by: org.xml.sax.SAXParseException: Premature end of file.
>>>> at
>>>>
>>>> com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(AbstractSAXParser.java:1231)
>>>> at
>>>>
>>>> opennlp.tools.dictionary.serializer.DictionarySerializer.create(DictionarySerializer.java:221)
>>>> ... 48 more
>>>> Tests run: 5, Failures: 4, Errors: 0, Skipped: 0, Time elapsed: 1.107
>>>> sec
>>>> <<< FAILURE!
>>>>
>>>> Results :
>>>>
>>>> Failed tests:
>>>>
>>>> testMultipleSentenceDefaultConfig(org.apache.commons.opennlp.TextAnalyzerTest)
>>>>
>>>> testSingleSentenceDefaultConfig(org.apache.commons.opennlp.TextAnalyzerTest)
>>>>
>>>> testSingleSentenceNoChunkerConfig(org.apache.commons.opennlp.TextAnalyzerTest)
>>>>
>>>> testSingleSentenceChunkerConfig(org.apache.commons.opennlp.TextAnalyzerTest)
>>>>
>>>> Tests run: 5, Failures: 4, Errors: 0, Skipped: 0
>>>>
>>>>
>>>
>>>
>>> --
>>> | Rupert Westenthaler [email protected]
>>> | Bodenlehenstraße 11 ++43-699-11108907
>>> | A-5500 Bischofshofen
>>
>>
>>
>
>
--
| Rupert Westenthaler [email protected]
| Bodenlehenstraße 11 ++43-699-11108907
| A-5500 Bischofshofen