Hi,
I wasn't on the list when this discussion happened, so I hope this will thread
correctly in archives. I linked to the archive below and tried to include
enough here to ensure searchers can find it if this won't thread.
I was getting an error with Nutch 1.17. I never used 1.16, but upgraded from
1.15 recently.
java.lang.Exception: java.lang.IllegalStateException: text width is less than
1, was <-26>
at
org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:492)
at
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:559)
Caused by: java.lang.IllegalStateException: text width is less than 1, was <-26>
at org.apache.commons.lang3.Validate.validState(Validate.java:829)
at
de.vandermeer.skb.interfaces.transformers.textformat.Text_To_FormattedText.transform(Text_To_FormattedText.java:215)
at
de.vandermeer.asciitable.AT_Renderer.renderAsCollection(AT_Renderer.java:250)
at de.vandermeer.asciitable.AT_Renderer.render(AT_Renderer.java:128)
at de.vandermeer.asciitable.AsciiTable.render(AsciiTable.java:191)
at org.apache.nutch.indexer.IndexWriters.describe(IndexWriters.java:326)
at
org.apache.nutch.indexer.IndexerOutputFormat.getRecordWriter(IndexerOutputFormat.java:45)
at
org.apache.hadoop.mapred.ReduceTask$NewTrackingRecordWriter.<init>(ReduceTask.java:542)
at
org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:615)
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:390)
at
org.apache.hadoop.mapred.LocalJobRunner$Job$ReduceTaskRunnable.run(LocalJobRunner.java:347)
at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
This looks like the error that Markus Jelsma described in the earlier
discussion, though the invalid test width in my case was -26. I eliminated it
when I updated the index-writers.xml for the solr_indexer_1 to use only a
single URL. I don't know where the -26 comes from or the -41 Marcus was
getting, but the fact that they were different values told me that the issue
would be in the site-specific difference in our configs.
Adding the link in the archive were I found the earlier discussion:
http://mail-archives.apache.org/mod_mbox/nutch-user/201910.mbox/%[email protected]%3E
Adding the only potentially relevant Jira link I found while searching:
https://issues.apache.org/jira/browse/NUTCH-2602
It seems potentially relevant because Marcus started getting the error after
migrating to 1.16 & I started getting it when I went from 1.15 to 1.17.
Thanks. Stay safe, stay healthy,
Joe