*SOLVED:*
Unfortunately, stderr log in Hadoop's Resource Manager UI was not useful
since it just reported "... Lost executor XX on workerYYY...". Therefore, I
dumped locally the whole app-related logs: /yarn logs -applicationId
application_1458320004153_0343   > ~/application_1458320004153_0343.txt/.
That log is split by container and I've noticed that each "workerYYY" had a
WARN and an sudden crash after a bit. 

Some "Untokenizable" chars in the /email.body/ made CoreNLP throw a WARN at
first and then crash/loose the executor after a few seconds. This happened
systematically and I think it is a bug in CoreNLP 3.4.1. 
The solution was just sanitise the input fed to the Annotation constructor.





--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Stanford-CoreNLP-sentiment-extraction-lost-executor-tp26536p26541.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to