Github user mohangadm commented on the pull request:

    https://github.com/apache/spark/pull/2323#issuecomment-56032431
  
    I have experienced the same kind of problem when using Avro with spark 
streaming API.
    If avro message is simple, its fine. but if the avro message has 
Union/Arrays its failing with the exception Below:
    ERROR scheduler.JobScheduler: Error running job streaming job 1411043845000 
ms.0
    org.apache.spark.SparkException: Job aborted due to stage failure: 
Exception while getting task result: com.esotericsoftware.kryo.KryoException: 
java.lang.NullPointerException
    Serialization trace:
    value (com.globallogic.goliath.model.Datum)
    data (com.globallogic.goliath.model.ResourceMessage)
        at 
org.apache.spark.scheduler.DAGScheduler.org$apache$spark$scheduler$DAGScheduler$$failJobAndIndependentStages(DAGScheduler.scala:1185)
        at 
org.apache.spark.scheduler.DAGScheduler$$anonfun$abortStage$1.apply(DAGScheduler.scala:1174)
        at 
org.apache.spark.scheduler.DAGScheduler$$anonfun$abortStage$1.apply(DAGScheduler.scala:1173)
        at 
scala.collection.mutable.ResizableArray$class.foreach(ResizableArray.scala:59)
        at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:47)
        at 
org.apache.spark.scheduler.DAGScheduler.abortStage(DAGScheduler.scala:1173)
        at 
org.apache.spark.scheduler.DAGScheduler$$anonfun$handleTaskSetFailed$1.apply(DAGScheduler.scala:688)
        at 
org.apache.spark.scheduler.DAGScheduler$$anonfun$handleTaskSetFailed$1.apply(DAGScheduler.scala:688)
        at scala.Option.foreach(Option.scala:236)
        at 
org.apache.spark.scheduler.DAGScheduler.handleTaskSetFailed(DAGScheduler.scala:688)
        at 
org.apache.spark.scheduler.DAGSchedulerEventProcessActor$$anonfun$receive$2.applyOrElse(DAGScheduler.scala:1391)
        at akka.actor.ActorCell.receiveMessage(ActorCell.scala:498)
        at akka.actor.ActorCell.invoke(ActorCell.scala:456)
        at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:237)
        at akka.dispatch.Mailbox.run(Mailbox.scala:219)
        at 
akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:386)
        at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
        at 
scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
        at 
scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
        at 
scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
    
    
    
    Above exception shows up when used output operations.
    
    below is the avro message.
    {"version": "01", "sequence": "00001", "resource": "sensor-001", 
"controller": "002", "controllerTimestamp": "1411038710358", "data": {"value": 
[{"name": "Temperature", "value": "30"}, {"name": "Speed", "value": "60"}, 
{"name": "Location", "value": ["+401213.1", "-0750015.1"]}, {"name": 
"Timestamp", "value": "2014-09-09T08:15:25-05:00"}]}}
    
    message is been successfully decoded in decoder, but throws exception for 
output operation.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to