Any kind of changes to the jvm classes will make it fail. By checkpointing
the data you mean using checkpoint with updateStateByKey? Here's a similar
discussion happened earlier which will clear your doubts i guess
http://mail-archives.us.apache.org/mod_mbox/spark-user/201507.mbox/%3CCA+AHuK=xoy8dsdaobmgm935goqytaaqkpqsvdaqpmojottj...@mail.gmail.com%3E

Thanks
Best Regards

On Thu, Sep 17, 2015 at 10:01 AM, Bin Wang <wbi...@gmail.com> wrote:

> And here is another question. If I load the DStream from database every
> time I start the job, will the data be loaded when the job is failed and
> auto restart? If so, both the checkpoint data and database data are loaded,
> won't this a problem?
>
>
>
> Bin Wang <wbi...@gmail.com>于2015年9月16日周三 下午8:40写道:
>
>> Will StreamingContex.getOrCreate do this work?What kind of code change
>> will make it cannot load?
>>
>> Akhil Das <ak...@sigmoidanalytics.com>于2015年9月16日周三 20:20写道:
>>
>>> You can't really recover from checkpoint if you alter the code. A better
>>> approach would be to use some sort of external storage (like a db or
>>> zookeeper etc) to keep the state (the indexes etc) and then when you deploy
>>> new code they can be easily recovered.
>>>
>>> Thanks
>>> Best Regards
>>>
>>> On Wed, Sep 16, 2015 at 3:52 PM, Bin Wang <wbi...@gmail.com> wrote:
>>>
>>>> I'd like to know if there is a way to recovery dstream from checkpoint.
>>>>
>>>> Because I stores state in DStream, I'd like the state to be recovered
>>>> when I restart the application and deploy new code.
>>>>
>>>
>>>

Reply via email to