use fuse interface. Gluster volume is directly accessible as local storage
on all nodes but performance is only 200 Mb/s. More than enough for
notebooks. For data prefer tachyon/alluxio on top of gluster...
Le 8 avr. 2016 6:35 AM, "ashish rawat" <dceash...@gmail.com> a écrit :

> Thanks Eran and Vincent.
> Eran, I would definitely like to try it out, since it won't add to the
> complexity of my deployment. Would see the S3 implementation, to figure out
> how complex it would be.
>
> Vincent,
> I haven't explored glusterfs at all. Would it also require to write an
> implementation of storage interface? Or zeppelin can work with it, out of
> the box?
>
> Regards,
> Ashish
>
> On Wed, Apr 6, 2016 at 12:53 PM, vincent gromakowski <
> vincent.gromakow...@gmail.com> wrote:
>
>> For 1 marathon on mesos restart zeppelin daemon In case of failure.
>> For 2 glusterfs fuse mount allows to share notebooks on all mesos nodes.
>> For 3 not available right now In our  design but a manual restart In
>> zeppelin config page is acceptable for US.
>> Le 6 avr. 2016 8:18 AM, "Eran Witkon" <eranwit...@gmail.com> a écrit :
>>
>>> Yes this is correct.
>>> For HA disk, if you don't have HA storage and no access to S3 then AFAIK
>>> you don't have other option at the moment.
>>> If you like to save notebooks to elastic then I suggest you look at the
>>> storage interface and implementation for git and s3 and implement that
>>> yourself. It does sound like an interesting feature
>>> Best
>>> Eran
>>> On Wed, 6 Apr 2016 at 08:57 ashish rawat <dceash...@gmail.com> wrote:
>>>
>>>> Thanks Eran. So 3, seems to be something external to Zeppelin, and
>>>> hopefully 1 only means running "zeppelin-daemon.sh start" on a slave
>>>> machine, when master become inaccessible. Is that correct?
>>>>
>>>> My main concern still remains on the storage front. And I don't really
>>>> have high availability disks or even hdfs in my setup. I have been using
>>>> elastic search cluster for data high availability, but was hoping that
>>>> zeppelin can save notebooks to a Elastic Search (like kibana) or maybe a
>>>> document store.
>>>>
>>>> Any idea if anything is planned in that direction. Don't want to
>>>> fallback to 'rsync' like options.
>>>>
>>>> Regards,
>>>> Ashish
>>>>
>>>>
>>>>
>>>> On Tue, Apr 5, 2016 at 11:17 PM, Eran Witkon <eranwit...@gmail.com>
>>>> wrote:
>>>>
>>>>> For 1 you need to have both zeppelin web HA and zeppelin deamon HA
>>>>> For 2 I guess you can use HDFS if you implement the storage interface
>>>>> for HDFS. But i am not sure.
>>>>> For 3 I mean that if you connect to an external cluster for example a
>>>>> spark cluster you need to make sure your spark cluster is HA. Otherwise 
>>>>> you
>>>>> will have zeppelin running but your notebook will fail as no spark cluster
>>>>> available.
>>>>> HTH
>>>>> Eran
>>>>>
>>>>>
>>>>> On Tue, 5 Apr 2016 at 20:20 ashish rawat <dceash...@gmail.com> wrote:
>>>>>
>>>>>> Thanks Eran for your reply.
>>>>>> For 1) I am assuming that it would similar to HA of any other web
>>>>>> application, i.e. running multiple instances and switching to the backup
>>>>>> server when master is down, is it not the case?
>>>>>> For 2) is it also possible to save it on hdfs?
>>>>>> Can you please explain 3, are you referring to interpreter config? If
>>>>>> I am using Spark interpreter and submitting jobs to it, and if zeppelin
>>>>>> master node goes down, then what could be the problem in slave node
>>>>>> pointing to the same cluster and submitting jobs?
>>>>>>
>>>>>> Regards,
>>>>>> Ashish
>>>>>>
>>>>>> On Tue, Apr 5, 2016 at 10:08 PM, Eran Witkon <eranwit...@gmail.com>
>>>>>> wrote:
>>>>>>
>>>>>>> I would say you need to account for these things
>>>>>>> 1) availability of the zeppelin deamon
>>>>>>> 2) availability of the notebookd files
>>>>>>> 3) availability of the interpreters used.
>>>>>>>
>>>>>>> For 1 i don't know of out-of-box solution
>>>>>>> For 2 any ha storage will do, s3 or any ha external mounted disk
>>>>>>> For 3 it is up to the interpreter and your big data ha solution
>>>>>>>
>>>>>>> On Tue, 5 Apr 2016 at 19:29 ashish rawat <dceash...@gmail.com>
>>>>>>> wrote:
>>>>>>>
>>>>>>>> Hi,
>>>>>>>>
>>>>>>>> Is there a suggested architecture to run Zeppelin in high
>>>>>>>> availability mode. The only option I could find was by saving 
>>>>>>>> notebooks to
>>>>>>>> S3. Are there any options if one is not using AWS?
>>>>>>>>
>>>>>>>> Regards,
>>>>>>>> Ashish
>>>>>>>>
>>>>>>>
>>>>>>
>>>>
>

Reply via email to