Thanks Josef and Victor. It is working.

Thanks!

2015-11-04 12:36 GMT+01:00 Josef A. Habdank <jahabd...@gmail.com>:

> It worked for me.
>
> I have put a small help, which I might post to some blog at some point
> (how to setup S3 access for Zeppelin on EMR):
>
> To have default EMR instance with Zeppelin store notebooks in S3
>
> # in S3 create the following bucket:
>
> /my-bucket-name/zeppelin/notebook/
>
> # edit the /etc/zeppelin/conf.dist/zeppelin-env.sh
>
> sudo vim /etc/zeppelin/conf.dist/zeppelin-env.sh
>
> # add:
>
> export ZEPPELIN_NOTEBOOK_S3_BUCKET=my-bucket-name
>
> export ZEPPELIN_NOTEBOOK_S3_USER=zeppelin
>
> # create and edit the zeppelin-site.xml
>
> sudo cp /etc/zeppelin/conf.dist/zeppelin-site.xml.template
> /etc/zeppelin/conf.dist/zeppelin-site.xml
>
> sudo vim /etc/zeppelin/conf.dist/zeppelin-site.xml
>
> # uncomment:
>
> <property>
>
>  <name>zeppelin.notebook.storage</name>
>
>  <value>org.apache.zeppelin.notebook.repo.S3NotebookRepo</value>
>
>  <description>notebook persistence layer implementation</description>
>
> </property>
>
>
> # comment:
>
> <property>
>
>  <name>zeppelin.notebook.storage</name>
>
>  <value>org.apache.zeppelin.notebook.repo.VFSNotebookRepo</value>
>
>  <description>notebook persistence layer implementation</description>
>
> </property>
>
> # remember to put the notebooks in the S3 :)
>
> # can download from here: https://github.com/Leemoonsoo?tab=repositories
>
> # restart zeppelin
>
> sudo /usr/lib/zeppelin/bin/zeppelin-daemon.sh stop
>
> sudo /usr/lib/zeppelin/bin/zeppelin-daemon.sh start
>
> # should work :)
>
> On 4 November 2015 at 12:21, Pablo Torre <pablotorr...@gmail.com> wrote:
>
>> Thanks Victor, I appreciate your help, I will give a try.
>>
>>
>>
>>
>> 2015-11-04 12:11 GMT+01:00 Victor Manuel Garcia <victor.gar...@beeva.com>
>> :
>>
>>> yes, the username is the same in zeppelin-site.xml and the bucket you
>>> need specify only the name.
>>>
>>> 2015-11-04 12:03 GMT+01:00 Pablo Torre <pablotorr...@gmail.com>:
>>>
>>>> Ok, because I am trying to test it from my localhost using a bucket in
>>>> Amazon S3. So if I am right I need to set up those two environment
>>>> variables in my localhost:
>>>>
>>>> ENV AWS_ACCESS_KEY=key
>>>> ENV AWS_SECRET_KEY=secret
>>>>
>>>> The other question that I have is about those two variables:
>>>>
>>>> export ZEPPELIN_NOTEBOOK_S3_BUCKET=bucket
>>>> export ZEPPELIN_NOTEBOOK_S3_USER=username
>>>>
>>>> The username I guess is the same user that you specify in
>>>> zeppelin-site.xml, and the bucket is only the name, I don't need to specify
>>>> s3://bucket....
>>>>
>>>> Please let me know if I am wrong
>>>>
>>>> Best,
>>>> Pablo
>>>>
>>>>
>>>> 2015-11-04 11:59 GMT+01:00 Victor Manuel Garcia <
>>>> victor.gar...@beeva.com>:
>>>>
>>>>> Hi Pablo,
>>>>> Yes, you need the AWS environment variables o credentials file in .aws
>>>>> folder in your home...
>>>>>
>>>>> 2015-11-04 11:34 GMT+01:00 Pablo Torre <pablotorr...@gmail.com>:
>>>>>
>>>>>> Hi Victor,
>>>>>>
>>>>>> Taking a look at your suggestion, I was wondering if we need to
>>>>>> export the AWS Credentials.....
>>>>>>
>>>>>> Best,
>>>>>> Pablo
>>>>>>
>>>>>> 2015-11-03 10:46 GMT+01:00 Victor Manuel Garcia <
>>>>>> victor.gar...@beeva.com>:
>>>>>>
>>>>>>> Hi Josef,
>>>>>>> For use S3 you need:
>>>>>>>
>>>>>>> Folder structure needs S3:
>>>>>>> bucket/username/notebook/
>>>>>>>
>>>>>>> set the enviroment variable:
>>>>>>> export ZEPPELIN_NOTEBOOK_S3_BUCKET=bucket
>>>>>>> export ZEPPELIN_NOTEBOOK_S3_USER=username
>>>>>>>
>>>>>>> and in zeppelin-site.xml:
>>>>>>> <!-- If used S3 to storage the notebooks, it is necessary the
>>>>>>> following folder structure bucketname/username/notebook/ -->
>>>>>>> <!--
>>>>>>> <property>
>>>>>>>   <name>zeppelin.notebook.s3.user</name>
>>>>>>>   <value>user</value>
>>>>>>>   <description>user name for s3 folder structure</description>
>>>>>>> </property>
>>>>>>> <property>
>>>>>>>   <name>zeppelin.notebook.s3.bucket</name>
>>>>>>>   <value>zeppelin</value>
>>>>>>>   <description>bucket name for notebook storage</description>
>>>>>>> </property>
>>>>>>>
>>>>>>> Uncomment:
>>>>>>> <property> <name>zeppelin.notebook.storage</name>
>>>>>>> <value>org.apache.zeppelin.notebook.repo.S3NotebookRepo</value>
>>>>>>> <description>notebook persistence layer implementation</description>
>>>>>>> </property>
>>>>>>>
>>>>>>> Comment:
>>>>>>> <property>
>>>>>>>   <name>zeppelin.notebook.storage</name>
>>>>>>>   <value>org.apache.zeppelin.notebook.repo.VFSNotebookRepo</value>
>>>>>>>   <description>notebook persistence layer
>>>>>>> implementation</description>
>>>>>>> </property>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> 2015-11-03 10:11 GMT+01:00 Josef A. Habdank <jahabd...@gmail.com>:
>>>>>>>
>>>>>>>> Hello,
>>>>>>>>
>>>>>>>> on the presentation on Spark Summit Amsterdam Moon showcased that
>>>>>>>> it is possible to change a location of the storage of the notebooks to 
>>>>>>>> S3
>>>>>>>> (so they are not stored locally on the zeppelin machine).
>>>>>>>>
>>>>>>>> I am running a standard Amazon EMR cluster with Zeppelin 0.6
>>>>>>>> SNAPSHOT (default Amazon installation).
>>>>>>>>
>>>>>>>> I tried to modify the /etc/zeppelin/conf.dist/zeppelin-env.sh file
>>>>>>>> changing
>>>>>>>> export ZEPPELIN_NOTEBOOK_DIR=/var/lib/zeppelin/notebook
>>>>>>>>
>>>>>>>> to the following:
>>>>>>>> export ZEPPELIN_NOTEBOOK_DIR=s3:/bucket/folder
>>>>>>>> export ZEPPELIN_NOTEBOOK_DIR=s3://bucket/folder
>>>>>>>> export ZEPPELIN_NOTEBOOK_DIR=s3n://bucket/folder
>>>>>>>>
>>>>>>>> and each time when I restart Zeppelin using
>>>>>>>> sudo /usr/lib/zeppelin/bin/zeppelin-daemon.sh stop
>>>>>>>> sudo /usr/lib/zeppelin/bin/zeppelin-daemon.sh start
>>>>>>>>
>>>>>>>> I get error:
>>>>>>>> "Notebook dir doesn't exist, create"
>>>>>>>>
>>>>>>>> when looking in to the zeppelin-env.sh I noticed it only checks for
>>>>>>>> the local folder.
>>>>>>>>   if [[ ! -d "${ZEPPELIN_NOTEBOOK_DIR}" ]]; then
>>>>>>>>     echo "Notebook dir doesn't exist, create
>>>>>>>> ${ZEPPELIN_NOTEBOOK_DIR}"
>>>>>>>>     $(mkdir -p "${ZEPPELIN_NOTEBOOK_DIR}")
>>>>>>>>   fi
>>>>>>>>
>>>>>>>> So I assume it is not the way :)
>>>>>>>>
>>>>>>>> Can you please tell me how can I do it? I want to store all my
>>>>>>>> notebooks on s3 folder, rather than locally.
>>>>>>>>
>>>>>>>> Thanks a lot!
>>>>>>>>
>>>>>>>> Josef
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> --
>>>>>>> *Victor Manuel Garcia Martinez*
>>>>>>> *Ingeniero de Software
>>>>>>>                                *
>>>>>>>
>>>>>>> *+34 672104297 <%2B34%20672104297>  | victor.gar...@beeva.com
>>>>>>> <marta.ta...@beeva.com>*
>>>>>>>              *              | victormanuel.garcia.marti...@bbva.com
>>>>>>> <marta.ta...@bbva.com>*
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> <http://www.beeva.com/>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>
>>>>>>
>>>>>> --
>>>>>> Pablo Torre.
>>>>>> Freelance software engineer and Ruby on Rails developer.
>>>>>> Oleiros (Coruña)
>>>>>> *Personal site <http://www.pablotorrerodriguez.com>*
>>>>>> My blog <http://www.aboutechnologies.com>
>>>>>>
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> *Victor Manuel Garcia Martinez*
>>>>> *Ingeniero de Software
>>>>>                              *
>>>>>
>>>>> *+34 672104297 <%2B34%20672104297>  | victor.gar...@beeva.com
>>>>> <marta.ta...@beeva.com>*
>>>>>              *              | victormanuel.garcia.marti...@bbva.com
>>>>> <marta.ta...@bbva.com>*
>>>>>
>>>>>
>>>>>
>>>>> <http://www.beeva.com/>
>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>> Pablo Torre.
>>>> Freelance software engineer and Ruby on Rails developer.
>>>> Oleiros (Coruña)
>>>> *Personal site <http://www.pablotorrerodriguez.com>*
>>>> My blog <http://www.aboutechnologies.com>
>>>>
>>>
>>>
>>>
>>> --
>>> *Victor Manuel Garcia Martinez*
>>> *Ingeniero de Software
>>>                            *
>>>
>>> *+34 672104297 <%2B34%20672104297>  | victor.gar...@beeva.com
>>> <marta.ta...@beeva.com>*
>>>              *              | victormanuel.garcia.marti...@bbva.com
>>> <marta.ta...@bbva.com>*
>>>
>>>
>>>
>>> <http://www.beeva.com/>
>>>
>>>
>>>
>>
>>
>> --
>> Pablo Torre.
>> Freelance software engineer and Ruby on Rails developer.
>> Oleiros (Coruña)
>> *Personal site <http://www.pablotorrerodriguez.com>*
>> My blog <http://www.aboutechnologies.com>
>>
>
>


-- 
Pablo Torre.
Freelance software engineer and Ruby on Rails developer.
Oleiros (Coruña)
*Personal site <http://www.pablotorrerodriguez.com>*
My blog <http://www.aboutechnologies.com>

Reply via email to