[ 
https://issues.apache.org/jira/browse/BAHIR-161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16414600#comment-16414600
 ] 

Esteban Laver commented on BAHIR-161:
-------------------------------------

Hi [~zaheer ud din], sql-cloudant supports loading all documents in a database 
using these endpoints: _all_docs, _changes, _view, _search (Cloudant Search), 
_find (Cloudant Query).

What you can do is load all docs from a database, create a Spark DataFrame with 
just the one document you need, and then save the DataFrame back into Cloudant.

See the [python 
examples|https://github.com/apache/bahir/tree/master/sql-cloudant/examples/python]
 for how to load docs and save the Spark dataframe, as well as how to use 
operations like filter/select to create a DataFrame with the document(s) you 
need.

 

> How to access all available databases from cloudant ?
> -----------------------------------------------------
>
>                 Key: BAHIR-161
>                 URL: https://issues.apache.org/jira/browse/BAHIR-161
>             Project: Bahir
>          Issue Type: Wish
>          Components: Spark SQL Data Sources, Spark Streaming Connectors
>            Reporter: Zaheer Ud Din
>            Priority: Major
>
> Hi,
> I want to get a specific document from all existing databases available and 
> then put that into another database.
> I want to this job from Notebook(python 3.5, spark 2.1) which should run 
> periodically to consider all databases created in future time.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to