I was able to connect to the endpoint by setting the property
'fs.s3a.endpoint' to the appropriate url  'https://storage.xxx.com:8181'


I am now able to query the data in the bucket. However, as soon as I enable
the S3 plugin - the response from Drill becomes extremely slow. This is
true even if I am not querying the S3 bucket. As an example, just issuing a
'use' command takes forever:

with the S3 plugin disabled:

0: jdbc:drill:zk=local> use cp;

+-------+---------------------------------+

|  ok   |             summary             |

+-------+---------------------------------+

| true  | Default schema changed to [cp]  |

+-------+---------------------------------+

1 row selected (0.543 seconds)


with the S3 plugin enabled:


0: jdbc:drill:zk=local> use cp;

+-------+---------------------------------+

|  ok   |             summary             |

+-------+---------------------------------+

| true  | Default schema changed to [cp]  |

+-------+---------------------------------+

1 row selected (221.293 seconds)


The S3 bucket configured in the plugin has approximately 20,000 objects. My
assumption is that there is some sort of metadata scan that occurs anytime
a command is executed? Any suggestions on how to improve performance?


Thanks




On Wed, May 24, 2017 at 3:14 PM, Abhishek Girish <[email protected]> wrote:

> I'm not sure if anyone has ever tried that. Connecting to S3 buckets (AWS)
> works via the S3a library. You could file a enhancement request on JIRA
> [1].
>
> If someone has any experience with it, they can share details on the JIRA,
> or work on it. You are welcome to contribute yourself.
>
> [1] https://issues.apache.org/jira/browse/DRILL
>
> On Wed, May 24, 2017 at 12:01 PM, Raz Baluchi <[email protected]>
> wrote:
>
> > Where would I specify to use SSL since the endpoint is https?
> >
> > On Wed, May 24, 2017 at 1:13 PM, Gautam Parai <[email protected]> wrote:
> >
> > > Hi Raz,
> > >
> > >
> > > Please see here for an example https://drill.apache.org/docs/
> > > s3-storage-plugin/
> > >
> > > Gautam
> > >
> > >
> > > ________________________________
> > > From: [email protected] <[email protected]> on behalf of Raz
> > > Baluchi <[email protected]>
> > > Sent: Wednesday, May 24, 2017 7:03:12 AM
> > > To: [email protected]
> > > Subject: S3 configuration for ceph or atmos
> > >
> > > Is there a guide for configuring the S3 storage plugin for non AWS S3
> > > storage?
> > >
> > > As and example, we have Ceph storage that is accessible via the S3 API
> at
> > > an endpoint like: "https://storage.xxx.com:8181"; and bucket:"xyz"
> > >
> > > How would I go about configuring the S3 storage plugin?
> > >
> > > Thanks
> > >
> >
>

Reply via email to