[ 
https://issues.apache.org/jira/browse/HADOOP-15364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16726369#comment-16726369
 ] 

Sameer Choudhary commented on HADOOP-15364:
-------------------------------------------

Hi Steve,

 

I am an Software Engineer on Amazon S3 team and am working on OSS contribution 
of S3 Select Pushdown support for Presto 
(https://github.com/prestodb/presto/pull/11970). I will be working with 
[~yuzhousun] on the reviews of design and implementation of this feature. Could 
you please provide me with a link to the PR for code review? I would start 
reviewing 
[https://github.com/steveloughran/hadoop/commit/875062e43d6144a66eac12e911902fa7ba6befde]
 and other related commits in the meantime.

 

Best,

Sameer

> Add support for S3 Select to S3A
> --------------------------------
>
>                 Key: HADOOP-15364
>                 URL: https://issues.apache.org/jira/browse/HADOOP-15364
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: fs/s3
>            Reporter: Steve Loughran
>            Assignee: Steve Loughran
>            Priority: Major
>         Attachments: HADOOP-15364-001.patch, HADOOP-15364-002.patch, 
> HADOOP-15364-004.patch
>
>
> Expect a PoC patch for this in a couple of days; 
> * it'll depend on an SDK update to work, plus a couple of of other minor 
> changes
> * Adds command line option too 
> {code}
> hadoop s3guard select -header use -compression gzip -limit 100 
> s3a://landsat-pds/scene_list.gz" \
> "SELECT s.entityId FROM S3OBJECT s WHERE s.cloudCover = '0.0' "
> {code}
> For wider use we'll need to implement the HADOOP-15229 so that callers can 
> pass down the expression along with any other parameters



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to