[ 
https://issues.apache.org/jira/browse/AVRO-3261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael A. Smith updated AVRO-3261:
-----------------------------------
    Description: boto3 python library has [s3 
get_object|https://boto3.amazonaws.com/v1/documentation/api/1.21.32/reference/services/s3.html#S3.Client.get_object]
 that accepts a byte range (to avoid downloading a whole file but just download 
selected byte range). if my Avro file is 100mb, can Avro library do some byte 
range seek to only download the s3 object partially (ie the parts that contain 
header.schema metadata and row count)?  (was: boto3 python library has s3 
get_object that accepts a byte range (to avoid downloading a whole file but 
just download selected byte range). if my Avro file is 100mb, can Avro library 
do some byte range seek to only download the s3 object partially (ie the parts 
that contain header.schema metadata and row count)?)

> s3 byte range for just schema or row count
> ------------------------------------------
>
>                 Key: AVRO-3261
>                 URL: https://issues.apache.org/jira/browse/AVRO-3261
>             Project: Apache Avro
>          Issue Type: Wish
>          Components: community, misc, python, tools
>    Affects Versions: 1.11.0
>            Reporter: t oo
>            Priority: Major
>
> boto3 python library has [s3 
> get_object|https://boto3.amazonaws.com/v1/documentation/api/1.21.32/reference/services/s3.html#S3.Client.get_object]
>  that accepts a byte range (to avoid downloading a whole file but just 
> download selected byte range). if my Avro file is 100mb, can Avro library do 
> some byte range seek to only download the s3 object partially (ie the parts 
> that contain header.schema metadata and row count)?



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to