[jira] [Commented] (HADOOP-16317) ABFS: improve random read performance

Thomas Marquardt (JIRA) Tue, 28 May 2019 14:42:06 -0700


    [ 
https://issues.apache.org/jira/browse/HADOOP-16317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16850169#comment-16850169
 ]


Thomas Marquardt commented on HADOOP-16317:
-------------------------------------------

We can be smarter about choosing random vs sequential mode in the driver, but 
can we be smarter than the application layer?  At least HBase knows if it is 
reading randomly or sequentially, so if a Filesystem API were exposed and HBase 
was updated it could call the correct API.  This could be a flag or hint 
indicating the caller's intention to read randomly or sequentially.

> ABFS: improve random read performance
> -------------------------------------
>
>                 Key: HADOOP-16317
>                 URL: https://issues.apache.org/jira/browse/HADOOP-16317
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: fs/azure
>    Affects Versions: 3.2.0
>            Reporter: Da Zhou
>            Priority: Major
>
> Improving random read performance is an interesting topic. ABFS doesn't 
> perform well when reading column format files as the process involves with 
> many seek operations which make the readAhead no use, and if readAhead is 
> used unwisely it would lead to unnecessary data request.
> Hence creating this Jira as a reminder to track the investigation and 
> progress of the work.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (HADOOP-16317) ABFS: improve random read performance

Reply via email to