[ 
https://issues.apache.org/jira/browse/FLUME-2502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14173393#comment-14173393
 ] 

Hudson commented on FLUME-2502:
-------------------------------

UNSTABLE: Integrated in flume-trunk #677 (See 
[https://builds.apache.org/job/flume-trunk/677/])
FLUME-2502. Improve Spool Directory Source's performance by not listing files 
each time. (hshreedharan: 
http://git-wip-us.apache.org/repos/asf/flume/repo?p=flume.git&a=commit&h=f979b2683fc48d85806ae7593ee0e393bd812260)
* 
flume-ng-core/src/main/java/org/apache/flume/client/avro/ReliableSpoolingFileEventReader.java
* 
flume-ng-core/src/test/java/org/apache/flume/client/avro/TestReliableSpoolingFileEventReader.java


> Spool source's directory listing is inefficient
> -----------------------------------------------
>
>                 Key: FLUME-2502
>                 URL: https://issues.apache.org/jira/browse/FLUME-2502
>             Project: Flume
>          Issue Type: Improvement
>          Components: Sinks+Sources
>    Affects Versions: v1.5.0
>            Reporter: Prateek Rungta
>            Assignee: Prateek Rungta
>         Attachments: FLUME-2502-0.patch, FLUME-2502-1.patch, 
> FLUME-2502-2.patch, FLUME-2502-3.patch, FLUME-2502-final.patch
>
>
> As mentioned in 
> [FLUME-2309|https://issues.apache.org/jira/browse/FLUME-2309], the directory 
> listing can it self become the bottleneck when accessing directories with a 
> large number of files (>1M). The fix in that JIRA added in the ability to 
> specify `RANDOM` as a Consume-Order to avoid sorting large lists.
> The slowness of the directory listing is still un-addressed. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to