----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/17867/ -----------------------------------------------------------
Review request for Flume and Hari Shreedharan. Repository: flume-git Description ------- This patch provides. * A consume order feature in the Spooling directory source, which will allow the users to explicitly state in which order; oldest, youngest or randomly files should be consumed from the spooling directory. * Fixes the old implementation of selecting the file from spooling directory. Previously, each file to be consumed was selected by sorting, which might become extremly time consuming if there are many files (of the order of 10K or more). The new implementation instead do a linear scan in case when the consume order is oldest or youngest. * Updates the Flume user guide accordingly. Diffs ----- Diff: https://reviews.apache.org/r/17867/diff/ Testing ------- Tested without specifying a consume order, by default oldest was used. Tested with specifying oldest, youngest and random consume order. Tested with large number of files, random order is fastest. Thanks, Muhammad Ehsan ul Haque
