GitHub user tdas opened a pull request:
https://github.com/apache/spark/pull/14094
[SPARK-16430][SQL][STREAMING] Add option maxFilesPerTrigger
## What changes were proposed in this pull request?
An option that limits the file stream source to read 1 file at a time
enables rate limiting. It has the additional convenience that a static set of
files can be used like a stream for testing as this will allows those files to
be considered one at a time.
This PR adds option `maxFilesPerTrigger`.
## How was this patch tested?
New unit test
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/tdas/spark SPARK-16430
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/14094.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #14094
----
commit ddd9426281e743af205f2a3f56be3535cd584b2d
Author: Tathagata Das <[email protected]>
Date: 2016-07-07T20:45:38Z
Add option maxFilesPerTrigger
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]