[ 
https://issues.apache.org/jira/browse/SPARK-10555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hyukjin Kwon updated SPARK-10555:
---------------------------------
    Labels: bulk-closed  (was: )

> Add INotifyDStream to Spark Streaming
> -------------------------------------
>
>                 Key: SPARK-10555
>                 URL: https://issues.apache.org/jira/browse/SPARK-10555
>             Project: Spark
>          Issue Type: New Feature
>          Components: DStreams
>            Reporter: Vinoth Chandar
>            Priority: Major
>              Labels: bulk-closed
>
> Currently, spark streaming has support for fileStreams, and while this is 
> super useful in general, it has its limitations - such as only being able to 
> process new files under each folder. 
> There are certain use cases (such as monitoring a root folder for incoming 
> data, and registering the files into HIVE & performing file level replication 
> across HDFS clusters) where taking actions based on multi level nested 
> uploads is useful. 
> We have a POC version of INotifyDStream that we are currently using in 
> Staging environment at Uber. Would love to contribute that back, if it makes 
> sense for Spark. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to