Arvind Ramachandran created SPARK-24791:
-------------------------------------------

             Summary: Spark Structured Streaming randomly does not process batch
                 Key: SPARK-24791
                 URL: https://issues.apache.org/jira/browse/SPARK-24791
             Project: Spark
          Issue Type: Bug
          Components: Spark Core
    Affects Versions: 2.1.1
            Reporter: Arvind Ramachandran


I have developed an application that writes small CSV files to a specific HDFS 
folder . Spark Structured Streaming reads the HDFS folder . On a random basis i 
see that it does not process a CSV File , the only case this occurs is the 
batch size is one CSV file again random in nature not consistent.I cannot 
guarantee the size of the batch will be greater than one because the 
requirement is low latency processing but volume is low.

I can seeĀ  that the commits , offset and source folders has the batch 
information but the csv file is not processed when i look at the logs.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to