Richard Platania created SINGA-160:
--------------------------------------

             Summary: Reading larger records from HDFS
                 Key: SINGA-160
                 URL: https://issues.apache.org/jira/browse/SINGA-160
             Project: Singa
          Issue Type: Bug
            Reporter: Richard Platania
            Priority: Minor


The current code for HDFSFile::Next in hdfsfile.cc handles reading data in at 
most two attempts. It may take more than two calls to hdfsRead(), depending on 
the user's configuration of hdfs-client.xml. For example, if 
input.localread.default.buffersize is more than 2x smaller than the data being 
read. I've found this to be a common occurrence when reading image mean files 
for larger datasets. A solution for this can be to wrap hdfsRead in a loop 
until the total amount read reaches the the size of the data.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to