kjchalup opened a new issue #11884: im2rec documentation is lacking / buggy
URL: https://github.com/apache/incubator-mxnet/issues/11884
 
 
   
[im2rec.py](https://github.com/apache/incubator-mxnet/blob/master/tools/im2rec.py)
 doesn't have docstrings. The online doc that describes the RecordIO format 
([here](https://mxnet.incubator.apache.org/architecture/note_data_loading.html))
 doesn't describe how to use the script. [Another 
page](https://mxnet.incubator.apache.org/faq/recordio.html) does, but it could 
really use a full-fledged real-world example, and should either be unified with 
or at least linked to from the other doc page.
   
   In addition, the im2rec.py script encodes class labels as floating point 
numbers:
   ```
   def write_list(path_out, image_list):
       with open(path_out, 'w') as fout:
           for i, item in enumerate(image_list):
               line = '%d\t' % item[0]
               for j in item[2:]:
                   line += '%f\t' % j ### <---- THIS IS THE FLOATING POINT LABEL
               line += '%s\n' % item[1]
               fout.write(line)
   ```
   But the [doc page](https://mxnet.incubator.apache.org/faq/recordio.html) has 
integer labels in the example. When trying to make my own test .lst file by 
hand I used integer labels which in fact didn't work with 
mx.io.ImageRecordIter. It'd be great to get a clarification on the exact 
required .lst file format.
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to