kjchalup opened a new issue #11884: im2rec documentation is lacking / buggy URL: https://github.com/apache/incubator-mxnet/issues/11884 [im2rec.py](https://github.com/apache/incubator-mxnet/blob/master/tools/im2rec.py) doesn't have docstrings. The online doc that describes the RecordIO format ([here](https://mxnet.incubator.apache.org/architecture/note_data_loading.html)) doesn't describe how to use the script. [Another page](https://mxnet.incubator.apache.org/faq/recordio.html) does, but it could really use a full-fledged real-world example, and should either be unified with or at least linked to from the other doc page. In addition, the im2rec.py script encodes class labels as floating point numbers: ``` def write_list(path_out, image_list): with open(path_out, 'w') as fout: for i, item in enumerate(image_list): line = '%d\t' % item[0] for j in item[2:]: line += '%f\t' % j ### <---- THIS IS THE FLOATING POINT LABEL line += '%s\n' % item[1] fout.write(line) ``` But the [doc page](https://mxnet.incubator.apache.org/faq/recordio.html) has integer labels in the example. When trying to make my own test .lst file by hand I used integer labels which in fact didn't work with mx.io.ImageRecordIter. It'd be great to get a clarification on the exact required .lst file format.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
